Skip to content

Very large number of nodes remain in component #24

@jnwoodhouse

Description

@jnwoodhouse

I am analysing some freshwater metagenomes and we really have no idea what to expect when it comes to plasmids (ie size),

I am somewhat stupidly stuck on the "first" sample which takes a very long time to run. What is obvious is that as it goes through what im guessing is the "peeling off" process that the time taken to consider each component is relative to the number of nodes in the graph.

For instance initially it reports (quite quickly)

252 nodes remain in component
243 nodes remain in component
217 nodes remain in component
201 nodes remain in component
194 nodes remain in component
182 nodes remain in component
182 nodes remain in component

but then much later (around 48h later)

its displaying

2355264 nodes remain in component
2355264 nodes remain in component
2355264 nodes remain in component
2355264 nodes remain in component
2355264 nodes remain in component
2355262 nodes remain in component
2355262 nodes remain in component
2355262 nodes remain in component

A single "peeling" process takes around 5 h and you see the number of nodes remaining often is not reduced, only 2 nodes in total.

Im wondering: Is this something that should be discarded (super high number of nodes), is there a setting that im missing that would circumvent such graph components being profiled? Or should i be getting super excited and ready to write a Nature paper??

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions