-
Notifications
You must be signed in to change notification settings - Fork 5
Description
I am analysing some freshwater metagenomes and we really have no idea what to expect when it comes to plasmids (ie size),
I am somewhat stupidly stuck on the "first" sample which takes a very long time to run. What is obvious is that as it goes through what im guessing is the "peeling off" process that the time taken to consider each component is relative to the number of nodes in the graph.
For instance initially it reports (quite quickly)
252 nodes remain in component
243 nodes remain in component
217 nodes remain in component
201 nodes remain in component
194 nodes remain in component
182 nodes remain in component
182 nodes remain in component
but then much later (around 48h later)
its displaying
2355264 nodes remain in component
2355264 nodes remain in component
2355264 nodes remain in component
2355264 nodes remain in component
2355264 nodes remain in component
2355262 nodes remain in component
2355262 nodes remain in component
2355262 nodes remain in component
A single "peeling" process takes around 5 h and you see the number of nodes remaining often is not reduced, only 2 nodes in total.
Im wondering: Is this something that should be discarded (super high number of nodes), is there a setting that im missing that would circumvent such graph components being profiled? Or should i be getting super excited and ready to write a Nature paper??