Hi, @constantinpape , thank you for this great project!
I tried to compute multicut segmentation using the provided example data and multicut.py on a SLURM cluster. However, after many attempts, the task always fails with a similar error message, as shown in initial_sub_graphs_failed.log. Here's a snippet of the log:
2025-04-15 11:31:32.660748: created tmp-folder and log dirs @ /gpfs/share/home/t2024041003/cluster_tools/tmp_mc
2025-04-15 11:31:32.660863: Start task initial_sub_graphs
2025-04-15 11:31:32.660898: reading global config from configs/global.config
2025-04-15 11:31:32.698030: copied python script from /gpfs/share/home/t2024041003/cluster_tools/cluster_tools/graph/initial_sub_graphs.py to /gpfs/share/home/t2024041003/cluster_tools/tmp_mc/initial_sub_graphs.py
2025-04-15 11:31:32.698085: reading default task config
2025-04-15 11:31:32.724259: written config for 8 jobs
2025-04-15 11:31:32.724307: reading global config from configs/global.config
2025-04-15 11:31:32.724388: reading default task config
2025-04-15 11:31:42.784707: task failed in run_impl with [Errno 2] No such file or directory: '/gpfs/share/home/t2024041003/cluster_tools/tmp_mc/logs/initial_sub_graphs_0.log'
2025-04-15 11:31:42.785505: move log from /gpfs/share/home/t2024041003/cluster_tools/tmp_mc/initial_sub_graphs.log to /gpfs/share/home/t2024041003/cluster_tools/tmp_mc/initial_sub_graphs_failed.log
The error seems to indicate a missing log file, but the file does exist. I encountered the same issue when running with my own dataset as well.
Could you please help me understand what's going wrong or suggest how to debug this?
Thank you very much!
Hi, @constantinpape , thank you for this great project!
I tried to compute multicut segmentation using the provided example data and
multicut.pyon a SLURM cluster. However, after many attempts, the task always fails with a similar error message, as shown ininitial_sub_graphs_failed.log. Here's a snippet of the log:The error seems to indicate a missing log file, but the file does exist. I encountered the same issue when running with my own dataset as well.
Could you please help me understand what's going wrong or suggest how to debug this?
Thank you very much!