parallelcmd

A lightweight Python CLI for queueing and executing shell commands in parallel. Inspired by GNU Parallel, parallelcmd provides:

Command generation from argument combinations
Concurrent execution with live output and progress tracking
Job management: inspect, reset, delete, and update queued jobs
Flexible workflows: resume, and scale workers on demand

Requirements

Python 3.8+
Standard library only (no external Python dependencies)

Quick start

From this directory:

python3 parallelcmd.py --help

Create a job database:

python3 parallelcmd.py init "echo {}" ::: a b c

Run queued jobs with 4 workers:

python3 parallelcmd.py exec -j 4 --progress

Or do both in one command:

python3 parallelcmd.py run -j 4 --progress "echo {}" ::: a b c

If you omit the subcommand entirely, parallelcmd.py defaults to run:

python3 parallelcmd.py -j 4 --progress "echo {}" ::: a b c

Check status:

python3 parallelcmd.py check

Command model

init builds commands and stores them in pardb.sqlite by default. Use --db <name> to target <name>.sqlite, or set the PARDB environment variable.

::: starts an inline argument list.
:::: starts an argument list loaded from a file (one value per line; empty lines and # comments are ignored).
:::: - reads the argument list from stdin.
Multiple lists are combined with Cartesian product.
If the command has no {} placeholders, placeholders are appended automatically.
If no ::: or :::: separator is given and stdin is a pipe, stdin lines are used as the argument list automatically.

Example:

python3 parallelcmd.py init "python train.py --lr {} --seed {}" ::: 1e-3 1e-4 ::: 1 2 3

This creates 6 jobs.

Subcommands

`init`

Initialize/append job queue.

python3 parallelcmd.py init [options] <command ...> [ ::: <args ...> ]* [ :::: <argfile ...> ]*

Options:

-a, --append append to existing table instead of recreating
-f, --force drop the existing parjob table and recreate it
--check_dup skip commands that already exist
-v, --verbose

`exec`

Execute queued jobs in parallel.

python3 parallelcmd.py exec [options]

Options:

-j, --nworkers number of workers (default: 4)
--id <id ...> run only these specific job IDs
--progress show aggregate progress line
--dashboard compact live dashboard mode
--dryrun print commands without running
-v, --verbose
--timeskip <sec> throttle displayed output updates
--randomorder fetch pending jobs in random order
--prefix <cmd> prefix each command (example: srun -N1 -n1)
--max_jobs <n> max jobs per worker
--check_timeleft SECONDS stop taking new jobs when SLURM time left is below threshold
--wait SECONDS when no job is available, wait this many seconds and retry instead of exiting (useful when another process is still adding jobs)
--timeout SECONDS kill a task and move to the next if it runs longer than this many seconds; timed-out jobs are recorded with exit code -124

`run`

Initialize and execute in one step (init + exec).

python3 parallelcmd.py run [options] <command ...> [ ::: <args ...> ]* [ :::: <argfile ...> ]*

Common options include:

init side: --append, -f/--force, --check_dup
exec side: -j/--nworkers, --id, --progress, --dashboard, --dryrun, --randomorder, --prefix, --max_jobs, --check_timeleft, --wait, --timeout

`check`

Inspect queue summary or list all rows.

python3 parallelcmd.py check [options]

Options:

-l, --list list all matching rows instead of the summary
--nonzero filter to only jobs with non-zero exit value
--where <sql> arbitrary SQL WHERE clause
--like <pattern> filter by Command LIKE <pattern>
--id <id ...> filter by specific job IDs

`reset`

Reset selected jobs to pending (Starttime, JobRuntime, Exitval set to NULL).

python3 parallelcmd.py reset [--all | --nonzero | --like <pattern> | --id <id ...> | --where <sql>]

Options:

-a, --all reset all jobs
--nonzero reset only jobs with non-zero exit value
--where <sql> arbitrary SQL WHERE clause
--like <pattern> filter by Command LIKE <pattern>
--id <id ...> filter by specific job IDs

Prompts for confirmation before changing rows.

`delete`

Delete selected jobs.

python3 parallelcmd.py delete [options]

Options:

-a, --all delete all jobs
--like <pattern> filter by SQL LIKE pattern on command text
--id <id ...> filter by job ID(s)

Prompts for confirmation before deleting rows.

`update`

Find/replace command text for selected jobs.

python3 parallelcmd.py update [options]

Options:

--replace "old,new" find and replace text pair (comma-separated)
--like <pattern> filter by SQL LIKE pattern on command text
--id <id ...> filter by job ID(s)

Prompts for confirmation before updating rows.

Global options

--db <name> SQLite DB basename; the file on disk is <name>.sqlite
--db_retries <n> max retries when SQLite is locked (default: 10)
--log_level {debug,info} logging level (default: info)

Useful examples

Pipe arguments from stdin (auto-detected when no ::: or :::: is given):

cat cases.txt | python3 parallelcmd.py -j 4 "bash run.sh {}"
seq 10 | python3 parallelcmd.py "echo {}"

Pipe stdin explicitly with :::: - (combinable with other arg lists):

cat cases.txt | python3 parallelcmd.py run "bash run.sh {} {}" :::: - ::: seed1 seed2

Run scripts from values in a file:

python3 parallelcmd.py init "bash run_case.sh {}" :::: cases.txt
python3 parallelcmd.py exec -j 8 --progress

Use a custom DB file:

python3 parallelcmd.py --db jobs init "echo {}" ::: x y z
python3 parallelcmd.py --db jobs exec -j 2

Kill tasks that exceed a time limit and continue to the next job:

python3 parallelcmd.py exec -j 4 --timeout 300

Timed-out jobs are recorded with exit code -124. Find them with:

python3 parallelcmd.py check -l --where "Exitval = -124"

Reset timed-out jobs to retry with a longer timeout:

python3 parallelcmd.py reset --where "Exitval = -124"
python3 parallelcmd.py exec -j 4 --timeout 600

Keep workers alive while another process appends jobs later:

python3 parallelcmd.py exec -j 4 --wait 10
python3 parallelcmd.py init -a "echo {}" ::: later1 later2

Retry failed jobs only:

python3 parallelcmd.py reset
python3 parallelcmd.py exec -j 4 --progress

Overwrite the queue with a new set of jobs (drop and recreate):

python3 parallelcmd.py init -f "echo {}" ::: x y z
python3 parallelcmd.py exec -j 4 --progress

Notes

Job output is streamed to stdout while running.
Queue state is persisted in SQLite, so you can stop and resume workflows.
reset, delete, and update are interactive (confirmation required).
With --wait, workers poll for newly appended jobs instead of exiting as soon as the queue is empty.

Troubleshooting

database is locked
- Usually temporary when multiple workers/processes access SQLite.
- Retry the command; avoid running multiple exec sessions against the same DB at once.
No jobs are executed
- Check queue state: python3 parallelcmd.py check --list.
- If jobs are already completed or marked in-progress, reset them: python3 parallelcmd.py reset.
Workers exit before later jobs are appended
- Start exec with --wait <seconds> so workers keep polling.
- Append work with init -a ... from another process or terminal.
Unexpected shell behavior / quoting issues
- Commands are executed through bash -c.
- Wrap complex commands in quotes and test one command manually before init.
--check_timeleft fails with missing SLURM_JOB_ID
- This option requires a SLURM job environment.
- Run inside a SLURM allocation or omit --check_timeleft.
Some jobs have exit code -124
- These jobs were killed by --timeout.
- Reset and retry them: python3 parallelcmd.py reset --where "Exitval = -124", then re-run exec with a larger --timeout or without it.
update --replace does not parse as expected
- Use exactly one comma-separated pair: --replace "old,new".
- If your text contains commas, run multiple updates with simpler replacement pairs.
Argument file (::::) seems ignored
- Ensure one argument per line.
- Blank lines and lines starting with # are intentionally skipped.

FAQ

How do I resume after interruption?
- Just run python3 parallelcmd.py exec -j 4 --progress again.
- Completed jobs (exit code 0) stay done; pending jobs continue.
How do I retry only failed jobs?
- Failed jobs are those with non-zero exit values.
- Run python3 parallelcmd.py reset (default filter resets jobs with Exitval <> 0), then run exec again.
- Use --nonzero to be explicit: python3 parallelcmd.py reset --nonzero.
What does exit code -124 mean?
- The job was killed by --timeout. This matches the GNU timeout convention.
- Reset and rerun: python3 parallelcmd.py reset --where "Exitval = -124", then exec with a longer --timeout.
Can I have multiple queues?
- Yes. Use different database basenames with --db.
- Example: python3 parallelcmd.py --db exp1 init ... then exec using the same --db.
Is it safe to run two exec commands on the same DB?
- It is not recommended.
- SQLite coordination can work, but contention/locking increases and behavior is harder to reason about.
Can I inspect/edit queued commands before running?
- Inspect: python3 parallelcmd.py check --list
- Bulk edit text: python3 parallelcmd.py update --replace "old,new" --like "%pattern%"
- Remove unwanted rows: python3 parallelcmd.py delete --id 12 13 14

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
README.md		README.md
parallelcmd.py		parallelcmd.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

parallelcmd

Requirements

Quick start

Command model

Subcommands

`init`

`exec`

`run`

`check`

`reset`

`delete`

`update`

Global options

Useful examples

Notes

Troubleshooting

FAQ

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

parallelcmd

Requirements

Quick start

Command model

Subcommands

init

exec

run

check

reset

delete

update

Global options

Useful examples

Notes

Troubleshooting

FAQ

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`init`

`exec`

`run`

`check`

`reset`

`delete`

`update`

Packages