feat!: SPRAS revision by tristan-f-r · Pull Request #320 · Reed-CompBio/spras

tristan-f-r · 2025-07-09T20:51:39Z

This change means that output files will not be reused whenever SPRAS is updated if osdf_immutable is true, furthering the immutability goal necessary to get OSDF integration working for SPRAS benchmarking. ('updated' depends on the git commit hash or the actual SPRAS release version)

This adds the unique spras_revision to every single paramater combination (before hashing) and the dataset label, to provide OSDF support on the level of deterministic, non-seeded algorithms when datasets are immutable.

This has the added benefit of allowing SPRAS users to simply upgrade their SPRAS version without needing to clear output, which complements #380. The refactored test also partially covers #165 and #45. (This is also where the majority of the code comes from: The actual feature patch here is a 50 line change.)

See #321 implemented by #335 for handling nondeterministic algorithms / seeded algorithms.

To make this change, a significant test refactor in test/analysis was needed to remove hardcoded paths (which contained the hashes being modified per-commit in this PR.) It turns out that whenever we make any change to the hash, this [original: the patch here fixes this] test breaks! That's why this PR is depended on by so many other PRs.

This adds the unique spras_revision to every single paramater combination (before hashing) and the dataset label, to provide OSDF support on the level of deterministic algorithms.

agitter

I finished another partial revision. I still haven't thought about the testing implications carefully.

spras/config/config.py

Snakefile

test/analysis/input/egfr.yaml

whoops! accidentally feature-regressed

agitter

A few more comments. I still haven't looked through all the test code.

test/analysis/input/egfr.yaml

spras/config/config.py

tristan-f-r · 2026-01-31T05:08:43Z

Since both past approaches do not scale well, I've decided to only focus on the RECORD file.

This fails specifically in the case where SPRAS is somehow ran without being installed as a python module, and I can't think of a plausible scenario where this happens.

agitter · 2026-02-08T17:45:17Z

As a follow up to our meeting discussion, I'm wondering if this type of output file versioning should be optional. Then when running in CHTC and writing to OSDF (or running locally and opting in) it could be enabled. By making it opt in, we would have simpler filenames by default and ensure the user knows they have to install and run SPRAS a specific way for this feature to work.

tristan-f-r · 2026-02-08T22:44:28Z

That makes the most sense to me as well 👍

what is going on in ci???

okay - sysconfig.get_path("purelib") is correct

we need better typing :/

tristan-f-r · 2026-02-11T22:20:34Z

config/config.yaml

+#
+# By default, this is disabled, as it can make output file names confusing. Here, it's set to true since we use this
+# configuration file for testing.
+osdf_immutable: true


This is a little annoying. We use this config for testing, so it's nice to enable this, but this is also our documentation config. I can write some extra code to enable this during testing, but that seems strange as well.

For now, I'm okay with keeping this then writing more documentation later (especially as we start focusing more on the COMBINE25 tutorial.)

feat: spras_revision

b0327a2

This adds the unique spras_revision to every single paramater combination (before hashing) and the dataset label, to provide OSDF support on the level of deterministic algorithms.

tristan-f-r marked this pull request as ready for review July 9, 2025 20:51

tristan-f-r added enhancement New feature or request needed for benchmarking Priority PRs needed for the benchmarking paper labels Jul 9, 2025

style: fmt

8cec738

tristan-f-r changed the title ~~feat: spras_revision~~ feat: SPRAS revision Jul 9, 2025

This comment was marked as outdated.

Sign in to view

tristan-f-r marked this pull request as draft July 9, 2025 21:37

tristan-f-r mentioned this pull request Jul 10, 2025

fix: custom installation of DOMINO #235

Open

1 task

tristan-f-r added 2 commits July 10, 2025 19:32

test: summary

5683392

docs(test_summary): mention preprocessing motivation

af90ce0

tristan-f-r marked this pull request as ready for review July 10, 2025 19:34

tristan-f-r changed the title ~~feat: SPRAS revision~~ feat!: SPRAS revision Jul 10, 2025

tristan-f-r added 7 commits July 10, 2025 12:44

test(analysis/summary): use input from /input instead

6141874

docs(test/analysis): mention dual integration testing

440a2d4

test(analysis/summary): use test/analysis provided gold standard

d9e852b

style: fmt

abb0eb9

chore: don't repeat docs inside analysis configs

60185fc

feat: get working with cytoscape

e6bd6a0

style: fmt

f9a3081

tristan-f-r mentioned this pull request Jul 11, 2025

Guaranteed immutable output #323

Open

test: remove nondet from analysis

77fc3b4

This comment was marked as outdated.

Sign in to view

fix: get input pathways at runtime

0592850

This was referenced Jul 11, 2025

Update summary.py to include parameter combinations #194

Merged

feat!: typed PRA#run #329

Merged

This was referenced Jul 21, 2025

chore: bump dependencies #310

Merged

Integration testing instead of artifacts #339

Open

feat: algorithm attributions #345

Open

tristan-f-r added the P-high This is a blocker for many PRs/issues/features label Jul 24, 2025

tristan-f-r added the tuning Workflow-spanning algorithm tuning label Jan 13, 2026

agitter reviewed Jan 17, 2026

View reviewed changes

spras/config/config.py Outdated Show resolved Hide resolved

spras/config/config.py Show resolved Hide resolved

Snakefile Outdated Show resolved Hide resolved

test/analysis/input/egfr.yaml Show resolved Hide resolved

tristan-f-r added 5 commits January 17, 2026 17:14

apply suggestions

e12fc75

clean, fix: strip project_directory

977bf5a

fix: correct equality on not SPRAS pyproject.toml

8500bcb

chore: grammar

112db39

chore: move attach_spras_revision out of Snakefile

c7262ed

github-actions bot added the merge-conflict This PR has merge conflicts. label Jan 31, 2026

tristan-f-r added 2 commits January 30, 2026 20:19

Merge branch 'main' into hash

f69a0f3

fix: properly resolve merge conflict

72e30bf

github-actions bot removed the merge-conflict This PR has merge conflicts. label Jan 31, 2026

tristan-f-r added 2 commits January 31, 2026 04:28

fix: undo mistaken merge conflict

c71b652

whoops! accidentally feature-regressed

chore: drop unnecessary self.datasets initialization

6b941e0

agitter reviewed Jan 31, 2026

View reviewed changes

test/analysis/input/egfr.yaml Show resolved Hide resolved

spras/config/config.py Outdated Show resolved Hide resolved

spras/config/config.py Outdated Show resolved Hide resolved

spras/config/config.py Outdated Show resolved Hide resolved

tristan-f-r added 2 commits January 31, 2026 05:04

feat: dynamic spras versioning

fbf0ceb

chore: error handling on setup.pu

edc0369

tristan-f-r added 4 commits January 30, 2026 21:10

docs: note on git commit hashes

3a1251d

chore: drop git magic

d330d6a

feat: correctly parse RECORD

5e31d06

style: fmt

dba2b45

tristan-f-r added 6 commits February 11, 2026 17:32

feat: optional spras revision

90b4e1f

docs: osdf_immutable info; ci: debug

fd5a490

what is going on in ci???

ci: ??????

210897b

okay - sysconfig.get_path("purelib") is correct

fix: don't use distribution files, opt for purepath

816dd28

style: fmt

cd78a2a

fix: tag iff osdf immutable, correct functools.partial sig

b025b7d

we need better typing :/

tristan-f-r commented Feb 11, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat!: SPRAS revision#320

feat!: SPRAS revision#320
tristan-f-r wants to merge 51 commits intoReed-CompBio:mainfrom
tristan-f-r:hash

tristan-f-r commented Jul 9, 2025 •

edited

Loading

Uh oh!

This comment was marked as outdated.

This comment was marked as outdated.

agitter left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

agitter left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tristan-f-r commented Jan 31, 2026 •

edited

Loading

Uh oh!

agitter commented Feb 8, 2026

Uh oh!

tristan-f-r commented Feb 8, 2026

Uh oh!

tristan-f-r Feb 11, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tristan-f-r commented Jul 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as outdated.

This comment was marked as outdated.

agitter left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

agitter left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tristan-f-r commented Jan 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

agitter commented Feb 8, 2026

Uh oh!

tristan-f-r commented Feb 8, 2026

Uh oh!

tristan-f-r Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tristan-f-r commented Jul 9, 2025 •

edited

Loading

tristan-f-r commented Jan 31, 2026 •

edited

Loading

tristan-f-r Feb 11, 2026 •

edited

Loading