fix: low-risk correctness audit batch (defensive copies, crash fixes, validation, config) by breimanntools · Pull Request #346 · breimanntools/aaanalysis

breimanntools · 2026-07-04T16:13:57Z

Part of #342. Implements the low-risk, non-output-changing portion of the July 2026
correctness audit plus the config.options group. No CPP regression-anchor regolden
required. (The output/behavior-changing findings are tracked separately — see Deferred.)

Included fixes

Defensive copies (shared-object mutation)

load_dataset(non_canonical_aa="gap") and load_features no longer return/mutate the
shared lru_cache DataFrame; get_df_feat no longer rewrites the caller's df_parts
in place.

Crash-on-edge-input

read_fasta raises a clear ValueError on pre-header text (was a cryptic KeyError).
SeqMut.mutate(df_feat=…) scores in mutation order and aligns results positionally, so
duplicate / label-colliding mutation rows no longer desync or crash.
CPP.eval clamps the cluster search so a single-feature set can't reach KMeans(0).

Validation / messages / config

display_df row/col selector is 0..n-1; the marker_size length check is reachable
(ValueError, not a later IndexError); plot_legend reports the real parameter names;
check_metric / check_match_X_n_clusters messages corrected; load_scales name=;
comp_seq_sim f-string paren; explicit categories on previously bare warnings.warn.
config.options (CONFIRM-FIRST, approved): validate the incoming verbose/random_state
candidate (not the current global); validate name_tmd; check_n_jobs saves and restores
the user's LOKY_MAX_CPU_COUNT instead of leaving it stuck at 1 after re-enabling
multiprocessing.

Intentional value correction

comp_seq_sim self-similarity diagonal is 100 (matching the [0, 100] scale of every
off-diagonal cell), not 1. Pro function, no regression anchor, no test pinned the value.

Deferred (output/behavior-changing — need a decision, not in this PR)

Sliding-window over-large-window guard (raise vs the current, test-encoded silent-empty).
Three rendered-plot fixes (feature-map importance-bar ticks, profile y-padding, weight_bold
spine/tick widths) — they change committed figures and need notebook re-execution.

These join the output-changing set tracked in #343 (CPP redundancy filter, TreeModel
per-round seeding, BH p-value monotonicity).

Tests

New tests/unit/test_correctness_batch_342.py plus a SeqMut duplicate-row regression in
test_seqmut_mutate.py. Touched-area suites pass locally (data_handling, config, plotting,
cpp_plot, aaclust, seq_analysis_pro, protein_engineering, sequence_feature) and the smoke gate.

Review

xhigh multi-agent /code-review (its findings — the SeqMut re-join edge, the config LOKY
handling, and three output-changing plot changes — were fixed or reverted) and
/security-review (clean).

🤖 Generated with Claude Code

…lidation gaps, messages) Part of #342. None of these change published numeric output. - Defensive copies (shared-object mutation): load_dataset(gap) and load_features no longer return/mutate the shared lru_cache frame; get_df_feat no longer rewrites the caller's df_parts in place. - Crash-on-edge-input: read_fasta raises a clear ValueError on pre-header text (was KeyError); SeqMut.mutate re-joins scored rows on (entry, mutation) so duplicate labels across entries no longer crash; CPP.eval clamps the cluster search so a single-feature set can't hit KMeans(0). - comp_seq_sim self-similarity diagonal is 100 (the [0,100] scale), not 1. - Plotting/display: marker_size length check is reachable (ValueError, not a later IndexError); display_df row/col selector is 0..n-1; weight_bold no longer renders thinner than normal; feature-map show_only_max uses imp_bar_label_type; profile y-padding uses max(0, ...); plot_legend validation errors report the real parameter names. - Validator/message hygiene: check_metric and check_match_X_n_clusters messages corrected; load_scales name=; comp_seq_sim paren; explicit categories on previously bare warnings.warn calls. Regression tests: tests/unit/test_correctness_batch_342.py (+ updated the check_match_X_n_clusters message assertion in test_aac_branch.py). Held (not in this commit): config.py options-validation group (verbose/ random_state/name_tmd/LOKY) — CONFIRM-FIRST surface, awaiting sign-off. Deferred: sliding-window over-large-window guard — needs a silent-empty- vs-raise decision (current silent-empty behavior is test-encoded). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

Completes the config.options group of #342 (CONFIRM-FIRST, approved): - _check_option validates the incoming verbose/random_state candidate directly instead of routing through the runtime resolvers, which read the current global and skipped validating a new value once one was set. - name_tmd is now validated like name_jmd_n / name_jmd_c (any name_* -> str). - check_n_jobs undoes only the loky CPU cap it set itself when allow_multiprocessing is re-enabled, so it is not left stuck at 1 and a user's own LOKY_MAX_CPU_COUNT is never clobbered. Tests added to tests/unit/test_correctness_batch_342.py. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

- SeqMut.mutate: score in mutation order (build_scan_output/_delta_table gain a sort=False path) and assign positionally, so duplicate mutation rows no longer desync/crash the previous (entry, mutation) re-join. - config check_n_jobs: on allow_multiprocessing=False remember the user's prior LOKY_MAX_CPU_COUNT and restore it on re-enable (was overwritten with '1' then popped, losing the user's value). - Revert three rendered-output plot changes that don't belong in a no-output- change batch (feature_map importance-bar ticks, profile y-axis padding, weight_bold spine/tick widths); deferred to the output-changing decision set. comp_seq_sim's diagonal correction (1 -> 100 on the [0,100] scale) is kept as the sole intentional value fix (pro function, unanchored, no test pinned it). Tests: SeqMut duplicate-row regression + LOKY save/restore assertion. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

… disabled window Re-review follow-up: check_n_jobs only undoes its own cap on re-enable if the value is still '1'; if the user set their own LOKY_MAX_CPU_COUNT (e.g. for another loky/ joblib library) while multiprocessing was disabled, it is left untouched. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

codecov · 2026-07-04T16:42:10Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 94.84%. Comparing base (7dcc8d8) to head (dc4289c).

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #346      +/-   ##
==========================================
+ Coverage   94.83%   94.84%   +0.01%     
==========================================
  Files         196      196              
  Lines       18767    18783      +16     
  Branches     3175     3181       +6     
==========================================
+ Hits        17797    17815      +18     
+ Misses        633      632       -1     
+ Partials      337      336       -1

Files with missing lines	Coverage Δ
aaanalysis/_utils/plotting.py	`92.98% <100.00%> (+1.16%)`	⬆️
aaanalysis/config.py	`100.00% <100.00%> (ø)`
aaanalysis/data_handling/_backend/parse_fasta.py	`100.00% <100.00%> (ø)`
aaanalysis/data_handling/_load_dataset.py	`100.00% <100.00%> (ø)`
aaanalysis/data_handling/_load_features.py	`100.00% <100.00%> (ø)`
aaanalysis/data_handling/_load_scales.py	`95.77% <100.00%> (ø)`
aaanalysis/data_handling/_read_fasta.py	`100.00% <100.00%> (ø)`
aaanalysis/feature_engineering/_aaclust.py	`97.48% <100.00%> (ø)`
...ysis/feature_engineering/_backend/check_aaclust.py	`100.00% <100.00%> (ø)`
...ysis/feature_engineering/_backend/check_feature.py	`93.45% <100.00%> (+0.02%)`	⬆️
... and 7 more

Components	Coverage Δ
cpp_core	`94.95% <ø> (ø)`

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

…test comments Per PR review: the test comments no longer cite the issue number or internal audit finding IDs; each test's comment now plainly states what it pins. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

breimanntools and others added 4 commits July 4, 2026 15:42

breimanntools merged commit 2e2311c into master Jul 4, 2026
16 checks passed

breimanntools deleted the fix/audit-correctness-batch branch July 4, 2026 21:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: low-risk correctness audit batch (defensive copies, crash fixes, validation, config)#346

fix: low-risk correctness audit batch (defensive copies, crash fixes, validation, config)#346
breimanntools merged 5 commits into
masterfrom
fix/audit-correctness-batch

breimanntools commented Jul 4, 2026

Uh oh!

codecov Bot commented Jul 4, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

breimanntools commented Jul 4, 2026

Included fixes

Deferred (output/behavior-changing — need a decision, not in this PR)

Tests

Review

Uh oh!

codecov Bot commented Jul 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

codecov Bot commented Jul 4, 2026 •

edited

Loading