You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Usability gaps surfaced while building a real linear + conformational B-cell epitope-prediction
pipeline on aaanalysis master (__version__ still reads 1.0.3) — using CPP, aap.find_features, EmbeddingPreprocessor, StructurePreprocessor, ShapModel, CPPPlot, dPULearn. Grouping the
findings here; the sub-issues below are the concrete, independently-shippable subtasks.
Already tracked (this project confirms the need in practice)
__version__ stays 1.0.3 on master (unbumped) → master install is indistinguishable from the PyPI
release; a .devN/git-hash suffix would disambiguate.
Discoverability: re-implemented things that already exist (SequenceFeature.get_labels_quantile/ get_labels_tiered, prune_by_correlation/variance, get_df_parts_from_windows) — a task→method
cookbook would help (docs epic epic: documentation architecture (Getting Started · Examples · API) #106).
Context
Usability gaps surfaced while building a real linear + conformational B-cell epitope-prediction
pipeline on aaanalysis master (
__version__still reads1.0.3) — using CPP,aap.find_features,EmbeddingPreprocessor,StructurePreprocessor,ShapModel,CPPPlot,dPULearn. Grouping thefindings here; the sub-issues below are the concrete, independently-shippable subtasks.
Already tracked (this project confirms the need in practice)
hand-build a 20×20 one-hot identity scale set (
Segment(1,1)= AAC) and a 400-token dipeptidealphabet for DPC. Not on master yet.
aa.eval_features— re-implemented across_val_predict+metric wrapper in ~every script.feature_matrix(df_seq=…)— confirmed present on master, resolved. 🎉wins linear Z1; positional CPP + 3D structure wins conformational Z2).
New subtasks (not yet tracked)
NumericalFeature.feature_matrixfor numeric CPP (run_num) outputsfind_features/ Pattern splits fail on free peptides (no flanks)CPP.run(n_jobs>1)mp.Manager()EOFErroroutside notebooksStructurePreprocessorper-feature failure isolation +msms/backend routingMinor
__version__stays1.0.3on master (unbumped) → master install is indistinguishable from the PyPIrelease; a
.devN/git-hash suffix would disambiguate.SequenceFeature.get_labels_quantile/ get_labels_tiered,prune_by_correlation/variance,get_df_parts_from_windows) — a task→methodcookbook would help (docs epic epic: documentation architecture (Getting Started · Examples · API) #106).