All notable changes to md-python are documented in this file.
The format follows Keep a Changelog and the project loosely adheres to Semantic Versioning.
- Updated
client.entities.mappings.peptide_to_protein_same_datasetandclient.entities.mappings.protein_to_protein_via_peptidesto accept a list of datasets instead of a single dataset.
client.entities.mappings.protein_to_protein_via_peptidesmap protein groups to protein groups through their shared peptides across datasets.
client.entities.mappings.peptide_to_protein_same_datasetmap peptides to their protein groups within a single dataset.
client.entities.mappings.protein_to_peptide_same_datasetmap protein groups to their peptides within a single dataset.
NormalisationImputationDatasetnow exposes the full surface of the MD Converternormalisation_and_imputationflow:- Filtration for
entity_type="protein"viafiltration_method="by missing values"(previously blocked). knn_tnimputation withknn_tn_k(1–10, default 5) andknn_tn_distance(truncation|correlation, defaulttruncation).mindetimputation parameterq(0–0.5, default 0.01).- Batch-correction sub-technique selector
batch_correction_technique(limma remove batch effect|combat|combat seq—combat seqis gene-only) with the matching parameter blocks (batch_variablesfor limma,batch_variable_combatmean_only+reference_batch_combatfor ComBat,batch_variable_combatonly for ComBat-Seq, plusdesign_variablesandexperiment_design).
include_imputed_values(defaultFalse) on median / quantile / sum / batch correction.median_normalisation_centre_at_zero(defaultTrue) on median normalisation.- Shared filter block for
by missing valuesandby minimum abundance:filter_valid_values_criteria(percentage|count),filter_threshold_proportion,filter_threshold_count,filter_valid_values_logic,filter_based_on_condition. NormalisationImputationDataset.filter_only(...)classmethod for filtration-only jobs (normalisation=skip,imputation=skip, output remains INTENSITY).NormalisationImputationDataset.help()classmethod with an in-process method-and-parameter reference.
- Filtration for
- Wire-format strings emitted by
NormalisationImputationDatasetnow use the MD Converter canonical (spaced) form:"batch correction","by missing values","by ptm localization probability","by minimum abundance","limma remove batch effect","combat seq". Underscored variants ("batch_correction","minimum_abundance","ptm_localization_probability","by_missing_values","limma_remove_batch_effect","combat_seq") are still accepted on input and normalised to canonical on output. NormalisationImputationDatasetconstructor moved from the legacy nestednormalisation_methods/imputation_methodsdicts to flat typed kwargs (normalisation_method,imputation_method,entity_type,filtration_method, plus method-specific kwargs). Theextra_paramsescape hatch is retained and is merged last, so caller-supplied keys override typed defaults.
- The output dataset type for the NI flow is
INTENSITY— including the filter-only pattern viafilter_only(...). An upload may therefore have multiple INTENSITY datasets after running NI;find_initial_datasetdoes not yet disambiguate this and will raise. A follow-up release will refine this.
- Prior releases — see git history.