-
Notifications
You must be signed in to change notification settings - Fork 0
User Guide
Jorge Miguel Silva edited this page Oct 10, 2024
·
1 revision
Process a single file:
phenoqc --input examples/sample_data.csv --output ./reports/ --schema schemas/pheno_schema.json --mapping examples/sample_mapping.json --impute meanBatch process multiple files:
phenoqc --input data1.csv data2.json data3.tsv --output ./reports/ --schema schemas/pheno_schema.json --mapping examples/sample_mapping.json --impute median-
--input: Input file(s) to process -
--output: Directory to save output reports -
--schema: JSON schema file for validation -
--mapping: Custom mapping file for ontology mapping -
--impute: Strategy for imputing missing data (mean, median)
To launch the GUI:
streamlit run src/gui.pyFollow the on-screen instructions to upload files, select options, and generate reports.
PhenoQC can be configured using YAML or JSON configuration files. Example:
input_files:
- data1.csv
- data2.json
output_directory: ./reports
schema_file: schemas/pheno_schema.json
mapping_file: examples/sample_mapping.json
imputation_strategy: meanSave this as config.yaml and use it with the CLI:
phenoqc --config config.yamlPhenoQC generates the following outputs:
- QC Report (PDF)
- Visual Summary (PNG)
- Processed Data (CSV)
These files will be saved in the specified output directory.