Skip to content

Improved Validation#9

Merged
jorgeMFS merged 2 commits into
ieeta-pt:mainfrom
jorgeMFS:main
Dec 18, 2025
Merged

Improved Validation#9
jorgeMFS merged 2 commits into
ieeta-pt:mainfrom
jorgeMFS:main

Conversation

@jorgeMFS

Copy link
Copy Markdown
Collaborator

Pull Request Checklist

Thank you for contributing to VCFX! Before submitting your pull request, please confirm the following:

  • [ x] I ran pre-commit run --files <changed files> to execute ruff, flake8, mypy, and pytest.
  • [ x] All C++ tests pass via ctest --output-on-failure from the build directory.
  • [x ] All Python tests pass via pytest tests/python.
  • [x ] Documentation has been updated where applicable (e.g. README.md, docs/*).

Provide a brief description of the changes below.

Add comprehensive validation features matching GATK ValidateVariants:
- ALT allele observation check (ALLELES validation)
- Empty VCF detection with --allow-empty override
- Header Type/Number field validation
- Variant sorting check (disable with -S)
- AN/AC consistency check in strict mode (CHR_COUNTS)
- REF validation against FASTA reference (-R)
- dbSNP ID validation (-D)
- GVCF format validation (-g)

Additional improvements:
- Add -i/--input flag for mmap file input
- Update documentation with all new options
- Add test cases for new validations
- Maintain ~110 MB/s throughput performance
@jorgeMFS jorgeMFS merged commit 8547a08 into ieeta-pt:main Dec 18, 2025
4 of 6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant