This project is a VCFv4.2 Python parser developed for the exam of the course "Bioinformatics".
Project's specifications are taken from samtools version 4.2.
A vcf file is used to specifies the format of a text file used in bioinformatics for storing gene sequence variations. This parser checks if the file is well-formed and respect the requirements of version 4.2.
Some sample .vcf files could be find here
Others could be find at:
- https://ftp-trace.ncbi.nlm.nih.gov/ReferenceSamples/giab/release/NA12878_HG001/latest/GRCh38/
- https://www.internationalgenome.org/data#download
An example of use can be found in testing.py