Whole Genome Files Explained
Whole genome sequencing gives you a complete genome file. It can be used to fill in the gaps of tests like 23andMe and AncestryDNA. However, the files are very large and can be confusing to handle.
Difference Between 1x and 30x Whole Genome Files:
30x WGS files are sequenced 30 times. This is to ensure accuracy; however, they can be more expensive. 1x WGS files are more cost-effective, but are not accurate enough to be used on GeneticLifehacks.
Types of Whole Genome Files:
- BAM: Is a popular whole-genome file format. A typical BAM whole genome file is between 50 to 100 GBs of data. It is more efficient for storage and faster processing.
- CRAM: Is a more compressed version of a BAM file.
- FASTQ: This is the standard format for raw sequencing data directly from the machine. It stores both the nucleotide sequence and a corresponding quality score for each base, allowing for assessment of data reliability.
- VCF: Stands for virtual cell format and is the standard text-based format for storing genetic variations, such as single nucleotide polymorphisms (SNPs).
All of these file formats must be converted into a more useable file type, like the data provided by 23andMe, AncestryDNA, etc, to be used on GeneticLifehacks. Here is my page on how you can convert your .BAM, .CRAM, or . FASTQ Whole Genome File for free.