Small Sample DNA-Seq Workflows
SNP & Variation Suite delivers the most powerful rare variant filtering workflows with the latest annotation sources and GenomeBrowse visualization. For the first time, in a single, integrated desktop solution, you can interactively filter hundreds to millions of common and rare variants down to a handful of potentially pathogenic variants.
Data Import and Management
Next-generation sequencing poses some unique data import and management challenges. Unlike microarrays where every sample is assayed for the same SNP set, next-generation sequencing generates variant calls unique to each sample. Most of these are considered rare and are not currently cataloged, which makes conventional data import and mapping difficult. SVS makes this easy with streamlined import and mapping of common and standardized formats such as variant call files (VCF) version 4.0 and higher. Furthermore, you can combine NGS data from multiple sources without having worry about file format compatibility. If your files contain read depth and quality scores, they can be imported as well. From there, quality assurance, variant filtering, and analysis is as fast as ever.
SVS provides a wide array of quality assurance measures to ensure your data is of the highest quality and your results are accurate. Standard quality assurance measures for small sample or small family exome or whole genome workflows are supported. Including screening out variants with poor read depths and other quality scores from the variant call files, presence (or absence) in public annotation databases, minor allele (alternate allele frequency) filtering based on public catalogs, and having an effect on the protein coding. For small families, Mendelian error detection is also available.
Similar to the ANNOVAR program, variant classification examines the interactions between variants and gene transcripts in order to classify variants based on their potential effect on genes. Variants are classified according to their position in relation to gene transcripts (intronic, exonic, utr5, etc). In addition, variants in coding exons are further classified according to their effect on the gene's protein sequence (synonymous, non-synonymous, frame-shifting, etc). This gives insight into which variants are most likely to have functional effects.
After performing extensive quality assurance on your data, the next step is sorting through all your variants to find those that really matter. Though more manageable on a whole-exome scale, this process can be daunting. The 1000 Genomes Project has already cataloged more than 25 million variants, 18 million more than dbSNP 135 (the closest thing to a database for common variants). Each person is expected to have roughly 4 million variants, 20 thousand in coding regions, and 250-300 that are potentially damaging. How do you distinguish the relatively small number of damaging variants from those that are benign?
SVS makes this process easy with filtering by annotation tracks. Using gene tracks, you can filter variants outside of genes or exons, leaving only those in coding regions. Public database probe tracks, such as dbSNP, 1000 Genomes, NHLBI ESP6500 Exomes, and ClinVar enable you to exclude variants considered common.
The dbNSFP NS Functional Predictions track can be used to filter out variants that are predicted as tolerated or benign based on the following functional predictions: SIFT, PolyPhen2, MutationTaster, GERP++, PhyloP and more. One or all of these predictions can be used to measure how likely a variant is to be damaging and filter out those considered benign. Functional prediction filtering is especially helpful for targeted resequencing projects where you are trying to locate causal variants based on GWAS results. You can also use case-control or familial data to identify variants that are unique to affected individuals only.
Examining variants in GenomeBrowse
Variant map visualization provides a practical representation of variant call spreadsheets in the context of public annotations including gene and exon regions. With a quick glance at a variant map, where variants can be colored by allele or as an insertion or deletion (e.g. variant classification) researchers can immediately see the results of theirfiltering workflows. Adding annotation sources further illustrates a complete picture of variants, helping you better understand the relevance of variants that may have beenfound to be disease gene related, medically actionable, or potentially deleteriousvariants. Learn more about GenomeBrowse as a stand-alone tool »
Case Study - Hela Azaiez, University of Iowa
"It opened so many doors for my research. It allowed me to explore new ideas. "
Home Brewed Personalized Genomics
Blog - Customizing Workflows for Family Structure
Customizing de Novo Candidate and Compound Heterozygous Candidate workflows to consider family structure