Exome Capture
Exome capture is a method used to extract and sequence the exome (collection of all exons) in a genome and compare this variation across a sample of individual organisms. This allows studies to quickly focus in on the small percent of the genome that is most likely to contain variation that strongly affects phenotypes of interest.
Only a small fraction of many eukaryote genomes are protein coding exon sequences, e.g., in humans this is approximately 1%--2% of the genome spread over 180,000 exons in approximately 20,000-21,000 genes[1]. The average human exon is only 145bp in length and the average gene contains 8.8 exons[2]. By only sequencing and analysing the exons in a sample of individuals the investigation (say for genotype--phenotype associations) can be made much more efficient---to the extent that the causative variation is indeed found within the exon sequence.
Ng et al. (2009) created a shotgun library of human DNA sequences and hybridized the DNA to Agilent 244K microarrays. The microarrays were designed to contain anchored oligos matching human exon sequences. The exon sequences from the samples are expected to hybridize to the oligos on the microarrays. The remaining DNA can be washed away then the hybridized DNA eluted for sequencing. Thus, the original DNA sample has been greatly enriched for exon sequences. They used an Illumina GA2 system for sequencing the remaining post-enrichment DNA fragments and mapped the resulting 76 base-pair reads to a reference human genome (hg18 http://genome.ucsc.edu). Using their approach the average sequence coverage of each exon in the genome was 51X. The coverage and quality score criteria resulted in 78% of genes having >95% of their exon bases called. In addition to eight reference individuals they are included four unrelated individuals with Freeman-Sheldon syndrome (FSS). They excluded common variants recorded in dbSNP and were able to identify mutations in MYH3, previously considered a candidate gene as causative of FSS, establishing that an exome approach can identify causual variants from very small sample sizes.[3]
Bamshad, M. J., Ng, S. B., Bigham, A. W., Tabor, H. K., Emond, M. J., Nickerson, D. A., & Shendure, J. (2011). Exome sequencing as a tool for Mendelian disease gene discovery. Nature Reviews Genetics, 12(11), 745-755.[2]
Bi, K., Vanderpool, D., Singhal, S., Linderoth, T., Moritz, C., & Good, J. M. (2012). Transcriptome-based exon capture enables highly cost-effective comparative genomic data collection at moderate evolutionary scales. BMC genomics, 13(1), 403.[3]
Choi, M., Scholl, U. I., Ji, W., Liu, T., Tikhonova, I. R., Zumbo, P., ... & Lifton, R. P. (2009). Genetic diagnosis by whole exome capture and massively parallel DNA sequencing. Proceedings of the National Academy of Sciences, 106(45), 19096-19101.[4]
Teer, J. K., & Mullikin, J. C. (2010). Exome sequencing: the sweet spot before whole genomes. Human molecular genetics, ddq333.[5]
References
- ↑ Elizabeth Pennisi (2012). "ENCODE Project Writes Eulogy For Junk DNA". Science 337 (6099): 1159–1160. doi:10.1126/science.337.6099.1159
- ↑ Table 21 of International Human Genome Sequencing Consortium (2001). "Initial sequencing and analysis of the human genome". Nature 409 (6822): 860–921. doi:10.1038/35057062
- ↑ Ng, S. B., Turner, E. H., Robertson, P. D., Flygare, S. D., Bigham, A. W., Lee, C., ... & Shendure, J. (2009). Targeted capture and massively parallel sequencing of 12 human exomes. Nature, 461(7261), 272-276.[1]