Methylation-sensitive linking libraries enhance gene-enriched sequencing of complex genomes and map DNA methylation domains

Persistent Link:
http://hdl.handle.net/10150/610003
Title:
Methylation-sensitive linking libraries enhance gene-enriched sequencing of complex genomes and map DNA methylation domains
Author:
Nelson, William; Luo, Meizhong; Ma, Jianxin; Estep, Matt; Estill, James; He, Ruifeng; Talag, Jayson; Sisneros, Nicholas; Kudrna, David; Kim, HyeRan; Ammiraju, Jetty; Collura, Kristi; Bharti, Arvind; Messing, Joachim; Wing, Rod; SanMiguel, Phillip; Bennetzen, Jeffrey; Soderlund, Carol
Affiliation:
Arizona Genomics Computational Laboratory, BIO5 Institute, University of Arizona, Tucson, Arizona, USA; Arizona Genomics Institute, Department of Plant Sciences, BIO5 Institute, University of Arizona, Tucson, Arizona, USA; Department of Genetics, University of Georgia, Athens, Georgia, USA; Department of Plant Biology, University of Georgia, Athens, Georgia, USA; Plant Genome Initiative at Rutgers, Waksman Institute, Rutgers, The State University of New Jersey, Piscataway, New Jersey, USA; Department of Horticulture and Landscape Architecture, Purdue University, West Lafayette, Indiana, USA; College of Life Sciences and Technology, Huazhong Agricultural University, Wuhan, Hubei 430070, PR China; Department of Agronomy, Purdue University, West Lafayette, Indiana, USA; Department of Microbial & Plant Genomics, University of Minnesota, St. Paul, MN, USA
Issue Date:
2008
Publisher:
BioMed Central
Citation:
BMC Genomics 2008, 9:621 doi:10.1186/1471-2164-9-621
Journal:
BMC Genomics
Rights:
© 2008 Nelson et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0)
Collection Information:
This item is part of the UA Faculty Publications collection. For more information this item or other items in the UA Campus Repository, contact the University of Arizona Libraries at repository@u.library.arizona.edu.
Abstract:
BACKGROUND:Many plant genomes are resistant to whole-genome assembly due to an abundance of repetitive sequence, leading to the development of gene-rich sequencing techniques. Two such techniques are hypomethylated partial restriction (HMPR) and methylation spanning linker libraries (MSLL). These libraries differ from other gene-rich datasets in having larger insert sizes, and the MSLL clones are designed to provide reads localized to "epigenetic boundaries" where methylation begins or ends.RESULTS:A large-scale study in maize generated 40,299 HMPR sequences and 80,723 MSLL sequences, including MSLL clones exceeding 100 kb. The paired end reads of MSLL and HMPR clones were shown to be effective in linking existing gene-rich sequences into scaffolds. In addition, it was shown that the MSLL clones can be used for anchoring these scaffolds to a BAC-based physical map. The MSLL end reads effectively identified epigenetic boundaries, as indicated by their preferential alignment to regions upstream and downstream from annotated genes. The ability to precisely map long stretches of fully methylated DNA sequence is a unique outcome of MSLL analysis, and was also shown to provide evidence for errors in gene identification. MSLL clones were observed to be significantly more repeat-rich in their interiors than in their end reads, confirming the correlation between methylation and retroelement content. Both MSLL and HMPR reads were found to be substantially gene-enriched, with the SalI MSLL libraries being the most highly enriched (31% align to an EST contig), while the HMPR clones exhibited exceptional depletion of repetitive DNA (to ~11%). These two techniques were compared with other gene-enrichment methods, and shown to be complementary.CONCLUSION:MSLL technology provides an unparalleled approach for mapping the epigenetic status of repetitive blocks and for identifying sequences mis-identified as genes. Although the types and natures of epigenetic boundaries are barely understood at this time, MSLL technology flags both approximate boundaries and methylated genes that deserve additional investigation. MSLL and HMPR sequences provide a valuable resource for maize genome annotation, and are a uniquely valuable complement to any plant genome sequencing project. In order to make these results fully accessible to the community, a web display was developed that shows the alignment of MSLL, HMPR, and other gene-rich sequences to the BACs; this display is continually updated with the latest ESTs and BAC sequences.
EISSN:
1471-2164
DOI:
10.1186/1471-2164-9-621
Version:
Final published version
Additional Links:
http://www.biomedcentral.com/1471-2164/9/621

Full metadata record

DC FieldValue Language
dc.contributor.authorNelson, Williamen
dc.contributor.authorLuo, Meizhongen
dc.contributor.authorMa, Jianxinen
dc.contributor.authorEstep, Matten
dc.contributor.authorEstill, Jamesen
dc.contributor.authorHe, Ruifengen
dc.contributor.authorTalag, Jaysonen
dc.contributor.authorSisneros, Nicholasen
dc.contributor.authorKudrna, Daviden
dc.contributor.authorKim, HyeRanen
dc.contributor.authorAmmiraju, Jettyen
dc.contributor.authorCollura, Kristien
dc.contributor.authorBharti, Arvinden
dc.contributor.authorMessing, Joachimen
dc.contributor.authorWing, Roden
dc.contributor.authorSanMiguel, Phillipen
dc.contributor.authorBennetzen, Jeffreyen
dc.contributor.authorSoderlund, Carolen
dc.date.accessioned2016-05-20T08:56:10Z-
dc.date.available2016-05-20T08:56:10Z-
dc.date.issued2008en
dc.identifier.citationBMC Genomics 2008, 9:621 doi:10.1186/1471-2164-9-621en
dc.identifier.doi10.1186/1471-2164-9-621en
dc.identifier.urihttp://hdl.handle.net/10150/610003-
dc.description.abstractBACKGROUND:Many plant genomes are resistant to whole-genome assembly due to an abundance of repetitive sequence, leading to the development of gene-rich sequencing techniques. Two such techniques are hypomethylated partial restriction (HMPR) and methylation spanning linker libraries (MSLL). These libraries differ from other gene-rich datasets in having larger insert sizes, and the MSLL clones are designed to provide reads localized to "epigenetic boundaries" where methylation begins or ends.RESULTS:A large-scale study in maize generated 40,299 HMPR sequences and 80,723 MSLL sequences, including MSLL clones exceeding 100 kb. The paired end reads of MSLL and HMPR clones were shown to be effective in linking existing gene-rich sequences into scaffolds. In addition, it was shown that the MSLL clones can be used for anchoring these scaffolds to a BAC-based physical map. The MSLL end reads effectively identified epigenetic boundaries, as indicated by their preferential alignment to regions upstream and downstream from annotated genes. The ability to precisely map long stretches of fully methylated DNA sequence is a unique outcome of MSLL analysis, and was also shown to provide evidence for errors in gene identification. MSLL clones were observed to be significantly more repeat-rich in their interiors than in their end reads, confirming the correlation between methylation and retroelement content. Both MSLL and HMPR reads were found to be substantially gene-enriched, with the SalI MSLL libraries being the most highly enriched (31% align to an EST contig), while the HMPR clones exhibited exceptional depletion of repetitive DNA (to ~11%). These two techniques were compared with other gene-enrichment methods, and shown to be complementary.CONCLUSION:MSLL technology provides an unparalleled approach for mapping the epigenetic status of repetitive blocks and for identifying sequences mis-identified as genes. Although the types and natures of epigenetic boundaries are barely understood at this time, MSLL technology flags both approximate boundaries and methylated genes that deserve additional investigation. MSLL and HMPR sequences provide a valuable resource for maize genome annotation, and are a uniquely valuable complement to any plant genome sequencing project. In order to make these results fully accessible to the community, a web display was developed that shows the alignment of MSLL, HMPR, and other gene-rich sequences to the BACsen
dc.description.abstractthis display is continually updated with the latest ESTs and BAC sequences.en
dc.language.isoenen
dc.publisherBioMed Centralen
dc.relation.urlhttp://www.biomedcentral.com/1471-2164/9/621en
dc.rights© 2008 Nelson et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0)en
dc.titleMethylation-sensitive linking libraries enhance gene-enriched sequencing of complex genomes and map DNA methylation domainsen
dc.typeArticleen
dc.identifier.eissn1471-2164en
dc.contributor.departmentArizona Genomics Computational Laboratory, BIO5 Institute, University of Arizona, Tucson, Arizona, USAen
dc.contributor.departmentArizona Genomics Institute, Department of Plant Sciences, BIO5 Institute, University of Arizona, Tucson, Arizona, USAen
dc.contributor.departmentDepartment of Genetics, University of Georgia, Athens, Georgia, USAen
dc.contributor.departmentDepartment of Plant Biology, University of Georgia, Athens, Georgia, USAen
dc.contributor.departmentPlant Genome Initiative at Rutgers, Waksman Institute, Rutgers, The State University of New Jersey, Piscataway, New Jersey, USAen
dc.contributor.departmentDepartment of Horticulture and Landscape Architecture, Purdue University, West Lafayette, Indiana, USAen
dc.contributor.departmentCollege of Life Sciences and Technology, Huazhong Agricultural University, Wuhan, Hubei 430070, PR Chinaen
dc.contributor.departmentDepartment of Agronomy, Purdue University, West Lafayette, Indiana, USAen
dc.contributor.departmentDepartment of Microbial & Plant Genomics, University of Minnesota, St. Paul, MN, USAen
dc.identifier.journalBMC Genomicsen
dc.description.collectioninformationThis item is part of the UA Faculty Publications collection. For more information this item or other items in the UA Campus Repository, contact the University of Arizona Libraries at repository@u.library.arizona.edu.en
dc.eprint.versionFinal published versionen
All Items in UA Campus Repository are protected by copyright, with all rights reserved, unless otherwise indicated.