A Comprehensive, Automatically Updated Fungal ITS Sequence Dataset for Reference-Based Chimera Control in Environmental Sequencing Efforts
De Sousa F
MetadataShow full item record
Subjectfungi; chimera detection; reference dataset; molecular ecology; PCR artifacts; Area 05; BIO/19
The nuclear ribosomal internal transcribed spacer (ITS) region is the most commonly chosen genetic marker for the molecular identification of fungi in environmental sequencing and molecular ecology studies. Several analytical issues complicate such efforts, one of which is the formation of chimeric-artificially joined-DNA sequences during PCR amplification or sequence assembly. Several software tools are currently available for chimera detection, but rely to various degrees on the presence of a chimera-free reference dataset for optimal performance. However, no such dataset is available for use with the fungal ITS region. This study introduces a comprehensive, automatically updated reference dataset for fungal ITS sequences based on the UNITE database for the molecular identification of fungi. This dataset supports chimera detection throughout the fungal kingdom and for full-length ITS sequences as well as partial (ITS1 or ITS2 only) datasets. The performance of the dataset on a large set of artificial chimeras was above 99.5%, and we subsequently used the dataset to remove nearly 1,000 compromised fungal ITS sequences from public circulation. The dataset is available at http://unite.ut.ee/repository.php and is subject to web-based third-party curation.
Showing items related by title, author, creator and subject.
The impact of reference panel short-read sequencing inaccessibility on genotype imputation (P14.046B), in: Abstracts from the 51st European Society of Human Genetics Conference: Posters Mitchell JS; König E; Gögele M; Pattaro C; Pramstaller PP; Fuchsberger C (2019)p. 514
Loh PR; Danecek P; Palamara PF; Fuchsberger C; A Reshef Y; K Finucane H; Schoenherr S; Forer L; McCarthy S; Abecasis GR; Durbin R; L Price A (2016)Haplotype phasing is a fundamental problem in medical and population genetics. Phasing is generally performed via statistical phasing in a genotyped cohort, an approach that can yield high accuracy in very large cohorts ...
Tkalcic M; Odic A; Kosir A; Tasic J (IEEE, 2010)This paper presents the results of a comparative study of emotion detection from human faces in posed and spontaneous expressions. The goal of the reasearch was to determine whether the algorithm used, that yielded high ...