<?xml version="1.0" encoding="UTF-8"?><xml><records><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Luo, Ruibang</style></author><author><style face="normal" font="default" size="100%">Sedlazeck, Fritz J</style></author><author><style face="normal" font="default" size="100%">Lam, Tak-Wah</style></author><author><style face="normal" font="default" size="100%">Schatz, Michael C</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">A multi-task convolutional deep neural network for variant calling in single molecule sequencing.</style></title><secondary-title><style face="normal" font="default" size="100%">Nat Commun</style></secondary-title><alt-title><style face="normal" font="default" size="100%">Nat Commun</style></alt-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Base Sequence</style></keyword><keyword><style  face="normal" font="default" size="100%">Computational Biology</style></keyword><keyword><style  face="normal" font="default" size="100%">DNA Mutational Analysis</style></keyword><keyword><style  face="normal" font="default" size="100%">Genome, Human</style></keyword><keyword><style  face="normal" font="default" size="100%">Genome-Wide Association Study</style></keyword><keyword><style  face="normal" font="default" size="100%">Genomics</style></keyword><keyword><style  face="normal" font="default" size="100%">Genotype</style></keyword><keyword><style  face="normal" font="default" size="100%">Genotyping Techniques</style></keyword><keyword><style  face="normal" font="default" size="100%">Humans</style></keyword><keyword><style  face="normal" font="default" size="100%">INDEL Mutation</style></keyword><keyword><style  face="normal" font="default" size="100%">Nanopores</style></keyword><keyword><style  face="normal" font="default" size="100%">Neural Networks (Computer)</style></keyword><keyword><style  face="normal" font="default" size="100%">Polymorphism, Single Nucleotide</style></keyword><keyword><style  face="normal" font="default" size="100%">Sequence Analysis, DNA</style></keyword><keyword><style  face="normal" font="default" size="100%">Software</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2019</style></year><pub-dates><date><style  face="normal" font="default" size="100%">2019 03 01</style></date></pub-dates></dates><volume><style face="normal" font="default" size="100%">10</style></volume><pages><style face="normal" font="default" size="100%">998</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;The accurate identification of DNA sequence variants is an important, but challenging task in genomics. It is particularly difficult for single molecule sequencing, which has a per-nucleotide error rate of ~5-15%. Meeting this demand, we developed Clairvoyante, a multi-task five-layer convolutional neural network model for predicting variant type (SNP or indel), zygosity, alternative allele and indel length from aligned reads. For the well-characterized NA12878 human sample, Clairvoyante achieves 99.67, 95.78, 90.53% F1-score on 1KP common variants, and 98.65, 92.57, 87.26% F1-score for whole-genome analysis, using Illumina, PacBio, and Oxford Nanopore data, respectively. Training on a second human sample shows Clairvoyante is sample agnostic and finds variants in less than 2 h on a standard server. Furthermore, we present 3,135 variants that are missed using Illumina but supported independently by both PacBio and Oxford Nanopore reads. Clairvoyante is available open-source ( https://github.com/aquaskyline/Clairvoyante ), with modules to train, utilize and visualize the model.&lt;/p&gt;</style></abstract><issue><style face="normal" font="default" size="100%">1</style></issue><custom1><style face="normal" font="default" size="100%">https://www.ncbi.nlm.nih.gov/pubmed/30824707?dopt=Abstract</style></custom1></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Sedlazeck, Fritz J</style></author><author><style face="normal" font="default" size="100%">Rescheneder, Philipp</style></author><author><style face="normal" font="default" size="100%">Smolka, Moritz</style></author><author><style face="normal" font="default" size="100%">Fang, Han</style></author><author><style face="normal" font="default" size="100%">Nattestad, Maria</style></author><author><style face="normal" font="default" size="100%">von Haeseler, Arndt</style></author><author><style face="normal" font="default" size="100%">Schatz, Michael C</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Accurate detection of complex structural variations using single-molecule sequencing.</style></title><secondary-title><style face="normal" font="default" size="100%">Nat Methods</style></secondary-title><alt-title><style face="normal" font="default" size="100%">Nat Methods</style></alt-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">DNA Mutational Analysis</style></keyword><keyword><style  face="normal" font="default" size="100%">Genome, Human</style></keyword><keyword><style  face="normal" font="default" size="100%">Genomics</style></keyword><keyword><style  face="normal" font="default" size="100%">High-Throughput Nucleotide Sequencing</style></keyword><keyword><style  face="normal" font="default" size="100%">Humans</style></keyword><keyword><style  face="normal" font="default" size="100%">Sequence Analysis, DNA</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2018</style></year><pub-dates><date><style  face="normal" font="default" size="100%">2018 Jun</style></date></pub-dates></dates><volume><style face="normal" font="default" size="100%">15</style></volume><pages><style face="normal" font="default" size="100%">461-468</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;Structural variations are the greatest source of genetic variation, but they remain poorly understood because of technological limitations. Single-molecule long-read sequencing has the potential to dramatically advance the field, although high error rates are a challenge with existing methods. Addressing this need, we introduce open-source methods for long-read alignment (NGMLR; https://github.com/philres/ngmlr ) and structural variant identification (Sniffles; https://github.com/fritzsedlazeck/Sniffles ) that provide unprecedented sensitivity and precision for variant detection, even in repeat-rich regions and for complex nested events that can have substantial effects on human health. In several long-read datasets, including healthy and cancerous human genomes, we discovered thousands of novel variants and categorized systematic errors in short-read approaches. NGMLR and Sniffles can automatically filter false events and operate on low-coverage data, thereby reducing the high costs that have hindered the application of long reads in clinical and research settings.&lt;/p&gt;</style></abstract><issue><style face="normal" font="default" size="100%">6</style></issue><custom1><style face="normal" font="default" size="100%">https://www.ncbi.nlm.nih.gov/pubmed/29713083?dopt=Abstract</style></custom1></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Nattestad, Maria</style></author><author><style face="normal" font="default" size="100%">Goodwin, Sara</style></author><author><style face="normal" font="default" size="100%">Ng, Karen</style></author><author><style face="normal" font="default" size="100%">Baslan, Timour</style></author><author><style face="normal" font="default" size="100%">Sedlazeck, Fritz J</style></author><author><style face="normal" font="default" size="100%">Rescheneder, Philipp</style></author><author><style face="normal" font="default" size="100%">Garvin, Tyler</style></author><author><style face="normal" font="default" size="100%">Fang, Han</style></author><author><style face="normal" font="default" size="100%">Gurtowski, James</style></author><author><style face="normal" font="default" size="100%">Hutton, Elizabeth</style></author><author><style face="normal" font="default" size="100%">Tseng, Elizabeth</style></author><author><style face="normal" font="default" size="100%">Chin, Chen-Shan</style></author><author><style face="normal" font="default" size="100%">Beck, Timothy</style></author><author><style face="normal" font="default" size="100%">Sundaravadanam, Yogi</style></author><author><style face="normal" font="default" size="100%">Kramer, Melissa</style></author><author><style face="normal" font="default" size="100%">Antoniou, Eric</style></author><author><style face="normal" font="default" size="100%">McPherson, John D</style></author><author><style face="normal" font="default" size="100%">Hicks, James</style></author><author><style face="normal" font="default" size="100%">McCombie, W Richard</style></author><author><style face="normal" font="default" size="100%">Schatz, Michael C</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Complex rearrangements and oncogene amplifications revealed by long-read DNA and RNA sequencing of a breast cancer cell line.</style></title><secondary-title><style face="normal" font="default" size="100%">Genome Res</style></secondary-title><alt-title><style face="normal" font="default" size="100%">Genome Res.</style></alt-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Breast Neoplasms</style></keyword><keyword><style  face="normal" font="default" size="100%">Female</style></keyword><keyword><style  face="normal" font="default" size="100%">Gene Amplification</style></keyword><keyword><style  face="normal" font="default" size="100%">Gene Rearrangement</style></keyword><keyword><style  face="normal" font="default" size="100%">Genome, Human</style></keyword><keyword><style  face="normal" font="default" size="100%">Genomic Structural Variation</style></keyword><keyword><style  face="normal" font="default" size="100%">High-Throughput Nucleotide Sequencing</style></keyword><keyword><style  face="normal" font="default" size="100%">Humans</style></keyword><keyword><style  face="normal" font="default" size="100%">MCF-7 Cells</style></keyword><keyword><style  face="normal" font="default" size="100%">Oncogenes</style></keyword><keyword><style  face="normal" font="default" size="100%">Receptor, ErbB-2</style></keyword><keyword><style  face="normal" font="default" size="100%">Repetitive Sequences, Nucleic Acid</style></keyword><keyword><style  face="normal" font="default" size="100%">Transcriptome</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2018</style></year><pub-dates><date><style  face="normal" font="default" size="100%">2018 08</style></date></pub-dates></dates><volume><style face="normal" font="default" size="100%">28</style></volume><pages><style face="normal" font="default" size="100%">1126-1135</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;The SK-BR-3 cell line is one of the most important models for HER2+ breast cancers, which affect one in five breast cancer patients. SK-BR-3 is known to be highly rearranged, although much of the variation is in complex and repetitive regions that may be underreported. Addressing this, we sequenced SK-BR-3 using long-read single molecule sequencing from Pacific Biosciences and develop one of the most detailed maps of structural variations (SVs) in a cancer genome available, with nearly 20,000 variants present, most of which were missed by short-read sequencing. Surrounding the important  oncogene (also known as ), we discover a complex sequence of nested duplications and translocations, suggesting a punctuated progression. Full-length transcriptome sequencing further revealed several novel gene fusions within the nested genomic variants. Combining long-read genome and transcriptome sequencing enables an in-depth analysis of how SVs disrupt the genome and sheds new light on the complex mechanisms involved in cancer genome evolution.&lt;/p&gt;</style></abstract><issue><style face="normal" font="default" size="100%">8</style></issue><custom1><style face="normal" font="default" size="100%">http://www.ncbi.nlm.nih.gov/pubmed/29954844?dopt=Abstract</style></custom1></record></records></xml>