<?xml version="1.0" encoding="UTF-8"?><xml><records><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Zarate, Samantha</style></author><author><style face="normal" font="default" size="100%">Carroll, Andrew</style></author><author><style face="normal" font="default" size="100%">Mahmoud, Medhat</style></author><author><style face="normal" font="default" size="100%">Krasheninina, Olga</style></author><author><style face="normal" font="default" size="100%">Jun, Goo</style></author><author><style face="normal" font="default" size="100%">Salerno, William J</style></author><author><style face="normal" font="default" size="100%">Schatz, Michael C</style></author><author><style face="normal" font="default" size="100%">Boerwinkle, Eric</style></author><author><style face="normal" font="default" size="100%">Gibbs, Richard A</style></author><author><style face="normal" font="default" size="100%">Sedlazeck, Fritz J</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Parliament2: Accurate structural variant calling at scale.</style></title><secondary-title><style face="normal" font="default" size="100%">Gigascience</style></secondary-title><alt-title><style face="normal" font="default" size="100%">Gigascience</style></alt-title></titles><dates><year><style  face="normal" font="default" size="100%">2020</style></year><pub-dates><date><style  face="normal" font="default" size="100%">2020 12 21</style></date></pub-dates></dates><volume><style face="normal" font="default" size="100%">9</style></volume><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;&lt;b&gt;BACKGROUND: &lt;/b&gt;Structural variants (SVs) are critical contributors to genetic diversity and genomic disease. To predict the phenotypic impact of SVs, there is a need for better estimates of both the occurrence and frequency of SVs, preferably from large, ethnically diverse cohorts. Thus, the current standard approach requires the use of short paired-end reads, which remain challenging to detect, especially at the scale of hundreds to thousands of samples.&lt;/p&gt;&lt;p&gt;&lt;b&gt;FINDINGS: &lt;/b&gt;We present Parliament2, a consensus SV framework that leverages multiple best-in-class methods to identify high-quality SVs from short-read DNA sequence data at scale. Parliament2 incorporates pre-installed SV callers that are optimized for efficient execution in parallel to reduce the overall runtime and costs. We demonstrate the accuracy of Parliament2 when applied to data from NovaSeq and HiSeq X platforms with the Genome in a Bottle (GIAB) SV call set across all size classes. The reported quality score per SV is calibrated across different SV types and size classes. Parliament2 has the highest F1 score (74.27%) measured across the independent gold standard from GIAB. We illustrate the compute performance by processing all 1000 Genomes samples (2,691 samples) in &lt;1 day on GRCH38. Parliament2 improves the runtime performance of individual methods and is open source (https://github.com/slzarate/parliament2), and a Docker image, as well as a WDL implementation, is available.&lt;/p&gt;&lt;p&gt;&lt;b&gt;CONCLUSION: &lt;/b&gt;Parliament2 provides both a highly accurate single-sample SV call set from short-read DNA sequence data and enables cost-efficient application over cloud or cluster environments, processing thousands of samples.&lt;/p&gt;</style></abstract><issue><style face="normal" font="default" size="100%">12</style></issue><custom1><style face="normal" font="default" size="100%">https://www.ncbi.nlm.nih.gov/pubmed/33347570?dopt=Abstract</style></custom1></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Regier, Allison A</style></author><author><style face="normal" font="default" size="100%">Farjoun, Yossi</style></author><author><style face="normal" font="default" size="100%">Larson, David E</style></author><author><style face="normal" font="default" size="100%">Krasheninina, Olga</style></author><author><style face="normal" font="default" size="100%">Kang, Hyun Min</style></author><author><style face="normal" font="default" size="100%">Howrigan, Daniel P</style></author><author><style face="normal" font="default" size="100%">Chen, Bo-Juen</style></author><author><style face="normal" font="default" size="100%">Kher, Manisha</style></author><author><style face="normal" font="default" size="100%">Banks, Eric</style></author><author><style face="normal" font="default" size="100%">Ames, Darren C</style></author><author><style face="normal" font="default" size="100%">English, Adam C</style></author><author><style face="normal" font="default" size="100%">Li, Heng</style></author><author><style face="normal" font="default" size="100%">Xing, Jinchuan</style></author><author><style face="normal" font="default" size="100%">Zhang, Yeting</style></author><author><style face="normal" font="default" size="100%">Matise, Tara</style></author><author><style face="normal" font="default" size="100%">Abecasis, Goncalo R</style></author><author><style face="normal" font="default" size="100%">Salerno, Will</style></author><author><style face="normal" font="default" size="100%">Zody, Michael C</style></author><author><style face="normal" font="default" size="100%">Neale, Benjamin M</style></author><author><style face="normal" font="default" size="100%">Hall, Ira M</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Functional equivalence of genome sequencing analysis pipelines enables harmonized variant calling across human genetics projects.</style></title><secondary-title><style face="normal" font="default" size="100%">Nat Commun</style></secondary-title><alt-title><style face="normal" font="default" size="100%">Nat Commun</style></alt-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Genome, Human</style></keyword><keyword><style  face="normal" font="default" size="100%">Human Genetics</style></keyword><keyword><style  face="normal" font="default" size="100%">Humans</style></keyword><keyword><style  face="normal" font="default" size="100%">Whole Genome Sequencing</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2018</style></year><pub-dates><date><style  face="normal" font="default" size="100%">2018 10 02</style></date></pub-dates></dates><volume><style face="normal" font="default" size="100%">9</style></volume><pages><style face="normal" font="default" size="100%">4038</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;Hundreds of thousands of human whole genome sequencing (WGS) datasets will be generated over the next few years. These data are more valuable in aggregate: joint analysis of genomes from many sources increases sample size and statistical power. A central challenge for joint analysis is that different WGS data processing pipelines cause substantial differences in variant calling in combined datasets, necessitating computationally expensive reprocessing. This approach is no longer tenable given the scale of current studies and data volumes. Here, we define WGS data processing standards that allow different groups to produce functionally equivalent (FE) results, yet still innovate on data processing pipelines. We present initial FE pipelines developed at five genome centers and show that they yield similar variant calling results and produce significantly less variability than sequencing replicates. This work alleviates a key technical bottleneck for genome aggregation and helps lay the foundation for community-wide human genetics studies.&lt;/p&gt;</style></abstract><issue><style face="normal" font="default" size="100%">1</style></issue><custom1><style face="normal" font="default" size="100%">http://www.ncbi.nlm.nih.gov/pubmed/30279509?dopt=Abstract</style></custom1></record></records></xml>