<?xml version="1.0" encoding="UTF-8"?><xml><records><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Zarate, Samantha</style></author><author><style face="normal" font="default" size="100%">Carroll, Andrew</style></author><author><style face="normal" font="default" size="100%">Mahmoud, Medhat</style></author><author><style face="normal" font="default" size="100%">Krasheninina, Olga</style></author><author><style face="normal" font="default" size="100%">Jun, Goo</style></author><author><style face="normal" font="default" size="100%">Salerno, William J</style></author><author><style face="normal" font="default" size="100%">Schatz, Michael C</style></author><author><style face="normal" font="default" size="100%">Boerwinkle, Eric</style></author><author><style face="normal" font="default" size="100%">Gibbs, Richard A</style></author><author><style face="normal" font="default" size="100%">Sedlazeck, Fritz J</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Parliament2: Accurate structural variant calling at scale.</style></title><secondary-title><style face="normal" font="default" size="100%">Gigascience</style></secondary-title><alt-title><style face="normal" font="default" size="100%">Gigascience</style></alt-title></titles><dates><year><style  face="normal" font="default" size="100%">2020</style></year><pub-dates><date><style  face="normal" font="default" size="100%">2020 12 21</style></date></pub-dates></dates><volume><style face="normal" font="default" size="100%">9</style></volume><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;&lt;b&gt;BACKGROUND: &lt;/b&gt;Structural variants (SVs) are critical contributors to genetic diversity and genomic disease. To predict the phenotypic impact of SVs, there is a need for better estimates of both the occurrence and frequency of SVs, preferably from large, ethnically diverse cohorts. Thus, the current standard approach requires the use of short paired-end reads, which remain challenging to detect, especially at the scale of hundreds to thousands of samples.&lt;/p&gt;&lt;p&gt;&lt;b&gt;FINDINGS: &lt;/b&gt;We present Parliament2, a consensus SV framework that leverages multiple best-in-class methods to identify high-quality SVs from short-read DNA sequence data at scale. Parliament2 incorporates pre-installed SV callers that are optimized for efficient execution in parallel to reduce the overall runtime and costs. We demonstrate the accuracy of Parliament2 when applied to data from NovaSeq and HiSeq X platforms with the Genome in a Bottle (GIAB) SV call set across all size classes. The reported quality score per SV is calibrated across different SV types and size classes. Parliament2 has the highest F1 score (74.27%) measured across the independent gold standard from GIAB. We illustrate the compute performance by processing all 1000 Genomes samples (2,691 samples) in &lt;1 day on GRCH38. Parliament2 improves the runtime performance of individual methods and is open source (https://github.com/slzarate/parliament2), and a Docker image, as well as a WDL implementation, is available.&lt;/p&gt;&lt;p&gt;&lt;b&gt;CONCLUSION: &lt;/b&gt;Parliament2 provides both a highly accurate single-sample SV call set from short-read DNA sequence data and enables cost-efficient application over cloud or cluster environments, processing thousands of samples.&lt;/p&gt;</style></abstract><issue><style face="normal" font="default" size="100%">12</style></issue><custom1><style face="normal" font="default" size="100%">https://www.ncbi.nlm.nih.gov/pubmed/33347570?dopt=Abstract</style></custom1></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Majidian, Sina</style></author><author><style face="normal" font="default" size="100%">Sedlazeck, Fritz J</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">PhaseME: Automatic rapid assessment of phasing quality and phasing improvement.</style></title><secondary-title><style face="normal" font="default" size="100%">Gigascience</style></secondary-title><alt-title><style face="normal" font="default" size="100%">Gigascience</style></alt-title></titles><dates><year><style  face="normal" font="default" size="100%">2020</style></year><pub-dates><date><style  face="normal" font="default" size="100%">2020 07 01</style></date></pub-dates></dates><volume><style face="normal" font="default" size="100%">9</style></volume><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;&lt;b&gt;BACKGROUND: &lt;/b&gt;The detection of which mutations are occurring on the same DNA molecule is essential to predict their consequences. This can be achieved by phasing the genomic variations. Nevertheless, state-of-the-art haplotype phasing is currently a black box in which the accuracy and quality of the reconstructed haplotypes are hard to assess.&lt;/p&gt;&lt;p&gt;&lt;b&gt;FINDINGS: &lt;/b&gt;Here we present PhaseME, a versatile method to provide insights into and improvement of sample phasing results based on linkage data. We showcase the performance and the importance of PhaseME by comparing phasing information obtained from Pacific Biosciences including both continuous long reads and high-quality consensus reads, Oxford Nanopore Technologies, 10x Genomics, and Illumina sequencing technologies. We found that 10x Genomics and Oxford Nanopore phasing can be significantly improved while retaining a high N50 and completeness of phase blocks. PhaseME generates reports and summary plots to provide insights into phasing performance and correctness. We observed unique phasing issues for each of the sequencing technologies, highlighting the necessity of quality assessments. PhaseME is able to decrease the Hamming error rate significantly by 22.4% on average across all 5 technologies. Additionally, a significant improvement is obtained in the reduction of long switch errors. Especially for high-quality consensus reads, the improvement is 54.6% in return for only a 5% decrease in phase block N50 length.&lt;/p&gt;&lt;p&gt;&lt;b&gt;CONCLUSIONS: &lt;/b&gt;PhaseME is a universal method to assess the phasing quality and accuracy and improves the quality of phasing using linkage information. The package is freely available at https://github.com/smajidian/phaseme.&lt;/p&gt;</style></abstract><issue><style face="normal" font="default" size="100%">7</style></issue><custom1><style face="normal" font="default" size="100%">https://www.ncbi.nlm.nih.gov/pubmed/32706368?dopt=Abstract</style></custom1></record></records></xml>