ConsensuSV-ONT - a modern method for accurate structural variant calling

bioRxiv [Preprint]. 2024 Jul 26:2024.07.26.605267. doi: 10.1101/2024.07.26.605267.

Abstract

The improvements in sequencing technology make the development of new tools for the detection of structural variance more and more common. However, the tools available for the long-read Oxford Nanopore sequencing are limited, and it is hard to choose one, which is the best. That is why there is a need to create a tool based on consensus that combines existing work in order to discover a set of high-quality, reliable structural variants that can be used for further downstream analysis. The field has also been subject to revolution in machine learning techniques, especially deep learning. In the spirit of the aforementioned need and developments, we propose a novel, fully automated ConsensuSV-ONT algorithm. The method uses six independent, state-of-the-art structural variant callers for long-read sequencing along with a convolutional neural network for filtering high-quality variants. We provide a runtime environment in the form of a docker image, wrapping a nextflow pipeline for efficient processing using parallel computing. The solution is complete in its form and is ready to use not only by computer scientists but accessible and easy to use for everyone working with Oxford Nanopore long-read sequencing data.

Publication types

  • Preprint