QClus: a droplet filtering algorithm for enhanced snRNA-seq data quality in challenging samples

Nucleic Acids Res. 2025 Jan 7;53(1):gkae1145. doi: 10.1093/nar/gkae1145.

Abstract

Single-nuclei RNA sequencing remains a challenge for many human tissues, as incomplete removal of background signal masks cell-type-specific signals and interferes with downstream analyses. Here, we present Quality Clustering (QClus), a droplet filtering algorithm targeted toward challenging samples. QClus uses additional metrics, such as cell-type-specific marker gene expression, to cluster nuclei and filter empty and highly contaminated droplets, providing reliable filtering of samples with varying number of nuclei and contamination levels. In a benchmarking analysis against seven alternative methods across six datasets, consisting of 252 samples and over 1.9 million nuclei, QClus achieved the highest quality in the greatest number of samples over all evaluated quality metrics and recorded no processing failures, while robustly retaining numbers of nuclei within the expected range. QClus combines high quality, automation and robustness with flexibility and user-adjustability, catering to diverse experimental needs and datasets.

MeSH terms

  • Algorithms*
  • Benchmarking
  • Cluster Analysis
  • Data Accuracy
  • High-Throughput Nucleotide Sequencing / methods
  • Humans
  • RNA, Small Nuclear / genetics
  • RNA, Small Nuclear / metabolism
  • RNA-Seq / methods
  • RNA-Seq / standards
  • Sequence Analysis, RNA / methods
  • Single-Cell Analysis / methods

Substances

  • RNA, Small Nuclear