Peak calling by Sparse Enrichment Analysis for CUT&RUN chromatin profiling

Epigenetics Chromatin. 2019 Jul 12;12(1):42. doi: 10.1186/s13072-019-0287-4.

Abstract

Background: CUT&RUN is an efficient epigenome profiling method that identifies sites of DNA binding protein enrichment genome-wide with high signal to noise and low sequencing requirements. Currently, the analysis of CUT&RUN data is complicated by its exceptionally low background, which renders programs designed for analysis of ChIP-seq data vulnerable to oversensitivity in identifying sites of protein binding.

Results: Here we introduce Sparse Enrichment Analysis for CUT&RUN (SEACR), an analysis strategy that uses the global distribution of background signal to calibrate a simple threshold for peak calling. SEACR discriminates between true and false-positive peaks with near-perfect specificity from "gold standard" CUT&RUN datasets and efficiently identifies enriched regions for several different protein targets. We also introduce a web server ( http://seacr.fredhutch.org ) for plug-and-play analysis with SEACR that facilitates maximum accessibility across users of all skill levels.

Conclusions: SEACR is a highly selective peak caller that definitively validates the accuracy of CUT&RUN for datasets with known true negatives. Its ease of use and performance in comparison with existing peak calling strategies make it an ideal choice for analyzing CUT&RUN data.

Keywords: CUT&RUN; Epigenome profiling; Peak calling.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Binding Sites / genetics
  • Chromatin / genetics
  • Chromatin Immunoprecipitation / methods*
  • DNA-Binding Proteins / analysis*
  • DNA-Binding Proteins / metabolism
  • Epigenesis, Genetic / genetics
  • Epigenomics / methods*
  • Genome
  • Humans
  • Protein Binding / genetics
  • Sequence Analysis, DNA

Substances

  • Chromatin
  • DNA-Binding Proteins