bam benchmark bioinformatics epigenomics genomic life sciences long read sequencing
ONT Methylation Benchmarking Datasets are generated to benchmark existing methylation-calling tools on the Oxford Nanopore sequencing platform using their recent R10.4.1 flowcell chemistry. It spans a diverse range of species, including bacteria (E. coli, H. pylori J99, H. pylori 26695, A. variabilis, T. denticola), plants (Rice, Arabidopsis), and mammals (mouse, human).In addition, the dataset includes EMSeq data for E. coli, plant, and mouse samples, which can serve as ground truth for methylation studies. It also provides unmethylated whole-genome amplified (WGA) DNA for H. pylori 26695 and a dam- dcm- double mutant (DM) of E. coli that lacks canonical 5mC and 6mA methylation. These variants, together with their wild-type counterparts, offer value for both training and benchmarking DNA methylation calling models.
Datasets will be updated periodically as additional data is generated.
https://github.com/SowpatiLab/ont-basemod-benchmark-data/blob/main/documentation.md
CSIR-Centre for Cellular and Molecular Biology
See all datasets managed by CSIR-Centre for Cellular and Molecular Biology.
onkar.ccmb@csir.res.in; tej.ccmb@csir.res.in
ONT Methylation Benchmarking Datasets was accessed on DATE from https://registry.opendata.aws/ont_basemod_data. Please cite Kulkarni et al. Comprehensive benchmarking of tools for nanopore-based detection of DNA methylation. bioRxiv (2024). doi: https://doi.org/10.1101/2024.11.09.622763 when referencing the ONT methylation benchmarking datasets in publications.
arn:aws:s3:::ont-basemod-benchmark-dataap-south-1aws s3 ls --no-sign-request s3://ont-basemod-benchmark-data/arn:aws:sns:ap-south-1:767415906609:ont-basemod-benchmark-data-object_createdap-south-1