"Are we throwing away good data? Evaluation of chimera detection algorithms on long-read amplicons reveals high false positive rates across algorithms" (Hakimzadeh et al. 2025)

Structure

This repository contains the data and part of the analysis stack for the abovementioned paper. It is structured as follows:

Simulated data holds scripts related to the simulated dataset from generating the simulated data, chimeric sequence creation, quality filtering, and chimera filtering related to the simulated dataset. Moreover, the scripts for the simulated dataset and statistical analysis were used to calculate the F1 score.

Real data holds scripts related to real data analysis.

BlasCh contains the BLAST scripts for alignment and specific module BlasCh designed for processing XML outputs to find false positive chimeras and false negative chimeras.

Figures & tables contain the scripts used for generating graphs and tables.

The workflow we followed for the real dataset was like this:

Name		Name	Last commit message	Last commit date
Latest commit History 435 Commits
BlasCh		BlasCh
DADA2		DADA2
Figures_tables		Figures_tables
Real_data		Real_data
Simulated_data		Simulated_data
LICENSE		LICENSE
README.md		README.md
workflow.png		workflow.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

"Are we throwing away good data? Evaluation of chimera detection algorithms on long-read amplicons reveals high false positive rates across algorithms" (Hakimzadeh et al. 2025)

Structure

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

"Are we throwing away good data? Evaluation of chimera detection algorithms on long-read amplicons reveals high false positive rates across algorithms" (Hakimzadeh et al. 2025)

Structure

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages