Preparing the install

Overview

This repository is a modified version of the AMR++ core repository with some notable modifications, changes, and parameterised for running on different compute systems (notably Super Computing Wales and Google Compute Cloud). For usage and tutorials refer to the core AMR++ documentation.

Notable changes to original pipeline:

Integrated fastp & bowtie2 as QC and alignment packages
- Note: Default alignment for megares is kept as BWA consistent with original AMR++ and can be changed by parameter. Bowtie2 alignment against megaresDB has significant change on alignment rate.
Based upon modified docker image (passdan/amrplusplus-update)
Some code tweaks and fixes to work with singularity-slurm submission and repair nextflow channel bugs
Bespoke job submission and result caputre to fit our specific requirements
Added Bracken processing with new results
- Note: Two new workflow parameters: 'standard_AMR_wKraken_and_bracken' and 'kraken_and_braken' (for already filtered/host removed input)
Coming soon: Longread data options (hifi/ONT). Testing underway

Codebase is provided as-is and is hyper-locally modified for our infrastructure. If you are not concerned about bracken output or fastp & bowtie2 you probably want to work from the original AMR++ repository. Note three branches to the github repository:

Master - Set up for slurm queing systems
GCP tuned - For deployment on Google Compute Cloud with the buckets and pipelining system
flat - For local running, no queuing system

Preparing the install

Download the github repository with git clone and the url above

Running with singularity

The pipeline is designed and configured to run with singularity. If you are using this then no further installation preparation is required. You may choose to pre-build the singulariy images in advance if wanted. Note: config/singularity_slurm.conf is where singularity images can be defined.

Reference databases

Optionally, download newer or more appropriate reference files

bowtie2 index files to remove contamination/host DNA

Either download directly, or build your indexes to be filtered against from user supplied fasta files. [Recommended] download human genome indexes directly from: https://bowtie-bio.sourceforge.net/bowtie2/index.shtml

Kraken2 database

The default parameter with AMR++ is to download the minikraken2 (2020) database which covers Bacteria, Archaea, Viruses which will occur the first time you run the pipeline. However you likely want a more updated version, and perhaps including fungi and plasmodium. They can be downloaded from here: https://benlangmead.github.io/aws-indexes/k2 and put in a location defined in the params.config file.

Edit Slurm submission scripts

An example slurm script defines these parameters:
- workdir: Location where processing will be performed (advice: use high speed location on your processing node i.e. /tmp)
- installdir: Location of this github repo on your system
- resultsdir: Where do you want the main outputs to be transfered to (not the full working folders & outputs)
- run: Name of the sequencing run for annotation and folder where the fastq files are Default input read location is: $workdir/$run/fastq

Running the pipeline

Default pipeline is standard_AMR_wKraken_and_bracken which will run from raw fastqs to the endpoint. Alternatives are to use mid-process data, kraken only etc. by modifying the --pipeline parameter

Available pipelines:
    - demo: Run a demonstration of AMR++
    - standard_AMR: Run the standard AMR++ pipeline
    - fast_AMR: Run the fast AMR++ pipeline without host removal.
    - standard_AMR_wKraken: Run the standard AMR++ pipeline with Kraken
    - **NEW** standard_AMR_wKraken_and_bracken: Run the standard AMR++ pipeline with Kraken AND Bracken

Available subworkflows:
    - eval_qc: Run FastQC analysis
    - trim_qc: Run trimming and quality control
    - rm_host: Remove host reads
    - resistome: Perform resistome analysis
    - align: Perform alignment to MEGARes database
    - kraken: Perform Kraken analysis
    - **NEW** kraken_and_bracken: Perform Kraken and Bracken analysis
    - qiime2: Perform QIIME 2 analysis
    - bam_resistome: Perform resistome analysis on BAM files

Submit as a slurm & singularity job with:

sbatch AMRplusplus_full.sh

You're finished!

Name		Name	Last commit message	Last commit date
Latest commit History 343 Commits
bin		bin
config		config
data		data
docs		docs
envs		envs
modules		modules
subworkflows		subworkflows
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
SLURM_AMRplusplus_cleanReads_submit.sh		SLURM_AMRplusplus_cleanReads_submit.sh
SLURM_AMRplusplus_submit.sh		SLURM_AMRplusplus_submit.sh
main_AMR++.nf		main_AMR++.nf
nextflow.config		nextflow.config
params.config		params.config
run_AMR++_slurm.sh		run_AMR++_slurm.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Preparing the install

Running with singularity

Reference databases

bowtie2 index files to remove contamination/host DNA

Kraken2 database

Edit Slurm submission scripts

Running the pipeline

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

passdan/AMRplusplus-W3-dev

Folders and files

Latest commit

History

Repository files navigation

Overview

Preparing the install

Running with singularity

Reference databases

bowtie2 index files to remove contamination/host DNA

Kraken2 database

Edit Slurm submission scripts

Running the pipeline

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages