Kraken2
What does it do?
Kraken2 is a program for taxonomic sequence classification.
For more information, see the Kraken2 github, and/or the protocol paper describing use of Kraken2.
If you publish data using this automator, don't forget to cite the authors of the tool Wood and Salzberg, 2014 and the Kraken2 paper Wood et al., 2019
How do I use it?
Subject
In the Subject
field, put kraken2
. Spelling counts, but case sensitivity doesn't.
Description
Required Components
You must include a list of SEQIDs one per line.
Optional Components
In order to customise your analyses, additional settings can be optionally modified:
-
seqtype - By default, the Kraken2 will assume you are running an analysis on paired-end raw sequence data.
- default is
paired
- If you want to analyse long-read nanopore or assembly data, add the following line:
seqtype=nanopore
seqtype=assembly
- default is
-
database - the default analysis will utilize the Kraken2 standard database (9/26/2022) from Kraken2 index zone.
- default is
kraken2
- If you want to analyse using a different database, the following options are currently supported:
database=greengenes
database=plusPF
database=rdp
database=silva
-The kraken2 and plusPF databases are meant for metagenomes, all others are 16S based databases
- default is
Example
For an example kraken2 analysis, see issue 29355. The zip file has been attached to this request as an example (the ftp links expire after approx. 2 weeks).
Interpreting Results
Kraken2 will upload a zipped folder called kraken2_output_redmineID.zip
to the ftp once it has completed. This will contain report files for kraken2 and bracken analyses.
How long does it take?
Kraken2 is much faster than the original kraken, however the time required for analysis will depend on the analysis type and number of sequences requested. If too many sequences are requested at once, the automator may run out of memory and fail. Please try to keep it to 10 metagenomes maximum.
What can go wrong?
1) Requested SEQIDs are not available. If we can't find some of the SEQIDs that you request, you will get a warning message informing you of it.
Version
Version 2.1.2 is currently available at the OLC. (as of 2024-07-4)