usage: stec.py profile_reduce [-h] [-version] [-v verbosity] [-p profile_file]
                              [-g gene_names] [-o output_folder]

Reduce full wgMLST profile from Enterobase using genes of interest

optional arguments:
  -h, --help            show this help message and exit
  -version, --version   show program's version number and exit
  -v verbosity, --verbosity verbosity
                        Set the logging level. Options are debug, info, warning, error, and critical. Default is info.
  -p profile_file, --profile_file profile_file
                        Specify name and path of profile file. If not provided, the default "profiles.list" in the current working directory will be used
  -g gene_names, --gene_names gene_names
                        Name and path of text file containing gene names to use to filter the profile file (one per line). If not provided, the default "genes.txt" in the current working directory will be used. If the file does not exist, the program will attempt to create a file using the .fasta files in the current working directory
  -o output_folder, --output_folder output_folder
                        Name and path of folder into which the reduced profile and notes are to be placed. If not provided, the default "nt_profile" folder in the current working directory will be used

Outputs

The reduced profile will be written to profile.txt in the supplied output folder. It contains all the unique profiles extracted from the full profile file

A notes file will be written to reducing_notes.txt in the supplied output folder. It contains notes on every sequence type processed from the full profile. If a profile is a duplicate of a previous profile, the ReducedSequenceType will be 0, and the Notes column will note that the profile is a duplicate

Table of Contents

Reduce Profiles

Inputs

Running the Script

Usage

Outputs