FilterSeq¶
Filters sequences in FASTA/FASTQ files
usage: FilterSeq [-h] [–version] ...
-
-h,--help¶ show this help message and exit
-
--version¶ show program’s version number and exit
- output files:
- <command>-pass
- reads passing filtering operation and modified accordingly, where <command> is the name of the filtering operation that was run.
- <command>-fail
- raw reads failing filtering criteria, where <command> is the name of the filtering operation.
- output annotation fields:
- None
FilterSeq length¶
Filters reads by length.
usage: FilterSeq [-h] [–version] ...
-
-h,--help¶ show this help message and exit
-
-s<seq_files>¶ A list of FASTA/FASTQ files containing sequences to process.
-
--fasta¶ Specify to force output as FASTA rather than FASTQ.
-
--failed¶ If specified create files containing records that fail processing.
-
--log<log_file>¶ Specify to write verbose logging to a file. May not be specified with multiple input files.
-
--nproc<nproc>¶ The number of simultaneous computational processes to execute (CPU cores to utilized).
-
--outdir<out_dir>¶ Specify to changes the output directory to the location specified. The input file directory is used if this is not specified.
-
--outname<out_name>¶ Changes the prefix of the successfully processed output file to the string specified. May not be specified with multiple input files.
-
-n<min_length>¶ Minimum sequence length to retain.
-
--inner¶ If specified exclude consecutive missing characters at either end of the sequence.
FilterSeq maskqual¶
Masks low quality positions.
usage: FilterSeq [-h] [–version] ...
-
-h,--help¶ show this help message and exit
-
-s<seq_files>¶ A list of FASTA/FASTQ files containing sequences to process.
-
--fasta¶ Specify to force output as FASTA rather than FASTQ.
-
--failed¶ If specified create files containing records that fail processing.
-
--log<log_file>¶ Specify to write verbose logging to a file. May not be specified with multiple input files.
-
--nproc<nproc>¶ The number of simultaneous computational processes to execute (CPU cores to utilized).
-
--outdir<out_dir>¶ Specify to changes the output directory to the location specified. The input file directory is used if this is not specified.
-
--outname<out_name>¶ Changes the prefix of the successfully processed output file to the string specified. May not be specified with multiple input files.
-
-q<min_qual>¶ Quality score threshold.
FilterSeq missing¶
Filters reads by N or gap character count.
usage: FilterSeq [-h] [–version] ...
-
-h,--help¶ show this help message and exit
-
-s<seq_files>¶ A list of FASTA/FASTQ files containing sequences to process.
-
--fasta¶ Specify to force output as FASTA rather than FASTQ.
-
--failed¶ If specified create files containing records that fail processing.
-
--log<log_file>¶ Specify to write verbose logging to a file. May not be specified with multiple input files.
-
--nproc<nproc>¶ The number of simultaneous computational processes to execute (CPU cores to utilized).
-
--outdir<out_dir>¶ Specify to changes the output directory to the location specified. The input file directory is used if this is not specified.
-
--outname<out_name>¶ Changes the prefix of the successfully processed output file to the string specified. May not be specified with multiple input files.
-
-n<max_missing>¶ Threshold for fraction of gap or N nucleotides.
-
--inner¶ If specified exclude consecutive missing characters at either end of the sequence.
FilterSeq quality¶
Filters reads by quality score.
usage: FilterSeq [-h] [–version] ...
-
-h,--help¶ show this help message and exit
-
-s<seq_files>¶ A list of FASTA/FASTQ files containing sequences to process.
-
--fasta¶ Specify to force output as FASTA rather than FASTQ.
-
--failed¶ If specified create files containing records that fail processing.
-
--log<log_file>¶ Specify to write verbose logging to a file. May not be specified with multiple input files.
-
--nproc<nproc>¶ The number of simultaneous computational processes to execute (CPU cores to utilized).
-
--outdir<out_dir>¶ Specify to changes the output directory to the location specified. The input file directory is used if this is not specified.
-
--outname<out_name>¶ Changes the prefix of the successfully processed output file to the string specified. May not be specified with multiple input files.
-
-q<min_qual>¶ Quality score threshold.
-
--inner¶ If specified exclude consecutive missing characters at either end of the sequence.
FilterSeq repeats¶
Filters reads by consecutive nucleotide repeats.
usage: FilterSeq [-h] [–version] ...
-
-h,--help¶ show this help message and exit
-
-s<seq_files>¶ A list of FASTA/FASTQ files containing sequences to process.
-
--fasta¶ Specify to force output as FASTA rather than FASTQ.
-
--failed¶ If specified create files containing records that fail processing.
-
--log<log_file>¶ Specify to write verbose logging to a file. May not be specified with multiple input files.
-
--nproc<nproc>¶ The number of simultaneous computational processes to execute (CPU cores to utilized).
-
--outdir<out_dir>¶ Specify to changes the output directory to the location specified. The input file directory is used if this is not specified.
-
--outname<out_name>¶ Changes the prefix of the successfully processed output file to the string specified. May not be specified with multiple input files.
-
-n<max_repeat>¶ Threshold for fraction of repeating nucleotides.
-
--missing¶ If specified count consecutive gap and N characters ‘ in addition to {A,C,G,T}.
-
--inner¶ If specified exclude consecutive missing characters at either end of the sequence.
FilterSeq trimqual¶
Trims sequences by quality score decay.
usage: FilterSeq [-h] [–version] ...
-
-h,--help¶ show this help message and exit
-
-s<seq_files>¶ A list of FASTA/FASTQ files containing sequences to process.
-
--fasta¶ Specify to force output as FASTA rather than FASTQ.
-
--failed¶ If specified create files containing records that fail processing.
-
--log<log_file>¶ Specify to write verbose logging to a file. May not be specified with multiple input files.
-
--nproc<nproc>¶ The number of simultaneous computational processes to execute (CPU cores to utilized).
-
--outdir<out_dir>¶ Specify to changes the output directory to the location specified. The input file directory is used if this is not specified.
-
--outname<out_name>¶ Changes the prefix of the successfully processed output file to the string specified. May not be specified with multiple input files.
-
-q<min_qual>¶ Quality score threshold.
-
--win<window>¶ Nucleotide window size for moving average calculation.
-
--reverse¶ Specify to trim the head of the sequence rather than the tail.