OpenMS
|
Digests a protein database in-silico.
pot. predecessor tools | → Digestor → | pot. successor tools |
---|---|---|
none (FASTA input) | IDFilter (peptide blacklist) |
This application is used to digest a protein database to get all peptides given a cleavage enzyme.
The output can be used e.g. as a blacklist filter input to IDFilter, to remove certain peptides.
The command line parameters of this tool are:
Digestor -- Digests a protein database in-silico. Full documentation: http://www.openms.de/doxygen/nightly/html/TOPP_Digestor.html Version: 3.3.0-pre-nightly-2024-11-20 Nov 21 2024, 02:34:56, Revision: decb5c8 To cite OpenMS: + Pfeuffer, J., Bielow, C., Wein, S. et al.. OpenMS 3 enables reproducible analysis of large-scale mass spec trometry data. Nat Methods (2024). doi:10.1038/s41592-024-02197-7. Usage: Digestor <options> Options (mandatory options marked with '*'): -in <file>* Input file (valid formats: 'fasta') -out <file>* Output file (peptides) (valid formats: 'idXML', 'fasta') -out_type <type> Set this if you cannot control the filename of 'out', e.g., in TOPPAS. (valid: 'idXML', 'fasta') -missed_cleavages <number> The number of allowed missed cleavages (default: '1') (min: '0') -min_length <number> Minimum length of peptide (default: '6') -max_length <number> Maximum length of peptide (default: '40') -enzyme <string> The type of digestion enzyme (default: 'Trypsin') (valid: 'Arg-C/P', 'Asp-N', 'Asp-N/B', 'Clostripain/P', 'elastase-trypsin-chymotrypsin', 'no cleavage', 'unspecific cleavage', 'Trypsin', 'Arg-C', 'staphylococcal protease/D', 'proli ne-endopeptidase/HKR', 'Glu-C+P', 'PepsinA + P', 'cyanogen-bromide', 'leukocyt e elastase', 'proline endopeptidase', 'Asp-N_ambic', 'Chymotrypsin', 'Chymotry psin/P', 'CNBr', 'Formic_acid', 'Lys-C', 'Lys-N', 'Lys-C/P', 'PepsinA', 'TrypC hymo', 'Trypsin/P', 'V8-DE', 'V8-E', 'glutamyl endopeptidase', 'Alpha-lytic protease', '2-iodobenzoate', 'iodosobenzoate') Options for FASTA output files: -FASTA:ID <option> Identifier to use for each peptide: copy from parent protein (parent); a conse cutive number (number); parent ID + consecutive number (both) (default: 'paren t') (valid: 'parent', 'number', 'both') -FASTA:description <option> Keep or remove the (possibly lengthy) FASTA header description. Keeping it can increase resulting FASTA file significantly. (default: 'remove') (valid: 'remove', 'keep') Common TOPP options: -ini <file> Use the given TOPP INI file -threads <n> Sets the number of threads allowed to be used by the TOPP tool (default: '1') -write_ini <file> Writes the default configuration file --help Shows options --helphelp Shows all options (including advanced)
INI file documentation of this tool: