OpenMS
|
Digests a protein database in-silico.
pot. predecessor tools | → Digestor → | pot. successor tools |
---|---|---|
none (FASTA input) | IDFilter (peptide blacklist) |
This application is used to digest a protein database to get all peptides given a cleavage enzyme.
The output can be used e.g. as a blacklist filter input to IDFilter, to remove certain peptides.
The command line parameters of this tool are:
stty: 'standard input': Inappropriate ioctl for device Digestor -- Digests a protein database in-silico. Full documentation: http://www.openms.de/doxygen/nightly/html/TOPP_Digestor.html Version: 3.4.0-pre-nightly-2025-01-20 Jan 21 2025, 02:20:01, Revision: 91e1ce6 To cite OpenMS: + Pfeuffer, J., Bielow, C., Wein, S. et al.. OpenMS 3 enables reproducible analysis of large-scale mass spectrometry data. Nat Methods (2024). doi:10.1038/s41592-024-02197-7. Usage: Digestor <options> Options (mandatory options marked with '*'): -in <file>* Input file (valid formats: 'fasta') -out <file>* Output file (peptides) (valid formats: 'idXML', 'fasta') -out_type <type> Set this if you cannot control the filename of 'out', e.g., in TOPPAS. (valid: 'idXML', 'fasta') -missed_cleavages <number> The number of allowed missed cleavages (default: '1') (min: '0') -min_length <number> Minimum length of peptide (default: '6') -max_length <number> Maximum length of peptide (default: '40') -enzyme <string> The type of digestion enzyme (default: 'Trypsin') (valid: 'glutamyl endopeptidase', 'Alpha-lytic protease', 'no cleavage', 'unspecific cleavage', 'Trypsin', 'Arg-C', 'Arg-C/P', 'Asp-N', 'Asp-N/B', 'Glu-C+P', 'PepsinA + P', 'cyanogen-bromide', 'Clostripain/P', 'elastase-trypsin-chymotrypsin', 'Asp-N_ambic', 'Chymotrypsin', 'Chymotrypsin/P', 'CNBr', 'Formic_acid', 'Lys-C', 'Lys-N', 'Lys-C/P', 'PepsinA', 'TrypChymo', 'Trypsin/P', 'V8-DE', 'V8-E', 'leukocyte elastase', 'proline endopeptidase', '2-iodobenzoate', 'iodosobenzoate', 'staphylococcal protease/D', 'proline-endopeptidase/HKR') Options for FASTA output files: -FASTA:ID <option> Identifier to use for each peptide: copy from parent protein (parent); a consecutive number (number); parent ID + consecutive number (both) (default: 'parent') (valid: 'parent', 'number', 'both') -FASTA:description <option> Keep or remove the (possibly lengthy) FASTA header description. Keeping it can increase resulting FASTA file significantly. (default: 'remove') (valid: 'remove', 'keep') Common TOPP options: -ini <file> Use the given TOPP INI file -threads <n> Sets the number of threads allowed to be used by the TOPP tool (default: '1') -write_ini <file> Writes the default configuration file --help Shows options --helphelp Shows all options (including advanced)
INI file documentation of this tool: