OpenMS
2.7.0
|
Export MS/MS data in .MGF format for GNPS (http://gnps.ucsd.edu).
GNPS (Global Natural Products Social Molecular Networking, http://gnps.ucsd.edu) is an open-access knowledge base for community-wide organization and sharing of raw, processed or identified tandem mass (MS/MS) spectrometry data. The GNPS web-platform makes possible to perform spectral library search against public MS/MS spectral libraries, as well as to perform various data analysis such as MS/MS molecular networking, network annotation propagation (http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1006089), and the Dereplicator-based annotation (https://www.nature.com/articles/nchembio.2219). The GNPS manuscript is available here: https://www.nature.com/articles/nbt.3597
This tool was developed for the Feature Based Molecular Networking (FBMN) workflow on GNPS (https://gnps.ucsd.edu/ProteoSAFe/static/gnps-splash2.jsp)
Please cite our preprint: Nothias, LF., Petras, D., Schmid, R. et al. Feature-based molecular networking in the GNPS analysis environment. Nat Methods 17, 905–908 (2020). https://doi.org/10.1038/s41592-020-0933-6
See the FBMN workflow documentation here (https://ccms-ucsd.github.io/GNPSDocumentation/featurebasedmolecularnetworking/)
In brief, after running an OpenMS "metabolomics" pipeline, the GNPSExport TOPP tool can be used on the consensusXML file and corresponding mzML files to generate the files needed for FBMN on GNPS. These two files are:
For each consensusElement in the consensusXML file, the GNPSExport produces one representative consensus MS/MS spectrum (named peptide annotation in OpenMS jargon) outputted in the MS/MS spectral file (.MGF file). Several modes for the generation of the consensus MS/MS spectrum are available and described below. Note that these parameters are defined in the GNPSExport INI parameters file.
Representative command:
The GNPSExport TOPP tool can be run on a consensusXML file and the corresponding mzML files to generate a MS/MS spectral file (MGF format) and corresponding feature quantification table (.TXT format) that contains the LC-MS peak area intensity.
Requirements:
Parameters:
Note that mass accuracy and the retention time window for the pairing between MS/MS scans and a LC-MS feature or consensusElement is defined at the IDMapper tool step.
A representative OpenMS-GNPS workflow would sequentially use these OpenMS TOPP tools:
The GitHub for that ProteoSAFe workflow and an OpenMS python wrappers is available here: https://github.com/Bioinformatic-squad-DorresteinLab/openms-gnps-workflow
An online version of the OpenMS-GNPS pipeline for FBMN running on CCMS server (http://proteomics.ucsd.edu/) is available on GNPS: https://ccms-ucsd.github.io/GNPSDocumentation/featurebasedmolecularnetworking-with-openms/
GNPS (Global Natural Products Social Molecular Networking, https://gnps.ucsd.edu/ProteoSAFe/static/gnps-splash2.jsp) is an open-access knowledge base for community-wide organization and sharing of raw, processed or identified tandem mass (MS/MS) spectrometry data. The GNPS web-platform makes possible to perform spectral library search against public MS/MS spectral libraries, as well as to perform various data analysis such as MS/MS molecular networking, Network Annotation Propagation Network Annotation Propagation (http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1006089) and the DEREPLICATOR (https://www.nature.com/articles/nchembio.2219) The GNPS paper is available here (https://www.nature.com/articles/nbt.3597)
The command line parameters of this tool are:
GNPSExport -- Tool to export representative consensus MS/MS scan per consensusElement into a .MGF file format . See the documentation on https://ccms-ucsd.github.io/GNPSDocumentation/featurebasedmolecularnetworking_with_ openms Full documentation: http://www.openms.de/doxygen/release/2.7.0/html/TOPP_GNPSExport.html Version: 2.7.0 Sep 13 2021, 20:58:47, Revision: 9110e58 To cite OpenMS: Rost HL, Sachsenberg T, Aiche S, Bielow C et al.. OpenMS: a flexible open-source software platform for mass spectrometry data analysis. Nat Meth. 2016; 13, 9: 741-748. doi:10.1038/nmeth.3959. To cite GNPSExport: Nothias L.F. et al.. Feature-based Molecular Networking in the GNPS Analysis Environment. bioRxiv 812404 (2019). doi:10.1101/812404. Usage: GNPSExport <options> Options (mandatory options marked with '*'): -in_cm <file>* Input consensusXML file containing only consensusElements with "peptide" annotations. (valid formats: 'consensusXML') -in_mzml <files>* Original mzml files containing the ms2 spectra (aka peptide annotation). Must be in order that the consensusXML file maps the origi nal mzML files. (valid formats: 'mzML') -out <file>* Output MGF file (valid formats: 'mgf') -output_type <choice> Specificity of mgf output information (default: 'most_inten se' valid: 'merged_spectra', 'most_intense') -ms2_bin_size <num> Bin size (Da) for fragment ions when merging ms2 scans (def ault: '0.019999999552965' min: '0.0') Options for exporting mgf file with merged spectra per consensusElement: -merged_spectra:precursor_mass_tolerance <num> Precursor mass tolerance (Da) for ms annotations (default: '0.5' min: '0.0') -merged_spectra:cos_similarity <num> Cosine similarity threshold for merged_spectra output (defa ult: '0.9' min: '0.0') Common TOPP options: -ini <file> Use the given TOPP INI file -threads <n> Sets the number of threads allowed to be used by the TOPP tool (default: '1') -write_ini <file> Writes the default configuration file --help Shows options --helphelp Shows all options (including advanced)
INI file documentation of this tool: