Executes a peak-picking and scoring algorithm on MRM/SRM data.

potential predecessor tools	→ OpenSwathAnalyzer →	potential successor tools
OpenSwathChromatogramExtractor		OpenSwathFeatureXMLToTSV
MRMMapper		OpenSwathConfidenceScoring

The idea of the OpenSwath Analyzer is to analyze a series of chromatograms together with the associated meta information (stored in TraML format) in order to determine likely places of elution of a peptide in targeted proteomics data (derived from SWATH-MS or MRM/SRM). This tool will perform peak picking on the chromatograms and scoring in a single tool, if you only want the peak picking look at UTILS_MRMTransitionGroupPicker tool.

The command line parameters of this tool are:

OpenSwathAnalyzer -- Picks peaks and finds features in an SWATH-MS or SRM experiment.
Full documentation: http://www.openms.de/doxygen/release/3.0.0/html/TOPP_OpenSwathAnalyzer.html
Version: 3.0.0 Jul 14 2023, 11:57:33, Revision: be787e9
To cite OpenMS:
 + Rost HL, Sachsenberg T, Aiche S, Bielow C et al.. OpenMS: a flexible open-source software platform for 
   mass spectrometry data analysis. Nat Meth. 2016; 13, 9: 741-748. doi:10.1038/nmeth.3959.

Usage:
  OpenSwathAnalyzer <options>

This tool has algorithm parameters that are not shown here! Please check the ini file for a detailed descript
ion or use the --helphelp option

Options (mandatory options marked with '*'):
  -in <file>*                    Input file containing the chromatograms. (valid formats: 'mzML')
  -tr <file>*                    Transition file (valid formats: 'traML')
  -rt_norm <file>                RT normalization file (how to map the RTs of this run to the ones stored in 
                                 the library) (valid formats: 'trafoXML')
  -out <file>*                   Output file (valid formats: 'featureXML')
  -no-strict                     Run in non-strict mode and allow some chromatograms to not be mapped.
                                 
  -swath_files <files>           [applies only if you have full MS2 spectra maps] Swath files that were used 
                                 to extract the transitions. If present, SWATH specific scoring will be used.
                                  (valid formats: 'mzML')
  -min_upper_edge_dist <double>  [applies only if you have full MS2 spectra maps] Minimal distance to the 
                                 edge to still consider a precursor, in Thomson (only in SWATH) (default: 
                                 '0.0')
                                 
Common TOPP options:
  -ini <file>                    Use the given TOPP INI file
  -threads <n>                   Sets the number of threads allowed to be used by the TOPP tool (default: 
                                 '1')
  -write_ini <file>              Writes the default configuration file
  --help                         Shows options
  --helphelp                     Shows all options (including advanced)

The following configuration subsections are valid:
 - algorithm   Algorithm parameters section

You can write an example INI file using the '-write_ini' option.
Documentation of subsection parameters can be found in the doxygen documentation or the INIFileEditor.
For more information, please consult the online documentation for this tool:
  - http://www.openms.de/doxygen/release/3.0.0/html/TOPP_OpenSwathAnalyzer.html

INI file documentation of this tool:

Legend:

required parameter

advanced parameter

+OpenSwathAnalyzerPicks peaks and finds features in an SWATH-MS or SRM experiment.

version3.0.0 Version of the tool that generated this parameters file.

++1Instance '1' section for 'OpenSwathAnalyzer'

in input file containing the chromatograms.input file*.mzML

tr transition fileinput file*.traML

rt_norm RT normalization file (how to map the RTs of this run to the ones stored in the library)input file*.trafoXML

out output fileoutput file*.featureXML

no-strictfalse run in non-strict mode and allow some chromatograms to not be mapped.true, false

swath_files[] [applies only if you have full MS2 spectra maps] Swath files that were used to extract the transitions. If present, SWATH specific scoring will be used.input file*.mzML

min_upper_edge_dist0.0 [applies only if you have full MS2 spectra maps] Minimal distance to the edge to still consider a precursor, in Thomson (only in SWATH)

log Name of log file (created only when specified)

debug0 Sets the debug level

threads1 Sets the number of threads allowed to be used by the TOPP tool

no_progressfalse Disables progress logging to command linetrue, false

forcefalse Overrides tool-specific checkstrue, false

testfalse Enables the test mode (needed for internal use only)true, false

+++modelOptions to control the modeling of retention time transformations from data

typelinear Type of modellinear, b_spline, interpolated, lowess

symmetric_regressionfalse Only for 'linear' model: Perform linear regression on 'y - x' vs. 'y + x', instead of on 'y' vs. 'x'.true, false

+++algorithmAlgorithm parameters section

stop_report_after_feature-1 Stop reporting after feature (ordered by quality; -1 means do not stop).

rt_extraction_window-1.0 Only extract RT around this value (-1 means extract over the whole range, a value of 500 means to extract around +/- 500 s of the expected elution). For this to work, the TraML input file needs to contain normalized RT values.

rt_normalization_factor1.0 The normalized RT is expected to be between 0 and 1. If your normalized RT has a different range, pass this here (e.g. it goes from 0 to 100, set this value to 100)

quantification_cutoff0.0 Cutoff in m/z below which peaks should not be used for quantification any more0.0:∞

write_convex_hullfalse Whether to write out all points of all features into the featureXMLtrue, false

spectrum_addition_methodsimple For spectrum addition, either use simple concatenation or use peak resamplingsimple, resample

add_up_spectra1 Add up spectra around the peak apex (needs to be a non-even integer)1:∞

spacing_for_spectra_resampling5.0e-03 If spectra are to be added, use this spacing to add them up0.0:∞

uis_threshold_sn-1 S/N threshold to consider identification transition (set to -1 to consider all)

uis_threshold_peak_area0 Peak area threshold to consider identification transition (set to -1 to consider all)

scoring_modeldefault Scoring model to usedefault, single_transition

im_extra_drift0.0 Extra drift time to extract for IM scoring (as a fraction, e.g. 0.25 means 25% extra on each side)0.0:∞

stricttrue Whether to error (true) or skip (false) if a transition in a transition group does not have a corresponding chromatogram.true, false

++++TransitionGroupPicker

stop_after_feature-1 Stop finding after feature (ordered by intensity; -1 means do not stop).

stop_after_intensity_ratio1.0e-04 Stop after reaching intensity ratio

min_peak_width1.0e-03 Minimal peak width (s), discard all peaks below this value (-1 means no action).

peak_integrationoriginal Calculate the peak area and height either the smoothed or the raw chromatogram data.original, smoothed

background_subtractionnone Remove background from peak signal using estimated noise levels. The 'original' method is only provided for historical purposes, please use the 'exact' method and set parameters using the PeakIntegrator: settings. The same original or smoothed chromatogram specified by peak_integration will be used for background estimation.none, original, exact

recalculate_peaksfalse Tries to get better peak picking by looking at peak consistency of all picked peaks. Tries to use the consensus (median) peak border if the variation within the picked peaks is too large.true, false

use_precursorsfalse Use precursor chromatogram for peak picking (note that this may lead to precursor signal driving the peak picking)true, false

use_consensustrue Use consensus peak boundaries when computing transition group picking (if false, compute independent peak boundaries for each transition)true, false

recalculate_peaks_max_z1.0 Determines the maximal Z-Score (difference measured in standard deviations) that is considered too large for peak boundaries. If the Z-Score is above this value, the median is used for peak boundaries (default value 1.0).

minimal_quality-1.0e04 Only if compute_peak_quality is set, this parameter will not consider peaks below this quality threshold

resample_boundary15.0 For computing peak quality, how many extra seconds should be sample left and right of the actual peak

compute_peak_qualityfalse Tries to compute a quality value for each peakgroup and detect outlier transitions. The resulting score is centered around zero and values above 0 are generally good and below -1 or -2 are usually bad.true, false

compute_peak_shape_metricsfalse Calculates various peak shape metrics (e.g., tailing) that can be used for downstream QC/QA.true, false

compute_total_mifalse Compute mutual information metrics for individual transitions that can be used for OpenSWATH/IPF scoring.true, false

boundary_selection_methodlargest Method to use when selecting the best boundaries for peaks.largest, widest

+++++PeakPickerMRM

sgolay_frame_length15 The number of subsequent data points used for smoothing.
This number has to be uneven. If it is not, 1 will be added.

sgolay_polynomial_order3 Order of the polynomial that is fitted.

gauss_width50.0 Gaussian width in seconds, estimated peak size.

use_gausstrue Use Gaussian filter for smoothing (alternative is Savitzky-Golay filter)false, true

peak_width-1.0 Force a certain minimal peak_width on the data (e.g. extend the peak at least by this amount on both sides) in seconds. -1 turns this feature off.

signal_to_noise1.0 Signal-to-noise threshold at which a peak will not be extended any more. Note that setting this too high (e.g. 1.0) can lead to peaks whose flanks are not fully captured.0.0:∞

sn_win_len1000.0 Signal to noise window length.

sn_bin_count30 Signal to noise bin count.

write_sn_log_messagesfalse Write out log messages of the signal-to-noise estimator in case of sparse windows or median in rightmost histogram bintrue, false

remove_overlapping_peaksfalse Try to remove overlapping peaks during peak pickingfalse, true

methodcorrected Which method to choose for chromatographic peak-picking (OpenSWATH legacy on raw data, corrected picking on smoothed chromatogram or Crawdad on smoothed chromatogram).legacy, corrected, crawdad

+++++PeakIntegrator

integration_typeintensity_sum The integration technique to use in integratePeak() and estimateBackground() which uses either the summed intensity, integration by Simpson's rule or trapezoidal integration.intensity_sum, simpson, trapezoid

baseline_typebase_to_base The baseline type to use in estimateBackground() based on the peak boundaries. A rectangular baseline shape is computed based either on the minimal intensity of the peak boundaries, the maximum intensity or the average intensity (base_to_base).base_to_base, vertical_division, vertical_division_min, vertical_division_max

fit_EMGfalse Fit the chromatogram/spectrum to the EMG peak model.false, true

++++DIAScoring

dia_extraction_window0.05 DIA extraction window in Th or ppm.0.0:∞

dia_extraction_unitTh DIA extraction window unitTh, ppm

dia_centroidedfalse Use centroided DIA data.true, false

dia_byseries_intensity_min300.0 DIA b/y series minimum intensity to consider.0.0:∞

dia_byseries_ppm_diff10.0 DIA b/y series minimal difference in ppm to consider.0.0:∞

dia_nr_isotopes4 DIA number of isotopes to consider.0:∞

dia_nr_charges4 DIA number of charges to consider.0:∞

peak_before_mono_max_ppm_diff20.0 DIA maximal difference in ppm to count a peak at lower m/z when searching for evidence that a peak might not be monoisotopic.0.0:∞

++++EMGScoring

interpolation_step0.2 Sampling rate for the interpolation of the model function.

tolerance_stdev_bounding_box3.0 Bounding box has range [minimim of data, maximum of data] enlarged by tolerance_stdev_bounding_box times the standard deviation of the data.

max_iteration500 Maximum number of iterations using by Levenberg-Marquardt algorithm.

init_momfalse Initialize parameters using method of moments estimators.true, false

+++++statistics

mean1.0 Centroid position of the model.

variance1.0 Variance of the model.

++++Scores

use_shape_scoretrue Use the shape score (this score measures the similarity in shape of the transitions using a cross-correlation)true, false

use_coelution_scoretrue Use the coelution score (this score measures the similarity in coelution of the transitions using a cross-correlation)true, false

use_rt_scoretrue Use the retention time score (this score measure the difference in retention time)true, false

use_library_scoretrue Use the library scoretrue, false

use_elution_model_scoretrue Use the elution model (EMG) score (this score fits a gaussian model to the peak and checks the fit)true, false

use_intensity_scoretrue Use the intensity scoretrue, false

use_nr_peaks_scoretrue Use the number of peaks scoretrue, false

use_total_xic_scoretrue Use the total XIC scoretrue, false

use_total_mi_scorefalse Use the total MI scoretrue, false

use_sn_scoretrue Use the SN (signal to noise) scoretrue, false

use_mi_scorefalse Use the MI (mutual information) scoretrue, false

use_dia_scorestrue Use the DIA (SWATH) scores. If turned off, will not use fragment ion spectra for scoring.true, false

use_ms1_correlationfalse Use the correlation scores with the MS1 elution profilestrue, false

use_sonar_scoresfalse Use the scores for SONAR scans (scanning swath)true, false

use_ion_mobility_scoresfalse Use the scores for Ion Mobility scanstrue, false

use_ms1_fullscanfalse Use the full MS1 scan at the peak apex for scoring (ppm accuracy of precursor and isotopic pattern)true, false

use_ms1_mifalse Use the MS1 MI scoretrue, false

use_uis_scoresfalse Use UIS scores for peptidoform identificationtrue, false

use_ionseries_scorestrue Use MS2-level b/y ion-series scores for peptidoform identificationtrue, false

use_ms2_isotope_scorestrue Use MS2-level isotope scores (pearson & manhattan) across product transitions (based on ID if annotated or averagine)true, false