![]() |
OpenMS
|
Export MSExperiment data to Apache Arrow format. More...
#include <OpenMS/FORMAT/MSExperimentArrowExport.h>
Static Public Member Functions | |
| static std::vector< std::string > | getSpectraArrowColumnNames (const MSExperiment &exp, const ArrowSpectraExportConfig &config=ArrowSpectraExportConfig{}) |
| Get available column names for spectra Arrow export. | |
| static std::vector< std::string > | getChromatogramArrowColumnNames (const MSExperiment &exp, const ArrowChromatogramExportConfig &config=ArrowChromatogramExportConfig{}) |
| Get available column names for chromatogram Arrow export. | |
| static bool | exportSpectraToArrowCDataInterface (const MSExperiment &exp, const ArrowSpectraExportConfig &config, ::ArrowSchema *out_schema, ::ArrowArray *out_array) |
| Export spectra to Arrow via C Data Interface (zero-copy to Python) | |
| static bool | exportChromatogramsToArrowCDataInterface (const MSExperiment &exp, const ArrowChromatogramExportConfig &config, ::ArrowSchema *out_schema, ::ArrowArray *out_array) |
| Export chromatograms to Arrow via C Data Interface (zero-copy to Python) | |
| static bool | exportSpectraToParquet (const MSExperiment &exp, const String &filename, const ArrowSpectraExportConfig &config=ArrowSpectraExportConfig{}, const ParquetWriteConfig &parquet_config=ParquetWriteConfig{}) |
| Export MSExperiment spectra to Parquet file. | |
| static bool | exportChromatogramsToParquet (const MSExperiment &exp, const String &filename, const ArrowChromatogramExportConfig &config=ArrowChromatogramExportConfig{}, const ParquetWriteConfig &parquet_config=ParquetWriteConfig{}) |
| Export MSExperiment chromatograms to Parquet file. | |
Export MSExperiment data to Apache Arrow format.
This class provides static methods to export MSExperiment spectra and chromatograms to Apache Arrow Tables and Parquet files.
|
static |
Export chromatograms to Arrow via C Data Interface (zero-copy to Python)
| [in] | exp | The MSExperiment to export |
| [in] | config | Export configuration |
| [out] | out_schema | Pointer to ArrowSchema struct |
| [out] | out_array | Pointer to ArrowArray struct |
|
static |
Export MSExperiment chromatograms to Parquet file.
Exports chromatogram data to Apache Parquet format. See exportSpectraToParquet() for details on Parquet benefits and options.
| [in] | exp | The MSExperiment to export |
| [in] | filename | Output file path |
| [in] | config | Arrow export configuration |
| [in] | parquet_config | Parquet writing options |
|
static |
Export spectra to Arrow via C Data Interface (zero-copy to Python)
Exports the Arrow schema and array to C Data Interface format, which allows zero-copy transfer to PyArrow via pyarrow.Table._import_from_c().
| [in] | exp | The MSExperiment to export |
| [in] | config | Export configuration |
| [out] | out_schema | Pointer to ArrowSchema struct (caller must allocate) |
| [out] | out_array | Pointer to ArrowArray struct (caller must allocate) |
|
static |
Export MSExperiment spectra to Parquet file.
Exports spectra data to Apache Parquet format, which provides:
Long format schema (one row per peak):
Semi-wide format schema (one row per spectrum):
Performance notes:
| [in] | exp | The MSExperiment to export |
| [in] | filename | Output file path (.parquet extension recommended) |
| [in] | config | Arrow export configuration (filtering, format, columns) |
| [in] | parquet_config | Parquet writing options (compression, row groups) |
Example:
|
static |
Get available column names for chromatogram Arrow export.
| [in] | exp | The MSExperiment to analyze |
| [in] | config | Export configuration |
|
static |
Get available column names for spectra Arrow export.
Returns the list of column names that would be included in the export based on the configuration and the actual data in the experiment.
| [in] | exp | The MSExperiment to analyze |
| [in] | config | Export configuration |