![]() |
OpenMS
|
Import and export ProteinIdentification data to/from Apache Arrow format. More...
#include <OpenMS/FORMAT/ProteinIdentificationArrowIO.h>
Static Public Member Functions | |
| static std::shared_ptr< arrow::Table > | exportProteinsToArrow (const std::vector< ProteinIdentification > &protein_identifications) |
| Export protein hits to Apache Arrow Table. | |
| static bool | exportProteinsToParquet (const std::vector< ProteinIdentification > &protein_identifications, const String &filename, const ParquetWriteConfig &config=ParquetWriteConfig{}) |
| Export protein hits to Parquet file. | |
| static std::shared_ptr< arrow::Table > | exportProteinGroupsToArrow (const std::vector< ProteinIdentification > &protein_identifications) |
| Export protein groups to Apache Arrow Table. | |
| static bool | exportProteinGroupsToParquet (const std::vector< ProteinIdentification > &protein_identifications, const String &filename, const ParquetWriteConfig &config=ParquetWriteConfig{}) |
| Export protein groups to Parquet file. | |
| static std::shared_ptr< arrow::Table > | exportSearchParamsToArrow (const std::vector< ProteinIdentification > &protein_identifications) |
| Export search parameters to Apache Arrow Table. | |
| static bool | exportSearchParamsToParquet (const std::vector< ProteinIdentification > &protein_identifications, const String &filename, const ParquetWriteConfig &config=ParquetWriteConfig{}) |
| Export search parameters to Parquet file. | |
| static bool | importFromParquet (const String &proteins_filename, const String &protein_groups_filename, const String &search_params_filename, std::vector< ProteinIdentification > &protein_identifications) |
| Import all three Parquet files and reconstruct ProteinIdentifications. | |
| static bool | importSearchParamsFromArrow (const std::shared_ptr< arrow::Table > &table, std::vector< ProteinIdentification > &protein_identifications) |
| Import search parameters from Arrow Table. | |
| static bool | importProteinsFromArrow (const std::shared_ptr< arrow::Table > &table, std::vector< ProteinIdentification > &protein_identifications) |
| Import protein hits from Arrow Table. | |
| static bool | importProteinGroupsFromArrow (const std::shared_ptr< arrow::Table > &table, std::vector< ProteinIdentification > &protein_identifications) |
| Import protein groups from Arrow Table. | |
| static bool | importSearchParamsFromParquet (const String &filename, std::vector< ProteinIdentification > &protein_identifications) |
| Import search parameters from Parquet file. | |
| static bool | importProteinsFromParquet (const String &filename, std::vector< ProteinIdentification > &protein_identifications) |
| Import protein hits from Parquet file. | |
| static bool | importProteinGroupsFromParquet (const String &filename, std::vector< ProteinIdentification > &protein_identifications) |
| Import protein groups from Parquet file. | |
Import and export ProteinIdentification data to/from Apache Arrow format.
This class provides static methods to export and import ProteinIdentification data to/from Apache Arrow Tables and Parquet files. Separate tables are provided for protein hits, protein groups, and search parameters.
|
static |
Export protein groups to Apache Arrow Table.
Each ProteinGroup becomes one row with group probability and member accessions.
| [in] | protein_identifications | Vector of protein identifications |
|
static |
Export protein groups to Parquet file.
| [in] | protein_identifications | Vector of protein identifications |
| [in] | filename | Output file path |
| [in] | config | Parquet writing options |
|
static |
Export protein hits to Apache Arrow Table.
Each ProteinHit becomes one row with identification, score, and metadata.
| [in] | protein_identifications | Vector of protein identifications |
|
static |
Export protein hits to Parquet file.
| [in] | protein_identifications | Vector of protein identifications |
| [in] | filename | Output file path |
| [in] | config | Parquet writing options |
|
static |
Export search parameters to Apache Arrow Table.
Each ProteinIdentification's SearchParameters becomes one row.
| [in] | protein_identifications | Vector of protein identifications |
|
static |
Export search parameters to Parquet file.
| [in] | protein_identifications | Vector of protein identifications |
| [in] | filename | Output file path |
| [in] | config | Parquet writing options |
|
static |
Import all three Parquet files and reconstruct ProteinIdentifications.
Reads the three Parquet files and reconstructs a vector of ProteinIdentification objects with hits, groups, and search parameters.
| [in] | proteins_filename | Path to proteins Parquet file |
| [in] | protein_groups_filename | Path to protein groups Parquet file |
| [in] | search_params_filename | Path to search params Parquet file |
| [out] | protein_identifications | Reconstructed protein identifications |
|
static |
Import protein groups from Arrow Table.
Adds ProteinGroups and IndistinguishableProteins to matching ProteinIdentifications by run_identifier.
| [in] | table | Arrow Table with protein groups |
| [out] | protein_identifications | Protein identifications to populate |
|
static |
Import protein groups from Parquet file.
| [in] | filename | Path to Parquet file |
| [out] | protein_identifications | Protein identifications to populate |
|
static |
Import protein hits from Arrow Table.
Adds ProteinHits to matching ProteinIdentifications by run_identifier. If no matching ProteinIdentification exists, creates new ones.
| [in] | table | Arrow Table with protein hits |
| [out] | protein_identifications | Protein identifications to populate |
|
static |
Import protein hits from Parquet file.
| [in] | filename | Path to Parquet file |
| [out] | protein_identifications | Protein identifications to populate |
|
static |
Import search parameters from Arrow Table.
Each row becomes a ProteinIdentification shell with run-level metadata and SearchParameters populated.
| [in] | table | Arrow Table with search parameters |
| [out] | protein_identifications | Reconstructed protein identifications |
|
static |
Import search parameters from Parquet file.
| [in] | filename | Path to Parquet file |
| [out] | protein_identifications | Reconstructed protein identifications |