![]() |
OpenMS
|
Public helpers for writing and concatenating Arrow tables to Parquet files. More...
Functions | |
| String | generateUuidV4 () |
| Generate a lowercase hyphenated RFC 4122 version-4 UUID string. | |
| bool | writeTableToParquet (const std::shared_ptr< arrow::Table > &table, const String &filename, const ParquetWriteConfig &config=ParquetWriteConfig{}) |
| Write an Arrow table to a Parquet file. | |
| bool | concatenateAndWriteToParquet (const std::vector< std::shared_ptr< arrow::Table > > &tables, const String &filename, const ParquetWriteConfig &config=ParquetWriteConfig{}) |
| Concatenate a vector of Arrow tables and write the result to a Parquet file. | |
Public helpers for writing and concatenating Arrow tables to Parquet files.
TOPP tools link against libOpenMS (which exports these helpers) but not directly against Arrow/Parquet. These wrappers keep all Arrow/Parquet API calls inside libOpenMS so downstream binaries don't need to import Arrow symbols.
| bool concatenateAndWriteToParquet | ( | const std::vector< std::shared_ptr< arrow::Table > > & | tables, |
| const String & | filename, | ||
| const ParquetWriteConfig & | config = ParquetWriteConfig{} |
||
| ) |
Concatenate a vector of Arrow tables and write the result to a Parquet file.
All tables must share the same schema. An empty input vector is a no-op (returns true without writing).
| [in] | tables | Vector of Arrow tables to concatenate (must share schema) |
| [in] | filename | Output file path |
| [in] | config | Parquet writer configuration |
tables is empty), false on error | String generateUuidV4 | ( | ) |
Generate a lowercase hyphenated RFC 4122 version-4 UUID string.
Used by QPX Parquet exporters when attaching file metadata.
| bool writeTableToParquet | ( | const std::shared_ptr< arrow::Table > & | table, |
| const String & | filename, | ||
| const ParquetWriteConfig & | config = ParquetWriteConfig{} |
||
| ) |
Write an Arrow table to a Parquet file.
| [in] | table | The Arrow table to write (must not be null) |
| [in] | filename | Output file path |
| [in] | config | Parquet writer configuration (compression, row group size, ...) |