HDF5

  • HDF stands for hierarchical data format.
  • Pandas provides a convenient wrapping via pd.HDFStore, files can be accessed like an object.
  • HDF5 is suitable for write-once, read-many applications. It is not a database, simultaneous writing may corrupt the file.
  • If the file is stored online, consider Apache Parquet.