Add collective metadata functions to the low level API #2224

jchelly · 2023-01-30T15:09:25Z

I'd like to add the following functions to the low level API:

H5Pset_all_coll_metadata_ops()
H5Pget_all_coll_metadata_ops
H5Pset_coll_metadata_write()
H5Pget_coll_metadata_write()

For context: I'm using h5py on a HPC cluster to process simulation outputs stored as HDF5. The code is distributed over multiple compute nodes which have 128 cores each. In order to make use of all of the cpu cores I run python using mpi4py with one process per core. I'm using collective I/O to read the input simulation data and write out the results.

This puts quite a load on the Lustre parallel file system, and I think it's probably because every process accesses the files independently for metadata operations. I'm hoping that can be alleviated by having HDF5 do all file access in collective mode so that only a few processes per node need to access the file system.

For my use case I just need to put the whole file in collective metadata mode. To do that I've added get/set_all_coll_metadata_ops() and get/set_coll_metadata_write() methods to h5p.PropFAID. The HDF5 documentation says that H5Pset_all_coll_metadata_ops() can also be called on group, dataset, datatype, link, or attribute access property lists. Of those I think h5py only exposes link and dataset access property lists so I also added get/set_all_coll_metadata_ops() to h5p.PropLAID and h5p.PropDAID.

The HDF5 documentations says H5P[get/set]_all_coll_metadata_ops can take file, group, dataset, datatype, link, or attribute access property list identifiers. Only file, link and dataset access property lists appear to be exposed by h5py.

codecov · 2023-01-30T15:14:09Z

Codecov Report

Base: 90.01% // Head: 89.30% // Decreases project coverage by -0.72% ⚠️

Coverage data is based on head (c22582f) compared to base (c6262ac).
Patch has no changes to coverable lines.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #2224      +/-   ##
==========================================
- Coverage   90.01%   89.30%   -0.72%     
==========================================
  Files          17       17              
  Lines        2394     2394              
==========================================
- Hits         2155     2138      -17     
- Misses        239      256      +17

Impacted Files	Coverage Δ
h5py/_hl/filters.py	`85.20% <0.00%> (-7.66%)`	⬇️
h5py/_hl/files.py	`87.36% <0.00%> (-0.73%)`	⬇️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

roblatham00 · 2023-10-11T18:43:59Z

Just driving by to show my support. These optimizations are critical for any non-trivial level of scaling and I hope these can be added to h5py soon

ptim0626 · 2024-10-15T15:50:26Z

Hi, thanks @jchelly for implementing this! May I ask what the status of this PR? I was looking for the exact low-level API functions in h5py and found this PR! I am more than happy to contribute to the code for any remaining work so this will be available in h5py in the future.

jchelly added 6 commits January 25, 2023 11:26

Add collective metadata functions to api_functions.txt

98caa4f

Add collective metadata methods to class h5p.PropFAID

e8ba4d4

Test setting collective metadata mode on file access properties

3db36eb

Add release note about collective metadata functions

11cbb6c

Add tests for collective metadata calls on PropDAID, PropLAID

c22582f

jchelly mentioned this pull request Nov 14, 2023

Support for H5D multi functions? #2298

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add collective metadata functions to the low level API #2224

Add collective metadata functions to the low level API #2224

jchelly commented Jan 30, 2023 •

edited

Loading

codecov bot commented Jan 30, 2023 •

edited

Loading

roblatham00 commented Oct 11, 2023

ptim0626 commented Oct 15, 2024

Add collective metadata functions to the low level API #2224

Are you sure you want to change the base?

Add collective metadata functions to the low level API #2224

Conversation

jchelly commented Jan 30, 2023 • edited Loading

codecov bot commented Jan 30, 2023 • edited Loading

Codecov Report

roblatham00 commented Oct 11, 2023

ptim0626 commented Oct 15, 2024

jchelly commented Jan 30, 2023 •

edited

Loading

codecov bot commented Jan 30, 2023 •

edited

Loading