Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cythonize slow parts of the code #204

Open
asmeurer opened this issue Aug 10, 2021 · 3 comments
Open

Cythonize slow parts of the code #204

asmeurer opened this issue Aug 10, 2021 · 3 comments
Assignees

Comments

@asmeurer
Copy link
Collaborator

For #198 and other performance issues, it may be worth seeing if we can Cythonize key parts of the library to improve the performance. It's still not clear yet whether this can work, or if the performance gains will be significant enough to warrant the additional complexity of having non-pure Python parts of the code.

@ericdatakelly ericdatakelly added this to the August 2021 milestone Aug 12, 2021
@ericdatakelly ericdatakelly modified the milestones: August 2021, September 2021 Sep 29, 2021
@ArvidJB
Copy link
Collaborator

ArvidJB commented Oct 11, 2021

Not sure if you have done this already, but it's probably worth trying to isolate the performance at the various levels:

  1. versioned_hdf5
  2. h5py
  3. libhdf5
  4. file io

When I run a slightly modified repro from #193

import h5py
import numpy as np
from versioned_hdf5 import VersionedHDF5File

if __name__ == '__main__':
    data = np.arange(3*74*3944).reshape((3, 74, 3944))

    with h5py.File('/tmp/foo_versioned.h5', 'w') as f:
        vf = VersionedHDF5File(f)
        with vf.stage_version('r0') as sv:
            sv.create_dataset('bar', data=data, chunks=(900, 3, 3))

    with h5py.File('/tmp/foo_unversioned.h5', 'w') as f:
        f.create_dataset('bar', data=data, chunks=(900, 3, 3), maxshape=(None, None, None))

    # takes about 19 seconds for me
    for _ in range(10):
        with h5py.File('/tmp/foo_versioned.h5', 'r') as f:
            vf = VersionedHDF5File(f)
            a = vf[vf.current_version]['bar'][:]
            assert a.shape == (3, 74, 3944)

    # takes about 9 seconds
    for _ in range(10):
        with h5py.File('/tmp/foo_unversioned.h5', 'r') as f:
            a = f['bar'][:]
            assert a.shape == (3, 74, 3944)

It seems that the versioned_hdf5 layer adds significant overhead over plain h5py. Let's try to bring that down.

It's probably worth writing the equivalent HDF5 code in C to compare it with h5py, to see how much overhead h5py adds. Maybe there are some significant savings there as well?

@peytondmurray
Copy link
Collaborator

I can look into optimizations that can be done here. Without knowing exactly what the cause of the slow performance is, I can spend 10h investigating what can be done and pursuing simpler optimizations. If I need more time on it, I can report back here with a more in-depth assessment. Does that sounds good?

@peytondmurray peytondmurray self-assigned this Jun 6, 2023
@crusaderky
Copy link
Collaborator

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants