WIP: Refactor inference modules #143

kaiserls · 2024-11-14T21:58:47Z

Description

This pull request addresses architectural limitations highlighted in issues #134 and #42. While the current design is effective for a stable prototype and generating results, it lacks flexibility for future extensions. Building on the refactoring efforts in #141 and #142, this PR restructures the framework to allow more flexible and modular usage.

Key Changes

New Subpackages:

evaluation: Handles kernel density estimation and the mapping of densities between data and parameter spaces.
inferences: Manages different approaches to selecting points for density evaluation:
- sampling: Dynamically generates points based on the density of previous points.
- grids: Uses a predefined set of points, enabling straightforward operations like chunking for multiprocessing.
samplers: Encapsulates dependencies on specific samplers for sampling-based inference.

Implementation of the Dependency Inversion Principle:

The dense_grid no longer depends on the InferenceTypes or calc_kernel_width. Instead it relies on the abstract KDE class, which serves as a callable interface.
The evaluate_density function no longer requires direct access to data, kernel_width, or the hardcoded eval_gauss_kde function. Instead, it operates via the KDE interface.
Enums such as InferenceType and DenseGridType are no longer used globally but only locally, in the inference and dense_grid respectively.

This pull request fixes #134 and allows to implement #42 quickly.

The dependency graph of eulerpi before the refactoring at commit 627ecab. Many of the dependencies are not visible, because this graph only contains modules, not classes.

Current dependency graph, the larger number of nodes is caused by splitting modules with multiple classes into separate files.

And a overview over the external dependencies

Question

Do we want/should support evaluation multiple slices at once, directly?
Should we reintroduce num_runs in the sampler The reason for the sub_runs was to have some fault tolerance. I am not sure if our package should take care of that or whether the details should be handled by the sampler. The user can still loop over the sampling by using continue_sampling.

Help wanted

Update / Rewrite the very complex result manager
Update the usage of the result manager

TODOs:

Finish the refactoring:
- Simplify and unify the different inference functions
- Bring the dependency on the concrete KDE, and Grid to the inference call.
Fix the documentation:
- cleanup the auto-generated files and check the navigation bar
- update the code examples - could also use automatic testing https://www.sphinx-doc.org/en/master/usage/
  extensions/doctest.html
Check for performance regression
Write the changelog

Type of change

New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

How Has This Been Tested?

Please note/describe at least one tests that you ran to verify your changes.

Local tests

Checklist For Contributor

My code follows the style guidelines of this project (I installed pre-commit)
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I wrote my changes in the changelog in the [unreleased] section
I have made corresponding changes to the documentation
My changes generate no new warnings
New and existing unit tests pass locally with my changes

Checklist For Maintainers

Continuous Integration (CI) is successfully running
Do we want to release/tag a new version? [✔️ / ❌]
- If yes, add a release to the changelog and set the new version in pyproject.toml. After the merge, tag the release!

…e the parameter sampling

…ng relative path

castellinilinguini · 2024-11-18T11:09:54Z

I believe this is a great idea, this PR should greatly improve readability, maintainability & separation of concerns. Just to make sure I got this right: In the end, dense grids, sparse grids, and MCMC samplers would be implemented in the sampling submodule, right?

Edit: Nevermind, I just saw you did two different submodules for that, which makes sense ofc!

kaiserls · 2024-12-11T13:33:13Z

eulerpi/inferences/sampling_inference_module.py

+def combined_evaluation(param, model, data_transformation, kde, slice):
+    param, sim_res, logdensity = evaluate_log_density(
+        param, model, data_transformation, kde, slice
+    )
+    combined_result = np.concatenate([param, sim_res, np.array([logdensity])])
+    return logdensity, combined_result
+
+
+def get_LogDensityEvaluator(
+    model: BaseModel,
+    data_transformation: DataTransformation,
+    kde: KDE,
+    slice: np.ndarray,
+):
+    logdensity_blob_function = partial(
+        combined_evaluation,
+        model=model,
+        data_transformation=data_transformation,
+        kde=kde,
+        slice=slice,
+    )
+
+    param_dim = slice.shape[0]
+    output_dim = (1, param_dim + model.data_dim + 1)
+    return FunctionWithDimensions(
+        logdensity_blob_function, param_dim, output_dim


Discussion Result: Refactor as a class with name LogDensityEvaluator(FunctionWithDimensions)

kaiserls · 2024-12-11T13:33:53Z

eulerpi/inferences/grid_inference_module.py

+def combined_evaluation(param, model, data_transformation, kde, slice):
+    param, sim_res, density = evaluate_density(
+        param, model, data_transformation, kde, slice
+    )
+    combined_result = np.concatenate([param, sim_res, np.array([density])])
+    return combined_result
+
+
+def get_DensityEvaluator(
+    model: BaseModel,
+    data_transformation: DataTransformation,
+    kde: KDE,
+    slice: np.ndarray,
+):
+    combined_evaluation_function = partial(
+        combined_evaluation,
+        model=model,
+        data_transformation=data_transformation,
+        kde=kde,
+        slice=slice,
+    )
+
+    param_dim = slice.shape[0]
+    output_dim = param_dim + model.data_dim + 1
+    return FunctionWithDimensions(
+        combined_evaluation_function, param_dim, output_dim
+    )


Discussion Result: Refactor as a class with name DensityEvaluator(FunctionWithDimensions)

kaiserls · 2024-12-11T13:50:02Z

eulerpi/inference.py

Discussion Result: Offer functional interface inference, but use object oriented approach internally

kaiserls · 2024-12-11T13:50:52Z

eulerpi/inferences/sampling_inference_module.py

+def sampling_inference(
+    model: BaseModel,
+    data_transformation: DataTransformation,
+    kde: KDE,
+    slice: np.ndarray,
+    result_manager: ResultManager,
+    num_processes: int,
+    sampler: Sampler = None,
+    num_walkers: int = 10,
+    num_steps: int = 2500,
+    num_burn_in_samples: Optional[int] = None,
+    thinning_factor: Optional[int] = None,
+    get_walker_acceptance: bool = False,


Discussion Result: Reintroduce subruns

…r, and PathManager

…lt manager

Fix a few minor bugs

Fix a few small issues/bugs

…tems-Biology/EPI into refactor/inference

Restore sub run functionality

Update imports

Move sampler subpackage to sampling_inference

Introduce two new subpackages: one for the density evaluations and on…

a8887dc

…e the parameter sampling

kaiserls requested a review from castellinilinguini November 14, 2024 21:58

kaiserls added 7 commits November 14, 2024 23:08

Use string names in __all__ and import modules in same subpackage usi…

708e084

…ng relative path

Intermediate commit

de7d82a

intermediate commit

6d207ee

Fix name clash

c5b8cae

[Inference] Give better names to inference module and functions

23c2491

Extract the FunctionWrapper, grid_evaluation and other functionality

4cd7dc6

Fix usage of evaluate_density in tests

73d5d63

Improve naming and fix evaluate_density usage by adapting interface

c3fd357

kaiserls changed the title ~~Refactor inference modules~~ WIP: Refactor inference modules Nov 19, 2024

kaiserls added 4 commits November 20, 2024 01:03

Delete old TODO

ece4f8a

Move start_subrun default argument to sampling_inference

51d874e

Extract the grid and the sampler as inference argument

c176394

Fix: remove ray import

4687139

kaiserls added enhancement New feature or request help wanted Extra attention is needed question Further information is requested and removed help wanted Extra attention is needed labels Nov 22, 2024

kaiserls mentioned this pull request Dec 11, 2024

WIP: Refactor/result manager #148

Closed

12 tasks

kaiserls commented Dec 11, 2024

View reviewed changes

[Logging] Use module wide instead of package wide loggers

4f39e4e

castellinilinguini mentioned this pull request Dec 20, 2024

If using multiple emcee-runs, the chain is restarted (there is also no test to check this works as intended) #149

Open

castellinilinguini added 6 commits December 23, 2024 18:47

Split up result manager into three classes: OutputWriter, ResultReade…

f501cfd

…r, and PathManager

Change inference module to incorporate changes of the result manager

81174d4

Change top level inference modules to incorporate changes in the resu…

0113a62

…lt manager

Change plotting to incorporate changes to the result manager

306c593

Remove obsolete files

bce9d0b

Remove obsolete result_manager.py

532b189

kaiserls and others added 13 commits December 27, 2024 00:14

Fix renaming from "inferences" to "inference_engines"

70497c1

Rework kde

0edb7ae

Unify __init__ file structure and usage

23c145d

Fix bugs in inference modules stemming from new result manager structure

9c54080

Implement append_inference_information method

dab3b50

Fix a few minor bugs

Fix small issues in path manager and result reader

2b1ed58

Move inference types to own file to avoid cyclic imports

4a0cdf5

Fix a few small issues/bugs

Fix imports

41271ae

Update model_check to accomodate latest changes

81bd34a

Re-implement subruns

b5ce5d5

Update tests to use new interface

2af576b

Merge branch 'refactor/inference' of github.com:Systems-Theory-in-Sys…

d0cef58

…tems-Biology/EPI into refactor/inference

Manually merge the rest of the two branches

5764801

kaiserls removed the enhancement New feature or request label Jan 8, 2025

castellinilinguini added 5 commits January 8, 2025 18:38

Fix a few bugs in the inference engines

e6cb2bf

Restore sub run functionality

Fix typo

139663c

Fix problem with duplicate InferenceTypes

d6a2978

Update imports

Fix subrun mechanism

3904ac9

Fix small bug in plotting

d278eee

kaiserls added enhancement New feature or request and removed question Further information is requested labels Jan 9, 2025

castellinilinguini added 9 commits January 10, 2025 13:23

Remove redundant files

be0a92a

Remove custom logger

e1c50e6

Introduce new subpackages for inference engine implementations

a7d28f8

Add interface to implement inference results

2978830

Move grid subpackage to grid_inference

600115c

Move sampler subpackage to sampling_inference

Fix imports in inference_engines

6b0de04

Update top level imports

2bb6cc1

Add interface for result loading

dbc7aa1

Add interface for plotting function

bfa6ef0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Refactor inference modules #143

WIP: Refactor inference modules #143

kaiserls commented Nov 14, 2024 •

edited

Loading

castellinilinguini commented Nov 18, 2024 •

edited

Loading

kaiserls Dec 11, 2024

kaiserls Dec 11, 2024

kaiserls Dec 11, 2024

kaiserls Dec 11, 2024

WIP: Refactor inference modules #143

Are you sure you want to change the base?

WIP: Refactor inference modules #143

Conversation

kaiserls commented Nov 14, 2024 • edited Loading

Description

Key Changes

Question

Help wanted

TODOs:

Type of change

How Has This Been Tested?

Checklist For Contributor

Checklist For Maintainers

castellinilinguini commented Nov 18, 2024 • edited Loading

kaiserls Dec 11, 2024

Choose a reason for hiding this comment

kaiserls Dec 11, 2024

Choose a reason for hiding this comment

kaiserls Dec 11, 2024

Choose a reason for hiding this comment

kaiserls Dec 11, 2024

Choose a reason for hiding this comment

kaiserls commented Nov 14, 2024 •

edited

Loading

castellinilinguini commented Nov 18, 2024 •

edited

Loading