-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Model template #104
Comments
This template work only for the functions which doesn't require fitting methods. |
@bthirion @jpaillard @AngelReyero. |
Therefore is it only for the Permutation Feature Importance? What other methods do not need refitting? The only difference with the other methods is the fit function? |
The knockoff, clustered_inference, dcrt_zero, desparsified_lasso are functions which don't need fitting. For me, the fit function means that the algorithm is required to keep an internal state for applying other functions. This is not the case for most algorithms instead because there is fast or their output contains all information. |
I agree with the general outline. |
In the case of the knockoffs for instance, wouldn't we also need a similar internal state to keep the information estimated to generate the knockoffs? For example, if using the generating method from https://arxiv.org/abs/2407.06892 there is need to fit multiple regressors. Similarly for other generating methods based on covariance estimation, where in order to generate knockoffs it is necessary to keep the covariance estimate. |
Yes, I released that some methods require having access to the estimator, such as knockoffs or permutation_test. This can be generalised to most of the methods. If we stay with functions, which signatures are the most interesting for you: def acn( In the second case, 'y' is computed at the beginning of the function. However, if we have the estimators as parameters, I will prefer a class implementation for getting track of the estimators, especially if we need to fit them. |
Based on the PR #58, #73, #100, #101, #102, I propose the following requirement for each model:
acn: acronym
name: full name of the model
acn.py (python file)
test/test_acn.py (test for the methods)
example/inference_models/acn.py (example of the methods)
The text was updated successfully, but these errors were encountered: