-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Selection of model examples #106
Comments
What are the criteria for creating an example associated with a specific model? |
A combo of : usefulness, and advocate the methods with good empirical behavior (error control, power...) |
The usefulness is quite difficult to estimate and it's subjective. For empirical behaviour, not all the models can propose, theoretically, a boundary for error control and power. |
You're right to some extent. Still, we know understand why some methods fail. For instance, basic permutation importance does not estimate a proper variable importance measure. We can include it for historical reference, but clearly, we should not advocate it. |
For the failure of a method, it's highly dependent on the context. I have two problems with this formulation. To avoid these problems and if we still want to base our evaluation on empirical behaviour, I will need to know what the context is for this library, i.e. properties of data (characteristic of the distribution, number of features, number of samples, linearity, correlation, ...). |
We don't want to point out particular datasets, but classes of problems:
Some assumptions are always unreasonable, because they're too restrictive : e.g. independence of columns of X and Gaussianity. They have been introduced historically for mathematical convenience, but nobody wants to rely on that. |
I don't know this type of classification. |
Ha, ha there is no formal definition.
|
If I correctly understand, we can summarise that broadly that some models:
For starting example, if you need to choose one model by class, what will be the model? If I correctly understand, the evaluation of these three types of problems will be different. |
|
Do you want to use all these dataset? For starting example, if you need to choose one model by class, what will be the model? Do you have some metrics for these three classes of problems, which can be used for comparing models from the same class? |
Probably not, which means that we have to try and see. what matters is that we can showcase the methods properly. |
One aspect is that we won't create one example for each function of the library. Examples are here to guide users toward the main information. They should not be exhaustive.
Please also remember that structure is here to help by providing guidelines and a common understanding. What matters first is functionality, clarity of the material and making maintenance easy.
Originally posted by @bthirion in #104 (comment)
The text was updated successfully, but these errors were encountered: