-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Automated selection of models via a best_chi2_worse_phi2
algorithm
#1962
Comments
Thanks @Cmurilochem for this nice analysis. Something like this would indeed help us better visualize and select the "best" models. Regarding your questions:
|
Thanks @Cmurilochem.
|
Thanks @Radonirinaunimi and @RoyStegeman for the extremely useful comments. I will start its implementation based on you suggestions in a small separated PR branched from |
There is no Pandoc is not in our dependencies because we depend on it through |
Thanks @RoyStegeman. I think I managed to make it work. I will proceed as planned then. |
As a continuation of #1943, I managed to automate the selection of best models via the @juanrojochacon's hyperopt algorithm wherein data of 1/$\varphi^{2}$ is used to decide on the best $\chi^{2}$ hyperpoint. Here I am just referring to it as
best_chi2_worse_phi2
algorithm.To this end, I made a post-fit script which is primarily based on the validphys vp_hyperoptplot.py module. I did so in such a way to make our implementation easier later. Just in case I attach it here analysis_hyperopt.zip.
The core of the idea is presented in the code snippet below:
Here, I define an internal between the chi2
minimum
and 1 standard deviationstd
from which I will monitor later on the corresponding 1/phi2 values. For these, I get the nth lowest 1/phi2 hyperpoints and save the selected models intobest_trial
. In the zip attached file I take as example the runs I discussed on Monday using 10 replicas (because I have much more points to test the algorithm). The final plot is show below:The yellow region defines the interval between chi2
minimum
(grey circle) and 1 standard deviationstd
of the loss data. I also asked the script to give me 10 models within this region which show the lowest 1/phi2's (cyan circles).Questions
std
sufficient for our purposes ? Note that for the analysis I selected a loss threshold of 3. So, all models showing higher losses were excluded from the DataFrame and analysis.validphys
? I tried to run thevp-hyperoptplot
but it always complains about the need forpandoc
(even if I havepandoc
installed).I would appreciate any comments and idea to improve are always welcome.
The text was updated successfully, but these errors were encountered: