Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training Data preparation #10

Open
GuptaVishu2002 opened this issue Jan 8, 2025 · 2 comments
Open

Training Data preparation #10

GuptaVishu2002 opened this issue Jan 8, 2025 · 2 comments

Comments

@GuptaVishu2002
Copy link

GuptaVishu2002 commented Jan 8, 2025

Hi, would it be possible to describe how to prepare training data (custom) or provide the csv files used to preprocess the datasets in train_model.ipynb, as the preprocessing seems to require several columns other than the ones provided in example_input.csv? Thank You.

@ch4perone
Copy link
Collaborator

Hi, the preprocessing steps can be found in the lib_loader directory. For example, you can look at the publicly available data from MSnLib, MSDial and CASMI. In particular, for MSnLib, you should be able to download the data, adjust the file path, and run the script to obtain a csv with the required metadata. All details are in the msnlib_loader.ipynb file. In the future, I may be able to provide a mock-up training example. Alas, due to other projects, I won't get to that very soon.

@ch4perone
Copy link
Collaborator

For processing the latest MSnLib version (msnlib_loader.ipynb), you may also want to check out the dev branch.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants