Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error occurs when loading datasets, seems a gzip error #102

Open
CandyClass opened this issue May 17, 2022 · 11 comments
Open

Error occurs when loading datasets, seems a gzip error #102

CandyClass opened this issue May 17, 2022 · 11 comments

Comments

@CandyClass
Copy link

train = moses.get_dataset('train')
the error shows:
Not a gzipped file (b've')
Seems a quite basic question but I just can't figure it out.

@BWyueyue
Copy link

Have you solved it?

@CandyClass
Copy link
Author

Have you solved it?

nope, I give up on this package..

@Liar-zzy
Copy link

Liar-zzy commented Sep 6, 2022

Have you checked the integrity of the file?
I just solve it.

@daniil-polykovskiy-insilico
Copy link
Contributor

Hi all, Insilico team will review this issue. We'll keep you posted

@BWyueyue
Copy link

BWyueyue commented Sep 9, 2022

Have you solved it?

It's fine now

@yangzhao1230
Copy link

I met the same error. Could you please tell me how you solved it? Thank you in advance.

@yangzhao1230
Copy link

Have you checked the integrity of the file? I just solve it.

Hi, I met the same error. Could you please explain the solution in more details? Thank you!

@D-coban
Copy link

D-coban commented Nov 1, 2022

Have you checked the integrity of the file? I just solve it.

Hi, I am also having the same issue. Would it be possible to explain how did you manage to solve the issue? Kind regards.

@filipekstrm
Copy link

Anyone solved this and would like to share how they did? I seem to have an issues also with moses.datasets.get_statistics and I am thinking it could be related

@filipekstrm
Copy link

Anyone solved this and would like to share how they did? I seem to have an issues also with moses.datasets.get_statistics and I am thinking it could be related

If anyone else stumbles upon this problem now, I had installed by cloning the repo and running python setup.py install, and by replacing the files in moses/dataset/data with files that I had on another system where the package didn't give this error and rerunning the installation script solved my problem

@sei0o
Copy link

sei0o commented Jun 25, 2024

On my environment this error message was shown when I forgot to fetch the files served via LFS. The dataset files (*.gz) are usually larger than code or documents so they are managed on the external storage. When LFS is enabled for a file, its content in the repository is replaced with a pointer to where the original content (dataset) is saved and it means git has to resolve those pointers and download the original content. You might want to try Installing Git Large File Storage and execute git lfs fetch .

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

8 participants