Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

General error/missing files #3843

Open
ahamelers opened this issue Dec 23, 2024 · 4 comments
Open

General error/missing files #3843

ahamelers opened this issue Dec 23, 2024 · 4 comments
Assignees
Labels
bug Something isn't working Download uploads

Comments

@ahamelers
Copy link
Collaborator

ahamelers commented Dec 23, 2024

We've gotten a lot of "General error" "The file is not available for download" error report emails this month.

One of these emails concerns a dataset brought to my attention by curator Evangeline:

While downloading the files for DOI: 10.5061/dryad.6hdr7sr9m (22.64 GB) I'm receiving an error in VE as per the screenshot, can you please check it and let me know.

error

The system is unable to find file woodBetaCCR1.rds. I also cannot find this file anywhere. It does not have a permanent storage path and I must be looking in the wrong place for the temporary path. It has no digest. How was the dataset sent through processing without creating a permanent storage path for all files, and without a digest for this file? Is the file lost permanently?

<StashEngine::DataFile:0x00007f057f385b40
 id: 3664511,
 upload_file_name: "woodBetaCCR1.rds",
 upload_content_type: "",
 upload_file_size: 22604677308,
 resource_id: 330823,
 storage_version_id: nil,
 upload_updated_at: Mon, 25 Nov 2024 21:36:29.000000000 UTC +00:00,
 created_at: Mon, 25 Nov 2024 21:36:29.000000000 UTC +00:00,
 updated_at: Mon, 25 Nov 2024 21:36:29.000000000 UTC +00:00,
 validated_at: nil,
 file_state: "created",
 url: nil,
 status_code: nil,
 timed_out: false,
 original_url: nil,
 cloud_service: nil,
 digest: nil,
 digest_type: nil,
 description: nil,
 original_filename: "woodBetaCCR1.rds",
 type: "StashEngine::DataFile",
 compressed_try: 0>

I am concerned these other emails indicate a similar problem.

@ahamelers ahamelers added the bug Something isn't working label Dec 23, 2024
@ahamelers ahamelers moved this from Backlog to In progress in Dryad product board Dec 23, 2024
@ryscher
Copy link
Member

ryscher commented Dec 23, 2024

This particular file is large. (22GB) and has not transferred from temporary storage into the permenant store. It is still in temporary storage at https://us-west-2.console.aws.amazon.com/s3/buckets/dryad-s3-prd?region=us-west-2&bucketType=general&prefix=330823/data/&showversions=false

@ahamelers
Copy link
Collaborator Author

ahamelers commented Dec 23, 2024

Okay, since the files are not lost, I will reduce the priority of this ticket. I will copy this file to permanent storage. The question to answer then is why this dataset (and potentially others) moved out of 'processing' status before the files were fully transfered, and how we can stop that from happening in the future

@ahamelers ahamelers moved this from In progress to Backlog in Dryad product board Dec 23, 2024
@ahamelers
Copy link
Collaborator Author

ahamelers commented Dec 23, 2024

I've been through around 2 months of these emails and found only 5 problematic (not removed on purpose) files.

These were the expected error type, mentioned in the email, and I copied them to where they were expected to be by our system, from other (later merritt version) locations within our buckets.

@ahamelers ahamelers self-assigned this Jan 3, 2025
@ahamelers
Copy link
Collaborator Author

ahamelers commented Jan 3, 2025

We should probably consider doing something to reduce the noise caused by the withdrawn/purposefully deleted files sending tons of these emails to our system and making it difficult to notice actual issues. Maybe when things are deleted as part of the withdrawal check, we can put a flag on each file itself, and then change the UI so they are not displayed with download links but have some kind of "we deleted this!" hover text

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working Download uploads
Projects
Status: Backlog
Development

No branches or pull requests

3 participants