Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fixes for Andersen lab #56

Merged
merged 2 commits into from
Jun 11, 2024
Merged

Fixes for Andersen lab #56

merged 2 commits into from
Jun 11, 2024

Conversation

joverlee521
Copy link
Contributor

Description of proposed changes

  1. Remove spaces from strain names
  2. Update data source to sra-via-andersen-lab

See commits for details.

Related issue(s)

Follow up on #50 + #54

Checklist

  • Checks pass

Saw a series of warning messages from seqkit in recent ingest run
that flagged spaces in strain names.¹ This results in the sequences
being skipped during the merging of the two data sources.

We could use the `--by-name` option for seqkit as the warning message
suggested, but this reminded me that we do not support spaces in the
strain name in our downstream phylogenetic analysis anyways. So just
remove spaces from the strain name.

¹ https://github.com/nextstrain/avian-flu/actions/runs/9456655781/job/26048985198#step:13:1469
@joverlee521 joverlee521 requested a review from a team June 11, 2024 15:58
@trvrb
Copy link
Member

trvrb commented Jun 11, 2024

Looks great. Thank you!

@joverlee521 joverlee521 merged commit de7805e into master Jun 11, 2024
8 checks passed
@joverlee521 joverlee521 deleted the fixes-for-andersen-source branch June 11, 2024 17:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants