-
-
Notifications
You must be signed in to change notification settings - Fork 189
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Grep output doesn't match what's in the lesson in a way that breaks the example #316
Comments
I have identified that grep behaves in the expected way (no |
Further update: On the latest version of MacOS, the |
Okay, I was mis-reading the lesson. We don't need the lines in bad_reads.txt to be divisible by 4. But I'm still getting 537 instead of 802. And I don't really feel that asking novices to pipe |
I have encountered the same error as well. I am running on WSL2 and when I run the command;
I get the output |
I also saw 537 vs 802 when I was running this on the Amazon instance. I do think the double grep with inverted 2nd grep is a bit hard for novices to understand. Especially without seeing an inverted grep first. There does seem to be an option If doing this has them practicing the pipe not enough, then we could add more practice too the data wrangling section. When I recently taught data wrangling, I showed learners my most common pipe combo where I pipe ls into wc to check the number of input and output files. Edit: Just checked and the Amazon Web Instance does have the |
So the actually number of bad reads is 536 and not 802? |
It seems so from the grep output. I'll be able to double check once PR #357 is merged. |
I'm trying to run this lesson with the data files downloaded from FigShare. In the Redirection, lesson, the output of
returns
537
rather than the expected 802.This is a problem because 537 is not a multiple of 4. This is happening because some of the reads with the string NNNNNNNNNN are non-contiguous in the file, so
grep
is inserting a--
line between groups of contiguous results. I think the lesson as written will mislead learners about how they can use grep, since it doesn't mention this behaviour (which I have replicated on multiple machines, so it's not just a quirk of one system).Has anyone encountered this problem? What do you do about it?
The text was updated successfully, but these errors were encountered: