-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ds000172 - dataset_description.json contains "un-braking space" (c2 a0) characters after lastnames #5
Comments
that is some really remarkable brittleness for a stock module…
— |
I ran into something like this yesterday... are there any web forms involved with getting this data? The HTML spec uses CR LF pairs: http://www.w3.org/TR/html401/interact/forms.html#h-17.13.4.1 which means '/r/n' and those can be replaced easily when parsing the data before writing the json. |
You can also set strict=False when loading, or just replace the characters when you do open inside the json loads function! |
I doubt those characters are a part of json spec... imho it is unreasonable to demand json parsers to understand all those screwy utf8 symbols used as delimiters. I guess validator should also check those jsons more thoroughly. Later will file about at least one more ;-) On August 29, 2016 12:57:29 PM EDT, Russ Poldrack [email protected] wrote:
|
Excellent colors btw, is that the "Dreaming of McDonalds hamburger" terminal theme? +1! 🍟🍔 |
More of "I was so cool when I was young" :-) On August 29, 2016 2:03:18 PM EDT, Vanessa Sochat [email protected] wrote:
|
@vsoch apparently strict=False wasn't enough. So will do fixups manually |
Yeah, I wound up just getting rid of them entirely before writing the file. |
So would it be helpful for us to upload a new revision of this dataset that fixes this issue? Or is it easy enough to work around? Is it something we should consider with future datasets? |
I just worked around for now, so as to me, no rush ;-) |
which breaks stock json modules in python
The text was updated successfully, but these errors were encountered: