-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Extract 'Rodent or Trash Violations' Feature from Restaurant Inspection Data #18
Comments
Rodent or Trash Violations
Feature from Restaurant Inspection Data
working on this |
Finished this. Uploading now |
Working on this now... I'll upload and post when finished |
I recommend using the most recent data from That set contains a 'violation description' column as well as a 'violation number' to deal with the fact that violation numbers have different meanings over time. |
I finished this. I uploaded the files to dropbox. |
Migrated this issue to codefordc/the-rat-hack repository as issue_13. The migrated issue simply checks the hackathon code and modifies it to run from the command line. |
Start with the DC DOH Food Service Establishment Inspection report data in the
/Data Sets/Restaurant Inspections/
folder in Dropbox.Develop a script to extract the number of food establishment inspections that found rodent or trash-related violations (violations 38 or 54). More details on violations can be found here
Note that this issue depends upon the geocoding results from Issue #13
Input:
CSV files with inspection summary and violation details
Output:
A CSV file with
feature_id
: The ID for the feature, in this case, "restaurant_violations_rodent_or_trash"feature_type
: Theestablishment_type
from the restaurant data setfeature_subtype
: Therisk_category
from1
-5
year
: The ISO-8601 year of the feature valueweek
: The ISO-8601 week number of the feature valuecensus_block_2010
: The 2010 Census Block of the feature valuevalue
: The value of the feature, i.e. the number of inspections that found rodent or trash-related violations in establishments with the given types and risk categories in the specified week, year, and census block.When you are finished
Submit a pull request on GitHub (or upload your scripts)
Upload any files to Dropbox
Need more information?
Flag Jason or Elizabeth, or ask your question in the comments below and we'll respond as soon as we can!
The text was updated successfully, but these errors were encountered: