Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extract 'Rodent or Trash Violations' Feature from Restaurant Inspection Data #18

Open
jasonasher opened this issue Sep 22, 2017 · 6 comments

Comments

@jasonasher
Copy link
Owner

Start with the DC DOH Food Service Establishment Inspection report data in the /Data Sets/Restaurant Inspections/ folder in Dropbox.

Develop a script to extract the number of food establishment inspections that found rodent or trash-related violations (violations 38 or 54). More details on violations can be found here

Note that this issue depends upon the geocoding results from Issue #13

Input:
CSV files with inspection summary and violation details

Output:
A CSV file with

  • 1 row for each establishment type and risk category, and each week, year, and census block
  • The following columns:

feature_id: The ID for the feature, in this case, "restaurant_violations_rodent_or_trash"
feature_type: The establishment_type from the restaurant data set
feature_subtype: The risk_category from 1-5
year: The ISO-8601 year of the feature value
week: The ISO-8601 week number of the feature value
census_block_2010: The 2010 Census Block of the feature value
value: The value of the feature, i.e. the number of inspections that found rodent or trash-related violations in establishments with the given types and risk categories in the specified week, year, and census block.

When you are finished
Submit a pull request on GitHub (or upload your scripts)
Upload any files to Dropbox

Need more information?
Flag Jason or Elizabeth, or ask your question in the comments below and we'll respond as soon as we can!

@jasonasher jasonasher changed the title Extract Rodent or Trash Violations Feature from Restaurant Inspection Data Extract 'Rodent or Trash Violations' Feature from Restaurant Inspection Data Sep 22, 2017
@xavier-gutierrez
Copy link
Contributor

working on this

@kelsonSS
Copy link
Contributor

Finished this. Uploading now

@zacharyclement
Copy link

Working on this now... I'll upload and post when finished

@jasonasher
Copy link
Owner Author

I recommend using the most recent data from
dc_restaurant_inspections

That set contains a 'violation description' column as well as a 'violation number' to deal with the fact that violation numbers have different meanings over time.

@zacharyclement
Copy link

I finished this. I uploaded the files to dropbox.

@eclee25
Copy link
Collaborator

eclee25 commented Jan 9, 2018

Migrated this issue to codefordc/the-rat-hack repository as issue_13. The migrated issue simply checks the hackathon code and modifies it to run from the command line.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants