kaggle-bike-share

my attempt at the Kaggle bike sharing demand competition http://www.kaggle.com/c/bike-sharing-demand

description of files and order of execution to generate results

clean_combine - cleans the test and train data sets and combines them into a single csv
add_features - adds additional features used by the regression algorithms
daily_trend_rf_split_predict - uses a random forest to fit the daily sums of the registered and casual users, then uses this to normalize the data, calculate typical workday and weekday hourly trends for the two categories, and combine it all together to make hourly predictions for the entire 2 year time span
the regression classifiers could be run in any order

4a. random_forest_hourly_predict - uses a random forest and features consisting of weather, etc. to predict the difference between the log of the rf daily trend prediction and the log of the actual counts

4b. gb_trees_hourly_predict - uses gradient boosted trees and features consisting of weather, etc. to predict the difference between the log of the rf daily trend prediction and the log of the actual counts

utility - file with function and class definitions

feature_selection - uses a random forest and a single weather, etc. time shifted feature to predict the difference between the log of the rf daily trend prediction and the log of the actual counts, output the feature importance to determine which time shifted feature should be included in the main results

ada_bag_hourly_predict, ada_hourly_predict - attempt to use adaboost or bagged adaboost to predict the hourly counts based on the rf daily trend prediction, didn't work as well as rf or gbt

daily_trend_gp_split_predict - uses a Gaussian process to fit the daily sums of the registered and casual users, then uses this to normalize the data, calculate typical workday and weekday hourly trends for the two categories, and combine it all together to make hourly predictions for the entire 2 year time span, replaced by rf trend predict

stack_gbt, stack_rf, stack_ridge - attempt to combine the gbt and rf hourly predictions using one of several classifiers, didn't work well

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.idea		.idea
__pycache__		__pycache__
README.md		README.md
ada_bag_hourly_pred.csv		ada_bag_hourly_pred.csv
ada_bag_hourly_predict.py		ada_bag_hourly_predict.py
ada_bag_hourly_submit.csv		ada_bag_hourly_submit.csv
ada_hourly_predict.py		ada_hourly_predict.py
add_features.py		add_features.py
atemp_feature_importance.png		atemp_feature_importance.png
clean_combine.py		clean_combine.py
combined_data.csv		combined_data.csv
daily_trend_gp_split_predict.py		daily_trend_gp_split_predict.py
daily_trend_rf_split_predict.py		daily_trend_rf_split_predict.py
feature_selection.py		feature_selection.py
gb_trees_hourly_pred.csv		gb_trees_hourly_pred.csv
gb_trees_hourly_predict.py		gb_trees_hourly_predict.py
gb_trees_hourly_submit.csv		gb_trees_hourly_submit.csv
gp_submit.csv		gp_submit.csv
gs_score_log.txt		gs_score_log.txt
humidity_feature_importance.png		humidity_feature_importance.png
random_forest_hourly_predict.py		random_forest_hourly_predict.py
rf_daily_submit.csv		rf_daily_submit.csv
rf_hourly_pred.csv		rf_hourly_pred.csv
rf_hourly_submit.csv		rf_hourly_submit.csv
sampleSubmission.csv		sampleSubmission.csv
stack_gbt.py		stack_gbt.py
stack_gbt_submit.csv		stack_gbt_submit.csv
stack_rf.py		stack_rf.py
stack_rf_submit.csv		stack_rf_submit.csv
stack_ridge.py		stack_ridge.py
stack_ridge_submit.csv		stack_ridge_submit.csv
submission_notes.txt		submission_notes.txt
temp_feature_importance.png		temp_feature_importance.png
test.csv		test.csv
train.csv		train.csv
utility.py		utility.py
weather_feature_importance.png		weather_feature_importance.png
windspeed_feature_importance.png		windspeed_feature_importance.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

kaggle-bike-share

About

Releases

Packages

Languages

vzaretsk/kaggle-bike-share

Folders and files

Latest commit

History

Repository files navigation

kaggle-bike-share

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages