Final Project in Course 22100 R For Bio Data Science, Spring 2021

The goal of this collaborative bio data science project is to demonstrate our ability to use Tidyverse R to perform data cleansing, transformation, visualization, and communication. The focus is on reproducibility and following the tidyverse style guide.

The Dataset

In this project, we are working with the COVID-19 World Vaccine Adverse Reactions Dataset, which is available on Kaggle: https://www.kaggle.com/ayushggarg/covid19-vaccine-adverse-reactions. The dataset comes from The Vaccine Adverse Event Reporting System (VAERS), established by The U.S. Department of Health and Human Services, and co-managed by the Centers for Disease Control and Prevention (CDC) and the U.S. Food and Drug Administration (FDA). It should be noted that VAERS data come from a passive surveillance system and represent unverified reports of health events that occur after vaccination with U.S.-licensed vaccines. Therefore, reports may include incomplete, inaccurate and coincidental information.

The raw data consists of 3 CSV files:

2021VAERSDATA.csv (VAERS DATA)
2021VAERSSYMPTOMS.csv (VAERS Symptoms)
2021VAERSVAX.csv (VAERS Vaccine)

Project Description

The flowchart illustrates the data flow from the raw data to the final results. A big part of the project consists of data cleansing for which the VAERS Data Use Guide (https://vaers.hhs.gov/docs/VAERSDataUseGuide_November2020.pdf) was useful. All visualizations were made with ggplot, and the modeling includes PCA, logistic regression, and chi-squared contingency table tests.

00_doit.R runs all scripts at once and produces an ioslides presentation from R Markdown in HTML format.

Name		Name	Last commit message	Last commit date
Latest commit History 287 Commits
.Rproj.user		.Rproj.user
R		R
data		data
doc		doc
results		results
.Rhistory		.Rhistory
.gitignore		.gitignore
DTU_Logo.jpg		DTU_Logo.jpg
DTU_Logo_Corporate_Red_CMYK.png		DTU_Logo_Corporate_Red_CMYK.png
README.md		README.md
project.Rproj		project.Rproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Final Project in Course 22100 R For Bio Data Science, Spring 2021

The Dataset

Project Description

About

Releases

Packages

Contributors 4

Languages

celia-b/Corona-Vaccine-Adverse-Effects

Folders and files

Latest commit

History

Repository files navigation

Final Project in Course 22100 R For Bio Data Science, Spring 2021

The Dataset

Project Description

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages