cs224u-project

Final project for the Stanford class CS224U: Natural Language Understanding.

We analyzed thematic and stylistic trends in a corpus of 355 popular and critically acclaimed 20th century English-language novels. First, we applied and adapted "the semantic cohort method", a vector space model meant to surface thematically similar words in a corpus; this method was originally proposed by the Stanford Literary Lab. Next, we studied trends in (1) the occurrence of words in these cohorts and (2) stylistic traits of novels, with the goal of demonstrating quantitative analysis’ usefulness as a tool to enrich existing literary scholarship as well as surface new patterns in literature.

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
data		data
src		src
.gitignore		.gitignore
README.md		README.md
cs224u-final-paper.pdf		cs224u-final-paper.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

cs224u-project

About

Releases

Packages

Languages

vivekchoksi/cs224u-project

Folders and files

Latest commit

History

Repository files navigation

cs224u-project

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages