Skip to content

Latest commit

 

History

History
16 lines (11 loc) · 921 Bytes

README.md

File metadata and controls

16 lines (11 loc) · 921 Bytes

Graph Topic Modeling for Semantic Search

The Jupyter notebooks in this repository accompany the blog post Topic extraction with Neo4j Graph Data Science for better semantic search.

To following along with the blog content, work through the notebooks in this order:

  1. Download_TMDB_movies.ipynb
  2. Extract themes.ipynb
  3. Clean up themes and get embeddings.ipynb
  4. Cluster themes.ipynb
  5. Summarize theme groups.ipynb
  6. Compare retrievers.ipynb

You will need a Neo4j environment with GDS installed. You can create an Aura DS instance or download Neo4j Desktop.

You will also need an API key for the Large Language Model of your choice. The notebooks use Anthropic and OpenAI, but you can adapt the code to use others.