mlx-gpt

	This learning project implements a GPT language model using Apple's MLX library, following Andrej Karpathy's Let's build GPT video.

🚀 Getting Started

I tried to stay as close as possible to the original material, so that it's easy to follow.
I recommend watching the walkthrough if you haven't yet!

Instalation

# Setup the environment
python -m venv .venv
source .venv/bin/activate

# Install dependencies
pip install -r requirements.txt

🤖 Usage

Train and run the Bigram model

At the moment the command below trains and runs the model straight away.
It will also download and cache the data if needed.

python bigram.py

Validation Roughly comparing the results in the video with my results as validation.

Video	MLX

Both converge to a similar value (Please ignore the formatting issues)

Train and run the GPT model

Coming soon...

Other

You can inspect the experimental notebook I created while following the video at experiment.ipynb. More understandable if you follow the video.
Tested on Macbook Air M1.

📦 Dependencies

📜 License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
data		data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bigram.py		bigram.py
data.py		data.py
gpt-dev.ipynb		gpt-dev.ipynb
requirements.txt		requirements.txt
v2.py		v2.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mlx-gpt

🚀 Getting Started

Instalation

🤖 Usage

Train and run the Bigram model

Train and run the GPT model

Other

📦 Dependencies

📜 License

About

Languages

License

DiogoNeves/mlx-gpt

Folders and files

Latest commit

History

Repository files navigation

mlx-gpt

🚀 Getting Started

Instalation

🤖 Usage

Train and run the Bigram model

Train and run the GPT model

Other

📦 Dependencies

📜 License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages