GitHub - dtdo90/Llama3.2_python_dataset: Finetune Llama 3.2 1B model on the python code dataset

Fine-tuning LLAMA 3.2-1B on Python Dataset

This repository demonstrates the process of fine-tuning LLAMA 3.2 1B on a Python instruction dataset from hugging face https://huggingface.co/datasets/iamtarun/python_code_instructions_18k_alpaca. The goal is to enhance the model's capability in generating and understanding Python code.

Training Details

Training Framework: The training uses the SFTTrainer from the trl (Transformer Reinforcement Learning) library.
Parameter Optimization: QLoRA (Low-Rank Adaptation) is applied to reduce the number of parameters and improve efficiency during the fine-tuning process.

Evaluation

Run eval_ollama_8B.ipynb to score the model's performance.

Interactive API with chainlit

Interact with the fine-tuned model through a web API by running the command chainlit run app.py. This will launch an interactive interface for the model.

Example

Below is an example of the model solving a leetcode question

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
experiments		experiments
.gitignore		.gitignore
README.md		README.md
app.py		app.py
data-test.json		data-test.json
eval_ollama_8B.ipynb		eval_ollama_8B.ipynb
llama32_python_code.ipynb		llama32_python_code.ipynb
requirements.txt		requirements.txt
test-data-with-response.json		test-data-with-response.json
utils_for_app.py		utils_for_app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fine-tuning LLAMA 3.2-1B on Python Dataset

Training Details

Evaluation

Interactive API with chainlit

Example

About

Releases

Packages

Languages

dtdo90/Llama3.2_python_dataset

Folders and files

Latest commit

History

Repository files navigation

Fine-tuning LLAMA 3.2-1B on Python Dataset

Training Details

Evaluation

Interactive API with chainlit

Example

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages