- Between the radio ether and you.
LLM
A framework for few-shot evaluation of language models.
📋 A list of open LLMs available for commercial use.
A quick guide (especially) for trending instruction finetuning datasets
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
A framework for Claude Opus to intelligently orchestrate subagents.
Get up and running with Llama 3.3, Phi 4, Gemma 2, and other large language models.
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer
You like pytorch? You like micrograd? You love tinygrad! ❤️
LM Studio JSON configuration file format and a collection of example config files.
A Gradio web UI for Large Language Models with support for multiple inference backends.
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Finetune Llama 3.3, Mistral, Phi-4, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
Fast and memory-efficient exact attention
A fast inference library for running LLMs locally on modern consumer-class GPUs
Universal LLM Deployment Engine with ML Compilation
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Interact with your documents using the power of GPT, 100% privately, no data leaks
The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured…