TOPLOC: A Locality Sensitive Hashing Scheme for Trustless Verifiable Inference

TOPLOC is a novel method for verifiable inference that enables users to verify that LLM providers are using the correct model configurations and settings. It leverages locality sensitive hashing for intermediate activations to detect unauthorized modifications.

Features

Detect unauthorized modifications to models, prompts, and precision settings
1000x reduction in storage requirements compared to full activation storage
Validation speeds up to 100x faster than original inference
Robust across different hardware configurations and implementations
Zero false positives/negatives in empirical testing

Key Components

Proof Generation

Extracts top-k values from the last hidden state
Uses polynomial encoding for compact storage
Generates verifiable proof during inference

Validation

Recalculates top-k features
Compares exponent and mantissa differences
Validates against predefined error thresholds

Storage Requirements

Only 258 bytes per 32 new tokens
Compared to 262KB for full token embeddings (Llama-3.1-8B-Instruct)

Integrations

vLLM

TOPLOC is integrated with vLLM for efficient inference and validation as part of this repository. The integration allows you to leverage vLLM's optimized inference pipeline while maintaining verification capabilities.

SGLang

We maintain a fork of SGLang that includes TOPLOC integration, enabling verifiable inference with SGLang's framework.

How to use the code

Installation

git clone https://github.com/PrimeIntellect/toploc.git
pip install -r requirements.txt

Run Experiments

This is an example on running validation with Llama-3.1-8B-Instruct over the ultrachat dataset.

First, generate the polynomial encodings for the model using:

python vllm_generate_poly.py --model_name meta-llama/Llama-3.1-8B-Instruct --tp 1 --n_samples 4 --save_dir signatures --max_decode_tokens 512 --dataset_name stingning/ultrachat --dtype bfloat16

This should create a directory called signatures with the polynomial encodings for the model.

You can then run validation with:

python vllm_validate_poly.py --decode_model_name meta-llama/Llama-3.1-8B-Instruct --validate_model_name meta-llama/Llama-3.1-8B-Instruct --tp 1 --n_samples 4 --save_dir just4 --max_decode_tokens 512 --dataset_name stingning/ultrachat --dtype bfloat16 --attn flash

If the verification passes, you should see:

VERIFICATION PASSED: Mantissa error mean: 0. below 10 and median: 0. below 8 and exp intersections: 100 below 90

And if it fails, you should see something like:

VERIFICATION FAILED: Mantissa error mean: 11.000000 above 10 or median: 10.000000 above 8 or exp intersections: 0 above 90

Citing

@misc{ong2025toploclocalitysensitivehashing,
      title={TOPLOC: A Locality Sensitive Hashing Scheme for Trustless Verifiable Inference}, 
      author={Jack Min Ong and Matthew Di Ferrante and Aaron Pazdera and Ryan Garner and Sami Jaghouar and Manveer Basra and Johannes Hagemann},
      year={2025},
      eprint={2501.16007},
      archivePrefix={arXiv},
      primaryClass={cs.CR},
      url={https://arxiv.org/abs/2501.16007}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
tests		tests
toploc		toploc
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
vllm_generate_poly.py		vllm_generate_poly.py
vllm_validate_poly.py		vllm_validate_poly.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TOPLOC: A Locality Sensitive Hashing Scheme for Trustless Verifiable Inference

Features

Key Components

Proof Generation

Validation

Storage Requirements

Integrations

vLLM

SGLang

How to use the code

Installation

Run Experiments

Citing

About

Releases

Packages

Contributors 3

Languages

License

PrimeIntellect-ai/toploc

Folders and files

Latest commit

History

Repository files navigation

TOPLOC: A Locality Sensitive Hashing Scheme for Trustless Verifiable Inference

Features

Key Components

Proof Generation

Validation

Storage Requirements

Integrations

vLLM

SGLang

How to use the code

Installation

Run Experiments

Citing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages