-
-
optimum-quanto Public
Forked from huggingface/optimum-quantoA pytorch quantization backend for optimum
Python Apache License 2.0 UpdatedJun 30, 2024 -
marlin-scaled-zero-point Public
Forked from IST-DASLab/marlinModified version of Marlin (https://github.com/IST-DASLab/marlin) with scaled zero point as input
Python Apache License 2.0 UpdatedJun 25, 2024 -
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
-
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
Python Other UpdatedApr 12, 2023 -
redis-py Public
Forked from redis/redis-pyRedis Python Client
Python MIT License UpdatedMar 3, 2023 -
-
-
-