Skip to content

Latest commit

 

History

History
11 lines (7 loc) · 291 Bytes

README.md

File metadata and controls

11 lines (7 loc) · 291 Bytes

tow

info

Implementing ByteT5 model and trainer using PyTorch. Exploring various position encoding methods (ALiBi, RoPE) and UTF-8-Unicode encoding.

run

CUDA_VISIBLE_DEVICES=1,2,3,4,5,6 nohup torchrun --standalone --nproc_per_node=6 train.py test_models/byt5-base test &