Zero-dependency implementation of BitNet neural network training and BPE tokenization in C
-
Updated
Aug 3, 2024 - C
8000
Zero-dependency implementation of BitNet neural network training and BPE tokenization in C
🐍This is a fast, lightweight, and clean CPython extension for the Byte Pair Encoding (BPE) algorithm, which is commonly used in LLM tokenization and NLP tasks.
Information Retreival Course Repository, 7th Semester, IITD, 2023-24
Add a description, image, and links to the bpe topic page so that developers can more easily learn about it.
To associate your repository with the bpe topic, visit your repo's landing page and select "manage topics."