8000 GitHub - itsAnanth/Transformers: Lightweight implementation of GPT-2 decoder only transformer architecture
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Lightweight implementation of GPT-2 decoder only transformer architecture

Notifications You must be signed in to change notification settings

itsAnanth/Transformers

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

37 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Transformers

Lightweight implementation of GPT-2 decoder only transformer architecture based on the makemore series by Andrej Karpathy.

The main file is gpt.py

The model was trained on google colab, checkpoint file is model.pt

NOTE:

gpt.py contains a lot of comments, this file is purely for educational purposes, its not intended to be production ready by any means

About

Lightweight implementation of GPT-2 decoder only transformer architecture

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
0