Jan 3, 2026

[deep dive llm (gpt-2)]

Reference:
https://huggingface.co/spaces/HuggingFaceFW/blogpost-fineweb-v1
Let's reproduce GPT-2 (1.6B): one 8XH100 node, 24 hours, $672, in llm.c

Tool:
https://tiktokenizer.vercel.app/

Library:
https://github.com/google/sentencepiece


Flow



No comments:

Post a Comment

Note: Only a member of this blog may post a comment.