BANANDRE
NO ONE CARES ABOUT CODE

Navigation

HomeCategories

Categories

Artificial Intelligence(567)
Software Architecture(304)
Software Development(284)
Data Engineering(159)
Engineering Management(85)
Enterprise Architecture(67)
Product Management(29)
Uncategorized(7)
Software Engineering(1)
tech(1)

Tagged with

#from-scratch

1 article found

The 1.8M-Parameter Language Model That Questions Everything We Know About Scale
data-curation
Featured

The 1.8M-Parameter Language Model That Questions Everything We Know About Scale

An enthusiast’s journey training a minimal-scale LLM from scratch reveals how architectural innovation and obsessive data curation can squeeze GPT-2 level quality into 25MB.

#data-curation#from-scratch#retention-mechanism...
Read More
BANANDRE
NO ONE CARES ABOUT CODE

Connect

2026 BANANDRE
Privacy PolicyTermsImpressum
Built with 🍌