BANANDRE
NO ONE CARES ABOUT CODE

Navigation

HomeCategories

Categories

Artificial Intelligence(567)
Software Architecture(304)
Software Development(284)
Data Engineering(159)
Engineering Management(85)
Enterprise Architecture(67)
Product Management(29)
Uncategorized(7)
Software Engineering(1)
tech(1)

Tagged with

#sparse-activation

1 article found

Step-3.5-Flash: The 196B Parameter Model That Makes Giants Look Wasteful
efficiency
Featured

Step-3.5-Flash: The 196B Parameter Model That Makes Giants Look Wasteful

Stepfun’s sparse MoE model activates only 11B parameters yet outperforms models 3-5x larger on coding and agentic tasks, delivering 100-300 tok/s on consumer hardware and forcing a reckoning with the parameter count arms race.

#efficiency#moe#sparse-activation...
Read More
BANANDRE
NO ONE CARES ABOUT CODE

Connect

2026 BANANDRE
Privacy PolicyTermsImpressum
Built with 🍌