BANANDRE
NO ONE CARES ABOUT CODE

Navigation

HomeCategories

Categories

Artificial Intelligence(530)
Software Architecture(298)
Software Development(279)
Data Engineering(149)
Engineering Management(80)
Enterprise Architecture(60)
Product Management(28)
tech(1)

Tagged with

#Inference Optimization

2 articles found

Blackwell’s 99KB Cage: How One Developer Jailbroke Qwen3.5 Performance with a 64-Line Kernel Patch
blackwell
Featured

Blackwell’s 99KB Cage: How One Developer Jailbroke Qwen3.5 Performance with a 64-Line Kernel Patch

Technical deep dive into unlocking 2x inference speed on RTX PRO 6000 Blackwell GPUs by fixing CUTLASS SMEM overflow bugs for MoE models

#blackwell#cuda#CUTLASS...
Read More
The Qwen Brain Drain: Why Alibaba’s Loss Is Your Local Inference Gain
Fine-tuning

The Qwen Brain Drain: Why Alibaba’s Loss Is Your Local Inference Gain

Alibaba’s Qwen team is imploding just as they released their best models yet. Here’s how to exploit the chaos using Unsloth to fine-tune Qwen3.5 on consumer hardware.

#Fine-tuning#Inference Optimization#qwen...
Read More
BANANDRE
NO ONE CARES ABOUT CODE

Connect

2026 BANANDRE
Privacy PolicyTermsImpressum
Built with 🍌