KaniTTS2’s 3GB Voice Cloning Promise: Open Source Revolution or Clever Hardware Marketing?
A new 400M parameter TTS model claims real-time voice synthesis in 3GB VRAM with full pretraining code. We dissect the architecture, benchmark the claims, and question what ‘open source’ really means in the age of AI voice cloning.