Jan-v2-VL’s 10x Breakthrough: Why Thinking Models Outlast Instruct Models on Long-Horizon Tasks
An 8B vision-language model executes 49 steps without failure while competitors fail at 5. The secret? Reasoning models, not instruct tuning, hold the key to long-horizon agentic capabilities.