5 articles found
PaddleOCR-VL delivers SOTA performance with 80x fewer parameters than competitors, redefining OCR capabilities
The open-source vision model that’s exposing how bad traditional OCR actually is at preparing documents for LLMs
China’s vision-language model outperforms GPT-5 Mini and Claude Sonnet while running locally – and developers are taking notice
Moondream 3 promises frontier-level reasoning with blazing speed, but does it deliver or just exploit benchmark shortcuts?
Apple’s FastVLM and MobileCLIP2 models running on WebGPU prove on-device AI doesn’t need cloud servers anymore