Tagged with

5 articles found

PaddleOCR-VL delivers SOTA performance with 80x fewer parameters than competitors, redefining OCR capabilities

The open-source vision model that’s exposing how bad traditional OCR actually is at preparing documents for LLMs

China’s vision-language model outperforms GPT-5 Mini and Claude Sonnet while running locally – and developers are taking notice

Moondream 3 promises frontier-level reasoning with blazing speed, but does it deliver or just exploit benchmark shortcuts?

Apple’s FastVLM and MobileCLIP2 models running on WebGPU prove on-device AI doesn’t need cloud servers anymore