Explore the new agentic loop pipeline using Gemma 4 and Falcon Perception for highly accurate, locally hosted image ...
Vision language models (VLMs) have made impressive strides over the past year, but can they handle real-world enterprise challenges? All signs point to yes, with one caveat: They still need maturing ...
As I highlighted in my last article, two decades after the DARPA Grand Challenge, the autonomous vehicle (AV) industry is still waiting for breakthroughs—particularly in addressing the “long tail ...
H2O.ai, a provider of open-source AI platforms, announced today two new vision-language models designed to improve document analysis and optical character recognition (OCR) tasks. The models, named ...
The field of optical image processing is undergoing a transformation driven by the rapid development of vision-language models (VLMs). A new review article published in iOptics details how these ...
Biomedical data analysis has evolved rapidly from convolutional neural network-based systems toward transformer architectures and large-scale foundation ...
If India’s AI ambitions needed a pre-India AI Impact Summit flex, Sarvam AI delivered it loud and clear. Days before the India AI Impact Summit 2026 kicks off in New Delhi, the Bengaluru-based startup ...
Figure AI has unveiled HELIX, a pioneering Vision-Language-Action (VLA) model that integrates vision, language comprehension, and action execution into a single neural network. This innovation allows ...
Hugging Face Inc. today open-sourced SmolVLM-256M, a new vision language model with the lowest parameter count in its category. The algorithm’s small footprint allows it to run on devices such as ...