Computer Vision Training Models

UMaine researcher aims to open the ‘black box’ of AI

The five-year, $584,034 project — “CAREER: Opening the Black Box: Advancing Interpretable Machine Learning for Computer Vision” — aims to bring greater transparency and accountability to AI-powered ...

EurekAlert!

Beyond bigger models: How efficient multimodal AI is redefining the future of intelligence

Multimodal large language models have shown powerful abilities to understand and reason across text and images, but their ...

Beyond automation: Physical AI ushers in a new era of smart machines

Nvidia CEO Jensen Huang declared physical AI as enabling “a new era of AI,” a bold proclamation now backed by concrete ...

Some elite AI researchers say language is limiting. Here's the new kind of model they are building instead.

Top AI researchers like Fei-Fei Li and Yann LeCun are developing world models, which don't rely solely on language.

Tech Xplore on MSN

New computer vision method links photos to floor plans with pixel-level accuracy

For people, matching what they see on the ground to a map is second nature. For computers, it has been a major challenge. A ...

12d

Yann LeCun’s New Startup AMI Labs: Can World Models Move Beyond Hype?

Instead of building yet another LLM, LeCun is focused on something he sees as more broadly applicable. He wants AI to learn ...

Scientific Research Publishing

On the Utility of Pose Estimation Models for Golf Swing Understanding ()

Pose Estimation, Golf Swing Analysis, Computer Vision, YOLO Pose, MediaPipe Pose, Sports Analytics, OKS Metric, Human Motion Analysis Share and Cite: Yuan, A. and Ndongmo, B. (2026) On the Utility of ...

Department of Computer Science - University of Texas at Austin

Adaptive Anatomy: 3D Models That Fit Every Form

Researchers at The University of Texas at Austin recently received support from the National Science Foundation (NSF) to ...

VentureBeat

Z.ai debuts open source GLM-4.6V, a native tool-calling vision model for multimodal reasoning

Chinese AI startup Zhipu AI aka Z.ai has released its GLM-4.6V series, a new generation of open-source vision-language models (VLMs) optimized for multimodal reasoning, frontend automation, and ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results