Visual Spatial Clip Art

VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction

VLM-3R is a unified Vision-Language Model (VLM) framework integrating 3D reconstructive instruction tuning for deep spatial understanding from monocular video. The rapid advancement of Large ...

AI Must Master Spatial Intelligence To Fully Operate In The Real World

Spatial intelligence, a frontier technology, is essential for AI to enable humanoids and autonomous vehicles to operate in the physical world.

Frontiers

Applications of AI to single-cell and spatial transcriptomics: current state-of-the-art and challenges

The final, formatted version of the article will be published soon. Artificial intelligence (AI) has become a common tool for bioinformatics, with hundreds of methods published in recent years. Due to ...

GitHub

The official repo for SpaceVista: All-Scale Visual Spatial Reasoning from $mm$ to $km$.

Spatial reasoning is the ability to perceive, interpret, and act across spatial scales, from millimeter-sized components to distant aerial scenes. All-scale spatial reasoning is fundamental to ...

Spatial Source

Tactile mapping for people with visual impairments

Navigating an unfamiliar city can be challenging, especially if your vision is impaired. While navigation apps can guide you from doorstep to doorstep and adapt your route on-the-go, they often fail ...

Microsoft

Magma: A Foundation Model for Multimodal AI Agents

We present Magma, a foundation model that serves multimodal AI agentic tasks in both the digital and physical worlds. Magma is a significant extension of vision-language (VL) models in that it not ...

IEEE

LIX: Implicitly Infusing Spatial Geometric Prior Knowledge Into Visual Semantic Segmentation for Autonomous Driving

Abstract: Despite the impressive performance achieved by data-fusion networks with duplex encoders for visual semantic segmentation, they become ineffective when spatial geometric data are not ...

IEEE

Visual-Guided Dual-Spatial Interaction Network for Fine-Grained Brain Semantic Decoding

Abstract: Brain semantic decoding has received a surge of attention in the computer vision and neuroscience disciplines. However, existing techniques ignore the sparse and implicit semantic analysis ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results