Chinese large-language model (LLM) start-ups including DeepSeek and Moonshot AI have rapidly open-sourced their latest models ...
Libby and Kanopy recently released the results of its 2025 Higher Education Survey. The findings reveal a significant shift in student expectations, with nearly three-quarters of undergraduates ...
Abstract: Remote sensing multimodal data integrates information from various sensors, providing detailed characterization and reliable monitoring capabilities for Earth observation tasks. This data is ...
Abstract: Integrating the Artificial Intelligence (AI) vision module into the robot grasping system can significantly improve its generalizability, thereby enhancing the efficiency of Human-Robot ...
Multimodal chain-of-thought (MCoT) reasoning has garnered attention for its ability to enhance step-by-step reasoning in multimodal contexts, particularly within multimodal large language models ...
MCiteBench is a benchmark to evaluate multimodal generating text with citations in Multimodal Large Language Models (MLLMs). It includes data from academic papers and review-rebuttal interactions, ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results