Unity Visual Script Official Tutorial

Imagination Helps Visual Reasoning, But Not Yet in Latent Space

This repository hosts the implementation and analysis details of CapImagine. It starts from systematic investigation on the internal mechanisms of latent-space visual reasoning methods through causal ...

IEEE

VR-Robo: A Real-to-Sim-to-Real Framework for Visual Robot Navigation and Locomotion

Abstract: Recent success in legged robot locomotion is attributed to the integration of reinforcement learning and physical simulators. However, these policies often encounter challenges when deployed ...

IEEE

Visual Boundary-Guided Pseudo-Labeling for Weakly Supervised 3D Point Cloud Segmentation in Indoor Environments

Abstract: Accurate segmentation of 3D point clouds in indoor scenes remains a challenging task, often hindered by the labor-intensive nature of data annotation. While weakly supervised learning ...

GitHub

TwiFF (Think With Future Frames): A Large-Scale Dataset for Dynamic Visual Reasoning

We present TwiFF, a unified model fine-tuned on a high-quality dynamic visual Chain-of-Thought (VCoT) dataset comprising 2.7 million samples. In dynamic multimodal question-answering tasks involving ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results