JavaScript Object Model

Vision-language-action models are the next leap in autonomous robotics

Explore how vision-language-action models like Helix, GR00T N1, and RT-1 are enabling robots to understand instructions and ...

7 ways Nano Banana 2 just got better and faster - how to try Google's latest image model

Google's new default model for generating images, Nano Banana 2 offers faster speeds, better text rendering, and higher ...

IEEE

MambaEVT: Event Stream based Visual Object Tracking using State Space Model

Abstract: Event camera-based visual tracking has drawn more and more attention in recent years due to the unique imaging principle and advantages of low energy consumption, high dynamic range, and ...

IEEE

MonoDiff9D: Monocular Category-Level 9D Object Pose Estimation via Diffusion Model

Abstract: Object pose estimation is a core means for robots to understand and interact with their environment. For this task, monocular category-level methods are attractive as they require only a ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results