Articles
- Vol.23, No.1, 2025
- Vol.22, No.6, 2024
- Vol.22, No.5, 2024
- Vol.22, No.4, 2024
- Vol.22, No.3, 2024
- Vol.22, No.2, 2024
- Vol.22, No.1, 2024
- Vol.21, No.6, 2023
- Vol.21, No.5, 2023
- Vol.21, No.4, 2023
- Vol.21, No.3, 2023
- Vol.21, No.2, 2023
- Vol.21, No.1, 2023
- Vol.20, No.6, 2022
- Vol.20, No.5, 2022
- Vol.20, No.4, 2022
- Vol.20, No.3, 2022
- Vol.20, No.2, 2022
- Vol.20, No.1, 2022
- Vol.19, No.6, 2021
- Vol.19, No.5, 2021
- Vol.19, No.4, 2021
- Vol.19, No.3, 2021
- Vol.19, No.2, 2021
- Vol.19, No.1, 2021
This study proposes a sign recognition system that withstands weather distortion. Using domain-robust transformers and multi-scale visual modeling, it ensures classification accuracy under blur, occlusion, and lighting variability in real-world conditions.
Chen Min Zhi, Xu Zhi Lin, Liu Hao Jie, Zhang Wen Rong, Gao Ping Fang
Paper ID: 32119601 | ✅ Access Request |
This paper introduces a hierarchical visual reasoning framework. It leverages object-level graphs and attention-based scene reconstruction to enable autonomous agents to interpret complex environments, supporting decision-making in unstructured and semantically rich autonomous navigation and robotic manipulation scenarios.
Akash Deepan Menon, Leonardo Marco Vargas, Haruto Jinsei Kobayashi, Ella Rose Whitman, Omar Jalil Farouq
Paper ID: 32119602 | ✅ Access Request |
This research presents a model that aligns language commands with visual inputs to guide autonomous scene exploration. Temporal attention layers enhance reasoning over multiple frames, allowing agents to navigate based on natural instructions in real-world settings.
Siddharth Varun Iyer, Tobias Alan McAllister, Mei Lin Fang, Juliana Hope Simmons, Victor Enrique D’Souza
Paper ID: 32119603 | ✅ Access Request |
This paper introduces a long-term visual place recognition framework. The model adapts to seasonal and lighting changes using weather-invariant feature encoding and cyclic temporal memory, enabling robust localization for lifelong autonomous navigation tasks.
Tanmay Rajesh Kulkarni, Chloe Frances Morgan, Hiroki Renji Nakamura, Matteo Javier de Luca, Nora Isabelle Hoffman
Paper ID: 32119604 | ✅ Access Request |
This study proposes a real-time monocular pose estimation system. Attention cascades refine keypoint localization while inverse kinematic networks reconstruct 3D human skeletons, supporting responsive human-robot collaboration and interaction in service robotics.
Kiran Dileep Natarajan, Isabelle Renee Winters, Yuki Shoji Tanaka, Patrick Elias Armstrong, Marta Sofia Jimenez
Paper ID: 32119605 | ✅ Access Request |
This work introduces a traffic scene reasoning framework using interaction graphs. Vehicles are modeled as nodes with motion features propagated across time, enabling agents to understand dynamic road environments for coordinated behavior prediction.
Devansh Arvind Pillai, Emilia Kate Reynolds, Sho Wen Matsuda, Lucien Thomas Walker, Aisha Carmen Mendes
Paper ID: 32119606 | ✅ Access Request |
This paper introduces a power-efficient depth perception system for embedded agents. Disparity maps are computed using optimized convolutional kernels, while motion-aligned attention enhances 3D consistency, enabling real-time awareness in wearable or mobile vision systems.
Parthasarathi Vikram Shenoy, Helena Alice Browne, Zhang Li Hao, Cheng Rui Wen, Koji Masaki Fujita
Paper ID: 32119607 | ✅ Access Request |
This study proposes a behavior prediction model for drivers. Scene context is used to adapt cross-domain features, and recurrent encoders track action states, improving anticipation of lane changes, braking, or acceleration in diverse traffic conditions.
Bhavik Ramesh Kanade, Emilie Jane Barrington, Yuhao Zhen Ming, Tomas Julian Frost, Carla Beatriz Mendez
Paper ID: 32119608 | ✅ Access Request |
Back