⏩ Volume 21, Issue No.6, 2023 (CVAS)
Cross-View Action Recognition in Surveillance Systems Using Graph-Based Relational Attention and Spatial Context Encoding

This study presents a cross-view action recognition framework that leverages graph-based relational attention with spatial context encoding, improving accuracy in multi-camera surveillance scenarios with varied viewpoints and occlusions.

Frederick Simon Langley, Huang Yaqing, Richa Kumari Deshmukh, Mateo Ignacio Paredes, Charlotte Isabelle Leclerc, Zhou Xinming

Paper ID: 32321601
✅ Access Request

Autonomous Aerial Navigation in GPS-Denied Environments Using Vision-Based Feature Matching and Global Pose Estimation

This paper develops a robust aerial navigation framework for drones in GPS-denied zones, using visual feature matching and global pose estimation to ensure precise trajectory control in unknown terrains.

Theodore Miles Hamilton, Zhang Yunqi, Shalini Vijayrao Patil, Lucas Raphael Boudreaux, Hanae Rin Nakamura, Omar Khaled Bassem

Paper ID: 32321602
✅ Access Request

Fine-Grained Material Classification for Robotic Manipulation Using Vision Transformers and Texture-Aware Attention Mechanisms

This work introduces a fine-grained material classification model for robotic grasping tasks using texture-aware attention layers in vision transformers, enabling adaptive and accurate manipulation of diverse surface types.

Quentin Louis Bernard, Xu Wenhao, Meenal Sarita Krishnan, Daniel James Hollister, Yuki Noriko Takahashi, Sofia Isabella Müller

Paper ID: 32321603
✅ Access Request

Multi-Agent Visual Coordination for Real-Time Object Handover in Human-Robot Collaborative Warehouses

This study presents a real-time visual coordination framework for multi-agent systems enabling seamless object handover in human-robot collaborative warehouses through synchronized detection, gesture parsing, and movement prediction models.

Julian Patrick Whitaker, Liu Zihan, Kavya Ramesh Nair, Thomas Philippe Verdier, Gabriela Adriana Montoya, Zhang Chengyu

Paper ID: 32321604
✅ Access Request

Interactive 3D Scene Reconstruction from Egocentric Video for Augmented Reality Assistance in Industrial Maintenance

This paper presents a 3D scene reconstruction pipeline from egocentric video footage for augmented reality tools, aiding industrial maintenance tasks with spatial overlays and real-time component identification assistance.

Leonardo Emiliano Costa, Zhang Xinyuan, Poonam Sheetal Rathi, Benjamin Charles Müller, He Wenjing, Natalia Sofía Pereira

Paper ID: 32321605
✅ Access Request

Occlusion-Robust Pedestrian Detection in Crowded Urban Environments Using Hierarchical Contextual Feature Pyramids

This study proposes a hierarchical contextual feature pyramid network to improve pedestrian detection in crowded urban environments, enhancing visibility under severe occlusions and varying human poses using refined attention layers.

Isaac Nathaniel Carmichael, Zhang Yuwei, Priyanka Deepak Nambiar, Louis Marcel Fontaine, Jin Minghao, Helena Sofia Duarte

Paper ID: 32321606
✅ Access Request

Adaptive Visual Place Recognition Using Illumination-Invariant Embeddings for Long-Term Robot Navigation

This research presents an adaptive visual place recognition system using illumination-invariant embeddings, enabling robust localization for long-term autonomous navigation across drastically changing lighting conditions and seasons.

Theodore Julian Prescott, Li Xiaowen, Anjali Mohan Iyer, Fabrizio Marco Leone, Yukiko Haruna Watanabe, Clara Isabelle Schubert

Paper ID: 32321607
✅ Access Request

Back