⏩ Volume 23, Issue No.1, 2025 (CVAS)
Visual-Inertial Odometry for Indoor Drones Using Lightweight Fusion Networks and Temporal Consistency Models

This study presents a visual-inertial odometry solution tailored for indoor drones, utilizing lightweight sensor fusion networks and temporal consistency modeling to enhance flight stability and localization accuracy.

Jonathan Paul Whitaker, Zhang Ruihong, Aarthi Suman Reddy, Étienne Louis Charron, Haruka Miki Yamamoto, Clara Teresa Delgado

Paper ID: 32523101
✅ Access Request

Generative Scene Synthesis for Simulation Environments Using Spatial Priors and Multimodal Vision Embeddings

This paper introduces a generative approach for creating synthetic scenes in simulation environments, combining spatial priors and multimodal vision embeddings for realistic and diverse visual training data.

Leonard Francis Barrett, Liu Zhaoqiang, Shruti Manasa Iyer, Antoine Michel Perrault, Keiko Nanami Suzuki, Bianca Emilia Morales

Paper ID: 32523102
✅ Access Request

Temporal Activity Segmentation in Egocentric Video Using Multi-Stage Memory Networks and Semantic Refinement

This work presents a model for temporal segmentation of egocentric activities using multi-stage memory networks and semantic refinement modules, enabling accurate and efficient event parsing in first-person video streams.

Gavin Matthew O'Connor, Zhao Yuhan, Divya Pranathi Narayan, Laurent Nicolas Marchand, Emi Hoshiko Sakamoto, Lorena Carolina Pacheco

Paper ID: 32523103
✅ Access Request

Contextual Object Tracking in Autonomous Vehicles Using Multi-Level Attention and Frame-Adaptive Learning

This paper proposes a contextual object tracking model for autonomous driving, incorporating multi-level attention and frame-adaptive learning to maintain robust performance in dynamic road scenarios.

Isaac Raymond Bennett, Zhang Qingli, Minal Sanjana Sharma, Jean-Marc André Pelletier, Yuna Riko Nakamura, Gabriela Alejandra Ortiz

Paper ID: 32523104
✅ Access Request

Scene Graph Reasoning for Visual Question Answering Using Graph Attention Networks and Language-Guided Traversals

This research introduces a visual question answering framework that applies scene graph reasoning with graph attention and language-guided traversal strategies to align visual understanding with natural language queries.

Bradley Simon Hughes, Li Wenxin, Rina Supriya Bhandarkar, Marcel Pierre Blanchet, Hana Aiko Fujimoto, Eliza Julieta Cruz

Paper ID: 32523105
✅ Access Request

Cross-Weather Semantic Segmentation for Autonomous Vehicles Using Style Normalization and Adaptive Feature Alignment

This study presents a cross-weather semantic segmentation model using style normalization and adaptive feature alignment, enabling autonomous vehicles to maintain perception accuracy in diverse weather conditions.

Julian Thomas McBride, Zhang Xiaoyan, Anushka Megha Ranganathan, Jean-Paul André Fontaine, Yuki Natsumi Sato, Mariela Cristina Varela

Paper ID: 32523106
✅ Access Request

Spatiotemporal Graph Neural Networks for Pedestrian Behavior Prediction in Complex Traffic Intersections

This paper introduces a spatiotemporal graph neural network for predicting pedestrian trajectories in traffic intersections, leveraging scene context and motion history for safe path planning in urban mobility systems.

Nathan Gregory Ellis, Zhao Minqing, Ishaani Raghavi Kapoor, Antoine Louis Renaud, Emi Sakura Tanaka, Isadora Lucia Mendes

Paper ID: 32523107
✅ Access Request

Back