⏩ Volume 21, Issue No.4, 2023 (CVAS)
Synthetic Data Generation for Rare Action Recognition Using Probabilistic Scene Composition and Transfer Learning

This paper presents a synthetic data generation pipeline using probabilistic scene composition and transfer learning to address class imbalance in rare action recognition within visual datasets for autonomous systems.

Matthew Alan Donnelly, Chen Yiran, Kavitha Suresh Chandra, Otto Karl Reinhardt, Hideo Masaki Fujimoto, Emily Jade Monroe

Paper ID: 32321401
✅ Access Request

Vision-Based Predictive Maintenance for Industrial Machinery Using Anomaly Detection in High-Frequency Image Streams

This research introduces a predictive maintenance framework that analyzes high-frequency image streams using anomaly detection models, identifying early visual signs of mechanical faults in industrial equipment to reduce downtime.

Jonathan Blake Hammond, Liang Wenxuan, Deepa Narayanan Iyer, Felix Noah Berger, Zainab Fatima Sadiq, Han Jialin

Paper ID: 32321402
✅ Access Request

Collaborative Visual SLAM Using Edge-Assisted Fusion of Inter-Robot Observations in Dynamic Workspaces

This study proposes a collaborative SLAM solution where multiple robots share visual observations via edge fusion techniques, enhancing localization accuracy and resilience in highly dynamic indoor environments.

Christian Paul Donovan, Mei Xueying, Anuradha Sree Raghavan, Francois Xavier Marchal, Carla Denise Gomez, Zhao Tiancheng

Paper ID: 32321403
✅ Access Request

Semantic Segmentation of Satellite Imagery Using Transformer-Based Multi-Scale Contextual Decoding Networks

This paper introduces a transformer-based semantic segmentation approach tailored for satellite imagery, utilizing multi-scale contextual decoding to accurately delineate urban and agricultural features across high-resolution remote sensing data.

Gregory Michael Huxley, Tang Weihao, Sneha Priya Venkataraman, Alice Caroline Foster, Peng Xiaoru, Jamal Ibrahim Al-Fayez

Paper ID: 32321404
✅ Access Request

Few-Shot Learning for Urban Object Detection Using Meta-Learning and Hierarchical Feature Generalization

This paper explores a few-shot learning framework incorporating meta-learning and hierarchical feature generalization to enable accurate urban object detection with minimal annotated data in complex cityscapes.

Arthur Nathaniel Granger, Xu Mingyu, Leena Sudha Prabhakaran, Stefan Rudolf Mertz, Natalia Veronika Kuznetsova, Kim Hyeonwoo

Paper ID: 32321405
✅ Access Request

Vision-Based Emotion Recognition in Driver Monitoring Systems Using Hybrid CNN-RNN Attention Networks

This study introduces a hybrid CNN-RNN attention-based framework for real-time emotion recognition in driver monitoring systems, enhancing safety by detecting stress, fatigue, or distraction through facial expression analysis.

Harrison George MacLeod, Zhao Liancheng, Meenal Kalyani Deshmukh, Ethan Robert Holloway, Aurora Celeste Franco, Wang Jinhai

Paper ID: 32321406
✅ Access Request

Zero-Shot Visual Question Answering in Surveillance Videos Using Cross-Modal Conceptual Graphs

This research presents a zero-shot VQA framework for surveillance videos using cross-modal conceptual graphs, enabling intelligent systems to answer questions about unseen actions, objects, and contexts without additional training data.

Logan Peter Whitaker, Lin Chunyan, Radhika Anantha Krishnan, Julien Bernard Lefevre, Fatima Noor Al-Farsi, Zhang Mingcheng

Paper ID: 32321407
✅ Access Request

Back