This two-volume set LNCS 14233-14234 constitutes the refereed proceedings of the 22nd International Conference on Image Analysis and Processing, ICIAP 2023, held in Udine, Italy, during September 1115, 2023.
The 85 full papers presented together with 7 short papers were carefully reviewed and selected from 144 submissions. The conference focuses on video analysis and understanding; pattern recognition and machine learning; deep learning; multi-view geometry and 3D computer vision; image analysis, detection and recognition; multimedia; biomedical and assistive technology; digital forensics and biometrics; image processing for cultural heritage; and robot vision.
Image Retrieval in Semiconductor Manufacturing.- Image Retrieval in
Semiconductor Manufacturing.- Self-Similarity Block for Deep Image
Denoising.- A request for clarity over the End of Sequence token in the
Self-Critical Sequence Training.- Shallow camera pipeline for night
photography enhancement.- GCK-Maps: a scene unbiased representation for
efficient Human Action Recognition.- Autism spectrum disorder identification
from visual exploration of images.- Target-Driven One-Shot Unsupervised
Domain Adaptation.- Combining identity features and artifact analysis for
Differential Morphing Attack Detection.- SynthCap: Augmenting Transformers
with Synthetic Data for Image Captioning.- An Effective CNN-Based Super
Resolution Method for Video Coding.- Medical Transformers for Boosting
Automatic Grading of Colon Carcinoma in Histological Images.- UAV
Multi-Object Tracking by combining two Deep Neural Architectures.- Consensus
Ranking for Efficient Face Image Retrieval: A Novel Method for Maximising
Precision and Recall.- Towards Explainable Navigation and
Recounting.- Towards facial expression robustness in multi-scale wild
environments.- Depth camera face recognition by normalized fractal
encodings.- Automatic Generation of Semantic Parts for Face Image
Synthesis.- Improved Bilinear Pooling For Real-Time Pose Event Camera
Relocalisation.- End-to-End Asbestos Roof Detection on Orthophotos Using
Transformer-based YOLO Deep Neural Network.- OpenFashionCLIP:
Vision-and-Language Contrastive Learning with Open-Source Fashion Data.- UAV
Multi-Object Tracking by combining two Deep Neural Architectures.- GLR:
Gradient-based Learning Rate scheduler.- A Large-scale Analysis of Athletes
Cumulative Race Time in Running Events.- Uncovering Lies: Deception Detection
in a Rolling-Dice Experiment.- Active Class Selection for Dataset Acquisition
in Sign Language Recognition.- MC-GTA: A Synthetic Benchmark for Multi-Camera
Vehicle Tracking.- A differentiable entropy model for learned image
compression.- Learning Landmarks Motion from Speech for Speaker-Agnostic 3D
Talking Heads Generation.- SCENE-pathy: Capturing the Visual Selective
Attention of People Towards Scene Elements.- Not with my name! Inferring
artists names of input strings employed by Diffusion Models.- Benchmarking
of Blind Video Deblurring Methods on Long Exposure and Resource Poor
Settings.- LieToMe: An LSTM-based Method for Deception Detection by Hand
Movements.- Spatial Transformer Generative Adversarial Network for Image
Super-Resolution.- Real-Time GAN-based Model for Underwater Image
Enhancement.- HERO: A Multi-Modal Approach on Mobile Devices for Visual-Aware
Conversational Assistance in Industrial Domains.- A Computer Vision-Based
water level monitoring system for touchless and sustainable water
dispensing.- Smoothing and Transition Matrices estimation to learn with Noisy
Labels.- Semi-supervised classification for Remote Sensing
datasets.- Exploiting Exif Data to Improve Image Classification using
Convolutional Neural Networks.- Weak Segmentation-Guided GAN for realistic
color edition.- Hand Gesture Recognition exploiting Handcrafted Features and
LSTM.- An Optimized Pipeline for Image-Based Localization in Museums from
Egocentric Images.- Annotating the Inferior Alveolar Canal: the Ultimate
Tool.- Active Class Selection for Dataset Acquisition in Sign Language
Recognition.- Enhancing PFI Prediction with GDS-MIL: A Graph-based Dual
Stream MIL Approach.