


default search action
18th VISIGRAPP 2023: Lisbon, Portugal - Volume 5: VISAPP
- Petia Radeva, Giovanni Maria Farinella, Kadi Bouatouch:

Proceedings of the 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, VISIGRAPP 2023, Volume 5: VISAPP, Lisbon, Portugal, February 19-21, 2023. SCITEPRESS 2023, ISBN 978-989-758-634-7
Invited Speakers
- Alexandru C. Telea:

Beyond the Third Dimension: How Multidimensional Projections and Machine Learning Can Help Each Other. 5-16 - Ferran Argelaguet:

The Infinite Loop. VISIGRAPP 2023: 17 - Vincent Hayward:

Human Tactile Mechanics and the Design of Haptic Interfaces. VISIGRAPP 2023: 19 - Liang Zheng:

Data-Centric Computer Vision. VISIGRAPP 2023: 21
Image and Video Understanding
- Peter Lorenz, Margret Keuper, Janis Keuper:

Unfolding Local Growth Rate Estimates for (Almost) Perfect Adversarial Detection. 27-38 - Wonwoo Jo, Kyungshin Lee, Jaewon Baik, Sang-Sun Lee, Dongho Choi, Hyunkyoo Park:

DaDe: Delay-Adaptive Detector for Streaming Perception. 39-46 - Warren Jouanneau, Aurélie Bugeau, Marc Palyart, Nicolas Papadakis, Laurent Vézard:

A Patch-Based Architecture for Multi-Label Classification from Single Positive Annotations. 47-58 - Maya Antoun, Daniel C. Asmar:

Human Object Interaction Detection Primed with Context. 59-68 - Bilal Abdulrahman, Zhigang Zhu:

Absolute-ROMP: Absolute Multi-Person 3D Mesh Prediction from a Single Image. 69-79 - Annika Mütze

, Matthias Rottmann, Hanno Gottschalk:
Semi-Supervised Domain Adaptation with CycleGAN Guided by Downstream Task Awareness. 80-90 - Xuan Wang, Hao Tang

, Zhigang Zhu:
A General Context Learning and Reasoning Framework for Object Detection in Urban Scenes. 91-102 - Jinlai Ning, Haoyan Guan, Michael W. Spratling

:
Rethinking the Backbone Architecture for Tiny Object Detection. 103-114 - Floris De Feyter

, Bram Claes, Toon Goedemé
:
Rotation Equivariance for Diamond Identification. 115-123 - Cyril Li, Christophe Ducottet, Sylvain Desroziers, Maxime Moreaud:

Toward Few Pixel Annotations for 3D Segmentation of Material from Electron Tomography. 124-131 - Devashish Lohani, Carlos Fernando Crispim Junior, Quentin Barthélemy, Sarah Bertrand, Lionel Robinault, Laure Tougne Rodet:

Leveraging Unsupervised and Self-Supervised Learning for Video Anomaly Detection. 132-143 - Afshin Dini

, Esa Rahtu
:
Visual Anomaly Detection and Localization with a Patch-Wise Transformer and Convolutional Model. 144-152 - Christian Limberg, Andrew Melnik, Helge J. Ritter, Helmut Prendinger:

YOLO: You Only Look 10647 Times. 153-160 - Arun Kumar Subramanian, Anoop M. Namboodiri:

On Attribute Aware Open-Set Face Verification. 161-172 - Joaquin Palma-Ugarte, Laura Jovani Estacio Cerquin, Victor Flores-Benites, Rensso Mora Colque

:
A Lightweight Gaussian-Based Model for Fast Detection and Classification of Moving Objects. 173-184 - Ryosuke Miyake, Tetsu Matsukawa, Einoshin Suzuki:

Image Generation from a Hyper Scene Graph with Trinomial Hyperedges. 185-195 - João Soares, Luís Magalhães, Rafaela Pinho, Mehrab K. Allahdad, Manuel Ferreira:

Automatic Defect Detection in Leather. 196-204 - Saeed Bakhshi Germi

, Esa Rahtu
:
IFMix: Utilizing Intermediate Filtered Images for Domain Adaptation in Classification. 205-211 - Pouya Shiri

, Amirali Baniasadi:
DeepCaps+: A Light Variant of DeepCaps. 212-220 - Zihao Guo

, Fei Li, Rujie Liu, Ryo Ishida, Genta Suzuki:
Body Part Information Additional in Multi-decoder Transformer-Based Network for Human Object Interaction Detection. 221-229 - Mohamed Ilyes Lakhal, Oswald Lanz

, Andrea Cavallaro:
Multi-View Video Synthesis Through Progressive Synthesis and Refinement. 230-238 - Muhammad Ali

, Omar Alsuwaidi
, Salman Khan:
BGD: Generalization Using Large Step Sizes to Attract Flat Minima. 239-249 - Mridula Vijendran, Frederick W. B. Li

, Hubert P. H. Shum
:
Tackling Data Bias in Painting Classification with Style Transfer. 250-261 - Arnav Varma, Elahe Arani, Bahram Zonooz:

Dynamically Modular and Sparse General Continual Learning. 262-273 - Pedro V. V. Paiva, Josué J. G. Ramos, Marina L. Gavrilova, Marco A. G. Carvalho:

Emotion Transformer: Attention Model for Pose-Based Emotion Recognition. 274-281 - Francesco Pasti

, Nicola Bellotto
:
Evaluation of Computer Vision-Based Person Detection on Low-Cost Embedded Systems. 282-293 - Otto Brookes, Majid Mirmehdi

, Hjalmar S. Kühl, Tilo Burghardt:
Triple-stream Deep Metric Learning of Great Ape Behavioural Actions. 294-302 - David Dueñas Gaviria, Md Mostafa Kamal Saker, Petia Radeva:

Efficient Deep Learning Ensemble for Skin Lesion Classification. 303-314 - Barbara Caroline Benato, Alexandre Xavier Falcão, Alexandru Cristian Telea:

Linking Data Separation, Visual Separation, and Classifier Performance Using Pseudo-labeling by Contrastive Learning. 315-324 - Emilie Mathian, Huidong Liu, Lynnette Fernandez-Cuesta, Dimitris Samaras, Matthieu Foll, Liming Chen:

HaloAE: A Local Transformer Auto-Encoder for Anomaly Detection and Localization Based on HaloNet. 325-337 - Aditya Kallappa, Sandeep Nagar, Girish Varma:

FInC Flow: Fast and Invertible k × k Convolutions for Normalizing Flows. 338-348 - Thomas Duboudin, Emmanuel Dellandréa, Corentin Abgrall, Gilles Hénaff, Liming Chen:

Learning Less Generalizable Patterns for Better Test-Time Adaptation. 349-358 - Silas Evandro Nachif Fernandes, Leandro A. Passos

, Danilo S. Jodas, Marco Akio, André N. de Souza, João Paulo Papa:
A Multi-Class Probabilistic Optimum-Path Forest. 361-368 - Rasna A. Amit, C. Krishna Mohan:

Quantitative Analysis to Find the Optimum Scale Range for Object Representations in Remote Sensing Images. 369-379 - Pawel Majewski

, Piotr Lampa, Robert Burduk, Jacek Reiner:
Mixing Augmentation and Knowledge-Based Techniques in Unsupervised Domain Adaptation for Segmentation of Edible Insect States. 380-387 - Kirill Prokofiev, Vladislav Sovrasov:

Combining Metric Learning and Attention Heads for Accurate and Efficient Multilabel Image Classification. 388-396 - Kira Maag

, Matthias Rottmann:
False Negative Reduction in Semantic Segmentation Under Domain Shift Using Depth Estimation. 397-408 - Jonay Suárez-Ramírez, Alejandro Betancor-Del-Rosario, Daniel Santana-Cedrés

, Nelson Monzón
:
Exploring Deep Learning Capabilities for Coastal Image Segmentation on Edge Devices. 409-418 - Yuka Ogino, Yuho Shoji, Takahiro Toizumi, Ryoma Oami, Masato Tsukada:

Fast Eye Detector Using Siamese Network for NIR Partial Face Images. 419-428 - Hiroaki Minoura, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi:

Understanding of Feature Representation in Convolutional Neural Networks and Vision Transformer. 429-436 - Eduardo de O. Andrade, Igor Garcia Ballhausen Sampaio, Joris Guérin

, José Viterbo:
Combining Two Adversarial Attacks Against Person Re-Identification Systems. 437-444 - Takahiro Suzuki, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi:

1D-SalsaSAN: Semantic Segmentation of LiDAR Point Cloud with Self-Attention. 445-452 - Yuki Saito, Hideo Saito, Vincent Frémont:

Monocular Depth Estimation for Tilted Images via Gravity Rectifier. 453-463 - Michael Danner, Bakir Hadzic

, Robert Radloff, Xueping Su, Le Ping Peng, Thomas Weber, Matthias Rätsch:
Overcome Ethnic Discrimination with Unbiased Machine Learning for Facial Data Sets. 464-471 - Sayeh Gholipour Picha

, Dawood Al Chanti, Alice Caplier:
How far Generated Data Can Impact Neural Networks Performance? 472-479 - Timothée Fréville, Charles Hamesse, Benoît Pairet, Rob Haelterman:

Object Detection in Floor Plans for Automated VR Environment Generation. 480-486 - Simon Thomine, Hichem Snoussi, Mahmoud Soua:

MixedTeacher: Knowledge Distillation for Fast Inference Textural Anomaly Detection. 487-494 - Jose Huaman, Felix O. Sumari H., Luigy Machaca, Esteban Clua, Joris Guérin

:
Benchmarking Person Re-Identification Datasets and Approaches for Practical Real-World Implementations. 495-502 - Taiki Yano, Nobutaka Kimura, Kiyoto Ito:

Surface-Graph-Based 6DoF Object-Pose Estimation for Shrink-Wrapped Items Applicable to Mixed Depalletizing Robots. 503-511 - Farzan Heidari, Michael A. Bauer:

Impact of Vehicle Speed on Traffic Signs Missed by Drivers. 512-519 - Abir Fathallah, Mounim A. El-Yacoubi, Najoua Essoukri Ben Amara:

Transfer Learning for Word Spotting in Historical Arabic Documents Based Triplet-CNN. 520-527 - Zhangchi Lu, Mertcan Cokbas, Prakash Ishwar, Janusz Konrad

:
Estimating Distances Between People Using a Single Overhead Fisheye Camera with Application to Social-Distancing Oversight. 528-535 - Luis E. Chuquimarca

, Boris Xavier Vintimilla, Sergio A. Velastin:
Banana Ripeness Level Classification Using a Simple CNN Model Trained with Real and Synthetic Datasets. 536-543 - Reshawn Ramjattan, Rajeev Ratan, Shiva Ramoudith, Patrick Hosein, Daniele Mazzei:

Using Continual Learning on Edge Devices for Cost-Effective, Efficient License Plate Detection. 544-550 - Daniel Perazzo, Thiago de Souza, Pietro Masur, Eduardo de Amorim, Pedro de Oliveira, Kelvin B. da Cunha, Lucas Maggi, Francisco Simões, Veronica Teichrieb, Lucas N. Kirsten

:
FedBID and FedDocs: A Dataset and System for Federated Document Analysis. 551-558 - Yuki Hirose, Kazuaki Nakamura, Naoko Nitta, Noboru Babaguchi:

An Experimental Consideration on Gait Spoofing. 559-566 - Masaya Mizuno, Yasutomo Kawanishi, Tomohiro Fujita, Daisuke Deguchi

, Hiroshi Murase:
Subjective Baggage-Weight Estimation from Gait: Can You Estimate How Heavy the Person Feels? 567-574 - Amal El Kaid, Karim Baïna, Jamal Baïna, Vincent Barra

:
Real-World Case Study of a Deep Learning Enhanced Elderly Person Fall Video-Detection System. 575-582 - Chenyu Wang, Toshio Endo, Takahiro Hirofuchi, Tsutomu Ikegami:

Pyramid Swin Transformer: Different-Size Windows Swin Transformer for Image Classification and Object Detection. 583-590 - Ali Raza, Muhammad Haroon Yousaf, Sergio A. Velastin, Serestina Viriri:

Human Fall Detection from Sequences of Skeleton Features using Vision Transformer. 591-598 - Yuichi Kamata, Moyuru Yamada, Takayuki Okatani:

Self-Modularized Transformer: Learn to Modularize Networks for Systematic Generalization. 599-606 - Rina Tagami, Hiroki Kobayashi, Shuichi Akizuki, Manabu Hashimoto:

Fast and Reliable Template Matching Based on Effective Pixel Selection Using Color and Intensity Information. 607-614 - Jack W. Barker, Neelanjan Bhowmik, Yona Falinie A. Gaus, Toby P. Breckon:

Robust Semi-Supervised Anomaly Detection via Adversarially Learned Continuous Noise Corruption. 615-625 - Nam Tuan Ly

, Atsuhiro Takasu:
An End-to-End Multi-Task Learning Model for Image-based Table Recognition. 626-634 - Juan Pablo Lagos, Esa Rahtu

:
PanDepth: Joint Panoptic Segmentation and Depth Completion. 635-643 - Viral Parekh, Karimulla Shaik:

Multi-Scale Feature Based Fashion Attribute Extraction Using Multi-Task Learning for e-Commerce Applications. 644-651 - Patrick Feifel, Frank Bonarens, Frank Köster:

Domain Adaptive Pedestrian Detection Based on Semantic Concepts. 652-659 - Jalila Filali, Denis Laurendeau, Steeve D. Côté

:
Environmental Information Extraction Based on YOLOv5-Object Detection in Videos Collected by Camera-Collars Installed on Migratory Caribou and Black Bears in Northern Quebec. 660-667 - Souha Mansour, Saoussen Ben Jabra, Ezzeddine Zagrouba

:
A Robust Deep Learning-Based Video Watermarking Using Mosaic Generation. 668-675 - Pawel Foszner, Agnieszka Szczesna, Luca Ciampi

, Nicola Messina, Adam Cygan, Bartosz Bizon, Michal Cogiel, Dominik Golba, Elzbieta Macioszek, Michal Staniszewski:
CrowdSim2: An Open Synthetic Benchmark for Object Detectors. 676-683 - Guilherme Gadelha, Herman Martins Gomes, Leonardo Vidal Batista:

Neural Architecture Search in the Context of Deep Multi-Task Learning. 684-691 - Deisy Chaves

, Nancy Agarwal
, Eduardo Fidalgo
, Enrique Alegre:
A Data Augmentation Strategy for Improving Age Estimation to Support CSEM Detection. 692-699 - Ryouichi Furukawa, Kazuhiro Hotta:

Shuffle Mixing: An Efficient Alternative to Self Attention. 700-707 - Takahiro Mano, Sota Kato, Kazuhiro Hotta:

Semantic Segmentation by Semi-Supervised Learning Using Time Series Constraint. 708-714 - Floris De Feyter

, Toon Goedemé
:
Joint Training of Product Detection and Recognition Using Task-Specific Datasets. 715-722 - Simon Mariani, Sander R. Klomp, Rob Romijnders, Peter H. N. de With:

The Effect of Covariate Shift and Network Training on Out-of-Distribution Detection. 723-730 - Ayato Takama, Sota Kato, Satoshi Kamiya, Kazuhiro Hotta:

Improvement of Vision Transformer Using Word Patches. 731-736 - Ana Paula dos Santos Dantas, Gabriel Bianchin de Oliveira, Daiane Mendes de Oliveira, Hélio Pedrini, Cid C. de Souza, Zanoni Dias:

Algorithmic Fairness Applied to the Multi-Label Classification Problem. 737-744 - Mohamed Dhouioui

, Tarek Frikha, Hassen Drira, Mohamed Abid:
A Novel 3D Face Reconstruction Model from a Multi-Image 2D Set. 745-753 - Laure Acin, Pierre Jacob, Camille Simon Chane, Aymeric Histace:

VK-SITS: Variable Kernel Speed Invariant Time Surface for Event-Based Recognition. 754-761 - Romain Guesdon

, Carlos Fernando Crispim Junior, Laure Tougne Rodet:
Synthetic Driver Image Generation for Human Pose-Related Tasks. 762-769 - Galina Zalesskaya, Bogna Bylicka, Eugene Liu:

How to Train an Accurate and Efficient Object Detection Model on any Dataset. 770-778 - Mircea Paul Muresan, Robert Schlanger, Radu Danescu, Sergiu Nedevschi:

Real-Time Obstacle Detection using a Pillar-based Representation and a Parallel Architecture on the GPU from LiDAR Measurements. 779-787 - Yuka Nokihara, Ryosuke Hori, Ryo Hachiuma, Hideo Saito:

Prediction of Shuttle Trajectory in Badminton Using Player's Position. 788-795 - Maria Pateraki, Panagiotis Sapoutzoglou

, Manolis I. A. Lourakis:
Crane Spreader Pose Estimation from a Single View. 796-805 - Nikolaos Poulopoulos, Emmanouil Z. Psarakis

:
Few-Shot Gaze Estimation via Gaze Transfer. 806-813 - João Almeida, Gonçalo Cruz, Diogo Silva

, Tiago Oliveira
:
Application of Deep Learning to the Detection of Foreign Object Debris at Aerodromes' Movement Area. 814-821 - Hadjer Boughanem, Haythem Ghazouani, Walid Barhoumi

:
YCbCr Color Space as an Effective Solution to the Problem of Low Emotion Recognition Rate of Facial Expressions In-The-Wild. 822-829 - Felipe Moreno Vera, Edgar Medina, Jorge Poco

:
WSAM: Visual Explanations from Style Augmentation as Adversarial Attacker and Their Influence in Image Classification. 830-837 - Xuehao Liu, Sarah Jane Delany

, Susan McKeever:
Applying Positional Encoding to Enhance Vision-Language Transformers. 838-845 - Odalisio L. S. Neto, Felipe G. Oliveira, João M. B. Cavalcanti, José L. S. Pio:

Brazilian Banknote Recognition Based on CNN for Blind People. 846-853 - Natália F. de C. Meira, Ricardo C. Câmara de M. Santos

, Mateus C. Silva, Eduardo José da S. Luz, Ricardo A. R. Oliveira:
Towards an Automatic System for Generating Synthetic and Representative Facial Data for Anonymization. 854-861 - Chintan Tundia, Rajiv Kumar

, Om P. Damani, G. Sivakumar:
FPCD: An Open Aerial VHR Dataset for Farm Pond Change Detection. 862-869 - Rajiv Kumar

, G. Sivakumar:
DEff-GAN: Diverse Attribute Transfer for Few-Shot Image Synthesis. 870-877 - Poulami Sinhamahapatra, Lena Heidemann, Maureen Monnet, Karsten Roscher:

Towards Human-Interpretable Prototypes for Visual Assessment of Image Classification Models. 878-887 - Wafa Aissa

, Marin Ferecatu, Michel Crucianu:
Curriculum Learning for Compositional Visual Reasoning. 888-897 - Hayato Yumiya, Daisuke Deguchi

, Yasutomo Kawanishi, Hiroshi Murase:
End-to-End Gaze Grounding of a Person Pictured from Behind. 898-905 - Mattias Billast, Kevin Mets, Tom De Schepper

, José Oramas
, Steven Latré:
Human Motion Prediction on the IKEA-ASM Dataset. 906-914
Motion, Tracking and Stereo Vision
- Léo Renaut, Heike Frei, Andreas Nüchter

:
Smoothed Normal Distribution Transform for Efficient Point Cloud Registration During Space Rendezvous. 919-930 - Norio Tagawa, Ming Yang:

On Computing Three-Dimensional Camera Motion from Optical Flow Detected in Two Consecutive Frames. 931-942 - Alexander Dolokov, Niek Andresen, Katharina Hohlbaum, Christa Thöne-Reineke, Lars Lewejohann, Olaf Hellwich:

Upper Bound Tracker: A Multi-Animal Tracking Solution for Closed Laboratory Settings. 945-952 - Dominik Penk, Maik Horn, Christoph Strohmeyer

, Frank Bauer, Marc Stamminger:
DeNos22: A Pipeline to Learn Object Tracking Using Simulated Depth. 953-962 - Mahmoud Z. Khairallah, Abanob Soliman

, Fabien Bonardi, David Roussel, Samia Bouchafa:
Flow-Based Visual-Inertial Odometry for Neuromorphic Vision Sensors Using non-Linear Optimization with Online Calibration. 963-973 - Isabella de Andrade

, João Paulo Lima:
Multi-Camera 3D Pedestrian Tracking Using Graph Neural Networks. 974-981 - Ritsuki Hasegawa, Fumihiko Sakaue, Jun Sato:

3D Human Body Reconstruction from Head-Mounted Omnidirectional Camera and Light Sources. 982-989 - Akira Nagatsu, Fumihiko Sakaue, Jun Sato:

3D Reconstruction of Occluded Luminous Objects. 990-996 - Johannes Künzel, Darko Vehar, Rico Nestler, Karl-Heinz Franke, Anna Hilsmann, Peter Eisert:

System for 3D Acquisition and 3D Reconstruction Using Structured Light for Sewer Line Inspection. 997-1006 - João Marcelo X. N. Teixeira, Narjara Pimentel, Eder Barbier

, Enrico Bernard, Veronica Teichrieb, Gimena Chaves:
Low-Cost 3D Reconstruction of Caves. 1007-1014 - Junesuk Lee, Soon-Yong Park

:
3D Mapping of Indoor Parking Space Using Edge Consistency Census Transform Stereo Odometry. 1015-1020 - Ilias Lazarou, Anastasios L. Kesidis, Andreas Tsatsaris:

Real-Time Monitoring of Crowd Panic Based on Biometric and Spatiotemporal Data. 1021-1027

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














