


default search action
MMM 2023, Bergen, Norway - Part II
- Duc-Tien Dang-Nguyen

, Cathal Gurrin
, Martha A. Larson
, Alan F. Smeaton, Stevan Rudinac
, Minh-Son Dao
, Christoph Trattner
, Phoebe Chen
:
MultiMedia Modeling - 29th International Conference, MMM 2023, Bergen, Norway, January 9-12, 2023, Proceedings, Part II. Lecture Notes in Computer Science 13834, Springer 2023, ISBN 978-3-031-27817-4
Multimedia Processing and Applications
- Shuo Chen, Di Li, Bobo Ju, Linhua Jiang, Dongfang Zhao:

Transparent Object Detection with Simulation Heatmap Guidance and Context Spatial Attention. 3-15 - Tianrun Chen

, Chenglong Fu, Ying Zang, Lanyun Zhu
, Jia Zhang, Papa Mao, Lingyun Sun:
Deep3DSketch+: Rapid 3D Modeling from Single Free-Hand Sketches. 16-28 - Yi-Ting Yang, Wei-Ta Chu:

Manga Text Detection with Manga-Specific Data Augmentation and Its Applications on Emotion Analysis. 29-40 - ShanShan Zhong

, Wushao Wen, Jinghui Qin:
SPEM: Self-adaptive Pooling Enhanced Attention Module for Image Recognition. 41-53 - Patrik Veselý

, Ladislav Peska
:
Less Is More: Similarity Models for Content-Based Video Retrieval. 54-65 - Wanliang Wang, Fangsen Xing, Jiacheng Chen, Hangyao Tu:

Edge Assisted Asymmetric Convolution Network for MR Image Super-Resolution. 66-78 - Weiyan Chen

, Changjian Zhu
, Shan Zhang
, Sen Xiang
:
An Occlusion Model for Spectral Analysis of Light Field Signal. 79-90 - Tianxing Feng, Zhe Zhang, Kaiqiang Xiong, Ronggang Wang:

Context-Guided Multi-view Stereo with Depth Back-Projection. 91-102 - Wei Wang, Peng Lu, Xujun Peng, Wang Yin

, Zhaoran Zhao:
RLSCNet: A Residual Line-Shaped Convolutional Network for Vanishing Point Detection. 103-114 - Jiajun Ouyang, Qingxuan Lv, Shu Zhang, Junyu Dong:

Energy Transfer Contrast Network for Unsupervised Domain Adaption. 115-126 - Xuran Deng, Chuanbin Liu, Zhiying Lu:

Recombining Vision Transformer Architecture for Fine-Grained Visual Categorization. 127-138 - Ming Gao, Shilian Wu, Zengfu Wang:

A Length-Sensitive Language-Bound Recognition Network for Multilingual Text Recognition. 139-150 - Yuan Zhang, Xiang Tian, Ziyang Zhang, Xiangmin Xu:

Lightweight Multi-level Information Fusion Network for Facial Expression Recognition. 151-163 - Duc-Tien Dang-Nguyen

, Vegard Velle Sjøen, Dinh-Hai Le, Thien-Phu Dao, Anh-Duy Tran
, Minh-Triet Tran:
Practical Analyses of How Common Social Media Platforms and Photo Storage Services Handle Uploaded Images. 164-176 - Kai Ye, Haoqin Ji, Yuan Li, Lei Wang, Peng Liu, Linlin Shen:

CCF-Net: A Cascade Center-Based Framework Towards Efficient Human Parts Detection. 177-189 - Yuhang Li, Feifan Cai, Yifei Tu, Youdong Ding:

Low-Light Image Enhancement Under Non-uniform Dark. 190-201 - Fucai Gong

, Yuchen Xie
, Le Jiang
, Keming Chen
, Yunxin Liu
, Xiaozhou Ye
, Ye Ouyang
:
A Proposal-Improved Few-Shot Embedding Model with Contrastive Learning. 202-214 - Haoqi Xu, Jian Hou, Huaqiang Yuan:

Weighted Multi-view Clustering Based on Internal Evaluation. 215-227 - Zhiqi Yan, Shuang Liang:

BENet: Boundary Enhance Network for Salient Object Detection. 228-239 - Trong-Hieu Nguyen Mau

, Quoc-Huy Trinh
, Nhat-Tan Bui
, Phuoc-Thao Vo Thi
, Minh-Van Nguyen
, Xuan-Nam Cao
, Minh-Triet Tran
, Hai-Dang Nguyen
:
PEFNet: Positional Embedding Feature for Polyp Segmentation. 240-251 - Daniele Lorenzi

, Farzad Tashtarian
, Hadi Amirpour
, Christian Timmerer
, Hermann Hellwagner
:
MCOM-Live: A Multi-Codec Optimization Model at the Edge for Live Streaming. 252-264 - Jinxin Guo, Jiaqiang Zhang, Xiaojing Zhang, Ming Ma:

LAE-Net: Light and Efficient Network for Compressed Video Action Recognition. 265-276 - Yunhong Li

, Shuai Li
, Zhenhua Yu
:
DARTS-PAP: Differentiable Neural Architecture Search by Polarization of Instance Complexity Weighted Architecture Parameters. 277-288 - Song Chen

, Chong Wang
, Weijie Liu
, Zhengjie Ye
, Jiacheng Deng
:
Pseudo-label Diversity Exploitation for Few-Shot Object Detection. 289-300 - Xinjia Xie

, Feng Liu, Shun Gai, Zhen Huang, Minghao Hu, Ankun Wang:
HSS: A Hierarchical Semantic Similarity Hard Negative Sampling Method for Dense Retrievers. 301-312 - Jingsen Fang

, Shoudong Shi, Yi Fang, Zheng Huo:
Realtime Sitting Posture Recognition on Embedded Device. 313-324 - Georgios Loupas, Theodora Pistola, Sotiris Diplaris, Konstantinos Ioannidis, Stefanos Vrochidis, Ioannis Kompatsiaris:

Comparison of Deep Learning Techniques for Video-Based Automatic Recognition of Greek Folk Dances. 325-336 - Yingnan Fu, Shu Zheng, Wenyuan Cai, Ming Gao, Cheqing Jin, Aoying Zhou:

Dynamic Feature Selection for Structural Image Content Recognition. 337-349 - Ke Dong, Hao Peng, Jie Che:

Dynamic-Static Cross Attentional Feature Fusion Method for Speech Emotion Recognition. 350-361 - Aimei Dong, Sidi Liu:

Research on Multi-task Semantic Segmentation Based on Attention and Feature Fusion Method. 362-373 - Minyan Zheng

, Jianping Luo
:
Space-Time Video Super-Resolution 3D Transformer. 374-385 - Despoina Touska

, Konstantinos Gkountakos
, Theodora Tsikrika, Konstantinos Ioannidis, Stefanos Vrochidis, Ioannis Kompatsiaris:
Graph-Based Data Association in Multiple Object Tracking: A Survey. 386-398 - Chaoqun Niu, Yuan Li, Jian Wang, Jizhe Zhou, Tu Xiong, Dong Yu, Huili Guo, Lin Zhang, Weibo Liang

, Jiancheng Lv:
Multi-view Adaptive Bone Activation from Chest X-Ray with Conditional Adversarial Nets. 399-410 - Wei Luo, Mengying Xu, Hanjiang Lai

:
Multimodal Reconstruct and Align Net for Missing Modality Problem in Sentiment Analysis. 411-422 - Ping Feng, Hanyun Zhang, Yingying Sun, Zhenjun Tang:

Lightweight Image Hashing Based on Knowledge Distillation and Optimal Transport for Face Retrieval. 423-434 - Shengwei Zhao, Yuying Liu, Shaoyi Du, Zhiqiang Tian, Ting Qu, Linhai Xu:

CMFG: Cross-Model Fine-Grained Feature Interaction for Text-Video Retrieval. 435-445 - Xiaoqiong Liu, Yuewei Lin, Qing Yang, Heng Fan:

Transferable Adversarial Attack on 3D Object Tracking in Point Cloud. 446-458 - Xiangqi Gan

, Changjian Zhu
, Mengqin Bai
, Ying Wei
, Weiyan Chen
:
A Spectrum Dependent Depth Layered Model for Optimization Rendering Quality of Light Field. 459-470 - Jing Yang

, Junwen Chen
, Keiji Yanai
:
Transformer-Based Cross-Modal Recipe Embeddings with Large Batch Training. 471-482 - Yuanhang Yin, Yang Hua

, Tao Song
, Ruhui Ma, Haibing Guan:
Self-supervised Multi-object Tracking with Cycle-Consistency. 483-495 - Chih-Wei Lin, Zhongsheng Chen, Xiuping Huang, Suhui Yang:

Video-Based Precipitation Intensity Recognition Using Dual-Dimension and Dual-Scale Spatiotemporal Convolutional Neural Network. 496-509 - Elissavet Batziou, Konstantinos Ioannidis, Ioannis Patras, Stefanos Vrochidis, Ioannis Kompatsiaris:

Low-Light Image Enhancement Based on U-Net and Haar Wavelet Pooling. 510-522 - Vijay John, Yasutomo Kawanishi

:
Audio-Visual Sensor Fusion Framework Using Person Attributes Robust to Missing Visual Modality for Person Recognition. 523-535 - Xinxin Zhang

, Shanliang Pan, Chengwu Qian, Jiadong Yuan:
Rumor Detection on Social Media by Using Global-Local Relations Encoding Network. 536-548 - Jinmeng Wu, Pengcheng Shu, Hanyu Hong, Xingxun Li, Lei Ma, Yaozong Zhang, Ying Zhu, Lei Wang:

Unsupervised Encoder-Decoder Model for Anomaly Prediction Task. 549-561 - Hongfeng Han, Zhiwu Lu, Ji-Rong Wen:

CTDA: Contrastive Temporal Domain Adaptation for Action Segmentation. 562-574 - Zhaoyong Yan, Liyan Ma, Xiangfeng Luo, Yan Sun:

Multi-scale and Multi-stage Deraining Network with Fourier Space Loss. 575-586 - Wenhua Gao, Lanju Zhang, Hao Yang, Yuan Zhang, Jinyao Yan, Tao Lin:

DHP: A Joint Video Download and Dynamic Bitrate Adaptation Algorithm for Short Video Streaming. 587-598 - Ting Pan, Fei Wang, Junzhou Xie, Weifeng Liu:

Generating New Paintings by Semantic Guidance. 599-610 - Maria Siopi, Giorgos Kordopatis-Zilos

, Polychronis Charitidis, Ioannis Kompatsiaris, Symeon Papadopoulos:
A Multi-Stream Fusion Network for Image Splicing Localization. 611-622 - Alexandros Oikonomidis

, Maria Pegia
, Anastasia Moumtzidou
, Ilias Gialampoukidis
, Stefanos Vrochidis
, Ioannis Kompatsiaris
:
Fusion of Multiple Classifiers Using Self Supervised Learning for Satellite Image Change Detection. 623-634 - Kazutoshi Shinoda, Yuki Takezawa

, Masahiro Suzuki, Yusuke Iwasawa, Yutaka Matsuo:
Improving the Robustness to Variations of Objects and Instructions with a Neuro-Symbolic Approach for Interactive Instruction Following. 635-646 - Jiaqin Lin, Shaoyi Du, Yuying Liu, Zhiqiang Tian, Ting Qu, Nanning Zheng:

Interpretable Driver Fatigue Estimation Based on Hierarchical Symptom Representations. 647-658 - Ly-Duyen Tran

, Dongyun Nie, Liting Zhou, Binh T. Nguyen, Cathal Gurrin
:
VAISL: Visual-Aware Identification of Semantic Locations in Lifelog. 659-670 - Xin Zhao, Zhihang Ren:

Multi-scale Gaussian Difference Preprocessing and Dual Stream CNN-Transformer Hybrid Network for Skin Lesion Segmentation. 671-682 - Peijie Dong, Xin Niu, Zimian Wei, Hengyue Pan, Dongsheng Li, Zhen Huang:

AutoRF: Auto Learning Receptive Fields with Spatial Pooling. 683-694 - Zhihong Wu, Xiwen Qu, Jun Huang, Xuangou Wu:

In-Air Handwritten Chinese Text Recognition with Attention Convolutional Recurrent Network. 695-707
BNI: Brave New Ideas
- Thu Nguyen, Andrea M. Storås, Vajira Thambawita, Steven Alexander Hicks, Pål Halvorsen, Michael A. Riegler:

Multimedia Datasets: Challenges and Future Possibilities. 711-717 - Zhengyu Zhao, Nga Dang, Martha A. Larson:

The Importance of Image Interpretation: Patterns of Semantic Misclassification in Real-World Adversarial Images. 718-725
Research2Biz
- Fredrik Håland Jensen, Oda Elise Nordberg, Andy Opel, Lars Nyre:

Students Take Charge of Climate Communication. 729-735
Demo
- Yibo Hu, Chenghao Yan, Chenyu Cao, Haorui Wang, Bin Wu:

Social Relation Graph Generation on Untrimmed Video. 739-744 - Jonathan Geffen

:
Improving Parent-Child Co-play in a Roblox Game. 745-750 - Victor Adriel de Jesus Oliveira

, Gernot Rottermanner
, Magdalena Boucher
, Stefanie Größbacher
, Peter Judmaier
, Werner Bailer
, Georg Thallinger
, Thomas Kurz
, Jakob Frank, Christoph Bauer, Gabriele Fröschl, Michael Batlogg:
Taylor - Impersonation of AI for Audiovisual Content Documentation and Search. 751-757 - Daiki Shimizu, Keiji Yanai:

Virtual Try-On Considering Temporal Consistency for Videoconferencing. 758-763

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














