


default search action
25th DICTA 2024: Perth, Australia
- International Conference on Digital Image Computing: Techniques and Applications, DICTA 2024, Perth, Australia, November 27-29, 2024. IEEE 2024, ISBN 979-8-3503-7903-7

- Qi Zhong, Yun Ye, Xian-Feng Han:

EFITFormer: Enhanced Feature Interaction Transformer for 3D Point Cloud Understanding. 1-8 - Uchitha Rajapaksha

, Hamid Laga, Dean Diepeveen
, Mohammed Bennamoun
, Ferdous Sohel:
Dynamic View Synthesis of Thin Structures with Short-term Movements from Monocular Videos Using Neural Radiance Fields. 9-16 - Zhixuan Gu, Sheng Ao, Minglin Chen, Yan Liu, Ye Zhang, Yulan Guo:

Unified Retrieval and Reranking Paradigm for Aerial-Ground Cross-Source 3D Place Recognition. 17-24 - Pasa Ciceklidag, Muhammad Ibrahim, Haitian Wang

, Yumeng Miao, Jin B. Hong
, Ghulam Mubashar Hassan
, Ajmal S. Mian
:
High-Definition 3D Point Cloud Mapping of the City of Subiaco in Western Australia. 25-32 - Yexing Xu, Minglin Chen, Longguang Wang, Ye Zhang, Yulan Guo:

Warp Consistent Neural Radiance Fields for Sparse Novel View Synthesis. 33-39 - Zeji Hui, Weiqin Chuah, Amirali Khodadadian Gostar, Alireza Bab-Hadiashar, Ruwan B. Tennakoon

:
LAF-NeRF: Learning Artifact-Free Neural Radiance Fields from Un-Curated Image Collections with Corruptions. 40-46 - Matthieu Delmas, Renaud Séguier:

LatentForensics: Towards Frugal Deepfake Detection in the StyleGAN Latent Space. 47-53 - Miaohua Zhang, Rodrigo Santa Cruz, Yulia Arzhaeva, Xun Li, Brendan Do, Jeremy Oorloff, Mohammad Ali Armin, Zeeshan Hayder, David Ahmedt-Aristizabal:

Point-Supervised Seagrass Segmentation for 3D Underwater Habitat Mapping. 54-61 - Melvin Hartley

, Nigel Brand
, Christabel Brand, Melinda Hodkiewicz
:
Characterising Heavy Mineral Concentrate Grain Morphology and Mineralogy with Computer Vision. 62-69 - Qi Bing, Chaoyi Zhang, Weidong Cai:

DeepIcon: A Hierarchical Network for Layer-Wise Icon Vectorization. 70-77 - Edwin Kwadwo Tenagyei, Yongsheng Gao, Andrew Lewis, Nick Nikzad

, Jun Zhou:
LinCNNFormer: Hybrid Linear Vision Transformer Based on Convolution Neural Network and Linear Attention. 78-84 - Muhammad Umer Ramzan

, Ali Zia, Abdelwahed Khamis, Ayman Elgharabawy
, Ahmad Liaqat, Usman Ali:
Locally-Focused Face Representation for Sketch-to-Image Generation Using Noise-Induced Refinement. 85-92 - Muhammad Sohail Danish, Javed Iqbal, Mohsen Ali

, M. Saquib Sarfraz, Salman H. Khan, Muhammad Haris Khan:
Perturbing Dominant Feature Modes for Single Domain-Generalized Object Detection. 93-100 - Avraham Chapman, Haiming Xu, Lingqiao Liu:

Enhancing Fine-Grained Visual Recognition in the Low-Data Regime Through Feature Magnitude Regularization. 101-108 - Vinith Kugathasan, Honglu Zhou, Zachary Izzo, Gayal Kuruppu, Sanoojan Baliah, Muhammad Haris Khan:

Matching Confidences and Softened Target Occurrences for Calibration. 109-116 - Jianyu Zhao, Yukun Wang, Ye Zhang, Hanyun Wang, Yulan Guo:

Uncertainty-Aware Cross-Modality Fusion for Visible-Infrared Object Detection. 117-125 - Javier Ureña Santiago, Thomas Ströhle, Antonio Rodríguez-Sánchez, Ruth Breu:

Vision Transformers for Weakly-Supervised Microorganism Enumeration. 126-133 - Tobias Ziegler, Marcel Müller, Abdelmajid Khelil:

ESCal: Efficient and Scalable Calibration of Camera Networks Using a Top View. 134-141 - Rahm Ranjan, David Ahmedt-Aristizabal, Mohammad Ali Armin, Juno Kim:

Developing Normative Gait Cycle Parameters for Clinical Analysis Using Human Pose Estimation. 142-149 - Adnan Munir

, Abdul Jabbar Siddiqui, Aoubaida M. Al Sabbagh:
DustRobust-YOLO: Enhanced UAV Detection in Dusty Conditions. 150-157 - Sharjeel Tahir, Nima Mirnateghi, Syed Afaq Ali Shah, Ferdous Sohel:

DEER: Deep Emotion-Sets for Fine-Grained Emotion Recognition. 158-165 - Jiahao Ma

, Jinguang Tong, Shan Wang, Zicheng Duan, Chuong Nguyen
:
Voxelized 3D Feature Aggregation for Multiview Detection. 166-173 - Khalil Mathieu Hannouch, Stephan K. Chalup

:
Generating Topologically and Geometrically Diverse Manifold Data in Dimensions Four and Below. 174-181 - Banu Wirawan Yohanes, Philip Ogunbona

, Wanqing Li:
Joint Task of Image Segmentation and Classification for Object Detection. 182-189 - Zhongsui Guo, Bahman Javadi, Sonit Singh

, Arcot Sowmya:
Masked-Enhanced Food Segment Anything Model for Automatic Dietary Intake Monitoring. 190-197 - Heui Yeon Bae, Morteza Saberi

, Sahar Shariflou, Michael Kalloniatis, Jack Phu, Ashish Agar, Ali Cheraghian, S. Mojtaba Golzan
:
Enhancing Glaucoma Diagnosis through Vision-Language Models and Large Language Model Descriptions. 198-205 - Mohsin Ali, Haider Raza, John Q. Gan, Muhammad Haris:

Integrating Spatial Information into Global Context: Summary Vision Transformer (S-ViT). 206-213 - Mahrukh Siddiqui, Shahzaib Iqbal, Bandar Alhaqbani, Bandar AlShammari, Tariq Mahmood Khan, Imran Razzak:

A Robust Algorithm for Contactless Fingerprint Enhancement and Matching. 214-220 - Xinwen Liu

, Jing Wang, S. Kevin Zhou, Craig Engstrom, Shekhar S. Chandra:
Evidence-Aware Multi-Modal Data Fusion and its Application to Total Knee Replacement Prediction. 221-228 - Maryam Mehdizadeh, Cara MacNish, David Alonso-Caneiro

, Ashley Gillman, Sajib Saha, Fred K. Chen
:
Improving OCT Image Reconstruction Through Multi-Input GANs with Gated Attention. 229-237 - Md. Zakir Hossain, Patrick Buckley, Himadri Shekhar Mondal, Md. Rakibul Hasan

, Tom Gedeon:
Predicting and Staging Hepatocellular Carcinoma from Contrast CT Scans. 238-243 - Sonit Singh

, Gordon N. Stevenson, Brendan Mian, Alec Welsh, Arcot Sowmya:
Automatic Segmentation of Human Placenta from 3D Multimodal Ultrasound Data. 244-251 - Amrijit Biswas, Md. Zakir Hossain, Yan Yang, Syed Mohammed Shamsul Islam, Tom Gedeon, Shafin Rahman:

Domain Adaptation for Classifying Spontaneous Smile Videos. 252-259 - Md Mustakim Musully Pias, Tarek Hasan Al Mahmud, Md Shafiqul Islam, Khandaker Takdir Ahmed, Mohammed Jashim Uddin, Md. Alamgir Hossain, Md Zahidul Islam:

Deep Neural Network Based Adaptive Beamforming for Real-Time Speech Enhancement. 260-267 - Yuxiang An, Dongnan Liu, Weidong Cai:

Multi-Source Unsupervised Domain Adaptation for Neuron Membrane Segmentation via Feature Enhancement. 268-275 - Yiran Shi, Xiaona Yang, Xuefeng Zhou, Jun Zhou, Bo Ding:

Multi-Branch Instance Segmentation of Cervical Cells. 276-283 - Namrah Rehman, Ahmad Khan, Zia Ur Rehaman:

Parameter-Efficient Diabetic Retinopathy Grading with LoRA-Based Fine-Tuning of Vision Foundational Models. 284-291 - Mohammad Javad Shokri

, Nandakishor Desai, Aravinda S. Rao
, Angelos Sharobeam, Bernard Yan, Marimuthu Palaniswami:
Decoding Stroke Patterns: A Novel Deep Learning Approach to Atrial Fibrillation Risk Stratification. 292-299 - Yiheng Lyu

, Lian Xu, Mohammed Bennamoun
, Farid Boussaïd, Girish Dwivedi
:
Importance-Aware Transformer: Addressing Intra-Class Heterogeneity in Weakly Supervised Brain Tumor Segmentation. 300-307 - Maryam Mehdizadeh, Janardhan Vignarajan, Ashu Gupta

, Sajib Saha:
A Fully Automated System for Localization and Classification of Foot Bones in X-Rays. 308-312 - Shahzaib Iqbal, Muhammad Zeeshan, Mehwish Mehmood

, Tariq Mahmood Khan, Imran Razzak:
TESL-Net: A Transformer-Enhanced CNN for Accurate Skin Lesion Segmentation. 313-320 - Hassan Mahmood, Farah Nawar, Syed Mohammed Shamsul Islam, Asim Iqbal:

NeuroAtlas: An Artificial Intelligence-Based Framework for Annotation, Segmentation and Registration of Large Scale Biomedical Imaging Data. 321-327 - Yue Xia, Yuan Yuan, Euijoon Ahn

, Jinman Kim:
Multi-Phase and Hierarchical Unsupervised Learning Framework for Glioblastoma Sub-Region Segmentation in MRI Sequences. 328-333 - Xuesong Li, Zeeshan Hayder

, Ali Zia
, Connor Cassidy, Shiming Liu, Warwick Stiller, Eric A. Stone
, Warren Conaty
, Lars Petersson, Vivien Rolland
:
MMCBE: Multi-Modality Dataset for Crop Biomass Estimation and Beyond. 334-342 - Abdul Hannan Khan, Syed Tahseen Raza Rizvi, Dheeraj Varma Chittari Macharavtu, Andreas Dengel:

oTTC: Object Time-to-Contact for Motion Estimation in Autonomous Driving. 343-350 - Mona Alzahrani

, Muhammad Usman, Randah Alharbi, Saeed Anwar, Ajmal Mian, Tarek Helmy:
3D Object Classification with Selective Multi-View Fusion and Shape Rendering. 351-358 - Muhammad Usman, Abdullah Almulhim, Mohammad Alaseri, Mona Alzahrani, Hamzah Luqman, Saeed Anwar:

Sketch-to-3D: Transforming Hand-Sketched Floorplans into 3D Layouts. 359-366 - Zilong Chen, Shengyun Zhao, Rui Zhong:

An Automatically Annotated Spacecraft Intelligent Perception Dataset Based on Segment Anything Model. 367-373 - Sonain Jamil, Kasem Amnuayrotchanachinda, Mengstab Abadi Amare:

LightDepthMagic: An Advanced Deep Learning and Computer Vision Framework for Realistic 3D Object Embedding in RGB Images. 374-381 - Syed Tahseen Raza Rizvi, Abdul Hannan Khan, Andreas Dengel:

DTA: Detect Them All for Safe and Reliable Autonomous Driving. 382-388 - Donghao Qiao, Farhana H. Zulkernine, Aman Anand:

CoBEVFusion Cooperative Perception with LiDAR-Camera Bird's Eye View Fusion. 389-396 - Liyana Wijayathunga, Dulitha Dabare, Alexander Rassau, Douglas Chai, Syed Mohammed Shamsul Islam:

OUTBACK: A Multimodal Synthetic Dataset for Rural Australian Off-road Robot Navigation. 397-402 - Zhiheng Tang, Chuong Nguyen

, Sundaram Muthu
:
Dynamic SLAM Using Video Object Segmentation: A Low Cost Setup for Mobile Robots. 403-410 - Saif Ur Rehman Khan

, Zia Khan, Md. Zakir Hossain, Nicanor Mayumu, Farhana Yasmin
, Younas Aziz:
Boosting the Prediction of Brain Tumor Using Two Stage BiGait Architecture. 411-418 - Mahdi Heravian Shandiz, David Alonso-Caneiro

, Scott A. Read
, Michael J. Collins
:
A Variational Autoencoder Approach for Blink Detection in Mobile Eye Tracking Devices. 419-426 - Boyuan Tan

, Yuxin Xue
, Lei Bi
, Jinman Kim
:
A Reverse Method of Data Augmentation for High Quality PET Image Synthesis. 427-434 - Kh Tohidul Islam

, Syed Mohammed Shamsul Islam, Md. Moniruzzaman, Abdul Ihdayhid:
A Hybrid Transformer-Deep Learning Model for Improved Cardiac MRI Left Ventricle Segmentation. 435-441 - Samaneh Hashemi, David Alonso-Caneiro

, Michael J. Collins
, Scott A. Read
, Zhiyong Li
:
Speckle Feature Classification for Optical Coherence Tomography Flow Rate Assessment. 442-448 - Mohammad Belal, Taimur Hassan, Abdelfatah Hassan

, Nael Alsheikh, Noureldin Elhendawi, Irfan Hussain:
Integrating Features for Recognizing Human Activities through Optimized Parameters in Graph Convolutional Networks and Transformer Architectures. 449-453 - Md Rakibul Islam, Abdullah Nazib

, Riad Hassan, Abu Rumman Refat, Kien Nguyen Thanh
, Clinton Fookes, Md Zahidul Islam:
Automated Radiomics Based Clinically Significant Prostate Cancer (csPCa) Grade Classification from Biparametric MRI. 454-461 - Meng Li, Chaoyi Li, Can Peng, Brian C. Lovell

:
Unified Framework for Histopathology Image Augmentation and Classification via Generative Models. 462-469 - Pronab Sarker, Anirudh Atmakuru, Subrata Chakraborty, Manoranjan Paul

, Prabal Datta Barua, Biswajeet Pradhan
:
Leveraging Convolutional Neural Networks for Precise Diagnosis of Autism Through Transfer Learning and Ensemble Model. 470-476 - Darren Chong, Sonit Singh

, Arcot Sowmya:
Spectrogram-Based Imagification Applying Deep Learning on Omics Data. 477-484 - Afsah Saleem, Muhammad Sulman, Arooba Maqsood, Shiraz Bashir, Syed Zulqarnain Gilani:

Deep-Attention Feature Fusion Network for Automated Diagnosis of Diabetic Retinopathy Using Fundus Photographs. 485-492 - Pervaiz Iqbal Khan, Andreas Dengel, Sheraz Ahmed:

Improving Medical Image Classification via Representation Fusion and Contrastive Learning. 493-499 - Jing-Hong Liu, Yi Chen, Zer-Wei Lee, Chih-Yuan Hsu, Yu-Lun Yen, Pei-Yung Hsiao, Li-Chen Fu:

Enhancing Lightweight Face Information Detection Network with Multi-Clue Interaction and Part-Aware Supervision. 500-507 - Yaping Jing, Di Shao

, Shang Gao, Xuequan Lu:
3D Face Recognition on Low-Quality Data via Dual Contrastive Learning. 508-514 - Muhammad Zeshan Alam, Javeria Shabbir, M. Umair Mukati:

Light Field Resolution Enhancement Framework. 515-521 - Chayan Mondal

, Duc-Son Pham
, Tele Tan, Tom Gedeon, Ashu Gupta
:
Pursuing an Effective Vision Encoder for Enhancing Explainable X-Ray Report Generation. 522-529 - Wafa Qaiser Khan, Michael B. Farrar, Mohammad Awrangjeb

, Shahla Hosseini Bai
, Stephen J. Trueman, Helen M. Wallace
, Tarran E. Richards, Waqas Arshid:
Assessment of Macadamia Nutrients Using Hyperspectral Data and Machine Learning. 530-537 - Felix Obunguta

, Souvikhane Hanpasith, Kotaro Sasai, Kiyoyuki Kaito:
Segregation Method for Pothole and Manhole Features Segmented in Pavement Smartphone Images Through Deep Learning. 538-544 - Nima Mirnateghi, Syed Mohammed Shamsul Islam, Syed Afaq Ali Shah:

Towards Explainability of Affordance Learning in Robot Vision. 545-552 - Farah Afifah Binti Mohd Nawayai, Md Kislu Noman, Syed Mohammed Shamsul Islam, Riaz-ul-haque Mian

:
Hierarchical Active Learning for Efficient Semi-Supervised Seagrass Image Classification. 553-560 - Aresha Arshad, Momina Moetesum, Adnan Ul-Hasan, Faisal Shafait:

Enhancing Multimodal Information Extraction from Visually Rich Documents with 2D Positional Embeddings. 561-568 - Zekun Long, Ali Zia, Jordi Nelis, Vivien Rolland, Jun Zhou:

A New Hyperspectral Unmixing Benchmark for Weak Signal Meat Contamination Detection. 569-576 - Hassan Mahmood, Syed Mohammed Shamsul Islam, Asim Iqbal:

Multimodal 3D Image Registration for Mapping Brain Disorders. 577-582 - Iyyakutti Iyappan Ganapathi, Syed Sadaf Ali, Sajid Javed, Neha Gour, Naoufel Werghi:

OSMGE: One-Shot Multiscale Geometric Encoding for Texture Segmentation in 3D Meshes. 583-592 - Muhammad Zafar Iqbal, Anwaar Ulhaq, Imran Razzak:

Unsupervised Nonlinear Deformable Registration Network for 4D CT Lung Imaging. 593-599 - Xinyu Wang, Muhammad Ibrahim, Atif Bin Mansoor

, Hasnein Tareque, Ajmal Mian
:
Automated Road Extraction and Centreline Fitting in LiDAR Point Clouds. 600-607 - Xinjie Wang

, Yifan Zhang, Ke Xu, Jianwei Wan, Yulan Guo, Hanyun Wang:
CEM-DIT: Context Entropy Model with Dual Interactive Transformer for Point Cloud Geometry Compression. 608-615 - Sameeruddin Muhammad, Wei Xiang, Scott Mann, Kang Han, Supriya Nair:

TemporalSwin-FPN Net: A Novel Pipeline for Metadata-Driven Sequence Classification in Camera Trap Imagery. 616-623 - Haitian Wang

, Muhammad Ibrahim, Yumeng Miao, Dustin Severtson, Atif Bin Mansoor
, Ajmal S. Mian
:
Multispectral Remote Sensing for Weed Detection in West Australian Agricultural Lands. 624-631 - Miaohua Zhang, Ali Cheraghian, Yi Qin, David Benn, Therese Rollan, Nariman Habili:

Efficient Atmospheric Correction for Onboard Processing Using Knowledge Distillation and Model Compression. 632-639 - Mohammad Arifur Rahman

, Shyh Wei Teng
, M. Manzur Murshed, Manoranjan Paul
, David Brennan:
Addressing Limitations of Common Methods in Attention-Based Hyperspectral Band Selection Algorithms. 640-647 - Waqas Arshid, Mohammad Awrangjeb

, Alan Wee-Chung Liew
, Yongsheng Gao:
CAMVOS: Leveraging Context and Memory for Advanced Video Object Segmentation. 648-654 - Muhammad Zia Ur Rehman

, Syed Mohammed Shamsul Islam, Anwaar Ulhaq, Naeem Janjua
, David Blake:
Multimodal Land Use Classification: Harnessing HSI and LiDAR Integration. 655-661 - Bodan Liu, Koji Tanaka, Md. Zakir Hossain:

Paraconsistent Abductive Learning for Processing Inconsistent Information. 662-669 - Fatima Khalid, Muhammad Hanif, Qurat Ul Ain:

Maize EfficientNet Fusion: Advancing Maize Disease Detection with MF-NET. 670-676 - Moritz Bergemann, Tanmay Singha, Duc-Son Pham

, Aneesh Krishna:
Domain Adversarial SegFormer. 677-684 - Austin Bevacqua, Tanmay Singha, Duc-Son Pham

:
Enhancing Semantic Segmentation with Synthetic Image Generation: A Novel Approach Using Stable Diffusion and ControlNet. 685-692 - Rashidul Hasan Nabil, M. Manzur Murshed, Manoranjan Paul

, Wei Luo:
360-Degree Point Cloud Compression with Adaptive Rate Control Optimisation for Regions of Interest. 693-699 - Md. Zahirul Islam, Tanvir Ahmed Redoy, Ashek Ahmmed, Manoranjan Paul

, M. Manzur Murshed:
Leveraging the Cuboidal Partitioning for Low Complexity CTU Structure Prediction in Versatile Video Coding. 700-705 - Hirra Anwar, Muhammad Jawad Khan, Muhammad Fayyaz, Ajmal Saeed Mian, Faisal Shafait:

Wheat Rust Disease Segmentation from Ground Imagery. 706-713 - Ans Munir, Faisal Z. Qureshi, Muhammad Haris Khan, Mohsen Ali:

Attention Based Simple Primitives for Open-World Compositional Zero-Shot Learning. 714-721 - Madison Wright, Karlym Nam, Jinguang Tong, Sundaram Muthu

, Lars Andersson, Chuong Nguyen
:
Improved Safety and 3D Scanning with Human-Robot Collaboration. 722-729 - Muhammad Zaman, Tanzila Kehkashan

, Adnan Akhunzada, Hashem Alaidaros
, Mueen Uddin, Muhammad Azeem:
EQCNN: Enhanced Remote Sensing Imagery Classification with Circuit-Based Error-Corrected Quantum Convolutional Neural Networks. 730-737 - Boxue Hou, Zekun Long:

WaveSamba: A Wavelet Transform SSM Zero-Shot Depth Estimation Decoder. 738-744 - Sarder Tazul Islam, Sajib Saha, Sirui Li, G. M. Atiqur Rahaman, Kok Wai Wong, Shaun Frost

:
Retinal Image Registration with Haar-Optimized Local Binary Descriptors for Bifurcation Points. 745-751 - Inzela Mirza, Shahzaib Iqbal, Bandar Alhaqbani, Bandar AlShammari, Tariq Mahmood Khan, Imran Razzak:

ShadowNets: Efficient and Accurate Face Recognition for Resource-Constrained Devices. 752-758 - Manoj Kumar M, T. Kishore Kumar:

Knee Joint Health Care Monitoring System using AI and IoT - Classification Approach. 759-763

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














