default search action
Athanasios Mouchtaris
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2018
- [j18]Anastasios Alexandridis, Athanasios Mouchtaris:
Multiple Sound Source Location Estimation in Wireless Acoustic Sensor Networks Using DOA Estimates: The Data-Association Problem. IEEE ACM Trans. Audio Speech Lang. Process. 26(2): 342-356 (2018) - 2017
- [j17]Nikolaos Stefanakis, Despoina Pavlidi, Athanasios Mouchtaris:
Perpendicular Cross-Spectra Fusion for Sound Source Localization With a Planar Microphone Array. IEEE ACM Trans. Audio Speech Lang. Process. 25(9): 1821-1835 (2017) - [j16]Nikolaos Stefanakis, Despoina Pavlidi, Athanasios Mouchtaris:
Corrections to "Perpendicular Cross-Spectra Fusion for Sound Source Localization With a Planar Microphone Array". IEEE ACM Trans. Audio Speech Lang. Process. 25(11): 2251 (2017) - [j15]Maximo Cobos, Fabio Antonacci, Anastasios Alexandridis, Athanasios Mouchtaris, Bowon Lee:
A Survey of Sound Source Localization Methods in Wireless Acoustic Sensor Networks. Wirel. Commun. Mob. Comput. 2017 (2017) - [j14]Maximo Cobos, Fabio Antonacci, Athanasios Mouchtaris, Bowon Lee:
Wireless Acoustic Sensor Networks and Applications. Wirel. Commun. Mob. Comput. 2017 (2017) - 2015
- [j13]Anthony Griffin, Anastasios Alexandridis, Despoina Pavlidi, Yiannis Mastorakis, Athanasios Mouchtaris:
Localizing multiple audio sources in a wireless acoustic sensor network. Signal Process. 107: 54-67 (2015) - [j12]Veronica Morfi, Gilles Degottex, Athanasios Mouchtaris:
Speech Analysis and Synthesis with a Computationally Efficient Adaptive Harmonic Model. IEEE ACM Trans. Audio Speech Lang. Process. 23(11): 1950-1962 (2015) - 2013
- [j11]Anastasios Alexandridis, Anthony Griffin, Athanasios Mouchtaris:
Capturing and Reproducing Spatial Audio Based on a Circular Microphone Array. J. Electr. Comput. Eng. 2013: 718574:1-718574:16 (2013) - [j10]Despoina Pavlidi, Anthony Griffin, Matthieu Puigt, Athanasios Mouchtaris:
Real-Time Multiple Sound Source Localization and Counting Using a Circular Microphone Array. IEEE Trans. Speech Audio Process. 21(10): 2193-2206 (2013) - 2011
- [j9]Anthony Griffin, Toni Hirvonen, Christos Tzagkarakis, Athanasios Mouchtaris, Panagiotis Tsakalides:
Single-Channel and Multi-Channel Sinusoidal Audio Coding Using Compressed Sensing. IEEE Trans. Speech Audio Process. 19(5): 1382-1395 (2011) - 2009
- [j8]Christos Tzagkarakis, Athanasios Mouchtaris, Panagiotis Tsakalides:
A Multichannel Sinusoidal Model Applied to Spot Microphone Signals for Immersive Audio. IEEE Trans. Speech Audio Process. 17(8): 1483-1497 (2009) - 2008
- [j7]Demetrios Cantzos, Athanasios Mouchtaris, Chris Kyriakakis:
Quality Enhancement of Compressed Audio Based on Statistical Conversion. EURASIP J. Audio Speech Music. Process. 2008 (2008) - [j6]Athanasios Mouchtaris, Kiki Karadimou, Panagiotis Tsakalides:
Multiresolution Source/Filter Model for Low Bitrate Coding of Spot Microphone Signals. EURASIP J. Audio Speech Music. Process. 2008 (2008) - 2007
- [j5]Athanasios Mouchtaris, Jan Van der Spiegel, Paul Mueller, Panagiotis Tsakalides:
A Spectral Conversion Approach to Single-Channel Speech Enhancement. IEEE Trans. Speech Audio Process. 15(4): 1180-1193 (2007) - 2006
- [j4]Athanasios Mouchtaris, Jan Van der Spiegel, Paul Mueller:
Nonparallel training for voice conversion based on a parameter adaptation approach. IEEE Trans. Speech Audio Process. 14(3): 952-963 (2006) - 2005
- [j3]Athanasios Mouchtaris, Shrikanth S. Narayanan, Chris Kyriakakis:
Multichannel audio synthesis by subband-based spectral conversion and parameter adaptation. IEEE Trans. Speech Audio Process. 13(2): 263-274 (2005) - 2003
- [j2]Athanasios Mouchtaris, Shrikanth S. Narayanan, Chris Kyriakakis:
Virtual Microphones for Multichannel Audio Resynthesis. EURASIP J. Adv. Signal Process. 2003(10): 968-979 (2003) - 2000
- [j1]Athanasios Mouchtaris, Panagiotis Reveliotis, Chris Kyriakakis:
Inverse Filter Design for Immersive Audio Rendering Over Loudspeakers. IEEE Trans. Multim. 2(2): 77-87 (2000)
Conference and Workshop Papers
- 2024
- [c98]Rupak Vignesh Swaminathan, Grant P. Strimel, Ariya Rastrow, Sri Harish Mallidi, Kai Zhen, Hieu Duy Nguyen, Nathan Susanj, Athanasios Mouchtaris:
Max-Margin Transducer Loss: Improving Sequence-Discriminative Training Using a Large-Margin Learning Strategy. ICASSP 2024: 12226-12230 - 2023
- [c97]Anastasios Alexandridis, Kanthashree Mysore Sathyendra, Grant P. Strimel, Feng-Ju Chang, Ariya Rastrow, Nathan Susanj, Athanasios Mouchtaris:
Gated Contextual Adapters For Selective Contextual Biasing In Neural Transducers. ICASSP 2023: 1-5 - [c96]Xuandi Fu, Kanthashree Mysore Sathyendra, Ankur Gandhe, Jing Liu, Grant P. Strimel, Ross McGowan, Athanasios Mouchtaris:
Robust Acoustic And Semantic Contextual Biasing In Neural Transducers For Speech Recognition. ICASSP 2023: 1-5 - [c95]Markus Müller, Anastasios Alexandridis, Zach Trozenski, Joel Whiteman, Grant P. Strimel, Nathan Susanj, Athanasios Mouchtaris, Siegfried Kunzmann:
Multilingual End-To-End Spoken Language Understanding For Ultra-Low Footprint Applications. ICASSP 2023: 1-5 - [c94]Saumya Y. Sahai, Jing Liu, Thejaswi Muniyappa, Kanthashree Mysore Sathyendra, Anastasios Alexandridis, Grant P. Strimel, Ross McGowan, Ariya Rastrow, Feng-Ju Chang, Athanasios Mouchtaris, Siegfried Kunzmann:
Dual-Attention Neural Transducers for Efficient Wake Word Spotting in Speech Recognition. ICASSP 2023: 1-5 - [c93]Grant P. Strimel, Yi Xie, Brian John King, Martin Radfar, Ariya Rastrow, Athanasios Mouchtaris:
Lookahead When It Matters: Adaptive Non-causal Transformers for Streaming Neural Transducers. ICML 2023: 32654-32676 - [c92]Martin Radfar, Paulina Lyskawa, Brandon Trujillo, Yi Xie, Kai Zhen, Jahn Heymann, Denis Filimonov, Grant P. Strimel, Nathan Susanj, Athanasios Mouchtaris:
Conmer: Streaming Conformer Without Self-attention for Interactive Voice Assistants. INTERSPEECH 2023: 2198-2202 - 2022
- [c91]Kai Wei, Dillon Knox, Martin Radfar, Thanh Tran, Markus Müller, Grant P. Strimel, Nathan Susanj, Athanasios Mouchtaris, Maurizio Omologo:
A Neural Prosody Encoder for End-to-End Dialogue Act Classification. ICASSP 2022: 7047-7051 - [c90]Bhuvan Agrawal, Markus Müller, Samridhi Choudhary, Martin Radfar, Athanasios Mouchtaris, Ross McGowan, Nathan Susanj, Siegfried Kunzmann:
Tie Your Embeddings Down: Cross-Modal Latent Spaces for End-to-end Spoken Language Understanding. ICASSP 2022: 7157-7161 - [c89]Anastasios Alexandridis, Kanthashree Mysore Sathyendra, Grant P. Strimel, Pavel Kveton, Jon Webb, Athanasios Mouchtaris:
TINYS2I: A Small-Footprint Utterance Classification Model with Contextual Support for On-Device SLU. ICASSP 2022: 7492-7496 - [c88]Anastasios Alexandridis, Grant P. Strimel, Ariya Rastrow, Pavel Kveton, Jon Webb, Maurizio Omologo, Siegfried Kunzmann, Athanasios Mouchtaris:
Caching Networks: Capitalizing on Common Speech for ASR. ICASSP 2022: 8412-8416 - [c87]Kanthashree Mysore Sathyendra, Thejaswi Muniyappa, Feng-Ju Chang, Jing Liu, Jinru Su, Grant P. Strimel, Athanasios Mouchtaris, Siegfried Kunzmann:
Contextual Adapters for Personalized Speech Recognition in Neural Transducers. ICASSP 2022: 8537-8541 - [c86]Kai Zhen, Hieu Duy Nguyen, Raviteja Chinta, Nathan Susanj, Athanasios Mouchtaris, Tariq Afzal, Ariya Rastrow:
Sub-8-Bit Quantization Aware Training for 8-Bit Neural Network Accelerator with On-Device Speech Recognition. INTERSPEECH 2022: 3033-3037 - [c85]Yi Xie, Jonathan Macoskey, Martin Radfar, Feng-Ju Chang, Brian John King, Ariya Rastrow, Athanasios Mouchtaris, Grant P. Strimel:
Compute Cost Amortized Transformer for Streaming ASR. INTERSPEECH 2022: 3043-3047 - [c84]Martin Radfar, Rohit Barnwal, Rupak Vignesh Swaminathan, Feng-Ju Chang, Grant P. Strimel, Nathan Susanj, Athanasios Mouchtaris:
ConvRNN-T: Convolutional Augmented Recurrent Neural Network Transducers for Streaming Speech Recognition. INTERSPEECH 2022: 4431-4435 - [c83]Kaiqi Zhao, Hieu Nguyen, Animesh Jain, Nathan Susanj, Athanasios Mouchtaris, Lokesh Gupta, Ming Zhao:
Knowledge Distillation via Module Replacing for Automatic Speech Recognition with Recurrent Neural Network Transducer. INTERSPEECH 2022: 4436-4440 - [c82]Kai Zhen, Martin Radfar, Hieu Duy Nguyen, Grant P. Strimel, Nathan Susanj, Athanasios Mouchtaris:
Sub-8-Bit Quantization for On-Device Speech Recognition: A Regularization-Free Approach. SLT 2022: 15-22 - [c81]Suhaila M. Shakiah, Rupak Vignesh Swaminathan, Hieu Duy Nguyen, Raviteja Chinta, Tariq Afzal, Nathan Susanj, Athanasios Mouchtaris, Grant P. Strimel, Ariya Rastrow:
Accelerator-Aware Training for Transducer-Based Speech Recognition. SLT 2022: 100-107 - 2021
- [c80]Feng-Ju Chang, Jing Liu, Martin Radfar, Athanasios Mouchtaris, Maurizio Omologo, Ariya Rastrow, Siegfried Kunzmann:
Context-Aware Transformer Transducer for Speech Recognition. ASRU 2021: 503-510 - [c79]Markus Müller, Samridhi Choudhary, Clement Chung, Athanasios Mouchtaris, Siegfried Kunzmann:
In Pursuit of Babel - Multilingual End-to-End Spoken Language Understanding. ASRU 2021: 1042-1049 - [c78]Feng-Ju Chang, Martin Radfar, Athanasios Mouchtaris, Brian John King, Siegfried Kunzmann:
End-to-End Multi-Channel Transformer for Speech Recognition. ICASSP 2021: 5884-5888 - [c77]Kai Zhen, Hieu Duy Nguyen, Feng-Ju Chang, Athanasios Mouchtaris, Ariya Rastrow:
Sparsification via Compressed Sensing for Automatic Speech Recognition. ICASSP 2021: 6009-6013 - [c76]Surabhi Punjabi, Harish Arsikere, Zeynab Raeesy, Chander Chandak, Nikhil Bhave, Ankish Bansal, Markus Müller, Sergio Murillo, Ariya Rastrow, Andreas Stolcke, Jasha Droppo, Sri Garimella, Roland Maas, Mat Hans, Athanasios Mouchtaris, Siegfried Kunzmann:
Joint ASR and Language Identification Using RNN-T: An Efficient Approach to Dynamic Language Switching. ICASSP 2021: 7218-7222 - [c75]Feng-Ju Chang, Martin Radfar, Athanasios Mouchtaris, Maurizio Omologo:
Multi-Channel Transformer Transducer for Speech Recognition. Interspeech 2021: 296-300 - [c74]Muhammad A. Shah, Joseph Szurley, Markus Müller, Athanasios Mouchtaris, Jasha Droppo:
Evaluating the Vulnerability of End-to-End Automatic Speech Recognition Models to Membership Inference Attacks. Interspeech 2021: 891-895 - [c73]Martin Radfar, Athanasios Mouchtaris, Siegfried Kunzmann, Ariya Rastrow:
FANS: Fusing ASR and NLU for On-Device SLU. Interspeech 2021: 1224-1228 - [c72]Vasileios Papadourakis, Markus Müller, Jing Liu, Athanasios Mouchtaris, Maurizio Omologo:
Phonetically Induced Subwords for End-to-End Speech Recognition. Interspeech 2021: 1992-1996 - [c71]Rupak Vignesh Swaminathan, Brian John King, Grant P. Strimel, Jasha Droppo, Athanasios Mouchtaris:
CoDERT: Distilling Encoder Representations with Co-Learning for Transducer-Based Speech Recognition. Interspeech 2021: 4543-4547 - [c70]Michael Saxon, Samridhi Choudhary, Joseph P. McKenna, Athanasios Mouchtaris:
End-to-End Spoken Language Understanding for Generalized Voice Assistants. Interspeech 2021: 4738-4742 - [c69]Jing Liu, Rupak Vignesh Swaminathan, Sree Hari Krishnan Parthasarathi, Chunchuan Lyu, Athanasios Mouchtaris, Siegfried Kunzmann:
Exploiting Large-Scale Teacher-Student Training for On-Device Acoustic Models. TDS 2021: 413-424 - 2020
- [c68]Mingzhi Yu, Hieu Duy Nguyen, Alex Sokolov, Jack Lepird, Kanthashree Mysore Sathyendra, Samridhi Choudhary, Athanasios Mouchtaris, Siegfried Kunzmann:
Multilingual Grapheme-To-Phoneme Conversion with Byte Representation. ICASSP 2020: 8234-8238 - [c67]Martin Radfar, Athanasios Mouchtaris, Siegfried Kunzmann:
End-to-End Neural Transformer Based Spoken Language Understanding. INTERSPEECH 2020: 866-870 - [c66]Hieu Duy Nguyen, Anastasios Alexandridis, Athanasios Mouchtaris:
Quantization Aware Training with Absolute-Cosine Regularization for Automatic Speech Recognition. INTERSPEECH 2020: 3366-3370 - [c65]Joseph P. McKenna, Samridhi Choudhary, Michael Saxon, Grant P. Strimel, Athanasios Mouchtaris:
Semantic Complexity in End-to-End Spoken Language Understanding. INTERSPEECH 2020: 4273-4277 - 2018
- [c64]Nikolaos Stefanakis, Symeon Delikaris-Manias, Athanasios Mouchtaris:
Acoustic Beamforming in Front of a Reflective Plane. EUSIPCO 2018: 26-30 - [c63]Nikolaos Stefanakis, Athanasios Mouchtaris:
Normalization of Partly Overlapping Audio Recordings from the Same Event Based on Relative Signal Powers. ICASSP 2018: 3141-3145 - [c62]Anastasios Alexandridis, Anthony Griffin, Athanasios Mouchtaris:
Multiple Source Location Estimation on a Dataset of Real Recordings in a Wireless Acoustic Sensor Network. MMSP 2018: 1-6 - 2017
- [c61]Nikolaos Stefanakis, Menelaos Viskadouros, Athanasios Mouchtaris:
A subjective evaluation on mixtures of crowdsourced audio recordings. EUSIPCO 2017: 1819-1823 - [c60]Symeon Delikaris-Manias, Despoina Pavlidi, Athanasios Mouchtaris, Ville Pulkki:
DOA estimation with histogram analysis of spatially constrained active intensity vectors. ICASSP 2017: 526-530 - [c59]Nikolaos Stefanakis, Stavros Chonianakis, Athanasios Mouchtaris:
Automatic matching and synchronization of user generated videos from a large scale sport event. ICASSP 2017: 3016-3020 - [c58]Anastasios Alexandridis, Nikolaos Stefanakis, Athanasios Mouchtaris:
Towards wireless acoustic sensor networks for location estimation and counting of multiple speakers in real-life conditions. ICASSP 2017: 6140-6144 - [c57]Nikolaos Stefanakis, Athanasios Mouchtaris:
Maximum component elimination in mixing of user generated audio recordings. MMSP 2017: 1-6 - 2016
- [c56]Anastasios Alexandridis, Stefanos Papadakis, Despoina Pavlidi, Athanasios Mouchtaris:
Development and evaluation of a digital MEMS microphone array for spatial audio. EUSIPCO 2016: 612-616 - [c55]Nikolaos Stefanakis, Athanasios Mouchtaris:
Direction of arrival estimation in front of a reflective plane using a circular microphone array. EUSIPCO 2016: 622-626 - [c54]Anastasios Alexandridis, Athanasios Mouchtaris:
Improving narrowband DOA estimation of sound sources using the complex Watson distribution. EUSIPCO 2016: 1468-1472 - [c53]Symeon Delikaris-Manias, Despoina Pavlidi, Ville Pulkki, Athanasios Mouchtaris:
3D localization of multiple audio sources utilizing 2D DOA histograms. EUSIPCO 2016: 1473-1477 - [c52]Nikolaos Stefanakis, Athanasios Mouchtaris:
Capturing and reproduction of a crowded sound scene using a circular microphone array. EUSIPCO 2016: 1673-1677 - [c51]Despoina Pavlidi, Symeon Delikaris-Manias, Ville Pulkki, Athanasios Mouchtaris:
3D DOA estimation of multiple sound sources based on spatially constrained beamforming driven by intensity vectors. ICASSP 2016: 96-100 - 2015
- [c50]Anastasios Alexandridis, Giorgos Borboudakis, Athanasios Mouchtaris:
Addressing the data-association problem for multiple sound source localization using DOA estimates. EUSIPCO 2015: 1551-1555 - [c49]Despoina Pavlidi, Symeon Delikaris-Manias, Ville Pulkki, Athanasios Mouchtaris:
3D localization of multiple sound sources with intensity vector estimates in single source zones. EUSIPCO 2015: 1556-1560 - [c48]Nikolaos Stefanakis, Athanasios Mouchtaris:
A multi-sensor approach for real-time detection and classification of impact sounds. EUSIPCO 2015: 2038-2042 - [c47]Nikolaos Stefanakis, Athanasios Mouchtaris:
Foreground suppression for capturing and reproduction of crowded acoustic environments. ICASSP 2015: 51-55 - [c46]Demosthenes Akoumianakis, Chrisoula Alexandraki, V. Alexiou, Christina Anagnostopoulou, A. Eleftheriadis, Vasiliki Lalioti, Yiannis Mastorakis, Apostolos Modas, Athanasios Mouchtaris, Despoina Pavlidi, George C. Polyzos, Panagiotis Tsakalides, George Xylomenos, Panagiotis Zervas:
The MusiNet project: Addressing the challenges in Networked Music Performance systems. IISA 2015: 1-6 - [c45]Anastasios Alexandridis, Athanasios Mouchtaris:
Multiple sound source location estimation and counting in a wireless acoustic sensor network. WASPAA 2015: 1-5 - 2014
- [c44]Anthony Griffin, Anastasios Alexandridis, Despoina Pavlidi, Athanasios Mouchtaris:
Real-time localization of multiple audio sources in a wireless acoustic sensor network. EUSIPCO 2014: 306-310 - [c43]Anastasios Alexandridis, Anthony Griffin, Athanasios Mouchtaris:
Breaking down the cocktail party: Capturing and isolating sources in a soundscape. EUSIPCO 2014: 1118-1122 - [c42]Christos Tzagkarakis, Stephen Becker, Athanasios Mouchtaris:
Joint low-rank representation and matrix completion under a singular value thresholding framework. EUSIPCO 2014: 1202-1206 - [c41]George P. Kafentzis, Theodora Yakoumaki, Athanasios Mouchtaris, Yannis Stylianou:
Analysis of emotional speech using an adaptive sinusoidal model. EUSIPCO 2014: 1492-1496 - [c40]Veronica Morfi, Gilles Degottex, Athanasios Mouchtaris:
A computationally efficient refinement of the fundamental frequency estimate for the Adaptive Harmonic Model. ICASSP 2014: 1478-1482 - [c39]Nikolaos Stefanakis, Yiannis Mastorakis, Athanasios Mouchtaris:
Instantaneous Detection and Classification of Impact Sound: Turning Simple Objects into Powerful Musical Control Interfaces. ICMC 2014 - [c38]Demosthenes Akoumianakis, Chrisoula Alexandraki, V. Alexiou, C. Anagnostopoulou, A. Eleftheriadis, V. Lalioti, Athanasios Mouchtaris, Despoina Pavlidi, George C. Polyzos, Panagiotis Tsakalides, George Xylomenos, Panagiotis Zervas:
The MusiNet project: Towards unraveling the full potential of Networked Music Performance systems. IISA 2014: 1-6 - [c37]Christos Tzagkarakis, Athanasios Mouchtaris:
Reconstruction of missing features based on a low-rank assumption for robust speaker identification. IISA 2014: 432-437 - 2013
- [c36]Marcelo F. Caetano, George P. Kafentzis, Athanasios Mouchtaris, Yannis Stylianou:
Adaptive sinusoidal modeling of percussive musical instrument sounds. EUSIPCO 2013: 1-5 - [c35]Christos Tzagkarakis, Athanasios Mouchtaris:
Sparsity based robust speaker identification using a discriminative dictionary learning approach. EUSIPCO 2013: 1-5 - [c34]Anastasios Alexandridis, Anthony Griffin, Athanasios Mouchtaris:
Directional coding of audio using a circular microphone array. ICASSP 2013: 296-300 - [c33]Andreas I. Koutrouvelis, Aki Härmä, Athanasios Mouchtaris:
Compressive sensing in footstep sounds, hand tremors and speech using K-SVD dictionaries. DSP 2013: 1-6 - [c32]Marcelo F. Caetano, George P. Kafentzis, Gilles Degottex, Athanasios Mouchtaris, Yannis Stylianou:
Evaluating how well filtered white noise models the residual from sinusoidal modeling of musical instrument sounds. WASPAA 2013: 1-4 - [c31]Anthony Griffin, Athanasios Mouchtaris:
Localizing multiple audio sources from DOA estimates in a wireless acoustic sensor network. WASPAA 2013: 1-4 - 2012
- [c30]Marcelo F. Caetano, Athanasios Mouchtaris, Frans Wiering:
The Role of Time in Music Emotion Recognition: Modeling Musical Emotions from Time-Varying Music Features. CMMR 2012: 171-196 - [c29]Anthony Griffin, Despoina Pavlidi, Matthieu Puigt, Athanasios Mouchtaris:
Real-time multiple speaker DOA estimation in a circular microphone array based on Matching Pursuit. EUSIPCO 2012: 2303-2307 - [c28]Despoina Pavlidi, Matthieu Puigt, Anthony Griffin, Athanasios Mouchtaris:
Real-time multiple sound source localization using a circular microphone array based on single-source confidence measures. ICASSP 2012: 2625-2628 - [c27]Matthieu Puigt, Anthony Griffin, Athanasios Mouchtaris:
Nonlinear blind mixture identification using local source sparsity and functional data clustering. SAM 2012: 481-484 - [c26]Despoina Pavlidi, Anthony Griffin, Matthieu Puigt, Athanasios Mouchtaris:
Source counting in real-time sound source localization using a circular microphone array. SAM 2012: 521-524 - 2011
- [c25]Matthieu Puigt, Anthony Griffin, Athanasios Mouchtaris:
Post-nonlinear speech mixture identification using single-source temporal zones & curve clustering. EUSIPCO 2011: 1844-1848 - [c24]Demetrios Cantzos, Athanasios Mouchtaris, Chris Kyriakakis:
Perceptually-Driven Scalable MDCT Enhancement of Compressed Audio Based on Statistical Conversion. ISM 2011: 41-46 - [c23]Georgina Tryfou, Aki Härmä, Athanasios Mouchtaris:
Tempo Estimation Based on Linear Prediction and Perceptual Modelling. ISMIR 2011: 197-202 - 2010
- [c22]Christos Tzagkarakis, Athanasios Mouchtaris:
Robust text-independent speaker identification using short test and training sessions. EUSIPCO 2010: 586-590 - [c21]Anthony Griffin, Toni Hirvonen, Athanasios Mouchtaris, Panagiotis Tsakalides:
Multichannel audio coding using sinusoidal modelling and compressed sensing. EUSIPCO 2010: 1439-1443 - [c20]Anthony Griffin, Eleni Karamichali, Athanasios Mouchtaris:
Speaker identification using sparsely excited speech signals and compressed sensing. EUSIPCO 2010: 1444-1448 - [c19]Toni Hirvonen, Athanasios Mouchtaris:
Top-down strategies in parameter selection of sinusoidal modeling of audio. ICASSP 2010: 273-276 - [c18]Toni Hirvonen, Athanasios Mouchtaris:
Sinusoidal spatial audio coding for low-bitrate binaural reproduction. ICASSP 2010: 389-392 - 2009
- [c17]Demetrios Cantzos, Athanasios Mouchtaris, Chris Kyriakakis:
Bandwidth extension of low bitrate compressed audio based on statistical conversion. ICME 2009: 97-100 - [c16]Anthony Griffin, Toni Hirvonen, Athanasios Mouchtaris, Panagiotis Tsakalides:
Encoding the sinusoidal model of an audio signal using compressed sensing. ICME 2009: 153-156 - 2008
- [c15]Demetrios Cantzos, Athanasios Mouchtaris, Chris Kyriakakis:
Synthesis of enhanced audio from low bitrate compressed audio based on unit selection and statistical conversion methods. ACSCC 2008: 2174-2179 - [c14]Christos Tzagkarakis, Athanasios Mouchtaris, Panagiotis Tsakalides:
Modeling and coding of spot microphone signals for immersive audio based on the sinusoidal model. EUSIPCO 2008: 1-5 - 2007
- [c13]Christos Tzagkarakis, Athanasios Mouchtaris, Panagiotis Tsakalides:
Sinusoidal modeling of spot microphone signals based on noise transplantation for multichannel audio coding. EUSIPCO 2007: 1362-1366 - [c12]Athanasios Mouchtaris, Yannis Agiomyrgiannakis, Yannis Stylianou:
Conditional Vector Quantization for Voice Conversion. ICASSP (4) 2007: 505-508 - [c11]Demetrios Cantzos, Athanasios Mouchtaris, Chris Kyriakakis:
Enhanced Multichannel Audio Resynthesis Through Residual Processing and Features Alignment. ICME 2007: 1267-1270 - 2006
- [c10]Athanasios Mouchtaris, Kiki Karadimou, Panagiotis Tsakalides:
Multiband source/filter representation of multichannel audio for reduction of inter-channel redundancy. EUSIPCO 2006: 1-5 - [c9]Christos Tzagkarakis, Athanasios Mouchtaris, Panagiotis Tsakalides:
Musical Genre Classification VIA Generalized Gaussian and Alpha-Stable Modeling. ICASSP (5) 2006: 217-220 - 2005
- [c8]Athanasios Mouchtaris, Jan Van der Spiegel, Paul Mueller, Panagiotis Tsakalides:
A spectral conversion approach to feature denoising and speech enhancement. INTERSPEECH 2005: 2057-2060 - 2004
- [c7]Athanasios Mouchtaris, Jan Van der Spiegel, Paul Mueller:
Non-parallel training for voice conversion by maximum likelihood constrained adaptation. ICASSP (1) 2004: 1-4 - [c6]Athanasios Mouchtaris, Jan Van der Spiegel, Paul Mueller:
A spectral conversion approach to the iterative Wiener filter for speech enhancement. ICME 2004: 1971-1974 - 2002
- [c5]Athanasios Mouchtaris, Shrikanth S. Narayanan, Chris Kyriakakis:
Effcient multichannel audio resynthesis by subband-based spectral conversion. EUSIPCO 2002: 1-4 - [c4]Athanasios Mouchtaris, Shrikanth S. Narayanan, Chris Kyriakakis:
Multiresolution spectral conversion for multichannel audio resynthesis. ICME (2) 2002: 273-276 - 2000
- [c3]Chris Kyriakakis, Athanasios Mouchtaris:
Virtual Microphones for Multichannel Audio Applications. IEEE International Conference on Multimedia and Expo (I) 2000: 11-14 - 1999
- [c2]Athanasios Mouchtaris, Panagiotis Reveliotis, Chris Kyriakakis:
Non-minimum phase inverse filter methods for immersive audio rendering. ICASSP 1999: 3077-3080 - 1998
- [c1]Athanasios Mouchtaris, Jong-soong Lim, Tomlinson Holman, Chris Kyriakakis:
Head-related transfer function synthesis for immersive audio. MMSP 1998: 155-160
Parts in Books or Collections
- 2008
- [p1]Athanasios Mouchtaris, Christos Tzagkarakis, Panagiotis Tsakalides:
Low Bitrate Coding of Spot Audio Signals for Interactive and Immersive Audio Applications. New Directions in Intelligent Interactive Multimedia 2008: 155-164
Informal and Other Publications
- 2024
- [i25]Yifan Yang, Kai Zhen, Ershad Banijamal, Athanasios Mouchtaris, Zheng Zhang:
AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tuning. CoRR abs/2406.18060 (2024) - 2023
- [i24]Saumya Y. Sahai, Jing Liu, Thejaswi Muniyappa, Kanthashree Mysore Sathyendra, Anastasios Alexandridis, Grant P. Strimel, Ross McGowan, Ariya Rastrow, Feng-Ju Chang, Athanasios Mouchtaris, Siegfried Kunzmann:
Dual-Attention Neural Transducers for Efficient Wake Word Spotting in Speech Recognition. CoRR abs/2304.01905 (2023) - [i23]Xuandi Fu, Kanthashree Mysore Sathyendra, Ankur Gandhe, Jing Liu, Grant P. Strimel, Ross McGowan, Athanasios Mouchtaris:
Robust Acoustic and Semantic Contextual Biasing in Neural Transducers for Speech Recognition. CoRR abs/2305.05271 (2023) - [i22]Suhaila M. Shakiah, Rupak Vignesh Swaminathan, Hieu Duy Nguyen, Raviteja Chinta, Tariq Afzal, Nathan Susanj, Athanasios Mouchtaris, Grant P. Strimel, Ariya Rastrow:
Accelerator-Aware Training for Transducer-Based Speech Recognition. CoRR abs/2305.07778 (2023) - 2022
- [i21]Kai Wei, Dillon Knox, Martin Radfar, Thanh Tran, Markus Müller, Grant P. Strimel, Nathan Susanj, Athanasios Mouchtaris, Maurizio Omologo:
A neural prosody encoder for end-ro-end dialogue act classification. CoRR abs/2205.05590 (2022) - [i20]Kanthashree Mysore Sathyendra, Thejaswi Muniyappa, Feng-Ju Chang, Jing Liu, Jinru Su, Grant P. Strimel, Athanasios Mouchtaris, Siegfried Kunzmann:
Contextual Adapters for Personalized Speech Recognition in Neural Transducers. CoRR abs/2205.13660 (2022) - [i19]Kai Zhen, Hieu Duy Nguyen, Raviteja Chinta, Nathan Susanj, Athanasios Mouchtaris, Tariq Afzal, Ariya Rastrow:
Sub-8-Bit Quantization Aware Training for 8-Bit Neural Network Accelerator with On-Device Speech Recognition. CoRR abs/2206.15408 (2022) - [i18]Yi Xie, Jonathan Macoskey, Martin Radfar, Feng-Ju Chang, Brian John King, Ariya Rastrow, Athanasios Mouchtaris, Grant P. Strimel:
Compute Cost Amortized Transformer for Streaming ASR. CoRR abs/2207.02393 (2022) - [i17]Martin Radfar, Rohit Barnwal, Rupak Vignesh Swaminathan, Feng-Ju Chang, Grant P. Strimel, Nathan Susanj, Athanasios Mouchtaris:
ConvRNN-T: Convolutional Augmented Recurrent Neural Network Transducers for Streaming Speech Recognition. CoRR abs/2209.14868 (2022) - [i16]Kai Zhen, Martin Radfar, Hieu Duy Nguyen, Grant P. Strimel, Nathan Susanj, Athanasios Mouchtaris:
Sub-8-bit quantization for on-device speech recognition: a regularization-free approach. CoRR abs/2210.09188 (2022) - 2021
- [i15]Feng-Ju Chang, Martin Radfar, Athanasios Mouchtaris, Brian John King, Siegfried Kunzmann:
End-to-End Multi-Channel Transformer for Speech Recognition. CoRR abs/2102.03951 (2021) - [i14]Kai Zhen, Hieu Duy Nguyen, Feng-Ju Chang, Athanasios Mouchtaris, Ariya Rastrow:
Sparsification via Compressed Sensing for Automatic Speech Recognition. CoRR abs/2102.04932 (2021) - [i13]Jing Liu, Rupak Vignesh Swaminathan, Sree Hari Krishnan Parthasarathi, Chunchuan Lyu, Athanasios Mouchtaris, Siegfried Kunzmann:
Exploiting Large-scale Teacher-Student Training for On-device Acoustic Models. CoRR abs/2106.06126 (2021) - [i12]Rupak Vignesh Swaminathan, Brian John King, Grant P. Strimel, Jasha Droppo, Athanasios Mouchtaris:
CoDERT: Distilling Encoder Representations with Co-learning for Transducer-based Speech Recognition. CoRR abs/2106.07734 (2021) - [i11]Michael Saxon, Samridhi Choudhary, Joseph P. McKenna, Athanasios Mouchtaris:
End-to-End Spoken Language Understanding for Generalized Voice Assistants. CoRR abs/2106.09009 (2021) - [i10]Feng-Ju Chang, Martin Radfar, Athanasios Mouchtaris, Maurizio Omologo:
Multi-Channel Transformer Transducer for Speech Recognition. CoRR abs/2108.12953 (2021) - [i9]Martin Radfar, Athanasios Mouchtaris, Siegfried Kunzmann, Ariya Rastrow:
FANS: Fusing ASR and NLU for on-device SLU. CoRR abs/2111.00400 (2021) - [i8]Feng-Ju Chang, Jing Liu, Martin Radfar, Athanasios Mouchtaris, Maurizio Omologo, Ariya Rastrow, Siegfried Kunzmann:
Context-Aware Transformer Transducer for Speech Recognition. CoRR abs/2111.03250 (2021) - 2020
- [i7]Surabhi Punjabi, Harish Arsikere, Zeynab Raeesy, Chander Chandak, Nikhil Bhave, Ankish Bansal, Markus Müller, Sergio Murillo, Ariya Rastrow, Sri Garimella, Roland Maas, Mat Hans, Athanasios Mouchtaris, Siegfried Kunzmann:
Streaming End-to-End Bilingual ASR Systems with Joint Language Identification. CoRR abs/2007.03900 (2020) - [i6]Joseph P. McKenna, Samridhi Choudhary, Michael Saxon, Grant P. Strimel, Athanasios Mouchtaris:
Semantic Complexity in End-to-End Spoken Language Understanding. CoRR abs/2008.02858 (2020) - [i5]Martin Radfar, Athanasios Mouchtaris, Siegfried Kunzmann:
End-to-End Neural Transformer Based Spoken Language Understanding. CoRR abs/2008.10984 (2020) - [i4]Bhuvan Agrawal, Markus Müller, Martin Radfar, Samridhi Choudhary, Athanasios Mouchtaris, Siegfried Kunzmann:
Tie Your Embeddings Down: Cross-Modal Latent Spaces for End-to-end Spoken Language Understanding. CoRR abs/2011.09044 (2020) - 2018
- [i3]Michalis Giannopoulos, Grigorios Tsagkatakis, Saverio G. Blasi, Farzad Toutounchi, Athanasios Mouchtaris, Panagiotis Tsakalides, Marta Mrak, Ebroul Izquierdo:
Convolutional Neural Networks for Video Quality Assessment. CoRR abs/1809.10117 (2018) - 2012
- [i2]Matthieu Puigt, Anthony Griffin, Athanasios Mouchtaris:
Post-Nonlinear Sparse Component Analysis Using Single-Source Zones and Functional Data Clustering. CoRR abs/1204.1085 (2012) - 2009
- [i1]Athanasios Mouchtaris, Panagiotis Tsakalides:
The ASPIRE Project - Sensor Networks for Immersive Multimedia Environments. ERCIM News 2009(78) (2009)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-08 19:18 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint