


default search action
MMSP 2004: Siena, Italy
- IEEE 6th Workshop on Multimedia Signal Processing, MMSP 2004, 29 September-1 October 2004, Siena, Italy. IEEE 2004, ISBN 0-7803-8578-0

- Marco Bertini, Alberto Del Bimbo, Walter Nunziati:

Automatic annotation of video streams. 1-4 - Thomas S. Huang:

Multimedia/multimodal signal processing, analysis, and understanding. 5- - Ton Kalker:

A marriage made in heaven: cryptography and watermarking? 6 - Hasan Ertan Çetingül, Engin Erzin, Yücel Yemez, A. Murat Tekalp:

On optimal selection of lip-motion features for speaker identification. 7-10 - Yubo Ge, Jun Song, Lingnan Ge, Katsuhiko Shirai:

Approach of feature with confident weight for robust speech recognition. 11-14 - Yongjin Wang, Ling Guan:

An investigation of speech-based human emotion recognition. 15-18 - Li Deng, Zicheng Liu, Zhengyou Zhang, Alejandro Acero:

Nonlinear information fusion in multi-sensor processing - extracting and exploiting hidden dynamics of speech captured by a bone-conductive microphone. 19-22 - L. Sahafi, Tejinder S. Randhawa, R. H. Stephen Hardy:

Context-based complexity reduction of H.264 in video over wireless applications. 23-26 - Haruhisa Kato

, Yasuyuki Nakajima:
Weighting factor determination algorithm for H.264/MPEG-4 AVC weighted prediction. 27-30 - Yuh-Ruey Lee, Chia-Wen Lin, Sung-Hung Yeh, Yung-Chang Chen:

Low-complexity DCT-domain video transcoders for arbitrary-size downscaling. 31-34 - Gianluca Monaci

, Philippe Jost, Pierre Vandergheynst:
Image compression with learnt tree-structured dictionaries. 35-38 - Gernot Ziegler, Hendrik P. A. Lensch, Marcus A. Magnor, Hans-Peter Seidel:

Multi-video compression in texture space using 4D SPIHT. 39-42 - Hiroshi Hasegawa, Toshiyuki Ono, Isao Yamada, Kohichi Sakaniwa:

An iterative MPEG super-resolution with an outer approximation of framewise quantization constraint. 43-46 - Bertrand Rivet, Laurent Girin, Christian Jutten, Jean-Luc Schwartz:

Using audiovisual speech processing to improve the robustness of the separation of convolutive speech mixtures. 47-50 - Tue Lehn-Schiøler:

Multimedia mapping using continuous state space models. 51-54 - Carlo Bonamico, Carlo Braccini, Fabio Lavagetto, Maurizio Costa:

A system for real-time synthesis of subtle expressivity for life-like MPEG-4 based virtual characters. 55-58 - Nicola Adami, Marzia Corvaglia, Riccardo Leonardi:

Inferring semantics from structural annotations of audio-visual documents. 59-62 - Nishiura Denda, Takanobu Nishiura, Hideki Kawahara, Toshio Irino:

A design of audio-visual talker tracking system based on CSP analysis and frame difference in real noisy environments. 63-66 - Dietrich Van der Weken, Mike Nachtegael, Etienne E. Kerre:

Improved image quality measures using ordered histograms. 67-70 - Silvia Corchs, Gianluigi Ciocca, Raimondo Schettini:

Video summarization using a neurodynamical model of visual attention. 71-74 - Alessandro Mecocci, Gregorio Molinari:

Color recovery in outdoor environments: a novel integrated approach using retinex, gray world and stretching. 75-78 - Fabio De Vito, Davide Quaglia, Juan Carlos De Martin:

Model-based distortion estimation for perceptual classification of video packets. 79-82 - Konstantinos Rapantzikos, Nicolas Tsapatsoulis, Yannis Avrithis:

Spatiotemporal visual attention architecture for video analysis. 83-86 - Cyril Bergeron, Catherine Lamy-Bergot:

Soft-input decoding of variable-length codes applied to the H.264 standard. 87-90 - Marco Cagnazzo, Thomas André, Marc Antonini, Michel Barlaud:

A smoothly scalable and fully JPEG2000-compatible video coder. 91-94 - Teodora Petrisor, Christophe Tillier, Béatrice Pesquet-Popescu, Jean-Christophe Pesquet:

Redundant multiresolution analysis for multiple description video coding. 95-98 - Taesang Yoo, Eric Setton, Xiaoqing Zhu, Andrea J. Goldsmith, Bernd Girod:

Cross-layer design for video streaming over wireless ad hoc networks. 99-102 - Yao-Win Hong, Anna Scaglione:

Content-based multiple access: combining source and multiple access coding for sensor networks. 103-106 - Dilip Krishnaswamy, Mihaela van der Schaar:

Adaptive modulated scalable video transmission over wireless networks with a game theoretic approach. 107-110 - Claus Bauer, Mark Vinton:

Joint optimization of scale factors and Huffman code books for MPEG-4 AAC. 111-114 - Kei Kikuiri, Nobuhiko Naka, Tomoyuki Ohya:

Phase-modification for improving efficiency of audio coders. 115-118 - Johan Eilert, Andreas Ehliar, Dake Liu:

Using low precision floating point numbers to reduce memory cost for MP3 decoding. 119-122 - Tsung-Han Tsai, Yung-Tsung Wang:

Content-based retrieval of audio example on MP3 compression domain. 123-126 - Darko Kirovski, Zeph Landau:

Generalized Lempel-Ziv compression for audio. 127-130 - Jan-Mark Batke, Gunnar Eisenberg, Gunnar Weishaupt, Thomas Sikora:

Evaluation of distance measures for MPEG-7 melody contours. 131-134 - José-Emilio Vila-Forcén, Sviatoslav Voloshynovskiy, Oleksiy J. Koval, Thierry Pun, Fernando Pérez-González:

Worst case additive attack against quantization-based watermarking techniques. 135-138 - Fernando Pérez-González, Mauro Barni, Andrea Abrardo, Carlos Mosquera:

Rational dither modulation: a novel data-hiding method robust to value-metric scaling attacks. 139-142 - Bian Yang, Martin Schmucker, Xiamu Niu, Christoph Busch, Sheng-He Sun:

Reversible image watermarking by histogram modification for integer DCT coefficients. 143-146 - Chin Kiong Wang, Matthew L. Miller, Ingemar J. Cox:

Using perceptual distance to improve the selection of dirty paper trellis codes for watermarking. 147-150 - Nedeljko Cvejic, Tapio Seppänen:

A novel scheme for merging digital audio watermarking and authentication. 151-154 - Mingyou Hu, Stewart Worrall, Abdul Hamid Sadka, Ahmet M. Kondoz:

An efficient scalable object contour tracking scheme and its application for video segmentation. 155-158 - Guoping Qiu, Jeremy Morris, Xunli Fan:

From sensory coding to scene classification. 159-162 - François Coldefy, Patrick Bouthemy, Michael Betser, Guillaume Gravier:

Tennis video abstraction from audio and visual cues. 163-166 - Alfred Dielmann, Steve Renals

:
Multi-stream segmentation of meetings. 167-170 - Ibrahim Karliga, Jenq-Neng Hwang, Hwa-Jong Kim:

A framework for fully automatic moving video-object segmentation based on graph partitioning and object tracking. 171-174 - Marco La Cascia, Lorenzo Valenti, Stan Sclaroff:

Fully automatic, real-time detection of facial gestures from generic video. 175-178 - Eric Setton, Bernd Girod:

Congestion-distortion optimized scheduling of video over a bottleneck link. 179-182 - Michael Kropfberger, Klaus Leopold, Hermann Hellwagner

:
Quality variations of different priority-based temporal video adaptation algorithms. 183-186 - Cristina Emilia Costa

, Fabrizio Granelli, Francesco G. B. De Natale:
Optimal energy distribution in embedded packet video transmission over wireless channels. 187-190 - Hrvoje Jenkac, Thomas Stockhammer, Günther Liebl:

H.264/AVC video transmission over MBMS in GERAN. 191-194 - Dekun Zou, Yun Q. Shi, Zhicheng Ni:

A semi-fragile lossless digital watermarking scheme based on integer wavelet transform. 195-198 - Alessandro Piva, Roberto Caldelli

, Francesco Filippini:
Data hiding for error concealment in H.264/AVC. 199-202 - Luis Pérez-Freire, Fernando Pérez-González, Sviatoslav Voloshynovskiy:

Revealing the true achievable rates of scalar Costa scheme. 203-206 - Emmanuel Garcia, Jean-Luc Dugelay, Vanessa Lopez Eslava:

Progressive hiding of a 3D object into its texture image. 207-210 - Guorong Xuan, Chengyun Yang, Yizhan Zhen, Yun Q. Shi, Zhicheng Ni:

Reversible data hiding based on wavelet spread spectrum. 211-214 - Alberto González, Miguel Ferrer, Maria de Diego, Laura Fuster:

Efficient implementation of matrix recursions in the multichannel affine projection algorithm for multichannel sound. 215-218 - Giorgio Zoia, Ruohua Zhou, Daniel Mlynek:

A multi-timbre chord/harmony analyzer based on signal processing and neural networks. 219-222 - Matti Karjalainen, Teemu Mäki-Patola:

Physics-based modeling of musical instruments for interactive virtual reality. 223-226 - Renato S. Pellegrini, Matthias Rosenthal:

Wave field synthesis with synchronous distributed signal processing. 227-230 - Florian Keiler, Udo Zölzer:

Parametric second- and fourth-order shelving filters for audio applications. 231-234 - Toshiyuki Uto, Masahiro Okuda, Masaaki Ikehara, Shinichi Takahashi:

Range image registration using multiresolution decomposition. 235-238 - Nikhil Gupta, M. N. S. Swamy, Eugene I. Plotkin:

Low-complexity video noise reduction in wavelet domain. 239-242 - Alessandro Capra, Alfio Castorina, Paolo Vivirito, Sebastiano Battiato:

Adaptive image data fusion for consumer devices application. 243-246 - Rastislav Lukac, Konstantinos N. Plataniotis:

Camera image processing system for Bayer CFA based imaging devices. 247-250 - Miroslav Vrankic

, Damir Sersic:
Image denoising based on adaptive quincunx wavelets. 251-254 - Bernardo Esteves Pires, Pedro M. Q. Aguiar:

Registration of images with small overlap. 255-258 - Zahir Y. Alpaslan, Alexander A. Sawchuk:

Multiple camera image acquisition models for multi-view 3D display interaction. 256-262 - Nikolaos V. Boulgouris, Konstantinos N. Plataniotis, Dimitrios Hatzinakos:

Gait recognition using dynamic time warping. 263-266 - Maurizio de Pascale, Gianluca de Pascale, Domenico Prattichizzo:

Haptic and graphic rendering of deformable objects based on GPUs. 267-270 - Fabio Antonacci, Marco Foco, Augusto Sarti, Stefano Tubaro:

Accurate and fast audio-realistic rendering of sounds in virtual environments. 271-274 - Heng Zhang, Luca Lucchese:

A fast tone reproduction algorithm for high dynamic range image display. 275-278 - Pierpaolo Baccichet, Antonio Chimienti:

A low complexity concealment algorithm for the whole-frame loss in H.264/AVC. 279-282 - Luigi Atzori, Giaime Ginesu, Alessio Raccis, Daniele D. Giusto:

A novel iterative approach for JPEG2000 error concealment. 283-286 - Jan-Ru Chen, Chun-Shien Lu, Kuo-Chin Fan:

A significant motion vector protection-based error-resilient scheme in H.264. 287-290 - Nicola Franchi, Marco Fumagalli, Rosa Lancini:

A sequence-based error-concealment algorithm applied to a balanced MD video coder. 291-294 - Tammam Tillo, Marco Grangetto, Gabriella Olmo:

A flexible error resilient scheme for JPEG 2000. 295-298 - Claudio Weidmann, Petr Kadlec, Olivia Nemethova, Ameen Al Moghrabi:

Combined sequential decoding and error concealment of H.264 video. 299-302 - Georgia Feideropoulou, Béatrice Pesquet-Popescu, Jean-Claude Belfiore:

Joint source-channel coding of scalable video on a Rayleigh fading channel. 303-306 - Khaled Lajnef, Christine Guillemot, Pierre Siohan:

Distributed coding of three sources using punctured turbo codes. 307-310 - Nicolas Gehrig, Pier Luigi Dragotti:

Distributed compression in camera sensor networks. 311-314 - Markus Flierl, Pierre Vandergheynst:

Distributed coding of dynamic scenes with motion-compensated wavelets. 315-318 - Marco F. Duarte, Yu Hen Hu:

Optimal decision fusion with applications to target detection in wireless ad hoc sensor networks. 319-322 - Anne Aaron, Prashant Ramanathan, Bernd Girod:

Wyner-Ziv coding of light fields for random access. 323-326 - Frédéric Payan, Marc Antonini:

Model-based bit allocation for normal mesh compression. 327-330 - Hsueh-Yi Sean Lin, Hong-Yuan Mark Liao, Ja-Chen Lin:

Visual salience-guided mesh decomposition. 331-334 - Petros Daras, Dimitrios Zarpalas, Dimitrios Tzovaras, Michael G. Strintzis:

3D model search and retrieval based on the spherical trace transform. 335-338 - Nicola Orio, Pietro Zanuttigh, Guido Maria Cortelazzo:

Content-based retrieval of 3D models based on multiple aspects. 339-342 - Jana Dittmann, Danny Hesse:

Network based intrusion detection to detect steganographic communication channels: on the example of audio data. 343-346 - Marco Grangetto, Alberto Grosso, Enrico Magli:

Selective encryption of JPEG 2000 images by means of randomized arithmetic coding. 347-350 - Rastislav Lukac, Konstantinos N. Plataniotis:

A color image secret sharing scheme satisfying the perfect reconstruction property. 351-354 - Ashwin Swaminathan, Yinian Mao, Min Wu:

Image hashing resilient to geometric and filtering operations. 355-358 - Christine Sénac, Eliathamby Ambikairajah:

Audio indexing using feature warping and fusion techniques. 359-362 - Zicheng Liu, Zhengyou Zhang, Alejandro Acero, Jasha Droppo, Xuedong Huang:

Direct filtering for air- and bone-conductive microphones. 363-366 - Ernst Warsitz, Reinhold Haeb-Umbach:

Robust speaker direction estimation with particle filtering. 367-370 - S. H. Srinivasan:

Speech quality measure based on auditory scene analysis. 371-374 - David Labarre, Éric Grivel, Mohamed Najim, Ezio Todini:

Two-Kalman filters based instrumental variable techniques for speech enhancement. 375-378 - Aljoscha Smolic, Karsten Müller, Philipp Merkle, Tobias Rein, Matthias Kautzner, Peter Eisert, Thomas Wiegand:

Representation, coding, and rendering of 3D video objects with MPEG-4 and H.264/AVC. 379-382 - Prashant Ramanathan, Bernd Girod:

Random access for compressed light fields using multiple representations. 383-386 - Daniel Grest, Reinhard Koch:

Realtime multi-camera person tracking for immersive environments. 387-390 - Oliver Grau:

3D sequence generation from multiple cameras. 391-394 - Fabio Cuzzolin, Augusto Sarti, Stefano Tubaro:

Invariant action classification with volumetric data. 395-398 - Jianfen Zeng, Liang Fan:

A simple and efficient way to prune motion vector trees in MC-EZBC. 399-402 - Mulugeta Libsie, Harald Kosch:

Video adaptation using the variation factory. 403-406 - Eugenio Di Sciascio, Luigi Alfredo Grieco, Giuseppe Grassi:

A CNN-based object-oriented coding system for real-time video compression. 407-410 - Alejandro Jaimes, Naofumi Yoshida, Kazumasa Murai, Kazutaka Hirata, Jun Miyazaki:

Interactive visualization of multi-stream meeting videos based on automatic visual content analysis. 411-414 - Son Minh Tran, Marius Preda

, Françoise J. Prêteux, Kalman Fazekas:
New proposal for enhancing the interactive capability in MPEG-4. 415-418 - Amaryllis Raouzaiou, Kostas Karpouzis, Stefanos D. Kollias:

Emotion synthesis in the MPEG-4 framework. 419-422 - Nemanja Petrovic, Nebojsa Jojic, Thomas S. Huang:

Hierarchical video clustering. 423-426 - Po Hsiang Tsai, Seun Jan, Hatice Gunes:

Video object encoder using selective local-space support vector machines. 427-429 - Cristina Ribeiro

, Gabriel David
, Catalin Calistru:
A multimedia database workbench for content and context retrieval. 430-433 - Manolis Falelakis, Christos Diou, Anastasios Delopoulos:

Identification of semantics: balancing between complexity and validity. 434-437 - Akihiko Nakagawa, Shoji Arai, Andrea Kutics, Hiroyuki Tanaka:

Mapping image segments to words for image retrieval. 438-441 - Atsuo Yoshitaka, Kentaro Ueda:

Retrieving the movement of multiple objects based on spatiotemporal abstraction. 442-445 - Kivanc M. Ozonat, Robert M. Gray:

Fast Gauss mixture image classification based on the central limit theorem. 446-449 - Imre Varga:

Audio codec for mobile multimedia applications. 450-453 - Susanna Spinsante, Ennio Gambi, Alessio Perotti, Aldo Vespasiani:

A Java based VoIP application: extending LAN telephony facilities in a MGCP framework. 454-457 - David García, Jorge Martínez-Bauset, Vicent Pla:

Optimality and sensitivity study of admission control policies for multimedia wireless networks. 458-461 - J. Aa. Serensen, Kåre J. Kristoffersen, Andres Cervera, M. Schiortz, Thomas Lynge, Zoltan Safar, Lars Birkedal

:
An infrastructure for context dependent mobile multimedia communication. 462-465 - Gennaro Boggia, Pietro Camarda, Domenico Striccoli:

Bandwidth estimation in prerecorded VBR-video distribution systems exploiting stream correlation. 466-469 - Pierluigi Salvo Rossi, Athina P. Petropulu, Jie Yu, Francesco Palmieri, Giulio Iannello:

Internet loss-delay modeling by use of input/output hidden Markov models. 470-473 - Taekhyun Kim, YongJune Shin, Edward J. Powers, Min Sik Kim, Simon S. Lam:

Application of wavelet denoising to the detection of shared congestion in overlay multimedia networks. 474-477 - Woo-Jin Han:

Low-delay unconstrained motion compensated temporal filtering technique for wavelet-based fully scalable video coding. 478-481 - Rozenn Dahyot

, Niall Rea, Anil C. Kokaram, Nick G. Kingsbury:
Inlier modeling for multimedia data analysis. 482-485 - Niklas Wernersson, Tomas Skollermo, Mikael Skoglund:

Improved quantization in multiple description coding by correlating transforms. 486-489 - Òscar Divorra Escoda, Pierre Vandergheynst:

A Bayesian approach to video expansions on parametric over-complete 2-D dictionaries. 490-493 - Lorenzo Granai, Pierre Vandergheynst:

Sparse decomposition over multi-component redundant dictionaries. 494-497 - Andrea Vitali, Fabrizio S. Rovati, Roberto Rinaldo, Riccardo Bernardini, Marco Durigon:

Video streaming over lossy/variable networks by means of multiple description. 498-501 - Tanir Özçelebi, A. Murat Tekalp, M. Reha Civanlar:

Optimal rate and input format control for content and context adaptive streaming of sports videos. 502-505 - Andreas Schorr, Andreas Kassler, Goran Petrovic:

Adaptive media streaming in heterogeneous wireless networks. 506-509 - Chuo-Ling Chang, Sangeun Han, Bernd Girod:

Sender-based rate-distortion optimized streaming of 3-D wavelet video with low latency. 510-513 - Mark Kalman, Bernd Girod:

Rate-distortion optimized video streaming using conditional packet delay distributions. 514-517

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














