Multimedia ComputingHumans are the best functioning example of multimedia communication and computing - that is, we understand information and experiences through the unified perspective offered by our five senses. This innovative textbook presents emerging techniques in multimedia computing from an experiential perspective in which each medium - audio, images, text, and so on - is a strong component of the complete, integrated exchange of information or experience. The authors' goal is to present current techniques in computing and communication that will lead to the development of a unified and holistic approach to computing using heterogeneous data sources. Gerald Friedland and Ramesh Jain introduce the fundamentals of multimedia computing, describing the properties of perceptually encoded information, presenting common algorithms and concepts for handling it, and outlining the typical requirements for emerging applications that use multifarious information sources. Designed for advanced undergraduate and beginning graduate courses, the book will also serve as an introduction for engineers and researchers interested in understanding the elements of multimedia and their role in building specific applications. |
Contents
A Definition | 6 |
Elements of Multimedia Computing | 15 |
Introduction to Sensors | 28 |
Sound | 36 |
Light | 51 |
Multimedia Documents | 67 |
Multimodal Integration and Synchronization | 82 |
Multimedia Systems | 95 |
Advanced Perceptual Compression | 156 |
Speech Compression | 174 |
Multimedia Information Retrieval | 188 |
Signal Processing Primer | 209 |
Multimedia Content Analysis | 233 |
Content Analysis Systems | 272 |
Content and Context | 290 |
Future Topics | 305 |
Other editions - View all
Common terms and phrases
acoustic ADPCM algorithm applications approach arithmetic coding audio bits camera capture chapter color combination communication components compression computer vision concepts consider content analysis database decoder defined described detection devices different media Discrete Cosine Transform discuss documents encoding environment example Exif experience Figure filter frame frequency function Gaussian human IEEE important information retrieval input interaction Internet JPEG linear machine learning metadata microphone Moore’s Law multimedia computing multimedia content analysis multimedia data multimedia systems multiple node objects operations output parameters perceived perception Perceptron phones pixels prediction problem pseudo-code quantization query Ramesh Jain represent require samples scene segment semantic semantic gap sensors sequence shows smartphones sound sources space spatial speaker speaker recognition specific speech recognition stream streaming media symbols synchronization tags techniques tion types typical usually values vectors visual Wiener Filter