Speech and Audio Processing

SERIES EDITOR: B.H. Juang, Georgia Tech

Series ISSN: 1932-121X (print) 1932-1678 (electronic)

Editor Bios


Acoustical Impulse Response Functions of Music Performance Halls Acoustical Impulse Response Functions of Music Performance Halls
Douglas Frey, Rangaraj Rangayyan, Victor Coelho
Digital measurement of the analog acoustical parameters of a music performance hall is difficult. The aim of such work is to create a digital acoustical derivation that is an accurate numerical representation of the complex analog characteristics of ...
Publication Date: 04/01/2013

Read More


Speech Recognition Algorithms Based on Weighted Finite-State Transducers Speech Recognition Algorithms Based on Weighted Finite-State Transducers
Takaaki Hori, Atsushi Nakamura
This book introduces the theory, algorithms, and implementation techniques for efficient decoding in speech recognition mainly focusing on the Weighted Finite-State Transducer (WFST) approach. The decoding process for speech recognition is viewed as ...
Publication Date: 01/01/2013

Read More


DFT-Domain Based Single-Microphone Noise Reduction for Speech Enhancement DFT-Domain Based Single-Microphone Noise Reduction for Speech Enhancement
Richard C. Hendriks, Timo Gerkmann, Jesper Jensen
As speech processing devices like mobile phones, voice controlled devices, and hearing aids have increased in popularity, people expect them to work anywhere and at any time without user intervention. However, the presence of acoustical disturbances ...
Publication Date: 01/01/2013

Read More


Articulatory Speech Synthesis from the Fluid Dynamics of the Vocal Apparatus Articulatory Speech Synthesis from the Fluid Dynamics of the Vocal Apparatus
Stephen Levinson, Donald W. Davis, Jr., Scott Slimon, Jun Huang
This book addresses the problem of articulatory speech synthesis based on computed vocal tract geometries and the basic physics of sound production in it. Unlike conventional methods based on analysis/synthesis using the well-known source filter mode...
Publication Date: 01/01/2012

Read More


Speech Enhancement in the Karhunen-Loeve Expansion Domain Speech Enhancement in the Karhunen-Loeve Expansion Domain
Jacob Benesty, Jingdong Chen, Yiteng Huang
This book is devoted to the study of the problem of speech enhancement whose objective is the recovery of a signal of interest (i.e., speech) from noisy observations. Typically, the recovery process is accomplished by passing the noisy observations t...
Publication Date: 01/01/2011

Read More


Sparse Adaptive Filters for Echo Cancellation Sparse Adaptive Filters for Echo Cancellation
Constantin Paleologu, Jacob Benesty, Silviu Ciochina
Adaptive filters with a large number of coefficients are usually involved in both network and acoustic echo cancellation. Consequently, it is important to improve the convergence rate and tracking of the conventional algorithms used for these applica...
Publication Date: 01/01/2010

Read More


A Perspective on Single-Channel Frequency-Domain Speech Enhancement A Perspective on Single-Channel Frequency-Domain Speech Enhancement
Jacob Benesty, Yiteng Huang
This book focuses on a class of single-channel noise reduction methods that are performed in the frequency domain via the short-time Fourier transform (STFT). The simplicity and relative effectiveness of this class of approaches make them the dominan...
Publication Date: 01/01/2011

Read More


Multi-Pitch Estimation Multi-Pitch Estimation
Mads Christensen, Andreas Jakobsson
Periodic signals can be decomposed into sets of sinusoids having frequencies that are integer multiples of a fundamental frequency. The problem of finding such fundamental frequencies from noisy observations is important in many speech and audio appl...
Publication Date: 01/01/2009

Read More


Discriminative Learning for Speech Recognition Discriminative Learning for Speech Recognition
Xiadong He, Li Deng
In this book, we introduce the background and mainstream methods of probabilistic modeling and discriminative parameter optimization for speech recognition. The specific models treated in depth include the widely used exponential-family distributions...
Publication Date: 01/01/2008

Read More


Latent Semantic Mapping Latent Semantic Mapping
Jerome R. Bellegarda
Latent semantic mapping (LSM) is a generalization of latent semantic analysis (LSA), a paradigm originally developed to capture hidden word patterns in a text document corpus. In information retrieval, LSA enables retrieval on the basis of conceptual...
Publication Date: 01/01/2007

Read More


Dynamic Speech Models Dynamic Speech Models
Li Deng
Speech dynamics refer to the temporal characteristics in all stages of the human speech communication process. This speech “chain” starts with the formation of a linguistic message in a speaker's brain and ends with the arrival of the message in ...
Publication Date: 01/01/2006

Read More


Articulation and Intelligibility Articulation and Intelligibility
Jont B. Allen
Immediately following the Second World War, between 1947 and 1955, several classic papers quantified the fundamentals of human speech information processing and recognition. In 1947 French and Steinberg published their classic study on the articulati...
Publication Date: 01/01/2005

Read More



Result Pages:  1  Displaying 1 to 12 (of 12 products)
Browse by Subject
ACM Books
IOP Concise Physics
0 items
LATEST NEWS

Newsletter
Note: Registered customers go to: Your Account to subscribe.

E-Mail Address:

Your Name: