Volume 15,
Number 1,
January 2007
- Paris Smaragdis:
Convolutive Speech Bases and Their Application to Supervised Speech Separation.
1-12
Electronic Edition (link) BibTeX
- Li Deng, Leo J. Lee, Hagai Attias, Alex Acero:
Adaptive Kalman Filtering and Smoothing for Tracking Vocal Tract Resonances Using a Continuous-Valued Hidden Dynamic Model.
13-23
Electronic Edition (link) BibTeX
- Ben Milner, Xu Shao:
Prediction of Fundamental Frequency and Voicing From Mel-Frequency Cepstral Coefficients for Unconstrained Speech Reconstruction.
24-33
Electronic Edition (link) BibTeX
- Patrick A. Naylor, Anastasis Kounoudes, Jon Gudnason, Mike Brookes:
Estimation of Glottal Closure Instants in Voiced Speech Using the DYPSA Algorithm.
34-43
Electronic Edition (link) BibTeX
- Farshad Lahouti, Amir K. Khandani:
Soft Reconstruction of Speech in the Presence of Noise and Packet Loss.
44-56
Electronic Edition (link) BibTeX
- Sean A. Ramprashad:
Sparse Bit-Allocations Based on Partial Ordering Schemes With Application to Speech and Audio Coding.
57-69
Electronic Edition (link) BibTeX
- Taesu Kim, Hagai Thomas Attias, Soo-Young Lee, Te-Won Lee:
Blind Source Separation Exploiting Higher-Order Frequency Dependencies.
70-79
Electronic Edition (link) BibTeX
- Tomohiro Nakatani, Keisuke Kinoshita, Masato Miyoshi:
Harmonicity-Based Blind Dereverberation for Single-Channel Speech Signals.
80-95
Electronic Edition (link) BibTeX
- Bertrand Rivet, Laurent Girin, Christian Jutten:
Mixing Audiovisual Speech Processing and Blind Source Separation for the Extraction of Speech Signals From Convolutive Mixtures.
96-108
Electronic Edition (link) BibTeX
- Guangji Shi, Parham Aarabi, Hui Jiang:
Phase-Based Dual-Microphone Speech Enhancement Using A Prior Speech Model.
109-118
Electronic Edition (link) BibTeX
- Gwo-Hwa Ju, Lin-Shan Lee:
A Perceptually Constrained GSVD-Based Approach for Enhancing Speech Corrupted by Colored Noise.
119-134
Electronic Edition (link) BibTeX
- Steven J. Rennie, Parham Aarabi, Brendan J. Frey:
Variational Probabilistic Speech Separation Using Microphone Arrays.
135-149
Electronic Edition (link) BibTeX
- Ian R. Lane, Tatsuya Kawahara, Tomoko Matsui, Satoshi Nakamura:
Out-of-Domain Utterance Detection Using Classification Confidences of Multiple Topics.
150-161
Electronic Edition (link) BibTeX
- Christian Raymond, Frédéric Béchet, Nathalie Camelin, Renato de Mori, Géraldine Damnati:
Sequential Decision Strategies for Machine Interpretation of Speech.
162-171
Electronic Edition (link) BibTeX
- Scott Axelrod, Vaibhava Goel, Ramesh A. Gopinath, Peder Olsen, Karthik Visweswariah:
Discriminative Estimation of Subspace Constrained Gaussian Mixture Models for Speech Recognition.
172-189
Electronic Edition (link) BibTeX
- Rajesh M. Hegde, Hema A. Murthy, Venkata Ramana Rao Gadde:
Significance of the Modified Group Delay Feature in Speech Recognition.
190-202
Electronic Edition (link) BibTeX
- Erik McDermott, Timothy J. Hazen, Jonathan Le Roux, Atsushi Nakamura, Shigeru Katagiri:
Discriminative Training for Large-Vocabulary Speech Recognition Using Minimum Classification Error.
203-223
Electronic Edition (link) BibTeX
- Satya Dharanipragada, Umit H. Yapanel, Bhaskar D. Rao:
Robust Feature Extraction for Continuous Speech Recognition Using the MVDR Spectrum Estimation Method.
224-234
Electronic Edition (link) BibTeX
- Michael L. Seltzer, Alex Acero:
Training Wideband Acoustic Models Using Mixed-Bandwidth Training Data for Speech Recognition.
235-245
Electronic Edition (link) BibTeX
- Joe Frankel, Simon King:
Speech Recognition Using Linear Dynamic Models.
246-256
Electronic Edition (link) BibTeX
- Chia-Ping Chen, Jeff A. Bilmes:
MVA Processing of Speech Features.
257-270
Electronic Edition (link) BibTeX
- Haizhou Li, Bin Ma, Chin-Hui Lee:
A Vector Space Modeling Approach to Spoken Language Identification.
271-284
Electronic Edition (link) BibTeX
- Peter Day, Asoke K. Nandi:
Robust Text-Independent Speaker Verification Using Genetic Programming.
285-295
Electronic Edition (link) BibTeX
- Youngim Jung, Ae-sun Yoon, Hyuk-Chul Kwon:
Grapheme-to-Phoneme Conversion of Arabic Numeral Expressions for Embedded TTS Systems.
296-309
Electronic Edition (link) BibTeX
- Jan H. Plasberg, W. Bastiaan Kleijn:
The Sensitivity Matrix: Using Advanced Auditory Models in Speech and Audio Processing.
310-319
Electronic Edition (link) BibTeX
- Ixone Arroabarren, Alfonso Carlosena:
Voice Production Mechanisms of Vocal Vibrato in Male Singers.
320-332
Electronic Edition (link) BibTeX
- Kazuyoshi Yoshii, Masataka Goto, Hiroshi G. Okuno:
Drum Sound Recognition for Polyphonic Audio Signals by Adaptation and Matching of Spectrogram Templates With Harmonic Structure Suppression.
333-345
Electronic Edition (link) BibTeX
- Kishan Thambiratnam, Sridha Sridharan:
Rapid Yet Accurate Speech Indexing Using Dynamic Match Lattice Spotting.
346-357
Electronic Edition (link) BibTeX
- Paris Smaragdis, Petros Boufounos:
Position and Trajectory Learning for Microphone Arrays.
358-368
Electronic Edition (link) BibTeX
Volume 15,
Number 2,
February 2007
- Y. Agiomyrgiannakis, Yannis Stylianou:
Conditional Vector Quantization for Speech Coding.
377-386
Electronic Edition (link) BibTeX
- Sorin Dusan, James L. Flanagan, A. Karve, M. Balaraman:
Speech Compression by Polynomial Approximation.
387-395
Electronic Edition (link) BibTeX
- G. Hu, D. Wang:
Auditory Segmentation Based on Onset and Offset Analysis.
396-405
Electronic Edition (link) BibTeX
- Richard C. Hendriks, Richard Heusdens, Jesper Jensen:
An MMSE Estimator for Speech Enhancement Under a Combined Stochastic-Deterministic Speech Model.
406-415
Electronic Edition (link) BibTeX
- Y. Nagata, T. Fujioka, M. Abe:
Two-Dimensional DOA Estimation of Sound Sources Based on Weighted Wiener Gain Exploiting Two-Directional Microphones.
416-429
Electronic Edition (link) BibTeX
- Marc Delcroix, Takafumi Hikichi, Masato Miyoshi:
Precise Dereverberation Using Multichannel Linear Prediction.
430-440
Electronic Edition (link) BibTeX
- S. Srinivasan, J. Samuelsson, W. Bastiaan Kleijn:
Codebook-Based Bayesian Speech Enhancement for Nonstationary Environments.
441-452
Electronic Edition (link) BibTeX
- R. Huang, John H. L. Hansen, P. Angkititrakul:
Dialect/Accent Classification Using Unrestricted Audio.
453-464
Electronic Edition (link) BibTeX
- M. Akbacak, John H. L. Hansen:
Environmental Sniffing: Noise Knowledge Estimation for Robust Speech Systems.
465-477
Electronic Edition (link) BibTeX
- J. Wu, Q. Huo:
A Study of Minimum Classification Error (MCE) Linear Regression for Supervised Adaptation of MCE-Trained Continuous-Density Hidden Markov Models.
478-488
Electronic Edition (link) BibTeX
- P. D. Teal:
Tracking Wide-Band Targets Having Significant Doppler Shift.
489-497
Electronic Edition (link) BibTeX
- P. Angkititrakul, J. H. L. Hansen:
Discriminative In-Set/Out-of-Set Speaker Recognition.
498-508
Electronic Edition (link) BibTeX
- Darko Kirovski, Zeph Landau:
Generalized Lempel-Ziv Compression for Audio.
509-518
Electronic Edition (link) BibTeX
- T. L. Nwe, H. Li:
Exploring Vibrato-Motivated Acoustic Features for Singer Identification.
519-530
Electronic Edition (link) BibTeX
- N. Laurenti, G. De Poli, D. Montagner:
A Nonlinear Method for Stochastic Spectrum Estimation in the Modeling of Musical Sounds.
531-541
Electronic Edition (link) BibTeX
- Sunil Bharitkar, Chris Kyriakakis:
Visualization of Multiple Listener Room Acoustic Equalization With the Sammon Map.
542-551
Electronic Edition (link) BibTeX
- D. T. Murphy, M. Beeson:
The KW-Boundary Hybrid Digital Waveguide Mesh for Room Acoustics Applications.
552-564
Electronic Edition (link) BibTeX
- Ramani Duraiswami, Dmitry N. Zotkin, Nail A. Gumerov:
Fast Evaluation of the Room Transfer Function Using Multipole Expansion.
565-576
Electronic Edition (link) BibTeX
- J. Mullen, D. M. Howard, D. T. Murphy:
Real-Time Dynamic Articulations in the 2-D Waveguide Mesh Vocal Tract Model.
577-585
Electronic Edition (link) BibTeX
- X. Sun, S. M. Kuo:
Active Narrowband Noise Control Systems Using Cascading Adaptive Filters.
586-592
Electronic Edition (link) BibTeX
- Muhammad Tahir Akhtar, Masahide Abe, Masayuki Kawamata:
On Active Noise Control Systems With Online Acoustic Feedback Path Modeling.
593-600
Electronic Edition (link) BibTeX
- Daniel Gatica-Perez, Guillaume Lathoud, Jean-Marc Odobez, Iain McCowan:
Audiovisual Probabilistic Tracking of Multiple Speakers in Meetings.
601-616
Electronic Edition (link) BibTeX
- Simon Doclo, Marc Moonen:
Superdirective Beamforming Robust Against Microphone Mismatch.
617-631
Electronic Edition (link) BibTeX
- C.-H. Lee, S.-K. Jung, H.-G. Kang:
Applying a Speaker-Dependent Speech Compression Technique to Concatenative TTS Synthesizers.
632-640
Electronic Edition (link) BibTeX
- K.-S. Lee:
Statistical Approach for Voice Personality Transformation.
641-651
Electronic Edition (link) BibTeX
- Xiaodong Cui, Abeer Alwan:
Robust Speaker Adaptation by Weighted Model Averaging Based on the Minimum Description Length Criterion.
652-660
Electronic Edition (link) BibTeX
- M.-Y. Tsai, F.-C. Chou, L.-S. Lee:
Pronunciation Modeling With Reduced Confusion for Mandarin Chinese Using a Three-Stage Framework.
661-675
Electronic Edition (link) BibTeX
- Qin Yan, Saeed Vaseghi, D. Rentzos, C.-H. Ho:
Analysis and Synthesis of Formant Spaces of British, Australian, and American Accents.
676-689
Electronic Edition (link) BibTeX
- D. Wang, S. Narayanan:
An Acoustic Measure for Word Prominence in Spontaneous Speech.
690-701
Electronic Edition (link) BibTeX
- Zhiyun Li, Ramani Duraiswami:
Flexible and Optimal Design of Spherical Microphone Arrays for Beamforming.
702-714
Electronic Edition (link) BibTeX
- M. Knaak, Shoko Araki, Shoji Makino:
Geometrically Constrained Independent Component Analysis.
715-726
Electronic Edition (link) BibTeX
- I. Balmages, Boaz Rafaely:
Open-Sphere Designs for Spherical Microphone Arrays.
727-732
Electronic Edition (link) BibTeX
- Peter Jancovic:
Fast Algorithm for Calculation of the Union-Based Probability.
732-734
Electronic Edition (link) BibTeX
- Y.-I. Kim, R. M. Kil:
Estimation of Interaural Time Differences Based on Zero-Crossings in Noisy Multisource Environments.
734-743
Electronic Edition (link) BibTeX
Volume 15,
Number 3,
March 2007
- Pradeepa Yahampath, Paul Rondeau:
Multiple-Description Predictive-Vector Quantization With Applications to Low Bit-Rate Speech Coding Over Networks.
749-755
Electronic Edition (link) BibTeX
- Ethan R. Duni, Bhaskar D. Rao:
High-Rate Optimized Recursive Vector Quantization Structures Using Hidden Markov Models.
756-769
Electronic Edition (link) BibTeX
- Ethan R. Duni, Bhaskar D. Rao:
A High-Rate Optimal Transform Coder With Gaussian Mixture Companders.
770-783
Electronic Edition (link) BibTeX
- Brian Kan-Wing Mak, Roger Wend-Huu Hsiao:
Kernel Eigenspace-Based MLLR Adaptation.
784-795
Electronic Edition (link) BibTeX
- Bertrand Rivet, Laurent Girin, Christian Jutten:
Log-Rayleigh Distribution: A Simple and Efficient Statistical Representation of Log-Spectral Coefficients.
796-802
Electronic Edition (link) BibTeX
- Patricia Scanlon, Daniel P. W. Ellis, Richard B. Reilly:
Using Broad Phonetic Group Experts for Improved Speech Recognition.
803-812
Electronic Edition (link) BibTeX
- Barbara Resch, Mattias Nilsson, Anders Ekman, W. Bastiaan Kleijn:
Estimation of the Instantaneous Pitch of Speech.
813-822
Electronic Edition (link) BibTeX
- Francesco Gianfelici, Giorgio Biagetti, Paolo Crippa, Claudio Turchetti:
Multicomponent AM-FM Representations: An Asymptotically Exact Approach.
823-837
Electronic Edition (link) BibTeX
- Dima Ruinskiy, Y. Lavner:
An Effective Algorithm for Automatic Detection and Exact Demarcation of Breath Sounds in Speech and Song Signals.
838-850
Electronic Edition (link) BibTeX
- Laurent Girin, Mohammad Firouzmand, Sylvain Marchand:
Perceptual Long-Term Variable-Rate Sinusoidal Modeling of Speech.
851-861
Electronic Edition (link) BibTeX
- Jesper Jensen, Richard Heusdens:
Improved Subspace-Based Single-Channel Speech Enhancement Using Generalized Super-Gaussian Priors.
862-872
Electronic Edition (link) BibTeX
- Juho Kontio, Laura Laaksonen, Paavo Alku:
Neural Network-Based Artificial Bandwidth Expansion of Speech.
873-881
Electronic Edition (link) BibTeX
- David Y. Zhao, W. Bastiaan Kleijn:
HMM-Based Gain Modeling for Enhancement of Speech in Noise.
882-892
Electronic Edition (link) BibTeX
- M. Khademul Islam Molla, Keikichi Hirose:
Single-Mixture Audio Source Separation by Subspace Decomposition of Hilbert Spectrum.
893-900
Electronic Edition (link) BibTeX
- Karsten Vandborg Sorensen, Sren Vang Andersen:
Rayleigh Mixture Model-Based Hidden Markov Modeling and Estimation of Noise in Noisy Speech Signals.
901-917
Electronic Edition (link) BibTeX
- Richard C. Hendriks, Rainer Martin:
MAP Estimators for Speech Enhancement Under Normal and Rayleigh Inverse Gaussian Distributions.
918-927
Electronic Edition (link) BibTeX
- Nikos Chatzichrisafis, Vassilios Diakoloukas, Vassilios Digalakis, Costas Harizakis:
Gaussian Mixture Clustering and Language Adaptation for the Development of a New Language Speech Recognition System.
928-938
Electronic Edition (link) BibTeX
- Ghinwa F. Choueiter, James R. Glass:
An Implementation of Rational Wavelets and Filter Design for Phonetic Classification.
939-948
Electronic Edition (link) BibTeX
- Esther Klabbers, Jan P. H. van Santen, Alexander Kain:
The Contribution of Various Sources of Spectral Mismatch to Audible Discontinuities in a Diphone Database.
949-956
Electronic Edition (link) BibTeX
- Jerome R. Bellegarda:
Globally Optimal Training of Unit Boundaries in Unit Selection Text-to-Speech Synthesis.
957-965
Electronic Edition (link) BibTeX
- Pim Korten, Jesper Jensen, Richard Heusdens:
High-Resolution Spherical Quantization of Sinusoidal Parameters.
966-981
Electronic Edition (link) BibTeX
- Hirokazu Kameoka, Takuya Nishimoto, Shigeki Sagayama:
A Multipitch Analyzer Based on Harmonic Temporal Structured Clustering.
982-994
Electronic Edition (link) BibTeX
- Johannes Nix, Volker Hohmann:
Combined Estimation of Spectral Envelopes and Sound Source Direction of Concurrent Voices by Multidimensional Statistical Filtering.
995-1008
Electronic Edition (link) BibTeX
- Matthew E. P. Davies, Mark D. Plumbley:
Context-Dependent Beat Tracking of Musical Audio.
1009-1020
Electronic Edition (link) BibTeX
- Leevi Peltola, Cumhur Erkut, Perry R. Cook, Vesa Välimäki:
Synthesis of Hand Clapping Sounds.
1021-1029
Electronic Edition (link) BibTeX
- Jean-Marc Valin:
On Adjusting the Learning Rate in Frequency Domain Echo Cancellation With Double-Talk.
1030-1034
Electronic Edition (link) BibTeX
- James D. Gordy, Rafik A. Goubran:
Statistical Analysis of Doubletalk Detection for Calibration and Performance Evaluation.
1035-1043
Electronic Edition (link) BibTeX
- Felix Albu, Martin Bouchard, Yuriy V. Zakharov:
Pseudo-Affine Projection Algorithms for Multichannel Active Noise Control.
1044-1052
Electronic Edition (link) BibTeX
- Jacob Benesty, Jingdong Chen, Yiteng Huang, Jacek Dmochowski:
On Microphone-Array Beamforming From a MIMO Acoustic Signal Processing Perspective.
1053-1065
Electronic Edition (link) BibTeX
- Tuomas Virtanen:
Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness Criteria.
1066-1074
Electronic Edition (link) BibTeX
- Carlos Busso, Zhigang Deng, Michael Grimm, Ulrich Neumann, Shrikanth Narayanan:
Rigid Head Motion in Expressive Speech Animation: Analysis and Synthesis.
1075-1086
Electronic Edition (link) BibTeX
- Chen Yang, Frank K. Soong, Tan Lee:
Static and Dynamic Spectral Features: Their Noise Robustness and Optimal Weights for ASR.
1087-1097
Electronic Edition (link) BibTeX
- Luis Buera, Eduardo Lleida, A. Miguel, A. Ortega, O. Saz:
Cepstral Vector Normalization Based on Stereo Data for Robust Speech Recognition.
1098-1113
Electronic Edition (link) BibTeX
- Xianyu Zhao, Zhijian Ou:
Closely Coupled Array Processing and Model-Based Compensation for Microphone Array Speech Recognition.
1114-1122
Electronic Edition (link) BibTeX
Volume 15,
Number 4,
May 2007
- Rasool Tahmasbi, Sadegh Rezaei:
A Soft Voice Activity Detection Using GARCH Filter and Variance Gamma Distribution.
1129-1134
Electronic Edition (link) BibTeX
- Jonathan Le Roux, Hirokazu Kameoka, Nobutaka Ono, Alain de Cheveigné, Shigeki Sagayama:
Single and Multiple F0 Contour Estimation Through Parametric Spectrogram Modeling of Speech in Noisy Environments.
1135-1145
Electronic Edition (link) BibTeX
- Thomas Eriksson, Frank Norden:
Memory-Based Vector Quantization of LSF Parameters by a Power Series Approximation.
1146-1155
Electronic Edition (link) BibTeX
- Bengt J. Borgstrom, Mihaela van der Schaar, A. Alwan:
Rate Allocation for Noncollaborative Multiuser Speech Communication Systems Based on Bargaining Theory.
1156-1166
Electronic Edition (link) BibTeX
- M. Jelinek, R. Salami:
Wideband Speech Coding Advances in VMR-WB Standard.
1167-1179
Electronic Edition (link) BibTeX
- Athanasios Mouchtaris, Jan Van der Spiegel, Paul Mueller, Panagiotis Tsakalides:
A Spectral Conversion Approach to Single-Channel Speech Enhancement.
1180-1193
Electronic Edition (link) BibTeX
- Esfandiar Zavarehei, Saeed Vaseghi, Qin Yan:
Noisy Speech Enhancement Using Harmonic-Noise Model and Codebook-Based Post-Processing.
1194-1203
Electronic Edition (link) BibTeX
- Xuechuan Wang, D. O'Shaughnessy:
Environmental Independent ASR Model Adaptation/Compensation by Bayesian Parametric Representation.
1204-1217
Electronic Edition (link) BibTeX
- Peter Birkholz, D. Jackel, Bernd J. Kröger:
Simulation of Losses Due to Turbulence in the Time-Varying Vocal System.
1218-1226
Electronic Edition (link) BibTeX
- Chung-Hsien Wu, Chi-Chun Hsia, Jiun-Fu Chen, Jhing-Fa Wang:
Variable-Length Unit Selection in TTS Using Structural Syntactic Cost.
1227-1235
Electronic Edition (link) BibTeX
- Karthikeyan Umapathy, Sridhar Krishnan, R. K. Rao:
Audio Signal Feature Extraction and Classification Using Local Discriminant Bases.
1236-1246
Electronic Edition (link) BibTeX
- Graham E. Poliner, Daniel P. W. Ellis, A. F. Ehmann, E. Gomez, S. Streich, Beesuan Ong:
Melody Transcription From Music Audio: Approaches and Evaluation.
1247-1256
Electronic Edition (link) BibTeX
- Harvey D. Thornburg, Randal J. Leistikow, J. Berger:
Melody Extraction and Musical Onset Detection via Probabilistic Models of Framewise STFT Peak Data.
1257-1272
Electronic Edition (link) BibTeX
- E. Vincent, M. D. Plumbley:
Low Bit-Rate Object Coding of Musical Audio Using Bayesian Harmonic Models.
1273-1282
Electronic Edition (link) BibTeX
- C. Dubois, M. Davy:
Joint Detection and Tracking of Time-Varying Harmonic Components: A Flexible Bayesian Approach.
1283-1295
Electronic Edition (link) BibTeX
- H. M. A. Malik, Rashid Ansari, Ashfaq A. Khokhar:
Robust Data Hiding in Audio Using Allpass Filters.
1296-1304
Electronic Edition (link) BibTeX
- Y. Avargel, I. Cohen:
System Identification in the Short-Time Fourier Transform Domain With Crossband Filtering.
1305-1319
Electronic Edition (link) BibTeX
- Fredric Lindström, Christian Schüldt, Ingvar Claesson:
An Improvement of the Two-Path Algorithm Transfer Logic for Acoustic Echo Cancellation.
1320-1326
Electronic Edition (link) BibTeX
- Jacek Dmochowski, Jacob Benesty, Sofiène Affes:
Direction of Arrival Estimation Using the Parameterized Spatial Correlation Matrix.
1327-1339
Electronic Edition (link) BibTeX
- Wolfgang Herbordt, Herbert Buchner, S. Nakamura, Walter Kellermann:
Multichannel Bin-Wise Robust Frequency-Domain Adaptive Filtering and Its Application to Adaptive Beamforming.
1340-1351
Electronic Edition (link) BibTeX
- Takaaki Hori, C. Hori, Yasuhiro Minami, Atsushi Nakamura:
Efficient WFST-Based One-Pass Decoding With On-The-Fly Hypothesis Rescoring in Extremely Large Vocabulary Continuous Speech Recognition.
1352-1365
Electronic Edition (link) BibTeX
- Xiaodong Cui, Yifan Gong:
A Study of Variable-Parameter Gaussian Mixture Hidden Markov Modeling for Noisy Speech Recognition.
1366-1376
Electronic Edition (link) BibTeX
- M. De Wachter, M. Matton, Kris Demuynck, Patrick Wambacq, R. Cools, Dirk Van Compernolle:
Template-Based Continuous Speech Recognition.
1377-1390
Electronic Edition (link) BibTeX
- Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
Exploiting Temporal Correlation of Speech for Error Robust and Bandwidth Flexible Distributed Speech Recognition.
1391-1403
Electronic Edition (link) BibTeX
- Paris Smaragdis, Madhusudana V. S. Shashanka:
A Framework for Secure Speech Recognition.
1404-1413
Electronic Edition (link) BibTeX
- Xunying Liu, M. Gales:
Automatic Model Complexity Control Using Marginalized Discriminative Growth Functions.
1414-1424
Electronic Edition (link) BibTeX
- Y. Han, Johan de Veth, Lou Boves:
Trajectory Clustering for Solving the Trajectory Folding Problem in Automatic Speech Recognition.
1425-1434
Electronic Edition (link) BibTeX
- P. Kenny, Gilles Boulianne, P. Ouellet, Pierre Dumouchel:
Joint Factor Analysis Versus Eigenchannels in Speaker Recognition.
1435-1447
Electronic Edition (link) BibTeX
- P. Kenny, Gilles Boulianne, P. Ouellet, Pierre Dumouchel:
Speaker and Session Variability in GMM-Based Speaker Verification.
1448-1460
Electronic Edition (link) BibTeX
- Wei-Ho Tsai, Shih-Sian Cheng, Hsin-Min Wang:
Automatic Speaker Clustering Using a Voice Characteristic Reference Space and Maximum Purity Estimation.
1461-1474
Electronic Edition (link) BibTeX
- Yipeng Li, DeLiang Wang:
Separation of Singing Voice From Music Accompaniment for Monaural Recordings.
1475-1487
Electronic Edition (link) BibTeX
- S. Bilbao, L. Savioja, J. O. Smith:
Parameterized Finite Difference Schemes for Plates: Stability, the Reduction of Directional Dispersion and Frequency Warping.
1488-1495
Electronic Edition (link) BibTeX
- Angel M. Gomez, Antonio M. Peinado, V. Sanchez, Antonio J. Rubio:
On the Ramsey Class of Interleavers for Robust Speech Recognition in Burst-Like Packet Loss.
1496-1499
Electronic Edition (link) BibTeX
Volume 15,
Number 5,
July 2007
- Scott C. Douglas, M. Gupta, H. Sawada, Shoji Makino:
Spatio-Temporal FastICA Algorithms for the Blind Separation of Convolutive Mixtures.
1511-1520
Electronic Edition (link) BibTeX
- Intae Lee, Te-Won Lee:
On the Assumption of Spherical Symmetry and Sparseness for the Frequency-Domain Speech Model.
1521-1528
Electronic Edition (link) BibTeX
- E. Warsitz, M. R. Haeb-Umbach:
Blind Acoustic Beamforming Based on Generalized Eigenvalue Decomposition.
1529-1539
Electronic Edition (link) BibTeX
- Abdeldjalil Aïssa-El-Bey, Karim Abed-Meraim, Yves Grenier:
Blind Separation of Underdetermined Convolutive Mixtures Using Their Time-Frequency Representation.
1540-1550
Electronic Edition (link) BibTeX
- Zhaoshui He, Shengli Xie, Shuxue Ding, Andrzej Cichocki:
Convolutive Blind Source Separation in the Frequency Domain Based on Sparse Representation.
1551-1563
Electronic Edition (link) BibTeX
- Alexey Ozerov, P. Philippe, Frédéric Bimbot, Rémi Gribonval:
Adaptation of Bayesian Models for Single-Channel Source Separation and its Application to Voice/Music Separation in Popular Songs.
1564-1578
Electronic Edition (link) BibTeX
- Ken'ichi Furuya, Akitoshi Kataoka:
Robust Speech Dereverberation Using Multichannel Blind Deconvolution With Spectral Subtraction.
1579-1591
Electronic Edition (link) BibTeX
- Hiroshi Sawada, Shoko Araki, Ryo Mukai, Shoji Makino:
Grouping Separated Frequency Components by Estimating Propagation Model Parameters in Frequency-Domain Blind Source Separation.
1592-1604
Electronic Edition (link) BibTeX
- Oscal T.-C. Chen, Chia-Hsiung Liu:
Content-Dependent Watermarking Scheme in Compressed Speech With Identifying Manner and Location of Attacks.
1605-1616
Electronic Edition (link) BibTeX
- Vesa Siivola, Teemu Hirsimäki, Sami Virpioja:
On Growing and Pruning Kneser-Ney Smoothed N-Gram Models.
1617-1624
Electronic Edition (link) BibTeX
- M. Lagrange, S. Marchand, J.-B. Rault:
Enhancing the Tracking of Partials for the Sinusoidal Modeling of Polyphonic Sounds.
1625-1634
Electronic Edition (link) BibTeX
- Mads Græsbøll Christensen, Andreas Jakobsson, Søren Holdt Jensen:
Joint High-Resolution Fundamental Frequency and Order Estimation.
1635-1644
Electronic Edition (link) BibTeX
- Xinglei Zhu, Gerald Beauregard, Lonce L. Wyse:
Real-Time Signal Estimation From Modified Short-Time Fourier Transform Magnitude Spectra.
1645-1653
Electronic Edition (link) BibTeX
- Anders Meng, P. Ahrendt, Jan Larsen, Lars Kai Hansen:
Temporal Feature Integration for Music Genre Classification.
1654-1664
Electronic Edition (link) BibTeX
- Masahiro Yukawa, Konstantinos Slavakis, Isao Yamada:
Adaptive Parallel Quadratic-Metric Projection Algorithms.
1665-1680
Electronic Edition (link) BibTeX
- A. W. H. Khong, Patrick A. Naylor:
Selective-Tap Adaptive Filtering With Performance Analysis for Identification of Time-Varying Systems.
1681-1695
Electronic Edition (link) BibTeX
- Guillaume Lathoud, Jean-Marc Odobez:
Short-Term Spatio-Temporal Clustering Applied to Multiple Moving Speakers.
1696-1710
Electronic Edition (link) BibTeX
- Ji Ming, Timothy J. Hazen, James R. Glass, Douglas A. Reynolds:
Robust Speaker Recognition in Noisy Conditions.
1711-1723
Electronic Edition (link) BibTeX
- Mark D. Skowronski, John G. Harris:
Noise-Robust Automatic Speech Recognition Using a Predictive Echo State Network.
1724-1730
Electronic Edition (link) BibTeX
- M. Afify, O. Siohan:
Comments on Vocal Tract Length Normalization Equals Linear Transformation in Cepstral Space.
1731-1732
Electronic Edition (link) BibTeX
Volume 15,
Number 6,
August 2007
- Jan S. Erkelens, Richard C. Hendriks, Richard Heusdens, Jesper Jensen:
Minimum Mean-Square Error Estimation of Discrete Fourier Coefficients With Generalized Gamma Priors.
1741-1752
Electronic Edition (link) BibTeX
- Chang Huai You, Susanto Rahardja, Soo Ngee Koh:
Audible Noise Reduction in Eigendomain for Speech Enhancement.
1753-1765
Electronic Edition (link) BibTeX
- A. M. Reddy, B. Raj:
Soft Mask Methods for Single-Channel Speaker Separation.
1766-1776
Electronic Edition (link) BibTeX
- Ann Spriet, Geert Rombouts, Marc Moonen, Jan Wouters:
Combined Feedback and Noise Suppression in Hearing Aids.
1777-1790
Electronic Edition (link) BibTeX
- Marc Delcroix, Takafumi Hikichi, Masato Miyoshi:
Dereverberation and Denoising Using Multichannel Linear Prediction.
1791-1801
Electronic Edition (link) BibTeX
- Woojay Jeon, B.-H. Juang:
Speech Analysis in a Model of the Central Auditory System.
1802-1817
Electronic Edition (link) BibTeX
- Nikolaos Mitianoudis, Tania Stathaki:
Batch and Online Underdetermined Source Separation Using Laplacian Mixture Models.
1818-1832
Electronic Edition (link) BibTeX
- Maurizio Mancini, Roberto Bresin, Catherine Pelachaud:
A Virtual Head Driven by Music Expressivity.
1833-1841
Electronic Edition (link) BibTeX
- Shantanu Chakrabartty, Yunbin Deng, Gert Cauwenberghs:
Robust Speech Feature Extraction by Growth Transformation in Reproducing Kernel Hilbert Space.
1842-1849
Electronic Edition (link) BibTeX
- Bertrand Mesot, David Barber:
Switching Linear Dynamical Systems for Noise Robust Speech Recognition.
1850-1858
Electronic Edition (link) BibTeX
- Amit S. Malegaonkar, Aladdin M. Ariyaeeinia, P. Sivakumaran:
Efficient Speaker Change Detection Using Adapted Gaussian Mixture Models.
1859-1869
Electronic Edition (link) BibTeX
- Yuan-Fu Liao, Zi-He Chen, Yau-Tarng Juang:
Latent Prosody Analysis for Robust Speaker Identification.
1870-1883
Electronic Edition (link) BibTeX
- Wai Nang Chan, Nengheng Zheng, Tan Lee:
Discrimination Power of Vocal Source and Vocal Tract Related Features for Speaker Segmentation.
1884-1892
Electronic Edition (link) BibTeX
- Wei Wu, Thomas Fang Zheng, Ming-Xing Xu, Frank K. Soong:
A Cohort-Based Speaker Model Synthesis for Mismatched Channels in Speaker Verification.
1893-1903
Electronic Edition (link) BibTeX
- Jean-Luc Rouas:
Automatic Prosodic Variations Modeling for Language and Dialect Discrimination.
1904-1911
Electronic Edition (link) BibTeX
- P. Taraba:
Kneser-Ney Smoothing With a Correcting Transformation for Small Data Sets.
1912-1921
Electronic Edition (link) BibTeX
- Darko Kirovski, Fabien A. P. Petitcolas, Zeph Landau:
The Replacement Attack.
1922-1931
Electronic Edition (link) BibTeX
- Kai Yu, M. J. F. Gales:
Bayesian Adaptive Inference and Adaptive Training.
1932-1943
Electronic Edition (link) BibTeX
Volume 15,
Number 7,
September 2007
- Mark A. Przybocki, Alvin F. Martin, A. N. Le:
NIST Speaker Recognition Evaluations Utilizing the Mixer Corpora - 2004, 2005, 2006.
1951-1959
Electronic Edition (link) BibTeX
- B. G. B. Fauve, D. Matrouf, N. Scheffer, Jean-François Bonastre, John S. D. Mason:
State-of-the-Art Performance in Text-Independent Speaker Verification Through Open-Source Software.
1960-1968
Electronic Edition (link) BibTeX
- F. Castaldo, D. Colibro, E. Dalmasso, Pietro Laface, C. Vair:
Compensation of Nuisance Factors for Speaker and Language Recognition.
1969-1978
Electronic Edition (link) BibTeX
- Lukas Burget, Pavel Matejka, Petr Schwarz, Ondrej Glembek, Jan Cernocký:
Analysis of Feature Extraction and Channel Compensation in a GMM Speaker Recognition System.
1979-1986
Electronic Edition (link) BibTeX
- Andreas Stolcke, Sachin S. Kajarekar, Luciana Ferrer, E. Shrinberg:
Speaker Recognition With Session Variability Normalization Based on MLLR Adaptation Transforms.
1987-1998
Electronic Edition (link) BibTeX
- Shou-Chun Yin, R. Rose, P. Kenny:
A Joint Factor Analysis Approach to Progressive Model Adaptation in Text-Independent Speaker Verification.
1999-2010
Electronic Edition (link) BibTeX
- Xavier Anguera, Chuck Wooters, Javier Hernando:
Acoustic Beamforming for Speaker Diarization of Meetings.
2011-2022
Electronic Edition (link) BibTeX
- Qin Jin, Tanja Schultz, Alex Waibel:
Far-Field Speaker Recognition.
2023-2032
Electronic Edition (link) BibTeX
- Hagai Aronowitz, David Burshtein:
Efficient Speaker Recognition Using Approximated Cross Entropy (ACE).
2033-2043
Electronic Edition (link) BibTeX
- V. Prakash, John H. L. Hansen:
In-Set/Out-of-Set Speaker Recognition Under Sparse Enrollment.
2044-2052
Electronic Edition (link) BibTeX
- Bin Ma, Haizhou Li, Rong Tong:
Spoken Language Recognition Using Ensemble Classifiers.
2053-2062
Electronic Edition (link) BibTeX
- Yosef A. Solewicz, Moshe Koppel:
UsingPost-Classifiers to Enhance Fusion of Low- and High-Level Speaker Recognition.
2063-2071
Electronic Edition (link) BibTeX
- N. Brummer, Lukas Burget, Jan Cernocký, Ondrej Glembek, Frantisek Grézl, Martin Karafiát, D. A. van Leeuwen, Pavel Matejka, Petr Schwarz, A. Strasheim:
Fusion of Heterogeneous Speaker Recognition Systems in the STBU Submission for the NIST Speaker Recognition Evaluation 2006.
2072-2084
Electronic Edition (link) BibTeX
- William M. Campbell, Joseph P. Campbell, T. P. Gleason, Douglas A. Reynolds, Wade Shen:
Speaker Verification Using Support Vector Machines and High-Level Features.
2085-2094
Electronic Edition (link) BibTeX
- N. Dehak, Pierre Dumouchel, P. Kenny:
Modeling Prosodic Features With Joint Factor Analysis for Speaker Verification.
2095-2103
Electronic Edition (link) BibTeX
- Joaquin Gonzalez-Rodriguez, P. Rose, Daniel Ramos, Doroteo Torre Toledano, Javier Ortega-Garcia:
Emulating DNA: Rigorous Quantification of Evidential Weight in Transparent and Testable Forensic Speaker Recognition.
2104-2115
Electronic Edition (link) BibTeX
- J. D. Williams, S. Young:
Scaling POMDPs for Spoken Dialog Management.
2116-2129
Electronic Edition (link) BibTeX
- S. Srinivasan, DeLiang Wang:
Transforming Binary Uncertainties for Robust Speech Recognition.
2130-2140
Electronic Edition (link) BibTeX
- J. Usher, Jacob Benesty:
Enhancement of Spatial Sound Quality: A New Reverberation-Extraction Audio Upmixer.
2141-2150
Electronic Edition (link) BibTeX
- Cheng-Yuan Lin, Jyh-Shing Roger Jang:
Automatic Phonetic Segmentation by Score Predictive Model for the Corpora of Mandarin Singing Voices.
2151-2159
Electronic Edition (link) BibTeX
- Rusheng Hu, Yunxin Zhao:
Knowledge-Based Adaptive Decision Tree State Tying for Conversational Speech Recognition.
2160-2168
Electronic Edition (link) BibTeX
Volume 15,
Number 8,
November 2007
- Javier Ramírez, José C. Segura, Juan Manuel Górriz, L. Garcia:
Improved Voice Activity Detection Using Contextual Multiple Hypothesis Testing for Robust Speech Recognition.
2177-2189
Electronic Edition (link) BibTeX
- Dagen Wang, S. S. Narayanan:
Robust Speech Rate Estimation for Spontaneous Speech.
2190-2201
Electronic Edition (link) BibTeX
- Seung Seop Park, Nam Soo Kim:
On Using Multiple Models for Automatic Speech Segmentation.
2202-2212
Electronic Edition (link) BibTeX
- Robert I. Damper, Tasanawan Soonklang:
Subjective Evaluation of Techniques for Proper Name Pronunciation.
2213-2221
Electronic Edition (link) BibTeX
- Tomoki Toda, Alan W. Black, Keiichi Tokuda:
Voice Conversion Based on Maximum-Likelihood Estimation of Spectral Parameter Trajectory.
2222-2235
Electronic Edition (link) BibTeX
- Te Li, Susanto Rahardja, Rongshan Yu, Soo Ngee Koh:
On Integer MDCT for Perceptual Audio Coding.
2236-2248
Electronic Edition (link) BibTeX
- Enrique Alexandre, Lucas Cuadra, M. Rosa, Francisco López-Ferreras:
Feature Selection for Sound Classification in Hearing Aids Through Restricted Search Driven by Genetic Algorithms.
2249-2256
Electronic Edition (link) BibTeX
- Hari Krishna Maganti, Daniel Gatica-Perez, Iain McCowan:
Speech Enhancement and Recognition in Meetings With an Audio-Visual Sensor Array.
2257-2269
Electronic Edition (link) BibTeX
- Xiangyang Wang, Wei Qi, Panpan Niu:
A New Adaptive Digital Audio Watermarking Based on Support Vector Regression.
2270-2277
Electronic Edition (link) BibTeX
- L. S. Smith, S. Collins:
Determining ITDs Using Two Microphones on a Flat Panel During Onset Intervals With a Biologically Inspired Spike-Based Technique.
2278-2286
Electronic Edition (link) BibTeX
- H. I. K. Rao, V. J. Mathews, Young-Cheol Park:
A Minimax Approach for the Joint Design of Acoustic Crosstalk Cancellation Filters.
2287-2298
Electronic Edition (link) BibTeX
- Mohammad H. Radfar, Richard M. Dansereau:
Single-Channel Speech Separation Using Soft Mask Filtering.
2299-2310
Electronic Edition (link) BibTeX
- Jingyi Zhang, Wai Lok Woo, Satnam Singh Dlay:
Blind Source Separation of Postnonlinear Convolutive Mixture.
2311-2330
Electronic Edition (link) BibTeX
- C. Busso, S. S. Narayanan:
Interrelation Between Speech and Facial Gestures in Emotional Utterances: A Single Subject Study.
2331-2347
Electronic Edition (link) BibTeX
- A. Abramson, I. Cohen:
Simultaneous Detection and Estimation Approach for Speech Enhancement.
2348-2359
Electronic Edition (link) BibTeX
- Zohra Yermeche, Nedelko Grbic, Ingvar Claesson:
Blind Subband Beamforming With Time-Delay Constraints for Moving Source Speech Enhancement.
2360-2372
Electronic Edition (link) BibTeX
- Joseph Keshet, Shai Shalev-Shwartz, Yoram Singer, D. Chazan:
A Large Margin Algorithm for Speech-to-Phoneme and Music-to-Score Alignment.
2373-2382
Electronic Edition (link) BibTeX
- Xinwei Li, Hui Jiang:
Solving Large-Margin Hidden Markov Model Estimation via Semidefinite Programming.
2383-2392
Electronic Edition (link) BibTeX
- Jinyu Li, Ming Yuan, Chin-Hui Lee:
Approximate Test Risk Bound Minimization Through Soft Margin Estimation.
2393-2404
Electronic Edition (link) BibTeX
- M. Afify, Xinwei Li, Hui Jiang:
Statistical Analysis of Minimum Classification Error Learning for Gaussian and Hidden Markov Model Classifiers.
2405-2417
Electronic Edition (link) BibTeX
- S. Umesh, Rohit Sinha:
A Study of Filter Bank Smoothing in MFCC Features for Recognition of Children's Speech.
2418-2430
Electronic Edition (link) BibTeX
- Haitian Xu, Paul Dalsgaard, Zheng-Hua Tan, Børge Lindberg:
Noise Condition-Dependent Training Based on Noise Classification and SNR Estimation.
2431-2443
Electronic Edition (link) BibTeX
- Rongqing Huang, John H. L. Hansen:
Unsupervised Discriminative Training With Application to Dialect Classification.
2444-2453
Electronic Edition (link) BibTeX
- Shizhen Wang, Xiaodong Cui, Abeer Alwan:
Speaker Adaptation With Limited Data Using Regression-Tree-Based Spectral Peak Alignment.
2454-2464
Electronic Edition (link) BibTeX
- J. Louradour, K. Daoudi, F. Bach:
Feature Space Mahalanobis Sequence Kernels: Application to SVM Speaker Verification.
2465-2475
Electronic Edition (link) BibTeX
- Minho Jin, F. K. Soong, Chang D. Yoo:
A Syllable Lattice Approach to Speaker Verification.
2476-2484
Electronic Edition (link) BibTeX
- M. Chibani, R. Lefebvre, P. Gournay:
Fast Recovery for a CELP-Like Speech Codec After a Frame Erasure.
2485-2495
Electronic Edition (link) BibTeX
- B. Geiser, Peter Jax, Peter Vary, H. Taddei, S. Schandl, M. Gartner, C. Guillaume, S. Ragot:
Bandwidth Extension for Hierarchical Speech and Audio Coding in ITU-T Rec. G.729.1.
2496-2509
Electronic Edition (link) BibTeX
- Jacek Dmochowski, Jacob Benesty, Sofiène Affes:
A Generalized Steered Response Power Method for Computationally Viable Source Localization.
2510-2526
Electronic Edition (link) BibTeX
- Ken'ichi Kumatani, Tobias Gehrig, Uwe Mayer, Emilian Stoimenov, John W. McDonough, Matthias Wölfel:
Adaptive Beamforming With a Minimum Mutual Information Criterion.
2527-2541
Electronic Edition (link) BibTeX
- K. C. Ho, Ming Sun:
An Accurate Algebraic Closed-Form Solution for Energy-Based Source Localization.
2542-2550
Electronic Edition (link) BibTeX
- Chien-Lin Huang, Chung-Hsien Wu:
Spoken Document Retrieval Using Multilevel Knowledge and Semantic Verification.
2551-2560
Electronic Edition (link) BibTeX
- Toon van Waterschoot, Marc Moonen:
A Pole-Zero Placement Technique for Designing Second-Order IIR Parametric Equalizer Filters.
2561-2565
Electronic Edition (link) BibTeX
Copyright © Sun May 17 00:22:55 2009
by Michael Ley (ley@uni-trier.de)