Volume 14,
Number 1,
January 2006
- Lie Lu, D. Liu, HongJiang Zhang:
Automatic mood detection and tracking of music audio signals.
5-18
Electronic Edition (link) BibTeX
- Ning Ma, Martin Bouchard, Rafik A. Goubran:
Speech enhancement using a masking threshold constrained Kalman filter and its heuristic implementations.
19-32
Electronic Edition (link) BibTeX
- James D. Gordy, Rafik A. Goubran:
On the perceptual performance limitations of echo cancellers in wideband telephony.
33-42
Electronic Edition (link) BibTeX
- Marcus Holmberg, David Gelbart, Werner Hemmert:
Automatic speech recognition with an adaptation model motivated by auditory processing.
43-49
Electronic Edition (link) BibTeX
- Thomas Blumensath, Mike E. Davies:
Sparse and shift-Invariant representations of music.
50-57
Electronic Edition (link) BibTeX
- Sue Harding, Jon P. Barker, Guy J. Brown:
Mask estimation for missing data speech recognition based on statistics of binaural interaction.
58-67
Electronic Edition (link) BibTeX
- Slim Essid, Gaël Richard, Bertrand David:
Instrument recognition in polyphonic music based on automatic taxonomies.
68-80
Electronic Edition (link) BibTeX
- Fabian Mörchen, Alfred Ultsch, Michael Thies, Ingo Lohken:
Modeling timbre distance with temporal statistics from polyphonic music.
81-90
Electronic Edition (link) BibTeX
- Emmanuel Vincent:
Musical source separation using time-frequency source priors.
91-98
Electronic Edition (link) BibTeX
- Mads Græsbøll Christensen, Søren Holdt Jensen:
On perceptual distortion minimization and nonlinear least-squares frequency estimation.
99-109
Electronic Edition (link) BibTeX
- Alberto Gonzalez, Maria de Diego, Miguel Ferrer, Gema Pinero:
Multichannel active noise equalization of interior noise.
110-122
Electronic Edition (link) BibTeX
- Y. Hinamoto, H. Sakai:
Analysis of the filtered-X LMS algorithm and a related new algorithm for active control of multitonal noise.
123-130
Electronic Edition (link) BibTeX
- Norman H. Adams, Mark A. Bartsch, Gregory H. Wakefield:
Note segmentation and quantization for music information retrieval.
131-141
Electronic Edition (link) BibTeX
- N. D. Cook, T. X. Fujisawa, K. Takami:
Evaluation of the affective valence of speech using pitch substructure.
142-151
Electronic Edition (link) BibTeX
- A. D. Subramaniam, W. R. Gardner, B. D. Rao:
Iterative joint source-channel decoding of speech spectrum parameters over an additive white Gaussian noise channel.
152-162
Electronic Edition (link) BibTeX
- S. Srinivasan, J. Samuelsson, W. Bastiaan Kleijn:
Codebook driven short-term predictor parameter estimation for speech enhancement.
163-176
Electronic Edition (link) BibTeX
- Y. Nagata, T. Fujioka, M. Abe:
Speech enhancement based on auto gain control.
177-190
Electronic Edition (link) BibTeX
- Laurent Benaroya, Frédéric Bimbot, Rémi Gribonval:
Audio source separation with a single sensor.
191-199
Electronic Edition (link) BibTeX
- Kostas Kokkinakis, Asoke K. Nandi:
Multichannel blind deconvolution for source separation in convolutive mixtures of speech.
200-212
Electronic Edition (link) BibTeX
- N. Gupta, Gökhan Tür, Dilek Hakkani-Tür, Srinivas Bangalore, Giuseppe Riccardi, Mazin Gilbert:
The AT&T spoken language understanding system.
213-222
Electronic Edition (link) BibTeX
- B. Milner, A. James:
Robust speech recognition over mobile and IP networks in burst-like packet loss.
223-231
Electronic Edition (link) BibTeX
- Ken Chen, Mark Hasegawa-Johnson, Aaron Cohen, Sarah Borys, Sung-Suk Kim, Jennifer Cole, Jeung-Yoon Choi:
Prosody dependent speech recognition on radio news corpus of American English.
232-245
Electronic Edition (link) BibTeX
- Néstor Becerra Yoma, Carlos Molina, J. Silva, Carlos Busso:
Modeling, estimating, and compensating low-bit rate coding distortion in speech recognition.
246-255
Electronic Edition (link) BibTeX
- Li Deng, Dong Yu, Alex Acero:
A bidirectional target-filtering model of speech coarticulation and reduction: two-stage implementation for phonetic recognition.
256-265
Electronic Edition (link) BibTeX
- Chung-Hsien Wu, Yu-Hsien Chiu, Chi-Jiun Shia, Chun-Yu Lin:
Automatic segmentation and identification of mixed-language speech using delta-BIC and LSA-based GMMs.
266-276
Electronic Edition (link) BibTeX
- Tomi Kinnunen, Evgeny Karpov, Pasi Fränti:
Real-time speaker identification and verification.
277-288
Electronic Edition (link) BibTeX
- Yang Shao, DeLiang Wang:
Model-based sequential organization in cochannel speech.
289-298
Electronic Edition (link) BibTeX
- C. Faller:
Parametric multichannel audio coding: synthesis of coherence cues.
299-310
Electronic Edition (link) BibTeX
- Renat Vafin, W. Bastiaan Kleijn:
Rate-distortion optimized quantization in multistage audio coding.
311-320
Electronic Edition (link) BibTeX
- Antti J. Eronen, V. T. Peltonen, J. T. Tuomi, Anssi Klapuri, S. Fagerlund, T. Sorsa, G. Lorho, Jyri Huopaniemi:
Audio-based context recognition.
321-329
Electronic Edition (link) BibTeX
- Wei-Ho Tsai, Hsin-Min Wang:
Automatic singer recognition of popular music recordings via estimation and modeling of solo vocal signals.
330-341
Electronic Edition (link) BibTeX
- Anssi Klapuri, Antti J. Eronen, Jaakko Astola:
Analysis of the meter of acoustic musical signals.
342-355
Electronic Edition (link) BibTeX
- V. Goel, S. Kumar, W. Byrne:
Corrections to "Segmental minimum Bayes-risk decoding for automatic speech recognition".
356-357
Electronic Edition (link) BibTeX
Volume 14,
Number 2,
March 2006
- Satoshi Nakamura, Konstantin Markov, Hiromi Nakaiwa, Gen-ichiro Kikui, H. Kawai, Takatoshi Jitsuhiro, Jin-Song Zhang, H. Yamamoto, Eiichiro Sumita, Seiichi Yamamoto:
The ATR multilingual speech-to-speech translation system.
365-376
Electronic Edition (link) BibTeX
- Liang Gu, Yuqing Gao, Fu-Hua Liu, Michael Picheny:
Concept-based speech-to-speech translation using maximum entropy models for statistical natural concept generation.
377-392
Electronic Edition (link) BibTeX
- Yasuhiro Akiba, Kenji Imamura, Eiichiro Sumita, Hiromi Nakaiwa, Shun'ichi Yamamoto, Hiroshi G. Okuno:
Using multiple edit distances to automatically grade outputs from Machine translation systems.
393-402
Electronic Edition (link) BibTeX
- Tanja Schultz, Alan W. Black, Stephan Vogel, Monika Woszczyna:
Flexible speech translation systems.
403-411
Electronic Edition (link) BibTeX
- A. Davis, Sven Nordholm, Roberto Togneri:
Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold.
412-424
Electronic Edition (link) BibTeX
- Li Deng, Alex Acero, I. Bazzi:
Tracking vocal tract resonances using a quantized nonlinear function embeddedin a temporal constraint.
425-434
Electronic Edition (link) BibTeX
- K. Mustafa, Ian C. Bruce:
Robust formant tracking for continuous speech with speaker variability.
435-444
Electronic Edition (link) BibTeX
- Huiqun Deng, R. K. Ward, M. P. Beddoes, M. Hodgson:
A new method for obtaining accurate estimates of vocal-tract filters and glottal waves from vowel sounds.
445-455
Electronic Edition (link) BibTeX
- Mike Brookes, Patrick A. Naylor, Jon Gudnason:
A quantitative assessment of group delay methods for identifying glottal closures in voiced speech.
456-466
Electronic Edition (link) BibTeX
- R. D. Zilca, Brian Kingsbury, Jiri Navratil, G. N. Ramaswamy:
Pseudo pitch synchronous analysis of speech with applications to speaker recognition.
467-478
Electronic Edition (link) BibTeX
- S. Gazor, R. R. Far:
Adaptive maximum windowed likelihood multicomponent AM-FM signal decomposition.
479-491
Electronic Edition (link) BibTeX
- Qiang Fu, P. Murphy:
Robust glottal source estimation based on joint source-filter model optimization.
492-501
Electronic Edition (link) BibTeX
- E. Fisher, J. Tabrikian, S. Dubnov:
Generalized likelihood ratio test for voiced-unvoiced decision in noisy speech using the harmonic model.
502-510
Electronic Edition (link) BibTeX
- Doroteo Torre Toledano, J. Gó Villardebo, Luis Hernández Gómez:
Initialization, training, and context-dependency in HMM-based formant tracking.
511-523
Electronic Edition (link) BibTeX
- A. D. Subramaniam, W. R. Gardner, B. D. Rao:
Low-complexity source coding using Gaussian mixture models, lattice vector quantization, and recursive coding with application to speech spectrum quantization.
524-532
Electronic Edition (link) BibTeX
- T. F. Quatieri, K. Brady, D. Messing, J. P. Campbell, W. M. Campbell, M. S. Brandstein, C. J. Weinstein, J. D. Tardelli, P. D. Gatewood:
Exploiting nonacoustic sensors for speech encoding.
533-544
Electronic Edition (link) BibTeX
- Hui Dong, Jerry D. Gibson:
Structures for SNR scalable speech coding.
545-557
Electronic Edition (link) BibTeX
- U. Bhaskar, K. Swaminathan:
Low bit-rate voice compression based on frequency domain interpolative techniques.
558-576
Electronic Edition (link) BibTeX
- H. Gustafsson, U. A. Lindgren, I. Claesson:
Low-complexity feature-mapped speech bandwidth extension.
577-588
Electronic Edition (link) BibTeX
- Olivier Pietquin, T. Dutoit:
A probabilistic framework for dialog simulation and optimal strategy learning.
589-599
Electronic Edition (link) BibTeX
- B. Gajic, K. K. Paliwal:
Robust speech recognition in noisy environments based on subband spectral centroid histograms.
600-608
Electronic Edition (link) BibTeX
- H. Najaf-Zadeh, P. Kabal:
Perceptual coding of narrow-band audio signals at low rates.
609-622
Electronic Edition (link) BibTeX
- Ashish Aggarwal, Shankar L. Regunathan, Kenneth Rose:
A trellis-based optimal parameter value selection for audio coding.
623-633
Electronic Edition (link) BibTeX
- P. Angkititrakul, John H. L. Hansen:
Advances in phone-based modeling for automatic accent classification.
634-646
Electronic Edition (link) BibTeX
- Chung-Hsien Wu, Chia-Hsin Hsieh:
Multiple change-point audio segmentation and classification using an MDL-based Gaussian model.
647-657
Electronic Edition (link) BibTeX
- Ngwa A. Shusina, Boaz Rafaely:
Unbiased adaptive feedback cancellation in hearing aids by closed-loop identification.
658-665
Electronic Edition (link) BibTeX
- Hiroshi Saruwatari, T. Kawamura, Tsuyoki Nishikawa, Akinobu Lee, Kiyohiro Shikano:
Blind source separation based on a fast-convergence algorithm combining ICA and beamforming.
666-678
Electronic Edition (link) BibTeX
- A. T. Cemgil, H. J. Kappen, D. Barber:
A generative model for music transcription.
679-694
Electronic Edition (link) BibTeX
- Mitsuko Aramaki, Richard Kronland-Martinet:
Analysis-synthesis of impact sounds by real-time dynamic filtering.
695-705
Electronic Edition (link) BibTeX
- Kelvin Chee-Mun Lee, Woon-Seng Gan:
Bandwidth-efficient recursive pth-order equalization for correcting baseband distortion in parametric loudspeakers.
706-710
Electronic Edition (link) BibTeX
- L. E. Rees, S. J. Elliott:
Adaptive algorithms for active sound-profiling.
711-719
Electronic Edition (link) BibTeX
- Muhammad Tahir Akhtar, Masahide Abe, Masayuki Kawamata:
A new variable step size LMS algorithm-based method for improved online secondary path modeling in active noise control systems.
720-726
Electronic Edition (link) BibTeX
- Thomas Hain, Philip C. Woodland, Gunnar Evermann, M. J. F. Gales, X. Liu, G. L. Moore, Daniel Povey, Lan Wang:
Corrections to "Automatic Transcription of Conversational Telephone Speech".
727-727
Electronic Edition (link) BibTeX
Volume 14,
Number 3,
May 2006
- S. Ramamohan, S. Dandapat:
Sinusoidal model-based analysis and classification of stressed speech.
737-746
Electronic Edition (link) BibTeX
- Joon-Hyuk Chang, Nam Soo Kim:
A new structural approach in system identification with generalized analysis-by-synthesis for robust speech coding.
747-751
Electronic Edition (link) BibTeX
- C. A. Rodbro, Jesper Jensen, Richard Heusdens:
Rate-distortion optimal time-segmentation and redundancy selection for VoIP.
752-763
Electronic Edition (link) BibTeX
- V. Grancharov, J. Samuelsson, W. Bastiaan Kleijn:
On causal algorithms for speech enhancement.
764-773
Electronic Edition (link) BibTeX
- Mingyang Wu, DeLiang Wang:
A two-stage algorithm for one-microphone reverberant speech enhancement.
774-784
Electronic Edition (link) BibTeX
- A. W. H. Khong, Patrick A. Naylor:
Stereophonic acoustic echo cancellation employing selective-tap adaptive algorithms.
785-796
Electronic Edition (link) BibTeX
- Jen-Tzung Chien, Chih-Hsien Huang:
Aggregate a posteriori linear regression adaptation.
797-807
Electronic Edition (link) BibTeX
- Jeih-Weih Hung, Lin-Shan Lee:
Optimization of temporal filters for constructing robust features in speech recognition.
808-832
Electronic Edition (link) BibTeX
- Ji Ming:
Noise compensation for speech recognition with arbitrary additive noise.
833-844
Electronic Edition (link) BibTeX
- F. Hilger, Hermann Ney:
Quantile based histogram equalization for noise robust large vocabulary speech recognition.
845-854
Electronic Edition (link) BibTeX
- S. Watanabe, A. Sako, A. Nakamura:
Automatic determination of acoustic model topology using variational Bayesian estimation and clustering for large vocabulary continuous speech recognition.
855-872
Electronic Edition (link) BibTeX
- Hong-Kwang Jeff Kuo, Yuqing Gao:
Maximum entropy direct models for speech recognition.
873-881
Electronic Edition (link) BibTeX
- K. C. Sim, M. J. F. Gales:
Minimum phone error training of precision matrix models.
882-889
Electronic Edition (link) BibTeX
- J. Silva, S. Narayanan:
Average divergence distance as a statistical discrimination measure for hidden Markov models.
890-906
Electronic Edition (link) BibTeX
- Rongqing Huang, John H. L. Hansen:
Advances in unsupervised audio classification and segmentation for the broadcast news and NGSW corpora.
907-919
Electronic Edition (link) BibTeX
- N. Mesgarani, M. Slaney, Shihab A. Shamma:
Discrimination of speech from nonspeech based on multiscale spectro-temporal Modulations.
920-930
Electronic Edition (link) BibTeX
- R. Sant'Ana, Rosangela Coelho, Abraham Alcaim:
Text-independent speaker recognition based on the Hurst parameter and the multidimensional fractional Brownian motion model.
931-940
Electronic Edition (link) BibTeX
- Enrique Vidal, Francisco Casacuberta, Luis Rodríguez, Jorge Civera, Carlos D. Martínez-Hinarejos:
Computer-assisted translation using speech recognition.
941-951
Electronic Edition (link) BibTeX
- Athanasios Mouchtaris, Jan Van der Spiegel, Paul Mueller:
Nonparallel training for voice conversion based on a parameter adaptation approach.
952-963
Electronic Edition (link) BibTeX
- J. Mullen, D. M. Howard, D. T. Murphy:
Waveguide physical modeling of vocal tract acoustics: flexible formant bandwidth control from increased model dimensionality.
964-971
Electronic Edition (link) BibTeX
- K. Sreenivasa Rao, B. Yegnanarayana:
Prosody modification using instants of significant excitation.
972-980
Electronic Edition (link) BibTeX
- Ki-Seung Lee:
MLP-based phone boundary refining for a TTS database.
981-989
Electronic Edition (link) BibTeX
- Jerome R. Bellegarda:
A global, boundary-centric framework for unit selection text-to-speech synthesis.
990-997
Electronic Edition (link) BibTeX
- Cheng-Han Yang, Hsueh-Ming Hang:
Cascaded trellis-based rate-distortion control algorithm for MPEG-4 advanced audio coding.
998-1007
Electronic Edition (link) BibTeX
- B. Supper, T. Brookes, F. Rumsey:
An auditory onset detection algorithm for improved automatic source localization.
1008-1017
Electronic Edition (link) BibTeX
- Woon-Seng Gan, Jun Yang, K.-S. Tan, Meng Hwa Er:
A digital beamsteerer for difference frequency in a parametric array.
1018-1025
Electronic Edition (link) BibTeX
- Rui Cai, Lie Lu, Alan Hanjalic, HongJiang Zhang, Lian-Hong Cai:
A flexible framework for key audio effects detection and auditory context inference.
1026-1039
Electronic Edition (link) BibTeX
- Dimitrios K. Fragoulis, Constantin Papaodysseus, Mihalis Exarhos, G. Roussopoulos, Thanasis Panagopoulos, D. Kamarotos:
Automated classification of piano-guitar notes.
1040-1050
Electronic Edition (link) BibTeX
- H. Viste, G. Evangelista:
A method for separation of overlapping partials based on similarity of temporal envelopes in multichannel mixtures.
1051-1061
Electronic Edition (link) BibTeX
- Serkan Kiranyaz, Ahmad Farooq Qureshi, Moncef Gabbouj:
A generic audio classification and segmentation approach for multimedia indexing and retrieval.
1062-1081
Electronic Edition (link) BibTeX
- Timothy J. Hazen:
Visual model structures and synchrony constraints for audio-visual speech recognition.
1082-1089
Electronic Edition (link) BibTeX
Volume 14,
Number 4,
July 2006
- John F. Pitrelli, R. Bakis, E. M. Eide, R. Fernandez, W. Hamza, M. A. Picheny:
The IBM expressive text-to-speech synthesis system for American English.
1099-1108
Electronic Edition (link) BibTeX
- Chung-Hsien Wu, Chi-Chun Hsia, Te-Hsien Liu, Jhing-Fa Wang:
Voice conversion using duration-embedded bi-HMMs for expressive speech synthesis.
1109-1116
Electronic Edition (link) BibTeX
- Eva Navas, Inma Hernáez, Iker Luengo:
An objective and subjective study of the role of semantics and prosodic features in building corpora for emotional TTS.
1117-1127
Electronic Edition (link) BibTeX
- M. Schroder:
Expressing degree of activation in synthetic speech.
1128-1136
Electronic Edition (link) BibTeX
- Mariët Theune, K. Meijs, Dirk Heylen, R. Ordelman:
Generating expressive speech for storytelling applications.
1137-1144
Electronic Edition (link) BibTeX
- Jianhua Tao, Yongguo Kang, Aijun Li:
Prosody conversion from neutral speech to emotional speech.
1145-1154
Electronic Edition (link) BibTeX
- Wentao Gu, Keikichi Hirose, Hiroya Fujisaki:
Modeling the effects of emphasis and question on fundamental frequency contours of Cantonese utterances.
1155-1170
Electronic Edition (link) BibTeX
- N. Campbell:
Conversational speech synthesis and the need for some laughter.
1171-1178
Electronic Edition (link) BibTeX
- Taishih Chi, Shihab A. Shamma:
Spectrum restoration from multiscale auditory phase singularities by generalized projections.
1179-1192
Electronic Edition (link) BibTeX
- A. Watanabe, T. Sakata:
Reliable methods for estimating relative vocal tract lengths from formant trajectories of common words.
1193-1204
Electronic Edition (link) BibTeX
- W. C. Chu:
Embedded quantization of line spectral frequencies using a multistage tree-structured vector quantizer.
1205-1217
Electronic Edition (link) BibTeX
- Jingdong Chen, Jacob Benesty, Yiteng Arden Huang, S. Doclo:
New insights into the noise reduction Wiener filter.
1218-1234
Electronic Edition (link) BibTeX
- Yunxin Zhao, Rong Hu, Xiaolong Li:
Speedup convergence and reduce noise for enhanced speech separation and recognition.
1235-1244
Electronic Edition (link) BibTeX
- Jen-Tzung Chien, Bo-Cheng Chen:
A new independent component analysis for speech recognition and separation.
1245-1254
Electronic Edition (link) BibTeX
- S. Dharanipragada, K. Visweswariah:
Gaussian mixture models with covariances or precisions in shared multiple subspaces.
1255-1266
Electronic Edition (link) BibTeX
- Brian Kan-Wing Mak, Roger Wend-Huu Hsiao, Simon Ka-Lung Ho, James T. Kwok:
Embedded kernel eigenvoice speaker adaptation and its implication to reference speaker weighting.
1267-1280
Electronic Edition (link) BibTeX
- Diamantino Caseiro, Isabel Trancoso:
A specialized on-the-fly algorithm for lexicon and language model composition.
1281-1291
Electronic Edition (link) BibTeX
- Toshihiko Abe, Masaaki Honda:
Sinusoidal model based on instantaneous frequency attractors.
1292-1300
Electronic Edition (link) BibTeX
- Hui Ye, S. Young:
Quality-enhanced voice morphing using maximum likelihood transformations.
1301-1312
Electronic Edition (link) BibTeX
- Ashish Aggarwal, Shankar L. Regunathan, Kenneth Rose:
Efficient bit-rate scalability for weighted squared error optimization in audio coding.
1313-1327
Electronic Edition (link) BibTeX
- Olivier Derrien, Pierre Duhamel, Maurice Charbit, G. Richard:
A new quantization optimization algorithm for the MPEG advanced audio coder using a statistical subband model of the quantization noise.
1328-1339
Electronic Edition (link) BibTeX
- M. G. Christensen, S. van de Par:
Efficient parametric coding of transients.
1340-1351
Electronic Edition (link) BibTeX
- Rongshan Yu, Susanto Rahardja, Lin Xiao, Chi Chung Ko:
A fine granular scalable to lossless audio coder.
1352-1363
Electronic Edition (link) BibTeX
- T. Umayahara, H. Hokari, S. Shimada:
Stereo width control using interpolation and extrapolation of time-frequency representation.
1364-1377
Electronic Edition (link) BibTeX
- F. Talantzis, D. B. Ward, Patrick A. Naylor:
Performance analysis of dynamic acoustic source separation in reverberant rooms.
1378-1390
Electronic Edition (link) BibTeX
- P. A. A. Esquef, Luiz W. P. Biscainho:
An efficient model-based multirate method for reconstruction of audio signals across long gaps.
1391-1400
Electronic Edition (link) BibTeX
- Slim Essid, Gaël Richard, Bertrand David:
Musical instrument recognition by pairwise classification strategies.
1401-1412
Electronic Edition (link) BibTeX
- Ixone Arroabarren, X. Rodet, Alfonso Carlosena:
On the measurement of the instantaneous frequency and amplitude of partials in vocal vibrato.
1413-1421
Electronic Edition (link) BibTeX
- Ixone Arroabarren, Alfonso Carlosena:
Inverse filtering in singing voice: a critical analysis.
1422-1431
Electronic Edition (link) BibTeX
- Saman S. Abeysekera, Kabi Prakash Padhi:
An investigation of window effects on the frequency estimation using the phase vocoder.
1432-1439
Electronic Edition (link) BibTeX
- A. Robel:
Adaptive additive modeling with continuous parameter trajectories.
1440-1453
Electronic Edition (link) BibTeX
- Crispin Cooper, Damian Murphy, David Howard, Alexander Tyrrell:
Singing synthesis with an evolved physical model.
1454-1461
Electronic Edition (link) BibTeX
- Emmanuel Vincent, Rémi Gribonval, Cédric Févotte:
Performance measurement in blind audio source separation.
1462-1469
Electronic Edition (link) BibTeX
- Panayiotis G. Georgiou, Chris Kyriakakis:
Robust maximum likelihood source localization: the case for sub-Gaussian versus Gaussian.
1470-1480
Electronic Edition (link) BibTeX
Volume 14,
Number 5,
September 2006
- Li Deng, Dong Yu, Alex Acero:
Structured speech modeling.
1492-1504
Electronic Edition (link) BibTeX
- Claude Barras, Xuan Zhu, S. Meignier, Jean-Luc Gauvain:
Multistage speaker diarization of broadcast news.
1505-1512
Electronic Edition (link) BibTeX
- M. J. F. Gales, Do Yeong Kim, Philip C. Woodland, Ho Yin Chan, D. Mrva, R. Sinha, S. E. Tranter:
Progress in the CU-HTK broadcast news transcription system.
1513-1525
Electronic Edition (link) BibTeX
- Yang Liu, Elizabeth Shriberg, Andreas Stolcke, Dustin Hillard, Mari Ostendorf, Mary P. Harper:
Enriching speech recognition with automatic detection of sentence boundaries and disfluencies.
1526-1540
Electronic Edition (link) BibTeX
- Spyridon Matsoukas, Jean-Luc Gauvain, Gilles Adda, T. Colthurst, Chia-Lin Kao, Owen Kimball, Lori Lamel, Fabrice Lefevre, J. Z. Ma, John Makhoul, Long Nguyen, Rohit Prasad, Richard M. Schwartz, Holger Schwenk, Bing Xiang:
Advances in transcription of broadcast news and conversational telephone speech within the combined EARS BBN/LIMSI system.
1541-1556
Electronic Edition (link) BibTeX
- S. E. Tranter, Douglas A. Reynolds:
An overview of automatic speaker diarization systems.
1557-1565
Electronic Edition (link) BibTeX
- Matthew Lease, Mark Johnson, Eugene Charniak:
Recognizing disfluencies in conversational speech.
1566-1573
Electronic Edition (link) BibTeX
- Jui-Feng Yeh, Chung-Hsien Wu:
Edit disfluency detection and correction using a cleanup language model and an alignment model.
1574-1583
Electronic Edition (link) BibTeX
- Hui Jiang, Xinwei Li, Chaojun Liu:
Large margin hidden Markov models for speech recognition.
1584-1595
Electronic Edition (link) BibTeX
- Stanley F. Chen, Brian Kingsbury, Lidia Mangu, Daniel Povey, George Saon, Hagen Soltau, Geoffrey Zweig:
Advances in speech transcription at IBM under the DARPA EARS program.
1596-1608
Electronic Edition (link) BibTeX
- C. A. Rodbro, M. N. Murthi, Søren Vang Andersen, Søren Holdt Jensen:
Hidden Markov model-based packet loss concealment for voice over IP.
1609-1623
Electronic Edition (link) BibTeX
- Farshad Lahouti, A. R. Fazel, A. H. Safavi-Naeini, Amir K. Khandani:
Single and double frame coding of speech LPC parameters using a lattice-based quantization scheme.
1624-1632
Electronic Edition (link) BibTeX
- Herbert Buchner, Jacob Benesty, Tomas Gänsler, Walter Kellermann:
Robust extended multidelay filter and double-talk detector for acoustic echo cancellation.
1633-1644
Electronic Edition (link) BibTeX
- M. Kuropatwinski, W. Bastiaan Kleijn:
Estimation of the short-term predictor parameters of speech under noisy conditions.
1645-1655
Electronic Edition (link) BibTeX
- Rile Hu, Chengqing Zong, Bo Xu:
An approach to automatic acquisition of translation templates based on phrase structure extraction and alignment.
1656-1663
Electronic Edition (link) BibTeX
- Alon Lavie, Fabio Pianesi, Lori S. Levin:
The NESPOLE! System for multilingual speech communication over the Internet.
1664-1673
Electronic Edition (link) BibTeX
- Gen-ichiro Kikui, Seiichi Yamamoto, Toshiyuki Takezawa, Eiichiro Sumita:
Comparative study on corpora for speech translation.
1674-1682
Electronic Edition (link) BibTeX
- Xiao Li, Jonathan Malkin, Jeff A. Bilmes:
A high-speed, low-resource ASR back-end based on custom arithmetic.
1683-1693
Electronic Edition (link) BibTeX
- Kai Yu, M. J. F. Gales:
Discriminative cluster adaptive training.
1694-1703
Electronic Edition (link) BibTeX
- Chak-Fai Li, Man-Hung Siu, Jeff Siu-Kei Au-Yeung:
Recursive likelihood evaluation and fast search algorithm for polynomial segment model with application to speech recognition.
1704-1718
Electronic Edition (link) BibTeX
- Jen-Tzung Chien:
Association pattern language modeling.
1719-1728
Electronic Edition (link) BibTeX
- Andreas Stolcke, Barry Chen, Horacio Franco, Venkata Ramana Rao Gadde, Martin Graciarena, Mei-Yuh Hwang, Katrin Kirchhoff, Arindam Mandal, Nelson Morgan, Xin Lei, Tim Ng, Mari Ostendorf, K. Sonmez, Anand Venkataraman, Dimitra Vergyri, Wen Wang, Jing Zheng, Qifeng Zhu:
Recent innovations in speech-to-text transcription at SRI-ICSI-UW.
1729-1744
Electronic Edition (link) BibTeX
- N. Duta, Richard M. Schwartz, John Makhoul:
Analysis of the errors produced by the 2004 BBN speech recognition system in the DARPA EARS evaluations.
1745-1753
Electronic Edition (link) BibTeX
- S. Mathur, B. H. Story, J. J. Rodriguez:
Vocal-tract modeling: fractional elongation of segment lengths in a waveguide model with half-sample delays.
1754-1762
Electronic Edition (link) BibTeX
- J. Vepa, S. King:
Subjective evaluation of join cost and smoothing methods for unit selection speech synthesis.
1763-1771
Electronic Edition (link) BibTeX
- C. Baras, N. Moreau, P. Dymarski:
Controlling the inaudibility and maximizing the robustness in an audio annotation watermarking system.
1772-1782
Electronic Edition (link) BibTeX
- M. Goto:
A chorus section detection method for musical audio signals and its application to a music listening station.
1783-1794
Electronic Edition (link) BibTeX
- Aggelos Pikrakis, Sergios Theodoridis, Dimitris Kamarotos:
Classification of musical patterns using variable duration hidden Markov models.
1795-1807
Electronic Edition (link) BibTeX
- Laurent Daudet:
Sparse and structured decompositions of signals with the molecular matching pursuit.
1808-1816
Electronic Edition (link) BibTeX
- V. Verfaille, Udo Zölzer, Dabiel Arfib:
Adaptive digital audio effects (a-DAFx): a new class of sound transformations.
1817-1831
Electronic Edition (link) BibTeX
- Fabien Gouyon, Anssi Klapuri, Simon Dixon, M. Alonso, G. Tzanetakis, C. Uhle, Pedro Cano:
An experimental comparison of audio tempo induction algorithms.
1832-1844
Electronic Edition (link) BibTeX
- M. R. Every, J. E. Szymanski:
Separation of synchronous pitched notes by spectral filtering of harmonics.
1845-1856
Electronic Edition (link) BibTeX
- S. M. Kuo, A. B. Puvvala:
Effects of frequency separation in periodic active noise control systems.
1857-1866
Electronic Edition (link) BibTeX
- Guangji Shi, M. M. Shanechi, Parham Aarabi:
On the importance of phase in human speech recognition.
1867-1874
Electronic Edition (link) BibTeX
- D. P. Das, S. R. Mohapatra, A. Routray, T. K. Basu:
Filtered-s LMS algorithm for multichannel active control of nonlinear noise processes.
1875-1880
Electronic Edition (link) BibTeX
Volume 14,
Number 6,
November 2006
- A. W. Rix, J. G. Beerends, D.-S. Kim, P. Kroon, O. Ghitza:
Objective Assessment of Speech and Audio Quality - Technology and Applications.
1890-1901
Electronic Edition (link) BibTeX
- Rainer Huber, Birger Kollmeier:
PEMO-Q - A New Method for Objective Audio Quality Assessment Using a Model of Auditory Perception.
1902-1911
Electronic Edition (link) BibTeX
- A. Karmakar, A. Kumar, R. K. Patney:
A Multiresolution Model of Auditory Excitation Pattern and Its Application to Objective Evaluation of Perceived Speech Quality.
1912-1923
Electronic Edition (link) BibTeX
- L. Malfait, J. Berger, M. Kastner:
P.563 - The ITU-T Standard for Single-Ended Speech Quality Assessment.
1924-1934
Electronic Edition (link) BibTeX
- Tiago H. Falk, Wai-Yip Chan:
Single-Ended Speech Quality Measurement Using Machine Learning Methods.
1935-1947
Electronic Edition (link) BibTeX
- V. Grancharov, David Y. Zhao, J. Lindblom, W. Bastiaan Kleijn:
Low-Complexity, Nonintrusive Speech Quality Assessment.
1948-1956
Electronic Edition (link) BibTeX
- Alexander Raake:
Short- and Long-Term Packet Loss Behavior: Towards Speech Quality Prediction for Arbitrary Loss Distributions.
1957-1968
Electronic Edition (link) BibTeX
- Sebastian Möller, Alexander Raake, N. Kitawaki, A. Takahashi, Marcel Wältermann:
Impairment Factor Framework for Wide-Band Speech Codecs.
1969-1976
Electronic Edition (link) BibTeX
- S. R. Broom:
VoIP Quality Assessment: Taking Account of the Edge-Device.
1977-1983
Electronic Edition (link) BibTeX
- A. Takahashi, A. Kurashima, H. Yoshino:
Objective Assessment Methodology for Estimating Conversational Quality in VoIP.
1984-1993
Electronic Edition (link) BibTeX
- S. George, S. Zielinski, F. Rumsey:
Feature Extraction for the Prediction of Multichannel Spatial Audio Fidelity.
1994-2005
Electronic Edition (link) BibTeX
- T. Yamada, M. Kumakura, N. Kitawaki:
Performance Estimation of Speech Recognition System Under Noise Conditions Using Objective Quality Measures and Artificial Voice.
2006-2013
Electronic Edition (link) BibTeX
- P. Li, Y. Guan, B. Xu, W. Liu:
Monaural Speech Separation Based on Computational Auditory Scene Analysis and Objective Quality Assessment of Speech.
2014-2023
Electronic Edition (link) BibTeX
- Georgios Evangelopoulos, Petros Maragos:
Multiband Modulation Energy Tracking for Noisy Speech Detection.
2024-2038
Electronic Edition (link) BibTeX
- Mohamed Deriche, Daryl Ning:
A Novel Audio Coding Scheme Using Warped Linear Prediction Model and the Discrete Wavelet Transform.
2039-2048
Electronic Edition (link) BibTeX
- John H. L. Hansen, V. Radhakrishnan, Kathryn Hoberg Arehart:
Speech Enhancement Based on Generalized Minimum Mean Square Error Estimators and Masking Properties of the Auditory System.
2049-2063
Electronic Edition (link) BibTeX
- Richard C. Hendriks, Richard Heusdens, Jesper Jensen:
Adaptive Time Segmentation for Improved Speech Enhancement.
2064-2074
Electronic Edition (link) BibTeX
- Tiemin Mei, Jiangtao Xi, Fuliang Yin, Alfred Mertins, Joe F. Chicharo:
Blind Source Separation Based on Time-Domain Optimization of a Frequency-Domain Independence Criterion.
2075-2085
Electronic Edition (link) BibTeX
- Y. Nagata, K. Mitsubori, T. Kagi, T. Fujioka, M. Abe:
Fast Implementation of KLT-Based Speech Enhancement Using Vector Quantization.
2086-2097
Electronic Edition (link) BibTeX
- Cyril Plapous, Claude Marro, Pascal Scalart:
Improved Signal-to-Noise Ratio Estimation for Speech Enhancement.
2098-2108
Electronic Edition (link) BibTeX
- Michael L. Seltzer, Richard M. Stern:
Subband Likelihood-Maximizing Beamforming for Speech Recognition in Reverberant Environments.
2109-2121
Electronic Edition (link) BibTeX
- Man-Hung Siu, Arthur Chan:
A Robust Viterbi Algorithm Against Impulsive Noise With Application to Speech Recognition.
2122-2133
Electronic Edition (link) BibTeX
- Y. Tian, J.-L. Zhou, H. Lin, H. Jiang:
Tree-Based Covariance Modeling of Hidden Markov Models.
2134-2146
Electronic Edition (link) BibTeX
- Jian Wu, Qiang Huo:
An Environment-Compensated Minimum Classification Error Training Approach Based on Stochastic Vector Mapping.
2147-2155
Electronic Edition (link) BibTeX
- K. W. Wilson, Trevor Darrell:
Learning a Precedence Effect-Like Weighting Function for the Generalized Cross-Correlation Framework.
2156-2164
Electronic Edition (link) BibTeX
- H. Sawada, Shoko Araki, Ryo Mukai, Shoji Makino:
Blind Extraction of Dominant Target Sources Using ICA and Time-Frequency Masking.
2165-2173
Electronic Edition (link) BibTeX
- Cédric Févotte, Simon J. Godsill:
A Bayesian Approach for Blind Separation of Sparse Sources.
2174-2188
Electronic Edition (link) BibTeX
- Yegui Xiao, Liying Ma, Khashayar Khorasani, Akira Ikuta:
A New Robust Narrowband Active Noise Control System in the Presence of Frequency Mismatch.
2189-2200
Electronic Edition (link) BibTeX
- Y. Yokotani, R. Geiger, G. D. T. Schuller, Soontorn Oraintara, K. R. Rao:
Lossless Audio Coding Using the IntMDCT and Rounding Error Shaping.
2201-2211
Electronic Edition (link) BibTeX
- Toshio Irino, Roy D. Patterson, H. Kawahara:
Speech Segregation Using an Auditory Vocoder With Event-Synchronous Enhancements.
2212-2221
Electronic Edition (link) BibTeX
- Toshio Irino, Roy D. Patterson:
A Dynamic Compressive Gammachirp Auditory Filterbank.
2222-2232
Electronic Edition (link) BibTeX
- Wei-Chen Chang, Alvin Wen-Yu Su:
A Multichannel Recurrent Network Analysis/Synthesis Model for Coupled-String Instruments.
2233-2241
Electronic Edition (link) BibTeX
- Juan Pablo Bello, Laurent Daudet, Mark B. Sandler:
Automatic Piano Transcription Using Frequency and Time-Domain Information.
2242-2251
Electronic Edition (link) BibTeX
- Panu Somervuo, A. Harma, S. Fagerlund:
Parametric Representations of Bird Sounds for Automatic Species Recognition.
2252-2263
Electronic Edition (link) BibTeX
- Ying Li, Chitra Dorai:
Instructional Video Content Analysis Using Audio Information.
2264-2274
Electronic Edition (link) BibTeX
Copyright © Sun May 17 00:22:54 2009
by Michael Ley (ley@uni-trier.de)