Chinese
English
Research Fellow (Professor)  |  Tsao, Yu  
 
contact
vita
education
experience
interests
descriptions
activities
invited_talk
honors
publications
patents
software
others
supervised
lab (New window)
 
 
 
 
 
Publications
 
Journal Articles
 
1. E. H.-H. Huang, R. Chao, Y. Tsao, and C.-M. Wu, "ElectrodeNet - A Deep-Learning-Based Sound Coding Strategy for Cochlear Implants," IEEE Transactions on Cognitive and Developmental Systems, volume 16, number 1, pages 346-357, February 2024. :::icon
2. S.-Y. Peng, I-C. Liu, Y.-H. Wu, T.-J. Lin, C.-J. Chen, X.-Z. Li, Y.-Q. Cheng, P.-H. Lin, K.-H. Hung, and Y. Tsao, "An SRAM-Based Reconfigurable Cognitive Computation Matrix for Sensor Edge Applications," IEEE Journal of Solid-State Circuits, volume 59, number 2, pages 636-648, February 2024. :::icon
3. K.-C. Ting, Y.-C. Lin, C.-T. Chan, T.-Y. Tu, Y. Tsao, K.-C. Liu, and C.-C. Shih, "Inertial Measurement Unit-based Romberg Test in Assessing Adults with Vestibular Hypofunction," IEEE Journal of Translational Engineering in Health and Medicine, volume 12, pages 245-255, December 2023. :::icon
4. H.-C. Kuo, Y.-P. Hsieh, H.-H. Tseng, C.-T. Wang, S.-H. Fang, and Y, Tsao, "Toward Real-World Voice Disorder Classification," IEEE Transactions on Biomedical Engineering, volume 70, number 10, pages 2922-2932, October 2023. :::icon
5. K.-C. Ting, S.-S. Wang, Y.-J. Li, C.-Y. Huang, T.-Y. Tu, C.-C. Shih, K.-C. Liu and Y. Tsao, "Detection of Otitis Media with Effusion Using In-Ear Microphones and Machine Learning," IEEE Sensors Journal, volume 23, pages 28411-28420, October 2023. :::icon
6. L.-C. Chen, K.-H. Hung, Y.-J. Tseng, H.-Y. Wang, T.-M. Lu, W.-C. Huang, and Y. Tsao, "Self-supervised Learning Based General Laboratory Progress Pretrained Model for Cardiovascular Event Detection," IEEE Journal of Translational Engineering in Health and Medicine, volume 12, pages 43-55, August 2023. :::icon
7. Y.-J. Lu, C.-Y. Chang, C. Yu, C.-F. Liu, J.-w. Hung, S. Watanabe, and Y. Tsao, "Improving Speech Enhancement Performance by Leveraging Contextual Broad Phonetic Class Information," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 31, pages 2738-2750, June 2023. :::icon
8. C.-Y. Cheng, H.-S. Lee, Y. Tsao, and H.-M. Wang, "Multi-target Filter and Detector for Unknown-number Speaker Diarization," IEEE Signal Processing Letters, volume 30, pages 638-642, May 2023. :::icon
9. Y.-W. Chen, H.-M. Wang, and Y. Tsao, "BASPRO: A Balanced Script Producer for Speech Corpus Collection Based on the Genetic Algorithm," APSIPA Transactions on Signal and Information Processing, volume 12, number 3, pages e15, April 2023, Themed Series: Advanced Acoustic, Sound and Audio Processing Techniques and Their Applications :::icon
10. T.-M. Chen, Y.-H. Tsai, H.-H. Tseng, K.-C. Liu, J.-Y. Chen, C.-H. Huang, G.-Y. Li, C.-Y. Shen, and Y. Tsao, "SRECG: ECG Signal Super-resolution Framework for Portable/Wearable Devices in Cardiac Arrhythmias Classification," IEEE Transactions on Consumer Electronics, volume 1, pages 1, January 2023. :::icon
11. S.-Y. Niu, L.-Z. Guo, Y. Li, Z. Zhang, T.-D. Wang, K.-C. Liu, Y. Tsao, T.-M. Liu, "Boundary-Preserved Deep Denoising of the Stochastic Resonance Enhanced Multiphoton Images," IEEE Journal of Translational Engineering in Health and Medicine, volume 10, pages 1-12, September 2022.
12. K.-C. Liu, K.-H. Hung, C.-Y. Hsieh, H.-Y. Huang, C.-T. Chan, and Y. Tsao, "Deep Learning Based Signal Enhancement of Low-Resolution Accelerometer for Fall Detection Systems," IEEE Transactions on Cognitive and Developmental Systems, volume 14, number 3, pages 1270-1281, September 2022. :::icon
13. R. E. Zezario, S.-W. Fu, F. Chen, C.-S. Fuh, H.-M. Wang, and Y. Tsao, "Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment Model with Cross-Domain Features," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 31, pages 54-70, September 2022. :::icon
14. L.-C. Chen, P.-H. Chen, R. T.-H. Tsai, and Y. Tsao,, "EPG2S: Speech Generation and Speech Enhancement based on Electropalatography and Audio Signals using Multimodal Learning," IEEE Signal Processing Letters, volume 29, pages 2582-2586, June 2022. :::icon
15. Y. Lin, Y.Tsao, and P.-J. Hsieh, "Neural Correlates of Individual Differences in Predicting Ambiguous Sounds Comprehension Level," NeuroImage, volume 251, pages 1-12, May 2022. :::icon
16. T. Hussain, W.-C. Wang, M. Gogate, K. Dashtipour, Y. Tsao, X. Lu, A. Ahsan, and A. Hussain, "A Novel Temporal Attentive-Pooling based Convolutional Recurrent Architecture for Acoustic Signal Enhancement," IEEE Transactions on Artificial Intelligence, volume 1, number 1, pages 1-12, April 2022. :::icon
17. C.-T. Wang, Z.-Y. Chuang, C.-H. Hung, Y. Tsao, S.-H. Fang, "Detection of Glottic Neoplasm Based on Voice Signals Using Deep Neural Networks," IEEE Sensors Journal, volume 6, pages 1-4, March 2022, (Letters)
18. Y.-W. Chen, K.-H. Hung, Y.-J. Li, A. C.-F. Kang, Y.-S. Lai, K.-C. Liu, S.-W. Fu, S.-S. Wang, Y. Tsao, "CITISEN: A Deep Learning-Based Speech Signal-Processing Mobile Application," IEEE Access, volume 10, pages 46082-46099, February 2022. :::icon
19. L.-C. Chen, J.-T. Sheu, Y.-J. Chuang, and Y. Tsao, "Predicting the Travel Distance of Patients while Accessing Healthcare using Deep Neural Network," IEEE Journal of Translational Engineering in Health and Medicine, volume 10, pages 1-11, February 2022. :::icon
20. S.-Y. Chuang, H.-M. Wang, and Y. Tsao, "Improved Lite Audio-Visual Speech Enhancement," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 30, pages 1345-1359, February 2022. :::icon
21. C.-H. Hu, Y.-H. Peng, J.Yamagishi, Y. Tsao, and H.-M. Wang, "SVSNet: An End-to-end Speaker Voice Similarity Assessment Model," IEEE Signal Processing Letters, volume 29, pages 767-771, February 2022. :::icon
22. S.-S. Wang, C.-C. Lai, C.-T. Wang, Y. Tsao, S.-H. Fang, "Continuous Speech for Improved Learning Pathological Voice Disorders," IEEE Open Journal of Engineering in Medicine and Biology, volume 3, pages 2644-1276, February 2022. :::icon
23. Y.-C. Lin, C. Yu, Y.-T. Hsu, S.-W. Fu, Y. Tsao, T.-W. Kuo, "SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 30, pages 1016-1031, December 2021. :::icon
24. R.-Y. Tseng, T.-W. Wang, S.-W. Fu, C.-Y. Lee, and Y. Tsao, "A Study of Joint Effect on Denoising Techniques and Visual Cues to Improve Speech Intelligibility in Cochlear Implant Simulation," IEEE Transactions on Cognitive and Developmental Systems, volume 13, pages 984-994, December 2021. :::icon
25. X. Lu, P. Shen, Y. Tsao, and H. Kawai, "Coupling A Generative Model With A Discriminative Learning Framework for Speaker Verification," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 29, pages 3631-3641, November 2021. :::icon
26. F. S. Abousaleh, W.-H. Cheng, N.-H. Yu, and Y. Tsao, "Multimodal Deep Learning Framework for Image Popularity Prediction on Social Media," IEEE Transactions on Cognitive and Developmental Systems, volume 13, number 3, pages 679-692, September 2021. :::icon
27. W. Ariyanti, T. Hussain, J.-C. Wang, C.-T. Wang, S.-H. Fang, and Y. Tsao, "Ensemble and Multimodal Learning for Pathological Voice Classification," IEEE Sensors Journal, volume 5, number 7, pages 1-4, July 2021, (Letters) :::icon
28. K.-C. Liu, M. Chan, C.-Y. Hsieh, H.-Y. Huang, C.-T. Chan, Y. Tsao, "Domain-adaptive Fall Detection Using Deep Adversarial Training," IEEE Transactions on Neural Systems & Rehabilitation Engineering, volume 29, pages 1243-1251, June 2021. :::icon
29. T.-H. Lin ,T. Akamatsu,Y. Tsao, "Sensing ecosystem dynamics via audio source separation: A case study of marine soundscapes off northeastern Taiwan,," PLOS Computational Biology, volume 1, number 1, pages 1-23, February 2021. :::icon
30. J.-K. Wang, Y.-F. Chang, K.-H. Tsai, W.-C. Wang, C.-Y. Tsai, C.-H. Cheng, and Y. Tsao, "Automatic recognition of murmurs of ventricular septal defect using convolutional recurrent neural networks with temporal attentive pooling," Scientific Reports, volume 10, number 21797, pages 1-10, December 2020. :::icon
31. N. Y.-H. Wang, H.-L. S. Wang, T.-W. Wang, S.-W. Fu, X. Lu, H.-M. Wang, and Y. Tsao, "Improving the Intelligibility of Speech for Simulated Electric and Acoustic Stimulation Using Fully Convolutional Neural Networks," IEEE Transactions on Neural Systems & Rehabilitation Engineering, volume 29, pages 184-195, December 2020. :::icon
32. T. Hussain, S. M. Siniscalchi, H.-L. S. Wang, Y. Tsao, S. V. Mario, and W.-H. Liao, "Ensemble Hierarchical Extreme Learning Machine for Speech Dereverberation," IEEE Transactions on Cognitive and Developmental Systems, volume 12, number 4, pages 744-758, December 2020. :::icon
33. X. Wang et al.,, "ASVspoof 2019: A Large-scale Public Database of Synthetized, Converted and Replayed Speech," Computer Speech and Language, volume 64, pages 1-27, November 2020. :::icon
34. H.-S. Lee, Y. Tsao, S.-K. Jeng, and H.-M. Wang, "Subspace-based Representation and Learning for Phonotactic Spoken Language Recognition," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 28, pages 3065-3079, November 2020. :::icon
35. K.-H. Tsai, W.-C. Wang, C.-H. Cheng, C.-Y. Tsai, J.-K. Wang, T.-H. Lin, S.-H. Fang, L.-C. Chen, and Y. Tsao, "Blind Monaural Source Separation on Heart and Lung Sounds Based on Periodic-Coded Deep Autoencoder," IEEE Journal of Biomedical and Health Informatics, volume 24, number 11, pages 3203-3214, November 2020. :::icon
36. T.-A. Hsieh, H.-M. Wang, X. Lu, and Y. Tsao, "WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-end Speech Enhancement," IEEE Signal Processing Letters, volume 27, pages 2149-2153, November 2020. :::icon
37. C. Yu*, R. E. Zezario*, S.-S. Wang, J. Sherman, Y.-Y. Hsieh, X. Lu, H.-M. Wang, and Y. Tsao, "Speech Enhancement based on Denoising Autoencoder with Multi-branched Encoders," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 28, pages 2756-2769, October 2020, (*equal contributions) :::icon
38. W.-C. Huang, H. Luo, H.-T. Hwang, C.-C. Lo, Y.-H. Peng, Y. Tsao, and H.-M. Wang, "Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion," IEEE Transactions on Emerging Topics in Computational Intelligence, volume 4, number 4, pages 468-479, August 2020. :::icon
39. N. Y.-H. Wang, C.-H. Chiang, H.-L. S. Wang and Y. Tsao, "Atypical Frequency Sweep Processing in Chinese Children With Reading Difficulties: Evidence From Magnetoencephalography," Frontiers in Psychology, volume 99, pages 99, July 2020. :::icon
40. C. Yu, K.-H. Hung, S.-S. Wang, Y. Tsao, and J.-w. Hung, "Time-Domain Multi-modal Bone/air Conducted Speech Enhancement," IEEE Signal Processing Letters, volume 27, pages 1035-1039, June 2020. :::icon
41. M. Lee, L. Lin, C.-Y. Chen, Y. Tsao, T.-H. Yao, M.-H. Fei and S.-H. Fang, "Forecasting Air Quality in Taiwan by Using Machine Learning," Scientific Reports, number 4153, pages 1-13, March 2020. :::icon
42. Y.-H. Lai, W.-N. Chen, T.-C. Hsu, C. Lin, Y. Tsao and S. Wu, "Overall Survival Prediction of Non-small Cell Lung Cancer by Integrating Microarray and Clinical Data with Deep Learning," Scientific Reports, number 4679, pages 1-11, March 2020. :::icon
43. S. C. Hidayati, T. W. Goh, Ji.-S. G. Chan, C.-C. Hsu, J. See, L.-K. Wong, K.-L. Hua, Y. Tsao, and W.-H. Cheng, "Dress With Style: Learning Style from Joint Deep Embedding of Clothing Styles and Body Shapes," IEEE Transactions on Multimedia, volume 23, pages 365-377, March 2020.
44. C.-L. Liu, S.-W. Fu, Y.-J. Li, J.-W. Huang, H.-M. Wang, and Y. Tsao, "Multichannel Speech Enhancement by Raw Waveform-mapping using Fully Convolutional Networks," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 28, pages 1888-1900, February 2020. :::icon
45. S.-W. Fu, C.-F. Liao, Y. Tsao, "Learning with Learned Loss Function: Speech Enhancement with Quality-Net to Improve Perceptual Evaluation of Speech Quality," IEEE Signal Processing Letters, volume 27, pages 26-30, December 2019. :::icon
46. J.-Y. Wu, C. Yu, S.-W. Fu, C.-T. Liu, S.-Y. Chien, Y. Tsao, "Increasing Compactness of Deep Learning based Speech Enhancement Models with Parameter Pruning and Quantization Techniques," IEEE Signal Processing Letters, volume 26, number 12, pages 1887-1891, December 2019. :::icon
47. T.-H. Lin amd Y. Tsao, "Source Separation in Ecoacoustics: A Roadmap towards Versatile Soundscape Information Retrieval," Remote Sensing in Ecology and Conservation, volume online, pages 1-12, December 2019. :::icon
48. C.-T. Wang, F.-C. Lin, J.-Y. Chen, M.-J. Hsiao, S.-H. Fang, Y.-H. Lai, Y. Tsao, "Detection of Pathological Voice Using Cepstrum Vectors: A Deep Learning Approach," Journal of Voice, volume 33, number 5, pages pp. 634-641, September 2019. :::icon
49. S.-H. Fang, C.-T. Wang, J.-Y. Chen, Y. Tsao and F.-C. Lin, "Combining Acoustic Signals and Medical Records to Improve Pathological Voice Classification," APSIPA Transactions on Signal and Information Processing, volume 8, pages 1-11, June 2019. :::icon
50. C.-W. Lee et al.,, "Bioimaging: New Templated Ostwald Ripening Process of Mesostructured FeOOH for Third‐Harmonic Generation Bioimaging," Small, volume 15, number 20, pages 1-11, May 2019. :::icon
51. H.-T. Chiang, Y.-Y. Hsieh, S.-W. Fu, K.-H. Hung, Y. Tsao, S.-Y. Chien, "Noise Reduction in ECG Signals Using Fully Convolutional Denoising Autoencoders," IEEE Access, volume 7, pages 60806-60813, April 2019. :::icon
52. Y.-C. Chu, Y.-F. Cheng, Y.-H. Lai, Y. Tsao, T.-Y. Tu, S. T. Young, T.-S. Chen, Y.-F. Chung, F. Lai, W.-H. Liao, "A Mobile Phone–Based Approach for Hearing Screening of School-Age Children: Cross-Sectional Validation Study," JMIR Mhealth Uhealth, volume 1, pages 1-13, April 2019. :::icon
53. Y. Tsao, T.-H. Lin, F. Chen, Y.-F. Chang, C.-H. Cheng, and K.-H. Tsai, "Robust S1 and S2 heart sound recognition based on spectral restoration and multi-style training," Biomedical Signal Processing and Control, volume 49, pages 173-180, March 2019. :::icon
54. H.-L. S. Wanga , N. Y.-H. Wang , I-C. Chen, and Y. Tsao, "Auditory Identification of Frequency-Modulated Sweeps and Reading Difficulties in Chinese," Research in Developmental Disabilities, volume 86, pages 53-61, January 2019. :::icon
55. C.-T. Liu, T.-W. Lin, Y.-H. Wu, Y.-S. Lin, H. Lee, Y. Tsao, and S.-Y. Chien, "Computation-Performance Optimization of Convolutional Neural Networks with Redundant Filter Removal," IEEE Transactions on Circuits and Systems I, volume 66, pages 1908-1921, December 2018. :::icon
56. H.-P. Liu, Y. Tsao, and C.-S. Fuh, "Bone Conducted Speech Enhancement Using Deep Denoising Autoencoder," Speech Communication, volume 104, pages 106-112, November 2018. :::icon :::icon
57. H.-T. Hwang, Y.-C. Wu, S.-S. Wang, C.-C. Hsu, Y. Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen, "Locally linear Embedding Based Post-filtering for Speech Enhancement," Journal of Information Science and Engineering, volume 34, number 6, pages 1469-1491, October 2018. :::icon
58. S.-Y. Tsui, Y. Tsao, C.-W. Lin, S.-H. Fang, and C.-T. Wang, "Demographic and Symptomatic Features of Voice Disorders and Their Potential Application in Classification using Machine Learning Algorithms," Folia Phoniatrica et Logopaedica, volume 70, pages 174-182, September 2018.
59. S.-W. Fu, T.-W. Wang, Y. Tsao, X. Lu, and H. Kawai, "End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 26, number 9, pages 1570-1584, September 2018. :::icon
60. Y.-H. Lai, Y. Tsao, X. Lu, F. Chen, Y.-T. Su, K.-C. Chen, Y.-H. Chen, L.-C. Chen, P.-H. Li, and C.-H. Lee, "Deep Learning based Noise Reduction Approach to Improve Speech Intelligibility for Cochlear Implant Recipients," Ear and Hearing, volume 39(4), number 4, pages 795-809, July 2018, This work receives the National Innovation Award 2018 (2018年國家新創獎) :::icon :::icon
61. J.-C. Hou, S.-S. Wang, Y.-H. Lai, Y. Tsao, H.-W. Chang, and H.-M. Wang, "Audio-visual Speech Enhancement using Multimodal Deep Convolutional Neural Networks," IEEE Transactions on Emerging Topics in Computational Intelligence, volume 2, number 2, pages 117-128, April 2018. :::icon
62. Y. Tsao, H.-C. Chu, S.-H. Fang, J. Lee, and C.-M. Lin, "Adaptive Noise Cancellation using Deep Cerebellar Model Articulation Controller," IEEE Access, volume 6, pages 37395-37402, April 2018. :::icon :::icon
63. T.-H. Lin, T. Akamatsu, and Y, Tsao, "Comparison of passive acoustic soniferous fish monitoring with supervised and unsupervised approaches," Journal of the Acoustical Society of America (JASA), volume 143, number 4, pages published onlione, April 2018. :::icon
64. S.-S. Wang, P. Lin, Y. Tsao, J.-W. Hung, and B. Su, "Suppression by Selecting Wavelets for Feature Compression in Distributed Speech Recognition," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 26, number 3, pages 564-579, March 2018. :::icon
65. J. Torres-Sospedra et al., "Off-Line Evaluation of Mobile-Centric Indoor Positioning Systems: The Experiences from the 2017 IPIN Competition," Sensors, volume 18, number 2, pages 487, February 2018. :::icon
66. H.-T. Hwang, Y.-C. Wu, Y.-H. Peng, C.-C. Hsu, Y. Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen, "Voice Conversion based on Locally Linear Embedding," Journal of Information Science and Engineering, volume 34, number 6, pages 1493-1516, January 2018. :::icon
67. P. Lin, D. Lyu, F. Chen, S.-S. Wang, and Y. Tsao, "Multi-style Learning with Denoising Autoencoders for Acoustic Modeling in the Internet of Things (IoT)," Computer Speech and Language, volume 46, pages 481-495, November 2017. :::icon
68. S.-W. Fu, P.-C. Li, Y.-H. Lai, C.-C. Yang, L.-C. Hsieh, and Y. Tsao, "Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion to Improve Speech Intelligibility After Oral Surgery," IEEE Transactions on Biomedical Engineering, volume 64, number 11, pages 2584 - 2594, November 2017. :::icon
69. T. Hussain, S. M. Siniscalchi, C.-C. Lee, S.-S. Wang, Y. Tsao and W.-H. Liao, "Experimental Study on Extreme Learning Machine Applications for Speech Enhancement," IEEE Access, volume 99, number 99, pages 1-1, October 2017. :::icon
70. S.-H. Fang, Y.-X. Fei, Z. Xu, and Y. Tsao, "Learning Transportation Modes from Smartphone Sensors Based on Deep Neural Network," IEEE Sensors Journal, volume 17, pages 6111 - 6118, September 2017. :::icon
71. F. Chen, D. Zheng, Y. Tsao, "Effects of Noise Suppression and Envelope Dynamic Range Compression on the Intelligibility of Vocoded Sentences for a Tonal Language," Journal of the Acoustical Society of America (JASA), volume 142, number 3, pages 1157-1166, September 2017. :::icon
72. S.-W. Hsiao, H.-C. Sun, M.-C. Hsieh, M.-H. Tsai, Y. Tsao, and C.-C. Lee, "Toward Automating Oral Presentation Scoring during Principal Certification Program using Audio-Video Low-level Behavior Profiles," IEEE Transactions on Affective Computing, volume PP, number PP, pages PP, September 2017. :::icon
73. X. Lu, P. Shen, Y. Tsao, and H. Kawai, "Regularization of Neural Network Model with Distance Metric Learning for I-vector based Spoken Language Identification," Computer Speech and Language, volume 44, pages 48-60, July 2017. :::icon
74. T.-H. Lin, S.-H. Fang, and Y, Tsao, "Improving Biodiversity Assessment via Unsupervised Separation of Biological Sounds from Long-duration Recordings," Scientific Reports, volume 7, number 4547, pages 1, July 2017. :::icon :::icon
75. Y.-H. Lai, F. Chen, S.-S. Wang, X. Lu, Y. Tsao, and C.-H. Lee, "A Deep Denoising Autoencoder Approach to Improving the Intelligibility of Vocoded Speech in Cochlear Implant Simulation," IEEE Transactions on Biomedical Engineering, volume 64, number 7, pages 1568 - 1578, July 2017. :::icon
76. A. Chern, Y.-H. Lai, Y.-p. Chang, Y. Tsao, R. Y. Chang, and H.-W. Chang, "A Smartphone-Based Multi-Functional Hearing Assistive System to Facilitate Speech Recognition in the Classroom," IEEE Access, volume 5, pages 10339 - 10351, June 2017, This paper has been selected as a Featured Article (http://ieeeaccess.ieee.org/special-sections/featured-articles/smartphone-based-multi-functional-hearing-assistive-system-facilitate-speech-recognition-classroom/) :::icon
77. T.-E. Chen, S.-I Yang, L.-T. Ho, K.-H. Tsai, Y.-H. Chen, Y.-F. Chang, Y.-H. Lai, S.-S. Wang, Y. Tsao*, and C.-C. Wu, "S1 and S2 Heart Sound Recognition using Deep Neural Networks," IEEE Transactions on Biomedical Engineering, volume 64, number 2, pages 372 - 380, February 2017. :::icon
78. H.-y. Lee, B.-H. Tseng, T.-H. Wen, and Y. Tsao, "Personalizing Recurrent Neural Network based Language Model by Social Network," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 25, number 3, pages 519 - 530, December 2016. :::icon
79. T. Guan, G.-x. Chu, Y. Tsao, F. Chen, "Assessing the Perceptual Contributions of Level-dependent Segments to Sentence Intelligibility," Journal of the Acoustical Society of America (JASA), volume 140, number 5, pages 3745-3754, November 2016. :::icon
80. S.-H. Fang, W.-H. Chang, Y. Tsao, H.-C. Shih, and C. Wang, "Channel State Reconstruction Using Multilevel Discrete Wavelet Transform for Improved Fingerprinting-Based Indoor Localization," IEEE Sensors Journal, volume 16, number 21, pages 7784 - 7791, November 2016. :::icon
81. H.-L. S. Wang, I-C. Chen, C.-H. Chiang, Y.-H. Lai, and Y. Tsao, "Auditory Perception, Suprasegmental Speech Processing, and Vocabulary Development in Chinese Preschoolers," Perceptual and Motor Skills, volume 123, number 2, pages 365-382, October 2016. :::icon
82. S.-H. Fang , H.-H. Liao , Y.-X. Fei , K.-H. Chen , J.- W. Huang , Y.-D. Lu and Y. Tsao, "Transportation Modes Classification Using Sensors on Smartphones," Sensors, volume 19;16, number 8, pages 1324, August 2016. :::icon
83. S.-S. Wang, A. Chern, Y. Tsao, J.-w. Hung, X. Lu, Y.-H. Lai, B. Su, "Wavelet Speech Enhancement based on Nonnegative Matrix Factorization," IEEE Signal Processing Letters, volume 23, number 8, pages 1101-1105, August 2016. :::icon
84. P. Lin, S.-W. Fu, S.-S.Wang, Y.-H. Lai, and Y. Tsao, "Maximum Entropy Learning with Deep Belief Networks," Entropy, volume 18, number 7, pages 251, July 2016. :::icon
85. F. Chen, Y. Tsao, and Y.-H. Lai, "Modeling Speech Intelligibility with Recovered Envelope from Temporal Fine Structure Stimulus," Speech Communication, volume 81, pages 120–128, July 2016. :::icon
86. Y. Tsao and Y.-H. Lai, "Generalized Maximum a Posteriori Spectral Amplitude Estimation for Speech Enhancement," Speech Communication, volume 76, pages 112–126, February 2016. :::icon :::icon
87. S.-H. Fang, C.-H. Wang, and Y. Tsao, "Compensating for Orientation Mismatch in Robust WiFi Localization Using Histogram Equalization," IEEE Transactions on Vehicular Technology, volume 64, number 11, pages 5210-5220, November 2015. :::icon
88. Y.-C. Lin, Y.-H. Lai, H.-W. Chang, Y. Tsao, Y.-p. Chang, and R. Y. Chang,, "A Smartphone-Based Remote Microphone Hearing Assistive System Using Wireless Technologies," IEEE Systems Journal, volume PP, pages 1-10, October 2015, Smarthear Demo: https://www.youtube.com/watch?v=e9HqIj09QJs :::icon
89. C.-C. Hsu, K.-M. Cheong, T.-S. Chi, and Y. Tsao, "Robust Voice Activity Detection Algorithm Based on Feature of Frequency Modulation of Harmonics and Its DSP Implementation," IEICE Transactions on Information and Systems, volume E98-D, number 10, pages 1808-1817, October 2015. :::icon
90. Y.-J, Lee, Y.-R. Chien, and Y. Tsao, "Rapid Converging M-max Partial Update Least Mean Square Algorithms with New Variable Step-size Methods," IEICE Transaction on Communications, volume Vol.E98-A, number No.12, pages 2650-2657, August 2015.
91. Y.-H. Lai, Y. Tsao, F. Chen, "Effects of Adaptation Rate and Noise Suppression on the Intelligibility of Compressed-Envelope Based Speech," PLoS ONE, volume 10.1371, pages journal.pone.0133519, July 2015. :::icon
92. Y. Tsao, P. Lin, T.-y. Hu, and X. Lu, "Ensemble Environment Modeling using Affine Transform Group," Speech Communication, volume 68, pages 55–68, April 2015. :::icon
93. Y. Tsao, S.-H. Fang, and Y. Hsiao, "Acoustic Echo Cancellation Using a Vector-Space-Based Adaptive Filtering Algorithm," IEEE Signal Processing Letters, volume 22, pages 351-355, March 2015. :::icon :::icon
94. Y. Tsao, T.-y. Hu, S. Sakti, S. Nakamura, and L.-s. Lee, "Variable Selection Linear Regression for Robust Speech Recognition," IEICE Transactions on Information and Systems, volume E97-D, number 6, pages 1477-1487, June 2014. :::icon
95. Y. Tsao, X. Lu, P. Dixon, T.-y. Hu, S. Matsuda, and C. Hori, "Incorporating Local Information of the Acoustic Environments to MAP-based Feature Compensation and Acoustic Model Adaptation," Computer Speech and Language, volume 28, number 3, pages 709-726, May 2014. :::icon
96. Y. Tsao, S. Matsuda, C. Hori, H. Kashioka, and C.-H. Lee, "A MAP-based Online Estimation Approach to Ensemble Speaker and Speaking Environment," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 22, number 2, pages 403-416, February 2014. :::icon
97. Y.-H. Lai, Y. Tsao, and F. Chen, "A Study of Adaptive WDRC in Hearing Aids under Noisy Conditions,," International Journal of Speech & Language Pathology and Audiology, volume 1, number 2, pages 43-51, December 2013, (invited paper) :::icon
98. Y. Tsao and C.-H. Lee, "An Ensemble Speaker and Speaking Environment Modeling Approach to Robust Speech Recognition," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 17, pages 1025 - 1037, June 2009. :::icon
99. Y. Tsao, S.-M. Lee, and L.-S. Lee, "Segmental Eigenvoice with Delicate Eigenspace for Improved Speaker Adaptation," IEEE Transactions on Speech and Audio Processing, volume 13, pages 399 - 411, April 2005. :::icon
 
 
Conference Papers
 
1. S.-W. Fu, K.-H. Hung, Y. Tsao, and Y.-C. F. Wang, "Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech," to appear in ICLR 2024,. :::icon
2. Y.-T. Liu, K.-C. Wang, K.-C. Liu, S.-Y. Peng, and Y. Tsao, "SDEMG: Score-based Diffusion Model For Surface Electromyographic Signal Denoising," to appear in IEEE ICASSP 2024,. :::icon
3. X. Lu, P. Shen, Y. Tsao, and H. Kawai, "Hierarchical Cross-modality Knowledge Transfer With Sinkhorn Attention For Ctc-based ASR," to appear in IEEE ICASSP 2024,. :::icon
4. H. Wu, H.-C. Kuo, Y. Tsao, H.-y. Lee, "Scalable Ensemble-based Detection Method Against Adversarial Attacks For Speaker Verification," to appear in IEEE ICASSP 2024,. :::icon
5. Y. Tseng, L. Berry, and Y.-T. Chen et al.,, "A Multi-task Evaluation Benchmark For Audio-visual Representation Models," to appear in IEEE ICASSP 2024,. :::icon
6. R. E. Zezario, B.-R. B. Bai, C.-S. Fuh, H.-M. Wang, and Y. Tsao, "Multi-task Pseudo-label Learning For Non-intrusive Speech Quality Assessment Model," to appear in IEEE ICASSP 2024,. :::icon
7. X. Lu, P. Shen, Y. Tsao, and H. Kawa, "Cross-modal alignment with optimal transport for CTC-based ASR," IEEE ASRU 2023, December 2023. :::icon
8. C.-C. Lee, H.-W. Chen, C.-S. Chen, H.-M. Wang, T.-T. Liu, and Y. Tsao, "LC4SV: A Denoising Framework Learning to Compensate for Unseen Speaker Verification Models," IEEE ASRU 2023, December 2023. :::icon
9. H.-T. Chiang, K.-H. Hung, S.-W. Fu, H.-C. Kuo, M.-H. Tsai, and Y. Tsao, "Study on the Correlation between Objective Evaluations and Subjective Speech Quality and Intelligibility," IEEE ASRU 2023, December 2023. :::icon
10. E. Cooper, W.-C. Huang, Y.Tsao, H.-M. Wang, T. Toda, and J. Yamagishi, "The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains," IEEE ASRU 2023, December 2023. :::icon
11. T.-A. Hsieh, C.-H. Huck Y., P.-Y. Chen, S. M. Siniscalchi, Y. Tsao, "Inference and Denoise: Causal Inference-based Neural Speech Enhancement," IEEE MLSP 2023, September 2023. :::icon
12. W.-Y. Ting, S.-S. Wang, Y. Tsao, and B. Su, "IANS: Intelligibility-aware Null-steering Beamforming for Dual-Microphone Arrays," IEEE MLSP 2023, September 2023. :::icon
13. I-C. Chern, S. Chern, H.-C. Kuo, H.-H. Tseng, K.-H. Hung, and Y. Tsao, "Voice Direction-of-Arrival Conversion," IEEE MLSP 2023, September 2023. :::icon
14. H. Yen, P.-J. Ku, C.-H. H. Yang, H. Hu, S. M. Siniscalchi, P.-Y. Chen, and Y. Tsao, "Neural Model Reprogramming with Similarity Based Mapping for Low-Resource Spoken Command Recognition," Interspeech 2023, August 2023. :::icon
15. Y.-L. Chien, H.-H. Chen, M.-C. Yen, S.-W. Tsai, H.-M. Wang, Y. Tsao, T.-S. Chi, "Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion," Interspeech 2023, August 2023. :::icon
16. L.-W. Chen, Y.-F. Cheng, H.-S. Lee, Y. Tsao, and H.-M. Wang, "A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean Speech," Interspeech 2023, August 2023. :::icon
17. H.-H. Chen, Y.-L. Chien, M.-C. Yen, S.-W. Tsai, T.-S. Chi, Y. Tsao, and H.-M. Wang, "Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features," Interspeech 2023, August 2023. :::icon
18. E.-P. Chu, K.-C. Liu, C.-Y. Hsieh, C.-Y. Chang, Y. Tsao, and C.-T. Chan, "Multi-Task Learning U-Net for Functional Shoulder Sub-Task Segmentation," IEEE EMBC 2023, July 2023. :::icon
19. C.-P. Liu, J.-H. Li, E.-P. Chu, C.-Y. Hsieh, K.-C. Liu, C.-T. Chan, and Y. Tsao:, "Deep Learning-based Fall Detection Algorithm Using Ensemble Model of Coarse-fine CNN and GRU Networks," IEEE MeMeA 2023, June 2023. :::icon
20. J. Kirton-Wingate, S. Ahmed, M. Gogate, Y. Tsao, A. Hussain, "Towards Individualised Speech Enhancement: An SNR Preference learning System For Multi-modal Hearing Aids," IEEE ICASSP 2023 (AMHAT 2023 Workshop), June 2023. :::icon
21. H.-Y. Lin, H.-H. Tseng, and Y. Tsao, "On the Robustness of Non-intrusive Speech Quality Model by Adversarial Examples," IEEE ICASSP 2023, June 2023. :::icon
22. I-C. Chern, K.-H. Hung, Y.-T. Chen, T. Hussain, M. Gogate, A. Hussain, Y. Tsao, and J.-C. Hou, "Audio-visual Speech Enhancement And Separation By Utilizing Multi-modal Self-supervised Embeddings," IEEE ICASSP 2023 (AMHAT 2023 Workshop), June 2023. :::icon
23. K.-C. Wang, K.-C. Liu, S.-Y. Peng, Y. Tsao, "ECG Artifact Removal from Single-Channel Surface EMG Using Fully Convolutional Networks," IEEE ICASSP 2023, June 2023. :::icon
24. T.-H. Chi, K.-C. Liu, C.-Y. Hsieh, Y. Tsao, and C.-T. Chan, "Pre-Impact Fall Detection via CNN-ViT Knowledge Distillation," IEEE ICASSP 2023, June 2023. :::icon
25. C.-J. Hsu, H.-L. Chung, H.-y. Lee, amd Y. Tsao, "T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5," IEEE ICASSP 2023, June 2023. :::icon
26. C.-C. Lee, Y. Tsao, H.-M. Wang, C.-S. Chen, "D4AM: A General Denoising Framework for Downstream Acoustic Models," ICLR 2023, May 2023. :::icon
27. H.-H. Tseng, H.-Y. Lin, H.-K. Hsuan, and Y. Tsao, "Interpretations of Domain Adaptations via Layer Variational Analysis," ICLR 2023, May 2023. :::icon
28. C.-H. Chen, K.-C. Liu, T.-Y. Lu, C.-Y. Chang, C.-T. Chan, and Y. Tsao, "Wearable-based Pain Assessment in Patients with Adhesive Capsulitis Using Machine Learning," IEEE NER 2023, April 2023.
29. Y.-J. Lu et al.,, "ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding," Interspeech 2022, September 2022.
30. Y.-W. Chen and Y. Tsao, "InQSS: a speech intelligibility and quality assessment model using a multi-task learning network," Interspeech 2022, September 2022. :::icon
31. K.-H. Hung, S.-W. Fu, H.-H. Tseng, H.-T. Chiang, Y. Tsao, C.-W. Lin, "Boosting Self-Supervised Embeddings for Speech Enhancement," Interspeech 2022, September 2022. :::icon
32. F.-L. Wang, H.-S. Lee, Y. Tsao and H.-M. Wang, "Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks Authors," Interspeech 2022, September 2022. :::icon
33. C.-C. Lee, C.-H. Hu, Y.-C. Lin, C.-S. Chen, H.-M. Wang and Y. Tsao, "NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling," Interspeech 2022, September 2022. :::icon
34. C.-J. Peng, Y.-J. Chan, Y.-L.Shen, C. Yu, Y. Tsao and T.-S. Chi, "Perceptual Characteristics Based Multi-objective Model for Speech Enhancement," Interspeech 2022, September 2022. :::icon
35. R. Chao, C. Yu, S.-W. Fu, X. Lu and Y. Tsao, "Perceptual Contrast Stretching on Target Feature for Speech Enhancement," Interspeech 2022, September 2022. :::icon
36. W.-C. Huang, E. C., Y. Tsao, H.-M. Wang, T. Toda and J. Yamagishi, "The VoiceMOS Challenge 2022," Interspeech 2022, September 2022. :::icon
37. C. Yu, S.-W. Fu, T.-An Hsieh, Y. Tsao and M. Ravanelli, "OSSEM: one-shot speaker adaptive speech enhancement using meta learning," Interspeech 2022, September 2022. :::icon
38. R. E. Zezario, S.-W. Fu, F. Chen, C.-S. Fuh, H.-M. Wang and Y. Tsao, "MTI-Net: A Multi-Target Speech Intelligibility Prediction Model," Interspeech 2022, pages 5463-5467, September 2022. :::icon
39. R. E. Zezario, F. Chen, C.-S. Fuh, H.-M. Wang and Y. Tsao, "MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids," Interspeech 2022, pages 3944-3948, September 2022, 1st Place, Machine Learning Challenges for Hearing Aids Challenge; 1st Place, The Hearing Industry Research Consortium Student Prize :::icon
40. T. Hussain, M. Diyan, M. Gogate, K. Dashtipour, A. Adeel, Y. Tsao, A. Hussain, "A Novel Speech Intelligibility Enhancement Model based on Canonical Correlation and Deep Learning," IEEE EMBC 2022, July 2022. :::icon
41. S.–S. Wang, Y. Tsao, W.–Z. Zheng, H.–W. Yeh, P.–C. Li, S.–H. Fang, Y.–H. Lai, "Dysarthric Speech Enhancement Based on Convolution Neural Network," IEEE EMBC 2022, July 2022. :::icon
42. C.-H. H. Yang, J. Qi, S. Y.-C. Chen, Y. Tsao, P.-Y. Chen, "When Bert Meets Quantum Temporal Convolution Learning for Text Classification In Heterogeneous Computing," ICASSP 2022, May 2022. :::icon
43. G.-T. Lin, C.-J. Hsu, D.-R. Liu, H.-Y. Lee, and Y. Tsao, "Analyzing The Robustness Of Unsupervised Speech Recognition," ICASSP 2022, May 2022. :::icon
44. C.-J. Hsu, H.-y. Lee, Y. Tsao, "XDBERT: Distilling Visual Information to BERT via Cross-Modal Encoders to Improve Language Understanding," ACL 2022, May 2022, (Short Paper)
45. Y.-J. Lu, Z.-Q. Wang, S. Watanabe, A. Richard, C. Yu, and Y. Tsao, "Conditional Diffusion Probabilistic Model For Speech Enhancement," ICASSP 2022, May 2022. :::icon
46. Y.-C. Lin,T.-A. Hsieh, K.-H. Hung, C. Yu, H. Garudadri, Y. Tsao, and T.-W. Kuo, "Speech Recovery For Real-world Self-powered Intermittent Devices," ICASSP 2022, May 2022. :::icon
47. K.-C. Wang, K.-C. Liu, H.-M. Wang, and Y. Tsao, "EMGSE: Acoustic/emg Fusion For Multimodal Speech Enhancement," ICASSP 2022, May 2022. :::icon
48. S.-W. Fu, C. Yu, K.-H. Hung, M. Ravanelli, and Y. Tsao, "MetricGAN-U: Unsupervised Speech Enhancement/ Dereverberation based Only On Noisy/ Reverberated Speech," ICASSP 2022, May 2022. :::icon
49. H. Wu, H.-C. Kuo, N. Zheng, K.-H. Hung, H.-Y. Lee, Y. Tsao, H.-M. Wang, H. Meng, "Partially Fake Audio Detection by Self-attention-based Fake Span Discovery," ICASSP 2022, May 2022. :::icon
50. H.-Y. Lin, H.-H. Tseng, X. Lu, Yu Tsao, "Unsupervised Noise Adaptive Speech Enhancement by Discriminator-Constrained Optimal Transport," NeurIPS 2021, December 2021. :::icon
51. Y.-J. Li, S.-S. Wang, Y. Tsao, and B. Su, "MIMO Speech Compression and Enhancement Based on Convolutional Denoising Autoencoder," APSIPA ASC 2021, December 2021. :::icon
52. X. Chang, T. Maekaku, P. Guo, J. Shi,Y.-J. Lu, A. S. Subramanian, T. Wang, S.-w. Yang, Y. Tsao, H.-y. Lee, S. Watanabe, "An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition," ASRU 2021, December 2021. :::icon
53. Z. Feng, Yu Tsao, and F. Chen, "Estimation and Correction of Relative Transfer Function for Binaural Speech Separation Networks to Preserve Spatial Cues," APSIPA ASC 2021, December 2021. :::icon
54. Y.-J. Lu, Y. Tsao, and S. Watanabe, "A Study on Speech Enhancement Based on Diffusion Probabilistic Model," APSIPA ASC 2021, December 2021. :::icon
55. M.-C. Yen, W.-C. Huang, K. Kobayashi, Y.-H. Peng, S.-W. Tsai, Y. Tsao, T. Toda, J.-S. Jang, and H.-M. Wang, "Mandarin Electrolaryngeal Speech Voice Conversion with Sequence-to-Sequence Model-ing," ASRU 2021, December 2021. :::icon
56. H.-T. Chiang, Y.-C. Wu, C. Yu, T. Toda, H.-M. Wang, Y.-C. Hu, and Y. Tsao, "HASA-NET: A Non-Intrusive Hearing-Aid Speech Assessment Network," ASRU 2021, December 2021. :::icon
57. X. Lu, P. Shen, Y. Tsao, and H. Kawai, "Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification," APSIPA ASC 2021, December 2021. :::icon
58. Y.-S. Liou, W.-C. Huang, M.-C. Yen, S.-W. Tsai, Y.-H. Peng, T. Toda, Y. Tsao, and H.-M. Wang,, "Time Alignment Using Lip Images for Frame-Based Electrolaryngeal Voice Conversion," APSIPA ASC 2021, December 2021. :::icon
59. M. E Noor, Y.-J. Lu, S.-Si. Wang, S. Ghose, C.-Y. Chang, R. E. Zezario, S. Ahmed, W.-H. Chung, Y. Tsao and H.-M. Wang, "Investigation of A Single-Channel Frequency-Domain Speech Enhancement Network to Improve End-To-End Bengali Automatic Speech Recogni-tion Under Unseen Noisy Conditions," Oriental COCOSDA 2021, November 2021. :::icon
60. T.-A. Hsieh, C. Yu, S.-W. Fu, X. Lu, and Y. Tsao, "Improving Perceptual Quality by Phone-Fortified Perceptual Loss using Wasserstein Distance for Speech Enhancement," Interspeech 2021, September 2021. :::icon
61. Y,-C. Wu, C.-H. Hu, H.-S. Lee, Y.-H. Peng, W.-C. Huang, Y. Tsao, H.-M. Wang and T. Toda, "Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN Vocoder," Interspeech 2021, September 2021. :::icon
62. G.-X. Lin, S.-W. Hu, Y.-J. Lu, Y. Tsao, and C.-S. Lu, "QISTA-Net-Audio: Audio Super-resolution via Non-Convex Lq-normMinimization," Interspeech 2021, September 2021.
63. S.-W. Fu, C. Yu, T.-A. Hsieh, P. Plantinga, M. Ravanelli, X. Lu, Y. Tsao, "MetricGAN +: An Improved Version of MetricGAN for Speech Enhancement," Interspeech 2021, September 2021. :::icon
64. W.-C. Huang, K. Kobayashi, Y.-H. Peng, C.-F. Liu, Y. Tsao, H.-M. Wang, T. Toda, "A Preliminary Study of a Two-Stage Paradigm for Preserving SpeakerIdentity in Dysarthric Voice Conversion," Interspeech 2021, September 2021. :::icon
65. R. E Zezario, C.-S. Fuh, H.-M. Wang, Y. Tsao, "Speech Enhancement with Zero-Shot Model Selection," EUSIPCO 2021, August 2021. :::icon
66. Y.-W. Chen, K.-H. Hung, S.-Y. Chuang, J. Sherman, X. Lu, Y. Tsao, "A Study of Incorporating Articulatory Movement Information in Speech Enhancement," EUSIPCO 2021, August 2021. :::icon
67. T.-Y. Lu, K.-C. Liu, C.-Y. Hsieh, C.-Y. Chang, Y. Tsao, C.-T. Chan, "Instrumented Shoulder Functional Assessment using Inertial Measurement Units for Frozen Shoulder," IEEE BHI 2021, pages 1-4, July 2021. :::icon
68. Y.-K. Wu, K.-P. Huang, Y. Tsao, H.-y. Lee, "One shot learning for speech separation," ICASSP 2021, June 2021. :::icon
69. X. Lu, P. Shen, Y. Tsao, H. Kawai, "Unsupervised neural adaptation model based on optimal transport for spoken language identification," ICASSP 2021, June 2021. :::icon
70. Y.-W. Chen, K.-H. Hung, S.-Y. Chuang, J. Sherman, W.-C. Huang, X. Lu, Y. Tsao, "EMA2S: An End-to-End Multimodal Articulatory-to-Speech System," ISCAS 2021, May 2021. :::icon
71. C.-J. Peng, Y.-J. Chan, C. Yu, S.-S. Wang, Y. Tsao, T.-S. Chi, "Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenario," ISCAS 2021, May 2021. :::icon
72. Y.-T. Chang, Y.-H. Yang, Y.-H. Peng, S.-S. Wang, T.-S. Chi, Y. Tsao and H.-M. Wang, "MoEVC: A Mixture of Experts Voice Conversion System With Sparse Gating Mechanism for Online Computation Acceleration," ISCSLP 2021, January 2021. :::icon
73. S.-W. Fu et al., "Boosting Objective Scores of Speech Enhancement Model through MetricGAN Post-Processing," APSIPA 2020, December 2020. :::icon
74. R. E. Zezario, S.-W. Fu, C.-S. Fuh, Y. Tsao, and H.-M. Wang, "STOI-Net: A Deep Learning based Non-Intrusive Speech Intelligibility Assessment Model," APSIPA 2020, December 2020. :::icon
75. S.-Y. Chuang, Y. Tsao, C.-C. Lo, H.-M. Wang, "Lite Audio-Visual Speech Enhancement," Interspeech 2020, October 2020. :::icon
76. C.-Y. Chen, W.-Z. Zheng, S.-S. Wang, Y. Tsao, P.-C. Li and Y.-H. Lai, "Enhancing Intelligibility of Dysarthric Speech Using Gated Convolutional-based Voice Conversion System," Interspeech 2020, October 2020. :::icon
77. Y.-J. Lu, C.-F. Liao, X. Lu, J.-w. Hung, Y. Tsao, "Incorporating Broad Phonetic Information for Speech Enhancement," Interspeech 2020, October 2020. :::icon
78. H. Li, S.-W. Fu, Y. Tsao, J. Yamagishi, "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric Learning," Interspeech 2020, October 2020. :::icon
79. C.-C. Lee, Y.-C. Lin, H.-T. Lin, H.-M. Wang, Y. Tsao, "SERIL: Noise Adaptive Speech Enhancement using Regularization-based Incremental Learning," Interspeech 2020, October 2020. :::icon
80. R. E. Zezario, T. Hussain, X. Lu, H.-M. Wang, and Y. Tsao, "Self-supervised Denoising Autoencoder with Linear Regression Decoder for Speech Enhancement," ICASSP 2020, May 2020. :::icon
81. T. Hussaink, Y. Tsao, H.-M. Wang, J.-C. Wang, S. M. Siniscalchi, and W.-H. Liao, "Compressed Multimodal Hierarchical Extreme Learning Machine for Speech Enhancement," APSIPA 2019, November 2019. :::icon
82. W.-C. Lin, Y. Tsao, F. Chen, and H.-M. Wang, "Investigation of Neural Network Approaches for Unified Spectral and Prosodic Feature Enhancement," APSIPA 2019, pages 1179-1184, November 2019. :::icon
83. F. Ye, Y. Tsao, and F. Chen, "Subjective Feedback-based Neural Network Pruning for Speech Enhancement," APSIPA 2019, November 2019. :::icon
84. K.-Y. Liu, S.-S. Wang, Y. Tsao, J.-w. Hung, "Speech Enhancement Based on the Integration of Fully Convolutional Network, Temporal Lowpass Filtering and Spectrogram Masking," ROCLING 2019, October 2019. :::icon
85. Y.-C. Lin, Y.-T. Hsu, S.-W. Fu, Y. Tsao, and T.-W. Kuo, "IA-NET: Acceleration and Compression of Speech Enhancement using Integer-adder Deep Neural Network," Interspeech 2019, September 2019. :::icon
86. F.-K. Chuang, S.-S. Wang, J.-w. Hung, Y. Tsao, and S.-H. Fang, "Speaker-aware Deep Denoising Autoencoder with Embedded Speaker Identity for Speech Enhancement," Interspeech 2019, September 2019. :::icon
87. T. Hussain, Y. Tsao, H.-M. Wang, J.-C. Wang, S. M. Siniscalchi, W.-H. Liao, "Audio-Visual Speech Enhancement Using Hierarchical Extreme Learning Machine," EUSIPCO 2019, September 2019. :::icon
88. X. Lu, P. Shen, S. Li, Y. Tsao, and H. Kawai, "Class-wise Centroid Distance Metric Learning for Acoustic Event Detection," Interspeech 2019, September 2019. :::icon
89. C.-F. Liao, Y. Tsao, H.-y. Lee and H.-M. Wang, "Noise Adaptive Speech Enhancement using Domain Adversarial Training," Interspeech 2019, September 2019, (with ISCA Travel Grant) :::icon
90. C.-F. Liao, Y. Tsao, X. Lu and H. Kawai, "Incorporating Symbolic Sequential Modeling for Speech Enhancement," Interspeech 2019, September 2019, (with ISCA Travel Grant) :::icon
91. R. E. Zezario, S.-W. Fu, X. Lu, H.-M. Wang, and Y. Tsao, "Specialized Speech Enhancement Model Selection Based on Learned Non-Intrusive Quality Assessment Metric," Interspeech 2019, September 2019. :::icon
92. W.-C. Huang, Y.-C. Wu, C.-C. Lo, P. L. Tobing, T. Hayashi, K. Kobayashi, T. Toda, Y. Tsao and H.-M. Wang, "Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice Conversion," Interspeech 2019, September 2019, (with ISCA Travel Grant) :::icon
93. W.-C. Huang et al.,, "Generalization of Spectrum Differential based Direct Waveform Modification for Voice Conversion," ISCA SSW 10, September 2019. :::icon
94. C.-C. Lo, S.-w. Fu, W. C. Huang, X. Wang, J. Yamagishi, Y. Tsao and H.-M. Wang, "MOSNet: Deep Learning based Objective Assessment for Voice Conversion," Interspeech 2019, September 2019. :::icon
95. P.-T. Huang, H.-S. Lee, S.-S. Wang, K.-Y. Chen, Y. Tsao and H.-M. Wang, "Exploring the Encoder Layers of Discriminative Autoencoders for LVCSR," Interspeech 2019, September 2019, (with ISCA Travel Grant) :::icon
96. W.-C. Huang, Y.-C. Wu, H.-T. Hwang, P. L. Tobing, T. Hayashiy, K. Kobayashi, T. Toda, Y. Tsao, H.-M. Wang, "Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion," EUSIPCO 2019, September 2019. :::icon
97. L.-W. Chen, H.-Y. Lee, and Y. Tsao, "Generative Adversarial Networks for Unpaired Voice Transformation on Impaired Speech," Interspeech 2019, September 2019. :::icon
98. S.-W. Fu, C.-F. Liao, Y. Tsao, S.-D. Lin, "MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement," ICML 2019, June 2019, Long Oral with ICML (top 3%) Travel Grant; Codes: https://github.com/JasonSWFu/MetricGAN :::icon
99. Y.-L. Shen, C.-Y. Huang, S.-S. Wang, Y. Tsao, H.-M. Wang, and T.-S. Chi, "Reinforcement Learning Based Speech Enhancement for Robust Speech Recognition," ICASSP 2019, May 2019. :::icon
100. K.-Y. Liu, S.-k. Lee, S.-S. Wang, Y. Tsao, J.-w. Hung, "Reducing noise and reverberation in speech signals via the integration of denoising autoencoder and temporal lowpass filtering," ICASI 2019, April 2019. :::icon
101. T. Hussain, Y. Tsao, S. M. Sinicalchi, J.-C. Wang, H.-M. Wang, and W.-H. Liao, "Bone-conducted Speech Enhancement using Hierarchical Extreme Learning Machine," IWSDS 2019, April 2019. :::icon
102. Shang-Chih Lin*, Yu Tsao, Shun-Feng Su, Yennun Huang, and Zi-Qing Zhong, "An Abnormal Detection Strategy of Rotating Electric Machine based on Frequency Distribution," The 39th Symposium on Electrical Power Engineering, December 2018.
103. R. E. Zezario, J.-W. Huang, X. Lu, Y. Tsao, H.-T. Hwang, H.-M. Wang, "Deep Denoising Autoencoder Based Post Filtering for Speech Enhancement," APSIPA 2018, December 2018. :::icon
104. Y.-T. Hsu, Y.-C. Lin, S.-W. Fu, Y. Tsao, T.-W. Kuo, "A study on speech enhancement using exponent-only floating point quantized neural network (EOFP-QNN)," SLT 2018, November 2018. :::icon
105. Shang-Chih Lin*, Yu Tsao, Shun-Feng Su, and Yennun Huang, "An Industrial IoT Analysis System Based on Machining Data of Metal Materials," International Conference on Fuzzy Theory and Its Applications, November 2018.
106. S.-k. Lee, S.-S. Wang, Y. Tsao, J.-w. Hung, "Speech Enhancement based on Reducing the Detail Portion of Speech Spectrograms in Modulation Domain via Discrete Wavelet Transform," ISCSLP 2018, November 2018. :::icon
107. Y.-T. Hsu, Z. Zhu, C.-T. Wang, S.-H. Fang, F. Rudzicz, and Y. Tsao, "Robustness against the channel effect in pathological voice detection," NeurIPS 2018, Machine Learning for Health (ML4H) Workshop, November 2018. :::icon
108. Shang-Chih Lin*, Chuan-Hsiang Su, Yu Tsao, Shun-Feng Su, Hong-Yuan Mark Liao, and Yennun Huang, "FIS-based Domestic Milling Machine PHM System Considering Multi-speed Frequency Variation," IEEE International Conference on Advanced Manufacturing, November 2018, (Best Paper Award) (獲推薦轉投SCI期刊, 擴充研究修改中)
109. W.-C. Huang, H.-T. Hwang, Y.-H. Peng, Y. Tsao, H.-M. Wang, "Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders," ISCSLP 2018, November 2018, Best Student Paper Award :::icon
110. Hung-Chung Li, Shang-Chih Lin, Yu Tsao, Shun-Feng Su, Pei-Li Sun and Yennun Huang, "A Supervised Learning Algorithm Considering Light Conditions for Visual Inspection of Metal Objects," The 54th Annual Conference of Chinese Society for Quality 2018 International Symposium of Quality Management, November 2018, (Makalot Industry-Academic Collaboration Award) (獲推薦轉投EI期刊, 擴充研究修改中)
111. Y.-Y. Kao, H.-P. Hsu, C.-F. Liao, Y. Tsao, H.-C. Yang, J.-L. Li, C.-C. Lee, H.-S. Lee, and H.-M. Wang, "Automatic Detection of Speech Under Cold Using Discriminative Autoencoders and Strength Modeling with Multiple Sub-Dictionary Generation," IEEE IWAENC, September 2018. :::icon
112. X. Lu, P. Shen, S. Li, Y. Tsao, H. Kawai, "Temporal Attentive Pooling for Acoustic Event Detection," Interspeech 2018, September 2018. :::icon
113. S.-W. Fu, Y. Tsao, H.-T. Hwang, H.-M. Wang, "Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM," Interspeech 2018, September 2018. :::icon
114. Y.-H. Peng, H.-T. Hwang, Y.-C. Wu, Y. Tsao, H.-M. Wang, "Exemplar-Based Spectral Detail Compensation for Voice Conversion," Interspeech 2018, September 2018. :::icon
115. B.-S. Yu, Y. Tsao, S.-W. Yang, Y.-K. Chen, and S.Y. Chien, "Architecture Design of Convolutional Neural Networks for Face Detection on an FPGA Platform," IEEE SiPS 2018, September 2018.
116. Y.-H. Lai, W.-Z. Zheng, S.-T. Tang, S.-H. Fang, W.-H. Liao, and Y. Tsao, "Improving the Performance of Hearing Aids in Noisy Environments based on Deep Learning Technology," EMBC 2018, April 2018. :::icon
117. N. Ryant et al., "Enhancement and Analysis of Conversational Speech: JSALT 2017," ICASSP, April 2018. :::icon
118. W.-J. Lee, S.-S. Wang, F. Chen, X. Lu, S.-Y. Chien, and Y. Tsao,, "Speech Dereverberation Based on Integrated Deep and Ensemble Learning Algorithm," ICASSP, April 2018. :::icon
119. L. Sun, J. Du, T. Gao, Y.-D. Lu, Y. Tsao, C.-H. Lee, N. Ryant, "A Novel LSTM-based Speech Preprocessor For Speaker Diarization in Realistic Mismatch Conditions," ICASSP, April 2018. :::icon
120. S.-W. Fu, Y. Tsao, X. Lu, and H. Kawai, "Raw Waveform-based Speech Enhancement by Fully Convolutional Networks," APSIPA 2017, November 2017. :::icon
121. Y.-H. Peng, C.-C. Hsu, Y.-C. Wu, H.-T. Hwang, Y.-W. Liu, Y. Tsao, and H.-M. Wang, "Fast Locally Linear Embedding Algorithm for Exemplar-based Voice Conversion," APSIPA 2017, November 2017, (Poster Presentation Award) :::icon
122. S.-S. Wang, Y. Tsao, H.-L. S. Wang, Y.-H. Lai*, and L. P.-H. Li, "A Deep Learning based Noise Reduction Approach to Improve Speech Intelligibility for Cochlear Implant Recipients in the Presence of Competing Speech Noise," APSIPA 2017, November 2017. :::icon
123. T.-H. Lin, Y.-H. Wang, S.-S. Lu, H.-W. Yen, and Y. Tsao, "Computing Biodiversity Change via a Soundscape Monitoring Network," PNC 2017 Annual Conference and Joint Meetings, November 2017. :::icon
124. S.-W. Fu, T.-y. Hu, Y. Tsao, X. Lu, "Complex Spectrogram Enhancement by Convolutional Neural Network with Multi-metrics Learning," IEEE MLSP 2017, September 2017. :::icon
125. T.-H. Lin and Y. Tsao, "Deblending of Simultaneous-source Seismic Data via Periodicity-coded Nonnegative Matrix Factorization," IEEE Dataport, September 2017. :::icon
126. M.-H. Yang, H.-S. Lee, Y.-D. Lu, K.-Y. Chen, Y. Tsao, B. Chen, and H.-M. Wang, "Discriminative Autoencoders for Acoustic Modeling," Interspeech2017, August 2017. :::icon
127. C.-L. Wu, H.-P. Hsu, S.-S. Wang, J.-W. Hung, Y.-H. Lai, H.-M. Wang, and Y. Tsao, "Wavelet Speech Enhancement Based on Robust Principal Component Analysis," Interspeech2017, August 2017. :::icon
128. C.-C. Hsu, H.-T. Hwang, Y.-C. Wu, Y. Tsao, and H.-M. Wang, "Voice Conversion from Unaligned Corpora Using Variational Autoencoding Wasserstein Generative Adversarial Networks," Interspeech2017, August 2017. :::icon
129. Y.-C. Wu, H.-T. Hwang, S.-S. Wang, C.-C. Hsu, Y. Tsao, and H.-M. Wang, "A Post-filtering Approach Based on Locally Linear Embedding Difference Compensation for Speech Enhancement," Interspeech2017, August 2017. :::icon
130. S.-T. Lin, Y.-H. Liao, Y. Tsao, and S.-Y. Chien,, "Object-based on-line video summarization for internet of video things," EEE ISCAS, May 2017. :::icon
131. H.-S. Lee, Y.-D. Lu, C.-C. Hsu, Y. Tsao, H.-M. Wang, and S.-K. Jeng, "Discriminative Autoencoders for Speaker Verification," IEEE ICASSP, March 2017. :::icon
132. Y.-C. Wu, H.-T. Hwang, S.-S. Wang, C.-C. Hsu, Y.-H. Lai, Y. Tsao, and H.-M. Wang, "A Locally Linear Embbeding Based Postfiltering Approach for Speech Enhancement," IEEE ICASSP, March 2017. :::icon
133. C.-C. Hsu, H.-T. Hwang, Y.-C. Wu, Y. Tsao and H.-M. Wang, "Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder," APSIPA ASC, December 2016. :::icon
134. J.-C. Hou, S.-S. Wang, Y.-H. Lai, J.-C. Lin, Y. Tsao, H.-W. Chang, and H.-M. Wang, "Audio-Visual Speech Enhancement using Deep Neural Networks," APSIPA 2016, December 2016. :::icon
135. C.-C. Hsu, H.-T. Hwang, Y.-C. Wu, Y. Tsao, and H.-M. Wang, "Dictionary Update for NMF-based Voice Conversion Using an Encoder-Decoder Network," ISCSLP, November 2016. :::icon
136. Y.-Y. Hsieh, C.-D. Wu, Y. Tsao, and S.-S. Lu, "A Linear Regression Model with Dynamic Pulse Transit Time Features for Noninvasive Blood Pressure Prediction," BioCAS, October 2016. :::icon
137. Y.-H. Lai, S.-S. Wang, Y.-T. Su, H.-C. Cheng, F. K. Fu, and Y. Tsao, "Improving the Performance of Speech Perception in Noisy Environment based on a FAME Strategy," ISCSLP 2016, October 2016. :::icon
138. C.-Y. Hsu, R. E. Zezario, J.-C. Wang, X. Lu, and Y. Tsao, "Incorporating Local Environment Information with Ensemble Neural Networks to Robust Automatic Speech Recognition," ISCSLP 2016, October 2016. :::icon
139. H.-S. Lee, Y. Tsao, C.-C. Lee, H.-M. Wang, W.-C. Lin, W.-C. Chen, S.-W. Hsiao, S.-K. Jeng, "Minimization of Regression and Ranking Losses with Shallow Neural Networks on Automatic Sincerity Evaluation," Interspeech, September 2016. :::icon
140. Y.-C. Wu, H.-T. Hwang, C.-C. Hsu, Y. Tsao, H.-M. Wang, "Locally Linear Embedding for Exemplar-Based Spectral Conversion," Interspeech, September 2016. :::icon
141. X. Lu, P. Shen, Y. Tsao, H. Kawai, "Pair-wise Distance Metric Learning of Neural Network Model for Spoken Language Identification," Interspeech, September 2016. :::icon
142. S.-W. Fu, Y. Tsao, X. Lu, "SNR-Aware Convolutional Neural Network Modeling for Speech Enhancement," Interspeech, September 2016. :::icon
143. Y.-H. Lai, C.-H. Wang, S.-Y. Hou, B.-Y. Chen, Y. Tsao, and Y.-W. Liu, "DCASE Report for Task 3: Sound Event Detection in Real Life Audio," DCASE 2016 workshop, September 2016. :::icon
144. C.-W. Wu, M.-T. Zhong, Y. Tsao, S.-W. Yang, Y.-K. Chen, and S.-Y. Chien, "Track-clustering Error Evaluation for Track-based Multi-camera Tracking System Employing Human Re-identification," CVPR workshop, August 2016, Codes: https://github.com/cw1204772/ClustTMCT :::icon
145. Y.-T. Liu, Y. Tsao, R. Y. Chang:, "Nonnegative Matrix Factorization-based Frequency Lowering Technology for Mandarin-speaking Hearing Aid Users," IEEE ICASSP2016, pages 5905-5909, May 2016. :::icon
146. Jeremy Chiaming Yang, Syu-Siang Wang, Yu Tsao, and Jeih-weih Hung, "Speech Enhancement via Ensemble Modeling NMF Adaptation," IEEE ICCE-Taiwan 2016, May 2016. :::icon
147. Syu-Siang Wang, Jeremy Chiaming Yang, Yu Tsao, and Jeih-weih Hung, "Leveraging Nonnegative Matrix Factorization in Processing the Temporal Modulation Spectrum for Speech Enhancement," IEEE ICCE-Taiwan 2016, May 2016. :::icon
148. S.-S. Wang and Y. Tsao, "Temporal Modulation Spectral Restoration for Robust Speech Recognition," IEEE International Conference on Multimedia Big Data, April 2016. :::icon
149. Ying-Hui Lai, Chien-Hsun Chen, Shih-Tsang Tang, Zong-Mu Yeh, and Yu Tsao, "Improving the Performance of Noise Reduction in Hearing Aids Based on the Genetic Algorithm," IFMBE Proceedings 57, March 2016.
150. P. Lin, D.-C. Lyu, Y.-F. Chang, and Y. Tsao, "Temporal Alignment for Deep Neural Networks," GlobalSIP 2015, December 2015. :::icon
151. H.-T. Hwang, Y. Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen, "A Probabilistic Interpretation for Artificial Neural Network-based Voice Conversion," APSIPA 2015, December 2015. :::icon
152. S.-S. Wang, H.-T. Hwang, Y.-H. Lai, Y. Tsao, X. Lu, H.-M. Wang, and B. Su, "Improving Denoising Auto-encoder Based Speech Enhancement With the Speech Parameter Generation Algorithm," APSIPA 2015, December 2015. :::icon
153. Y.-T. Liu, R. Y. Chang, Y. Tsao, and Y.-p. Chang, "A New Frequency Lowering Technique for Mandarin-Speaking Hearing Aid Users," GlobalSIP 2015, December 2015. :::icon
154. X. Lu, P. Shen, Y. Tsao, C. Hori, H. Kawai, "Sparse Representation with Temporal Max-Smoothing for Acoustic Event Detection," Interspeech 2015, ISCA, editor, pages 1176-1180, September 2015. :::icon
155. P. Lin, D.-C. Lyu, Y.-F. Chang, and Y. Tsao, "Speech Recognition with Temporal Neural Networks," Interspeech 2015, ISCA, editor, pages 21–25, September 2015. :::icon
156. P. Lin, S.-S. Wang, and Y. Tsao, "Temporal Information in Tone Recognition," IEEE ICCE 2015, June 2015. :::icon
157. W.-C. Chen, P.-T. Lai, Y. Tsao, and C.-C. Lee, "Multimodal Arousal Rating using Unsupervised Fusion Technique," ICASSP 2015, April 2015. :::icon
158. Y.-H. Lai, S.-S. Wang, P.-C. Li, and Yu Tsao, "A Discriminative Post-filter for Speech Enhancement in Hearing Aids," ICASSP 2015, April 2015. :::icon
159. Y.-H. Lai, F. Chen, and Y. Tsao, "Effect of Adaptive Envelope Compression in Simulated Electric Hearing in Reverberation," ISIC 2014, December 2014. :::icon
160. Y.-F. Chang, P. Lin, S.-H. Cheng, K.-H. Chan, Y.-C. Zeng, C.-W. Liao, W.-T. Chang, Y.-C. Wang, Y. Tsao, "Robust Anchorperson Detection Based on Audio Streams using a Hybrid I-vector and DNN System," APSIPA 2014, December 2014. :::icon
161. H. Jing, A.-C. Liang, S.-D. Lin, and Y. Tsao, "A Transfer Probabilistic Collective Factorization Model to Handle Sparse Data in Collaborative Filtering," ICDM 2014, December 2014, accepted as a regular paper (acceptance rate=9.5%) :::icon
162. X. Lu, Y. Tsao, S. Matsuda, and C. Hori, "Ensemble Modeling of Denoising Autoencoder for Speech Spectrum Restoration," Interspeech 2014, September 2014. :::icon
163. P. Lin, F. Chen, S.-S. Wang, Y. Tsao and Y. H. Lai, "Automatic Speech Recognition with Primarily Temporal Envelope Information," Interspeech 2014, September 2014. :::icon
164. X. Lu, Y. Tsao, P. Shen, and C. Hori, "Spectral Patch Based Sparse Coding for Acoustic Event Detection," ISCSLP 2014, September 2014. :::icon
165. H.-S. Lee, Y. Tsao, H.-M. Wang and S.-K. Jen, "Clustering-Based I-Vector Formulation for Speaker Recognition," Interspeech 2014, September 2014. :::icon
166. S.-S. Wang, P. Lin, D.-C. Lyu, Y. Tsao, H.-T. Hwang, B. Su and H.-M. Wang, "Acoustic Feature Conversion using a Polynomial based Feature Transferring Algorithm," ISCSLP 2014, September 2014. :::icon
167. Y. H. Lai, F. Chen, and Y. Tsao, "An Adaptive Envelope Compression Strategy for Speech Processing in Cochlear Implants," Interspeech 2014, September 2014. :::icon
168. H. Jing, T.-Y. Hu, H.-S. Lee, W.-C. Chen, C.-C. Lee, Y. Tsao and H.-M. Wang, "Ensemble of Machine Learning Algorithms for Cognitive and Physical Speaker Load Detection," Interspeech 2014, September 2014. :::icon
169. H.-t. Fan, J.-w. Hung, X. Lu, S.-S. Wang, Yu Tsao, "Speech Enhancement using Segmental Nonnegative Matrix Factorization," ICASSP 2014, May 2014. :::icon
170. H.-S. Lee, Y. Tsao, Y.-F. Chang, H.-M. Wang, and S.-K. Jeng, "Speaker Verification Using Kernel-Based Binary Classifiers with Binary Operation Derived Features," ICASSP 2014, May 2014. :::icon
171. X. Lu, Yu Tsao, S. Matsuda, and C. Hori, "Sparse Representation Based on a Bag of Spectral Exemplars for Acoustic Event Detection," ICASSP 2014, May 2014. :::icon
172. H. Jing, Y. Tsao, K.-Y. Chen and H.-M. Wang, "Semantic Naïve Bayes Classifier for Document Classification," IJCNLP, December 2013. :::icon
173. H.-T. Hwang, Y. Tsao, H.-M. Wang, Y.-R. Wang, S.-H. Chen, "Incorporating Global Variance in the Training Phase of GMM-based Voice Conversion," APSIPA 2013, October 2013. :::icon
174. C.-H. Wang, T.-W. Kao, S.-H. Fang, Y. Tsao, L.-C. Kuo, S.-W. Kao, and N.-C. Lin, "Robust Wi-Fi Location Fingerprinting Against Device Diversity based on Spatial Mean Normalization," APSIPA 2013, October 2013. :::icon
175. Hung-yi Lee, Ting-yao Hu, How Jing, Yun-Fan Chang, Yu Tsao, Yu-Cheng Kao and Tsang-Long Pao, "Ensemble of Machine Learning and Acoustic Segment Model Techniques for Speech Emotion and Autism Spectrum Disorders Recognition," Interspeech 2013, August 2013, (Second Place In the Autism Sub-Challenge) :::icon
176. Tsung-Hsien Wen, Aaron Heidel, Hung-yi Lee, Yu Tsao and Lin-Shan Lee, "Recurrent Neural Network Based Language Model Personalization by Social Network Crowdsourcing," Interspeech 2013, August 2013, (Best Student Paper Award Nomination) :::icon
177. Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Yih-Ru Wang and Sin-Horng Chen, "Alleviating the Over-Smoothing Problem in GMM-Based Voice Conversion with Discriminative Training," Interspeech 2013, August 2013. :::icon
178. Bo Li, Yu Tsao and Khe Chai Sim, "An Investigation of Spectral Restoration Algorithms for Deep Neural Networks based Noise Robust Speech Recognition," Interspeech 2013, August 2013. :::icon
179. Xugang Lu, Yu Tsao, Shigeki Matsuda and Chiori Hori, "Speech Enhancement Based on Deep Denoising Autoencoder," Interspeech 2013, August 2013, Codes: Tensor Flow: https://github.com/jonlu0602/DeepDenoisingAutoencoder; Keras: https://github.com/jerrygood0703/DDAE; Matlab: https://drive.google.com/open?id=0B8ZEsMh6ITIlNVZ1VmROdTdQNUU :::icon :::icon
180. Ying-Hui Lai, Yu-Cheng Su, Yu Tsao, Shuenn-Tsong Young, "Evaluation of Generalized Maximum a Posteriori Spectral Amplitude (GMAPA) Speech Enhancement Algorithm in Hearing Aids," ISCE 2013, June 2013. :::icon
181. Syu-Siang Wang, Yu Tsao, Jeih-weih Hung, "Filtering on the Temporal Probability Sequence in Histogram Equalization for Robust Speech Recognition," ICASSP 2013, IEEE, May 2013. :::icon
182. Yu-Cheng Su, Yu Tsao, Jung-En Wu, Fu-Rong Jean, "Speech Enhancement using Generalized Maximum a Posteriori Spectral Amplitude Estimator," ICASSP 2013, IEEE, May 2013. :::icon
183. How Jing and Yu Tsao, "Sparse Maximum Entropy Deep Belief Nets," IJCNN 2013, IEEE, April 2013. :::icon
184. H.-T. Hwang, Yu Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen, "Exploring Mutual Information for GMM-Based Spectral Conversion," ISCSLP 2012, IEEE, December 2012. :::icon
185. S.-S. Wang, J.-W. Hung, and Yu Tsao, "A Study on Cepstral Subband Normalization for Robust ASR," ISCSLP 2012, IEEE, December 2012. :::icon
186. X. Lu, Yu Tsao, S. Matsuda, C. Hori, and H. Kashioka, "Acoustic Space Partition based on Broad Phonetic Class for Ensemble Acoustic Modeling," ISCSLP 2012, IEEE, December 2012. :::icon
187. T.-Y. Hu, Yu Tsao, and L.-S. Lee, "Discriminative Fuzzy Clustering Maximum a Posterior Linear Regression for Speaker Adaptation," Interspeech 2012, ISCA, September 2012. :::icon
188. H.-T. Hwang, Yu Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen, "A Study of Mutual Information for GMM-Based Spectral Conversion," Interspeech 2012, ISCA, September 2012. :::icon
189. Yu Tsao, C.-L. Huang, S. Matsuda, C. Hori, and H. Kashioka, "A Linear Projection Approach to Environment Modeling for Robust Speech Recognition," ICASSP 2012, IEEE, April 2012. :::icon
190. C.-L. Huang, Yu Tsao, and C. Hori, "Feature Normalization and Selection for Robust Speaker State Recognition," COCOSDA 2011, IEEE, October 2011. :::icon
191. Yu Tsao, P. R. Dixon, C. Hori, and H. Kawai, "Incorporating Regional Information to Enhance MAP-based Stochastic Feature Compensation for Robust Speech Recognition," Interspeech, ISCA, August 2011. :::icon
192. Yu Tsao, R. Isotani, H. Kawai, and S. Nakamura, "Increasing Discriminative Capability on Map-based Mapping Function Estimation for Acoustic Model Adaptation," ICASSP, IEEE, May 2011. :::icon
193. Y. Tsao, S. Matsuda, S. Sakai, R. Isotani, H. Kawai, and S. Nakamura, "A Sampling-based Environment Population Projection Approach for Rapid Acoustic Model Adaptation," ICASSP, IEEE, May 2011. :::icon
194. J. Li, Y. Tsao, and C.-H. Lee, "Shrinkage Model Adaptation in Automatic Speech Recognition," Interspeech, ISCA, September 2010. :::icon
195. A. Mushtaq, Y. Tsao, and C.-H. Lee, "A Particle Filter Feature Compensation Approach to Robust Speech Recognition," Interspeech, ISCA, September 2010. :::icon
196. Yu Tsao, H. Sun, H. Li, and C.-H. Lee, "An Acoustic Segment Model Approach to Incorporating Temporal Information into Speaker Modeling for Text-Independent Speaker Recognition," ICASSP, IEEE, May 2010. :::icon
197. Y. Tsao, S. Matsuda, S. Nakamura, and C.-H. Lee, "MAP Estimation of Online Mapping Parameters in Ensemble Speaker and Speaking Environment Modeling," ASRU, IEEE, December 2009. :::icon
198. Y. Tsao, J. Li, C.-H. Lee, and S. Nakamura, "Soft Margin Estimation on Improving Environment Structures for Ensemble Speaker and Speaking Environment Modeling," IUCS, ACM, December 2009. :::icon
199. S. Matsuda, Y. Tsao, J. Li, S. Nakamura, and C.-H. Lee, "A Study on Soft Margin Estimation of Linear Regression Parameters for Speaker Adaptation," Interspeech, ISCA, December 2009. :::icon
200. Y. Tsao, J. Li, and C.-H. Lee, "Ensemble Speaker and Speaking Environment Modeling Approach with Advanced Online Estimation Process," ICASSP, IEEE, May 2009. :::icon
201. S.-Y. Peng, Y. Tsao, P. E. Hasler, and D. V. Anderson, "A Programmable Analog Radial-Basis-Function Based Classifier," ICASSP, IEEE, December 2008. :::icon
202. Y. Tsao and C.-H. Lee, "Improving the Ensemble Speaker and Speaking Environment Modeling Approach by Enhancing the Precision of the Online Estimation Process," Interspeech, ISCA, September 2008. :::icon
203. Y. Tsao and C.-H. Lee, "Two Extensions to Ensemble Speaker and Speaking Environment Modeling for Robust Automatic Speech Recognition," ASRU, IEEE, December 2007. :::icon
204. I. Bromberg, Q. Fu, J. Hou, J. Li, C. Ma, B. Mattews, A. Moreno-Daniel, J. Morris, S. M. Siniscalchi, Y. Tsao, and Y. Wang, "Detection-based ASR In the Automatic Speech Attribute Transcription Project," Interspeech, ISCA, September 2007. :::icon
205. Y. Tsao and C.-H. Lee, "An Ensemble Modeling Approach to Joint Characterization of Speaker and Speaking Environments," Interspeech, ISCA, September 2007. :::icon
206. C. Ma, Y. Tsao, and C.-H. Lee, "A Study on Detection Based Automatic Speech Recognition," Interspeech, ISCA, September 2006. :::icon
207. Y. Tsao and C.-H. Lee, "A Vector Space Approach to Environment Modeling for Robust Speech Recognition," Interspeech, ISCA, September 2006. :::icon
208. Y. Tsao, J. Li, and C.-H. Lee, "A Study on Separation between Acoustic Models and Its Applications," Eurospeech, ISCA, September 2005. :::icon
209. J. Li, Y. Tsao, and C.-H. Lee, "A Study on Knowledge Source Integration for Candidate Rescoring in Automatic Speech Recognition," ICASSP, IEEE, April 2005. :::icon
210. Y. Tsao, S.-M. Lee, and L.-S. Lee, "Segmental Eigenvoice for Rapid Speaker Adaptation," Eurospeech, ISCA, September 2001. :::icon
 
 
Technical Reports
 
1. 王豫煌、林誠謙、嚴漢偉、林子皓、陸聲山、曹昱、端木茂甯、黃俊嘉、莊庭瑞, "亞洲聲景長期監測網," number 3, 臺灣生態學會、中央研究院、日本國立研究開發法人海洋研究開發機構、林業試驗所森林保護組, August 2019. :::icon
2. 曹昱, "基於人工智慧之語音溝通輔具," 中研院 | 數理科學, 漫步科研, 科普專欄 2019-06-20, 2019. :::icon
3. 張佑榕、曹昱, "研之有物(智慧聽)," 中央研究院, 2019. :::icon
4. 端木茂甯, "研之有物(蝙蝠的超音波,藏了什麼訊息?)," 中央研究院, 2018. :::icon
 
 
Book & Book Chapters
 
1. P. Lin, Y. Tsao, and L.-W. Kuo,, chapter "Controlling the Biocompatibility and Mechanical Effects of Implantable Microelectrodes to Improve Chronic Neural Recordings in the Auditory Nervous System," "An Excursus into Hearing Loss," S. Hatzopoulos and A. Ciorba, editor, pages 173-195, IntechOpen, May 2018. :::icon
2. Y.-H. Lai, Fe. Chen, and Y. Tsao,, chapter "Adaptive Dynamic Range Compression for Improving Envelope-Based Speech Perception: Implications for Cochlear Implants," "Emerging Technology and Architecture for Big-data Analytics," A. Chattopadhyay and Y. Hao, editor, pages 191-214, Springer, April 2017. :::icon
 
 
Others
 
1. Yu Tsao, "基於深度學習之語音增強技術及其應用,", 2020大數據人工智能. :::icon :::icon
2. "Yu Tsao's CV," 2024. :::icon
3. Yu Tsao, "Utilizing Deep Learning for Speech Enhancement in Assistive Oral Communication Technologies,", Keynote Speech in M3Oriental Workshop, ACM Multimedia Asia 2023 December 2023. :::icon
4. Yu Tsao, "Wearable Devices and Machine Learning Algorithms for Augmented Oral Communication Assistance,", CTSoc Technical Talk November 2023. :::icon
5. Fei Chen and Yu Tsao, "Advances in Psychoacoustics and Machine Learning towards Objective Speech Intelligibility Evaluation," October 2023. :::icon :::icon
6. Fei Chen and Yu Tsao, "Speech Assessment Metrics: From Psychoacoustics to Machine Learning,", Tutorial in Interspeech 2023 August 2023. :::icon :::icon
7. Yu Tsao, "聽說 AI," November 2022, 國科會工程處記者會 :::icon :::icon
8. Fei Chen and Yu Tsao, "Speech enhancement for cochlear implants: From psychoacoustics to machine learning,", Tutorial in Interspeech 2022 September 2022. :::icon
9. Fei Chen and Yu Tsao, "Advances in Cochlear Implants: From Speech Perception, Enhancement to Evaluation,", Tutorial in EUSIPCO 2022 September 2022. :::icon
10. Fei Chen and Yu Tsao, "Speech Perception and Enhancement in Cochlear Implants," December 2021, Tutorial in APSIPA 2021 :::icon :::icon
11. Berrak Sisman, Yu Tsao, Haizhou Li, "Theory and Practice of Voice Conversion,", Tutorial in APSIPA 2020 December 2020. :::icon
12. Fei Chen and Yu Tsao, "Intelligibility Evaluation and Speech Enhancement based on Deep Learning,", Tutorial in Interspeech 2020 October 2020, Video: https://www.youtube.com/watch?v=89S4CgfPWG0 :::icon :::icon
13. Yu Tsao, "Speech Enhancement based on Deep Learning and Intelligibility Evaluation,", Tutorial in APSIPA 2019 November 2019. :::icon
14. H.-Y. Lee and Y. Tsao, "Generative Adversarial Network and its Applications to Speech Signal Processing and Natural Language Processing,", Tutorial in Interspeech 2019 September 2019. :::icon
15. "Improving biodiversity monitoring through soundscape information retrieval," May 2018. :::icon
16. Hung-iy Lee and Yu Tsao, "Generative Adversarial Network and its Applications to Speech Signal Processing and Natural Language Processing,", Tutorial in ICASSP 2018 April 2018. :::icon
17. Y.-C. Lin, Y.-H. Lai, H.-W. Chang, Y. Tsao, Y.-p. Chang, and R. Y. Chang, "PAD-MMRT," August 2014, Original corpus is prepared by K.-S. Tsai, L.-H. Tseng, C.-J.Wu, and S.-T. Young: “Development of a Mandarin monosyllable recognition test,” Ear and Hearing, vol. 30, no. 1, pp. 90–99, 2009. :::icon :::icon
18. 曹昱,蘇煜程,王緒翔, "線性映射轉換函數於聲學模型調適之強健式語音辨識,", 計算語言學學會通訊 第 23 卷第 2 期 (2012 年 6 月 ) June 2012. :::icon
 
 
bg