|
Research Fellow (Professor) | Tsao, Yu |
|
|
|
|
|
Publications |
|
Journal Articles | |
1. |
E. H.-H. Huang, R. Chao, Y. Tsao, and C.-M. Wu, "ElectrodeNet - A Deep-Learning-Based Sound Coding Strategy for Cochlear Implants," IEEE Transactions on Cognitive and Developmental Systems, volume 16, number 1, pages 346-357, February 2024. ::: |
2. |
S.-Y. Peng, I-C. Liu, Y.-H. Wu, T.-J. Lin, C.-J. Chen, X.-Z. Li, Y.-Q. Cheng, P.-H. Lin, K.-H. Hung, and Y. Tsao, "An SRAM-Based Reconfigurable Cognitive Computation Matrix for Sensor Edge Applications," IEEE Journal of Solid-State Circuits, volume 59, number 2, pages 636-648, February 2024. ::: |
3. |
K.-C. Ting, Y.-C. Lin, C.-T. Chan, T.-Y. Tu, Y. Tsao, K.-C. Liu, and C.-C. Shih, "Inertial Measurement Unit-based Romberg Test in Assessing Adults with Vestibular Hypofunction," IEEE Journal of Translational Engineering in Health and Medicine, volume 12, pages 245-255, December 2023. ::: |
4. |
H.-C. Kuo, Y.-P. Hsieh, H.-H. Tseng, C.-T. Wang, S.-H. Fang, and Y, Tsao, "Toward Real-World Voice Disorder Classification," IEEE Transactions on Biomedical Engineering, volume 70, number 10, pages 2922-2932, October 2023. ::: |
5. |
K.-C. Ting, S.-S. Wang, Y.-J. Li, C.-Y. Huang, T.-Y. Tu, C.-C. Shih, K.-C. Liu and Y. Tsao, "Detection of Otitis Media with Effusion Using In-Ear Microphones and Machine Learning," IEEE Sensors Journal, volume 23, pages 28411-28420, October 2023. ::: |
6. |
L.-C. Chen, K.-H. Hung, Y.-J. Tseng, H.-Y. Wang, T.-M. Lu, W.-C. Huang, and Y. Tsao, "Self-supervised Learning Based General Laboratory Progress Pretrained Model for Cardiovascular Event Detection," IEEE Journal of Translational Engineering in Health and Medicine, volume 12, pages 43-55, August 2023. ::: |
7. |
Y.-J. Lu, C.-Y. Chang, C. Yu, C.-F. Liu, J.-w. Hung, S. Watanabe, and Y. Tsao, "Improving Speech Enhancement Performance by Leveraging Contextual Broad Phonetic Class Information," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 31, pages 2738-2750, June 2023. ::: |
8. |
C.-Y. Cheng, H.-S. Lee, Y. Tsao, and H.-M. Wang, "Multi-target Filter and Detector for Unknown-number Speaker Diarization," IEEE Signal Processing Letters, volume 30, pages 638-642, May 2023. ::: |
9. |
Y.-W. Chen, H.-M. Wang, and Y. Tsao, "BASPRO: A Balanced Script Producer for Speech Corpus Collection Based on the Genetic Algorithm," APSIPA Transactions on Signal and Information Processing, volume 12, number 3, pages e15, April 2023, Themed Series: Advanced Acoustic, Sound and Audio Processing Techniques and Their Applications ::: |
10. |
T.-M. Chen, Y.-H. Tsai, H.-H. Tseng, K.-C. Liu, J.-Y. Chen, C.-H. Huang, G.-Y. Li, C.-Y. Shen, and Y. Tsao, "SRECG: ECG Signal Super-resolution Framework for Portable/Wearable Devices in Cardiac Arrhythmias Classification," IEEE Transactions on Consumer Electronics, volume 1, pages 1, January 2023. ::: |
11. |
S.-Y. Niu, L.-Z. Guo, Y. Li, Z. Zhang, T.-D. Wang, K.-C. Liu, Y. Tsao, T.-M. Liu, "Boundary-Preserved Deep Denoising of the Stochastic Resonance Enhanced Multiphoton Images," IEEE Journal of Translational Engineering in Health and Medicine, volume 10, pages 1-12, September 2022. |
12. |
K.-C. Liu, K.-H. Hung, C.-Y. Hsieh, H.-Y. Huang, C.-T. Chan, and Y. Tsao, "Deep Learning Based Signal Enhancement of Low-Resolution Accelerometer for Fall Detection Systems," IEEE Transactions on Cognitive and Developmental Systems, volume 14, number 3, pages 1270-1281, September 2022. ::: |
13. |
R. E. Zezario, S.-W. Fu, F. Chen, C.-S. Fuh, H.-M. Wang, and Y. Tsao, "Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment Model with Cross-Domain Features," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 31, pages 54-70, September 2022. ::: |
14. |
L.-C. Chen, P.-H. Chen, R. T.-H. Tsai, and Y. Tsao,, "EPG2S: Speech Generation and Speech Enhancement based on Electropalatography and Audio Signals using Multimodal Learning," IEEE Signal Processing Letters, volume 29, pages 2582-2586, June 2022. ::: |
15. |
Y. Lin, Y.Tsao, and P.-J. Hsieh, "Neural Correlates of Individual Differences in Predicting Ambiguous Sounds Comprehension Level," NeuroImage, volume 251, pages 1-12, May 2022. ::: |
16. |
T. Hussain, W.-C. Wang, M. Gogate, K. Dashtipour, Y. Tsao, X. Lu, A. Ahsan, and A. Hussain, "A Novel Temporal Attentive-Pooling based Convolutional Recurrent Architecture for Acoustic Signal Enhancement," IEEE Transactions on Artificial Intelligence, volume 1, number 1, pages 1-12, April 2022. ::: |
17. |
C.-T. Wang, Z.-Y. Chuang, C.-H. Hung, Y. Tsao, S.-H. Fang, "Detection of Glottic Neoplasm Based on Voice Signals Using Deep Neural Networks," IEEE Sensors Journal, volume 6, pages 1-4, March 2022, (Letters) |
18. |
Y.-W. Chen, K.-H. Hung, Y.-J. Li, A. C.-F. Kang, Y.-S. Lai, K.-C. Liu, S.-W. Fu, S.-S. Wang, Y. Tsao, "CITISEN: A Deep Learning-Based Speech Signal-Processing Mobile Application," IEEE Access, volume 10, pages 46082-46099, February 2022. ::: |
19. |
L.-C. Chen, J.-T. Sheu, Y.-J. Chuang, and Y. Tsao, "Predicting the Travel Distance of Patients while Accessing Healthcare using Deep Neural Network," IEEE Journal of Translational Engineering in Health and Medicine, volume 10, pages 1-11, February 2022. ::: |
20. |
S.-Y. Chuang, H.-M. Wang, and Y. Tsao, "Improved Lite Audio-Visual Speech Enhancement," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 30, pages 1345-1359, February 2022. ::: |
21. |
C.-H. Hu, Y.-H. Peng, J.Yamagishi, Y. Tsao, and H.-M. Wang, "SVSNet: An End-to-end Speaker Voice Similarity Assessment Model," IEEE Signal Processing Letters, volume 29, pages 767-771, February 2022. ::: |
22. |
S.-S. Wang, C.-C. Lai, C.-T. Wang, Y. Tsao, S.-H. Fang, "Continuous Speech for Improved Learning Pathological Voice Disorders," IEEE Open Journal of Engineering in Medicine and Biology, volume 3, pages 2644-1276, February 2022. ::: |
23. |
Y.-C. Lin, C. Yu, Y.-T. Hsu, S.-W. Fu, Y. Tsao, T.-W. Kuo, "SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 30, pages 1016-1031, December 2021. ::: |
24. |
R.-Y. Tseng, T.-W. Wang, S.-W. Fu, C.-Y. Lee, and Y. Tsao, "A Study of Joint Effect on Denoising Techniques and Visual Cues to Improve Speech Intelligibility in Cochlear Implant Simulation," IEEE Transactions on Cognitive and Developmental Systems, volume 13, pages 984-994, December 2021. ::: |
25. |
X. Lu, P. Shen, Y. Tsao, and H. Kawai, "Coupling A Generative Model With A Discriminative Learning Framework for Speaker Verification," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 29, pages 3631-3641, November 2021. ::: |
26. |
F. S. Abousaleh, W.-H. Cheng, N.-H. Yu, and Y. Tsao, "Multimodal Deep Learning Framework for Image Popularity Prediction on Social Media," IEEE Transactions on Cognitive and Developmental Systems, volume 13, number 3, pages 679-692, September 2021. ::: |
27. |
W. Ariyanti, T. Hussain, J.-C. Wang, C.-T. Wang, S.-H. Fang, and Y. Tsao, "Ensemble and Multimodal Learning for Pathological Voice Classification," IEEE Sensors Journal, volume 5, number 7, pages 1-4, July 2021, (Letters) ::: |
28. |
K.-C. Liu, M. Chan, C.-Y. Hsieh, H.-Y. Huang, C.-T. Chan, Y. Tsao, "Domain-adaptive Fall Detection Using Deep Adversarial Training," IEEE Transactions on Neural Systems & Rehabilitation Engineering, volume 29, pages 1243-1251, June 2021. ::: |
29. |
T.-H. Lin ,T. Akamatsu,Y. Tsao, "Sensing ecosystem dynamics via audio source separation: A case study of marine soundscapes off northeastern Taiwan,," PLOS Computational Biology, volume 1, number 1, pages 1-23, February 2021. ::: |
30. |
J.-K. Wang, Y.-F. Chang, K.-H. Tsai, W.-C. Wang, C.-Y. Tsai, C.-H. Cheng, and Y. Tsao, "Automatic recognition of murmurs of ventricular septal defect using convolutional recurrent neural networks with temporal attentive pooling," Scientific Reports, volume 10, number 21797, pages 1-10, December 2020. ::: |
31. |
N. Y.-H. Wang, H.-L. S. Wang, T.-W. Wang, S.-W. Fu, X. Lu, H.-M. Wang, and Y. Tsao, "Improving the Intelligibility of Speech for Simulated Electric and Acoustic Stimulation Using Fully Convolutional Neural Networks," IEEE Transactions on Neural Systems & Rehabilitation Engineering, volume 29, pages 184-195, December 2020. ::: |
32. |
T. Hussain, S. M. Siniscalchi, H.-L. S. Wang, Y. Tsao, S. V. Mario, and W.-H. Liao, "Ensemble Hierarchical Extreme Learning Machine for Speech Dereverberation," IEEE Transactions on Cognitive and Developmental Systems, volume 12, number 4, pages 744-758, December 2020. ::: |
33. |
X. Wang et al.,, "ASVspoof 2019: A Large-scale Public Database of Synthetized, Converted and Replayed Speech," Computer Speech and Language, volume 64, pages 1-27, November 2020. ::: |
34. |
H.-S. Lee, Y. Tsao, S.-K. Jeng, and H.-M. Wang, "Subspace-based Representation and Learning for Phonotactic Spoken Language Recognition," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 28, pages 3065-3079, November 2020. ::: |
35. |
K.-H. Tsai, W.-C. Wang, C.-H. Cheng, C.-Y. Tsai, J.-K. Wang, T.-H. Lin, S.-H. Fang, L.-C. Chen, and Y. Tsao, "Blind Monaural Source Separation on Heart and Lung Sounds Based on Periodic-Coded Deep Autoencoder," IEEE Journal of Biomedical and Health Informatics, volume 24, number 11, pages 3203-3214, November 2020. ::: |
36. |
T.-A. Hsieh, H.-M. Wang, X. Lu, and Y. Tsao, "WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-end Speech Enhancement," IEEE Signal Processing Letters, volume 27, pages 2149-2153, November 2020. ::: |
37. |
C. Yu*, R. E. Zezario*, S.-S. Wang, J. Sherman, Y.-Y. Hsieh, X. Lu, H.-M. Wang, and Y. Tsao, "Speech Enhancement based on Denoising Autoencoder with Multi-branched Encoders," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 28, pages 2756-2769, October 2020, (*equal contributions) ::: |
38. |
W.-C. Huang, H. Luo, H.-T. Hwang, C.-C. Lo, Y.-H. Peng, Y. Tsao, and H.-M. Wang, "Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion," IEEE Transactions on Emerging Topics in Computational Intelligence, volume 4, number 4, pages 468-479, August 2020. ::: |
39. |
N. Y.-H. Wang, C.-H. Chiang, H.-L. S. Wang and Y. Tsao, "Atypical Frequency Sweep Processing in Chinese Children With Reading Difficulties: Evidence From Magnetoencephalography," Frontiers in Psychology, volume 99, pages 99, July 2020. ::: |
40. |
C. Yu, K.-H. Hung, S.-S. Wang, Y. Tsao, and J.-w. Hung, "Time-Domain Multi-modal Bone/air Conducted Speech Enhancement," IEEE Signal Processing Letters, volume 27, pages 1035-1039, June 2020. ::: |
41. |
M. Lee, L. Lin, C.-Y. Chen, Y. Tsao, T.-H. Yao, M.-H. Fei and S.-H. Fang, "Forecasting Air Quality in Taiwan by Using Machine Learning," Scientific Reports, number 4153, pages 1-13, March 2020. ::: |
42. |
Y.-H. Lai, W.-N. Chen, T.-C. Hsu, C. Lin, Y. Tsao and S. Wu, "Overall Survival Prediction of Non-small Cell Lung Cancer by Integrating Microarray and Clinical Data with Deep Learning," Scientific Reports, number 4679, pages 1-11, March 2020. ::: |
43. |
S. C. Hidayati, T. W. Goh, Ji.-S. G. Chan, C.-C. Hsu, J. See, L.-K. Wong, K.-L. Hua, Y. Tsao, and W.-H. Cheng, "Dress With Style: Learning Style from Joint Deep Embedding of Clothing Styles and Body Shapes," IEEE Transactions on Multimedia, volume 23, pages 365-377, March 2020. |
44. |
C.-L. Liu, S.-W. Fu, Y.-J. Li, J.-W. Huang, H.-M. Wang, and Y. Tsao, "Multichannel Speech Enhancement by Raw Waveform-mapping using Fully Convolutional Networks," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 28, pages 1888-1900, February 2020. ::: |
45. |
S.-W. Fu, C.-F. Liao, Y. Tsao, "Learning with Learned Loss Function: Speech Enhancement with Quality-Net to Improve Perceptual Evaluation of Speech Quality," IEEE Signal Processing Letters, volume 27, pages 26-30, December 2019. ::: |
46. |
J.-Y. Wu, C. Yu, S.-W. Fu, C.-T. Liu, S.-Y. Chien, Y. Tsao, "Increasing Compactness of Deep Learning based Speech Enhancement Models with Parameter Pruning and Quantization Techniques," IEEE Signal Processing Letters, volume 26, number 12, pages 1887-1891, December 2019. ::: |
47. |
T.-H. Lin amd Y. Tsao, "Source Separation in Ecoacoustics: A Roadmap towards Versatile Soundscape Information Retrieval," Remote Sensing in Ecology and Conservation, volume online, pages 1-12, December 2019. ::: |
48. |
C.-T. Wang, F.-C. Lin, J.-Y. Chen, M.-J. Hsiao, S.-H. Fang, Y.-H. Lai, Y. Tsao, "Detection of Pathological Voice Using Cepstrum Vectors: A Deep Learning Approach," Journal of Voice, volume 33, number 5, pages pp. 634-641, September 2019. ::: |
49. |
S.-H. Fang, C.-T. Wang, J.-Y. Chen, Y. Tsao and F.-C. Lin, "Combining Acoustic Signals and Medical Records to Improve Pathological Voice Classification," APSIPA Transactions on Signal and Information Processing, volume 8, pages 1-11, June 2019. ::: |
50. |
C.-W. Lee et al.,, "Bioimaging: New Templated Ostwald Ripening Process of Mesostructured FeOOH for Third‐Harmonic Generation Bioimaging," Small, volume 15, number 20, pages 1-11, May 2019. ::: |
51. |
H.-T. Chiang, Y.-Y. Hsieh, S.-W. Fu, K.-H. Hung, Y. Tsao, S.-Y. Chien, "Noise Reduction in ECG Signals Using Fully Convolutional Denoising Autoencoders," IEEE Access, volume 7, pages 60806-60813, April 2019. ::: |
52. |
Y.-C. Chu, Y.-F. Cheng, Y.-H. Lai, Y. Tsao, T.-Y. Tu, S. T. Young, T.-S. Chen, Y.-F. Chung, F. Lai, W.-H. Liao, "A Mobile Phone–Based Approach for Hearing Screening of School-Age Children: Cross-Sectional Validation Study," JMIR Mhealth Uhealth, volume 1, pages 1-13, April 2019. ::: |
53. |
Y. Tsao, T.-H. Lin, F. Chen, Y.-F. Chang, C.-H. Cheng, and K.-H. Tsai, "Robust S1 and S2 heart sound recognition based on spectral restoration and multi-style training," Biomedical Signal Processing and Control, volume 49, pages 173-180, March 2019. ::: |
54. |
H.-L. S. Wanga , N. Y.-H. Wang , I-C. Chen, and Y. Tsao, "Auditory Identification of Frequency-Modulated Sweeps and Reading Difficulties in Chinese," Research in Developmental Disabilities, volume 86, pages 53-61, January 2019. ::: |
55. |
C.-T. Liu, T.-W. Lin, Y.-H. Wu, Y.-S. Lin, H. Lee, Y. Tsao, and S.-Y. Chien, "Computation-Performance Optimization of Convolutional Neural Networks with Redundant Filter Removal," IEEE Transactions on Circuits and Systems I, volume 66, pages 1908-1921, December 2018. ::: |
56. |
H.-P. Liu, Y. Tsao, and C.-S. Fuh, "Bone Conducted Speech Enhancement Using Deep Denoising Autoencoder," Speech Communication, volume 104, pages 106-112, November 2018. ::: ::: |
57. |
H.-T. Hwang, Y.-C. Wu, S.-S. Wang, C.-C. Hsu, Y. Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen, "Locally linear Embedding Based Post-filtering for Speech Enhancement," Journal of Information Science and Engineering, volume 34, number 6, pages 1469-1491, October 2018. ::: |
58. |
S.-Y. Tsui, Y. Tsao, C.-W. Lin, S.-H. Fang, and C.-T. Wang, "Demographic and Symptomatic Features of Voice Disorders and Their Potential Application in Classification using Machine Learning Algorithms," Folia Phoniatrica et Logopaedica, volume 70, pages 174-182, September 2018. |
59. |
S.-W. Fu, T.-W. Wang, Y. Tsao, X. Lu, and H. Kawai, "End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 26, number 9, pages 1570-1584, September 2018. ::: |
60. |
Y.-H. Lai, Y. Tsao, X. Lu, F. Chen, Y.-T. Su, K.-C. Chen, Y.-H. Chen, L.-C. Chen, P.-H. Li, and C.-H. Lee, "Deep Learning based Noise Reduction Approach to Improve Speech Intelligibility for Cochlear Implant Recipients," Ear and Hearing, volume 39(4), number 4, pages 795-809, July 2018, This work receives the National Innovation Award 2018 (2018年國家新創獎) ::: ::: |
61. |
J.-C. Hou, S.-S. Wang, Y.-H. Lai, Y. Tsao, H.-W. Chang, and H.-M. Wang, "Audio-visual Speech Enhancement using Multimodal Deep Convolutional Neural Networks," IEEE Transactions on Emerging Topics in Computational Intelligence, volume 2, number 2, pages 117-128, April 2018. ::: |
62. |
Y. Tsao, H.-C. Chu, S.-H. Fang, J. Lee, and C.-M. Lin, "Adaptive Noise Cancellation using Deep Cerebellar Model Articulation Controller," IEEE Access, volume 6, pages 37395-37402, April 2018. ::: ::: |
63. |
T.-H. Lin, T. Akamatsu, and Y, Tsao, "Comparison of passive acoustic soniferous fish monitoring with supervised and unsupervised approaches," Journal of the Acoustical Society of America (JASA), volume 143, number 4, pages published onlione, April 2018. ::: |
64. |
S.-S. Wang, P. Lin, Y. Tsao, J.-W. Hung, and B. Su, "Suppression by Selecting Wavelets for Feature Compression in Distributed Speech Recognition," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 26, number 3, pages 564-579, March 2018. ::: |
65. |
J. Torres-Sospedra et al., "Off-Line Evaluation of Mobile-Centric Indoor Positioning Systems: The Experiences from the 2017 IPIN Competition," Sensors, volume 18, number 2, pages 487, February 2018. ::: |
66. |
H.-T. Hwang, Y.-C. Wu, Y.-H. Peng, C.-C. Hsu, Y. Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen, "Voice Conversion based on Locally Linear Embedding," Journal of Information Science and Engineering, volume 34, number 6, pages 1493-1516, January 2018. ::: |
67. |
P. Lin, D. Lyu, F. Chen, S.-S. Wang, and Y. Tsao, "Multi-style Learning with Denoising Autoencoders for Acoustic Modeling in the Internet of Things (IoT)," Computer Speech and Language, volume 46, pages 481-495, November 2017. ::: |
68. |
S.-W. Fu, P.-C. Li, Y.-H. Lai, C.-C. Yang, L.-C. Hsieh, and Y. Tsao, "Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion to Improve Speech Intelligibility After Oral Surgery," IEEE Transactions on Biomedical Engineering, volume 64, number 11, pages 2584 - 2594, November 2017. ::: |
69. |
T. Hussain, S. M. Siniscalchi, C.-C. Lee, S.-S. Wang, Y. Tsao and W.-H. Liao, "Experimental Study on Extreme Learning Machine Applications for Speech Enhancement," IEEE Access, volume 99, number 99, pages 1-1, October 2017. ::: |
70. |
S.-H. Fang, Y.-X. Fei, Z. Xu, and Y. Tsao, "Learning Transportation Modes from Smartphone Sensors Based on Deep Neural Network," IEEE Sensors Journal, volume 17, pages 6111 - 6118, September 2017. ::: |
71. |
F. Chen, D. Zheng, Y. Tsao, "Effects of Noise Suppression and Envelope Dynamic Range Compression on the Intelligibility of Vocoded Sentences for a Tonal Language," Journal of the Acoustical Society of America (JASA), volume 142, number 3, pages 1157-1166, September 2017. ::: |
72. |
S.-W. Hsiao, H.-C. Sun, M.-C. Hsieh, M.-H. Tsai, Y. Tsao, and C.-C. Lee, "Toward Automating Oral Presentation Scoring during Principal Certification Program using Audio-Video Low-level Behavior Profiles," IEEE Transactions on Affective Computing, volume PP, number PP, pages PP, September 2017. ::: |
73. |
X. Lu, P. Shen, Y. Tsao, and H. Kawai, "Regularization of Neural Network Model with Distance Metric Learning for I-vector based Spoken Language Identification," Computer Speech and Language, volume 44, pages 48-60, July 2017. ::: |
74. |
T.-H. Lin, S.-H. Fang, and Y, Tsao, "Improving Biodiversity Assessment via Unsupervised Separation of Biological Sounds from Long-duration Recordings," Scientific Reports, volume 7, number 4547, pages 1, July 2017. ::: ::: |
75. |
Y.-H. Lai, F. Chen, S.-S. Wang, X. Lu, Y. Tsao, and C.-H. Lee, "A Deep Denoising Autoencoder Approach to Improving the Intelligibility of Vocoded Speech in Cochlear Implant Simulation," IEEE Transactions on Biomedical Engineering, volume 64, number 7, pages 1568 - 1578, July 2017. ::: |
76. |
A. Chern, Y.-H. Lai, Y.-p. Chang, Y. Tsao, R. Y. Chang, and H.-W. Chang, "A Smartphone-Based Multi-Functional Hearing Assistive System to Facilitate Speech Recognition in the Classroom," IEEE Access, volume 5, pages 10339 - 10351, June 2017, This paper has been selected as a Featured Article (http://ieeeaccess.ieee.org/special-sections/featured-articles/smartphone-based-multi-functional-hearing-assistive-system-facilitate-speech-recognition-classroom/) ::: |
77. |
T.-E. Chen, S.-I Yang, L.-T. Ho, K.-H. Tsai, Y.-H. Chen, Y.-F. Chang, Y.-H. Lai, S.-S. Wang, Y. Tsao*, and C.-C. Wu, "S1 and S2 Heart Sound Recognition using Deep Neural Networks," IEEE Transactions on Biomedical Engineering, volume 64, number 2, pages 372 - 380, February 2017. ::: |
78. |
H.-y. Lee, B.-H. Tseng, T.-H. Wen, and Y. Tsao, "Personalizing Recurrent Neural Network based Language Model by Social Network," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 25, number 3, pages 519 - 530, December 2016. ::: |
79. |
T. Guan, G.-x. Chu, Y. Tsao, F. Chen, "Assessing the Perceptual Contributions of Level-dependent Segments to Sentence Intelligibility," Journal of the Acoustical Society of America (JASA), volume 140, number 5, pages 3745-3754, November 2016. ::: |
80. |
S.-H. Fang, W.-H. Chang, Y. Tsao, H.-C. Shih, and C. Wang, "Channel State Reconstruction Using Multilevel Discrete Wavelet Transform for Improved Fingerprinting-Based Indoor Localization," IEEE Sensors Journal, volume 16, number 21, pages 7784 - 7791, November 2016. ::: |
81. |
H.-L. S. Wang, I-C. Chen, C.-H. Chiang, Y.-H. Lai, and Y. Tsao, "Auditory Perception, Suprasegmental Speech Processing, and Vocabulary Development in Chinese Preschoolers," Perceptual and Motor Skills, volume 123, number 2, pages 365-382, October 2016. ::: |
82. |
S.-H. Fang , H.-H. Liao , Y.-X. Fei , K.-H. Chen , J.- W. Huang , Y.-D. Lu and Y. Tsao, "Transportation Modes Classification Using Sensors on Smartphones," Sensors, volume 19;16, number 8, pages 1324, August 2016. ::: |
83. |
S.-S. Wang, A. Chern, Y. Tsao, J.-w. Hung, X. Lu, Y.-H. Lai, B. Su, "Wavelet Speech Enhancement based on Nonnegative Matrix Factorization," IEEE Signal Processing Letters, volume 23, number 8, pages 1101-1105, August 2016. ::: |
84. |
P. Lin, S.-W. Fu, S.-S.Wang, Y.-H. Lai, and Y. Tsao, "Maximum Entropy Learning with Deep Belief Networks," Entropy, volume 18, number 7, pages 251, July 2016. ::: |
85. |
F. Chen, Y. Tsao, and Y.-H. Lai, "Modeling Speech Intelligibility with Recovered Envelope from Temporal Fine Structure Stimulus," Speech Communication, volume 81, pages 120–128, July 2016. ::: |
86. |
Y. Tsao and Y.-H. Lai, "Generalized Maximum a Posteriori Spectral Amplitude Estimation for Speech Enhancement," Speech Communication, volume 76, pages 112–126, February 2016. ::: ::: |
87. |
S.-H. Fang, C.-H. Wang, and Y. Tsao, "Compensating for Orientation Mismatch in Robust WiFi Localization Using Histogram Equalization," IEEE Transactions on Vehicular Technology, volume 64, number 11, pages 5210-5220, November 2015. ::: |
88. |
Y.-C. Lin, Y.-H. Lai, H.-W. Chang, Y. Tsao, Y.-p. Chang, and R. Y. Chang,, "A Smartphone-Based Remote Microphone Hearing Assistive System Using Wireless Technologies," IEEE Systems Journal, volume PP, pages 1-10, October 2015, Smarthear Demo: https://www.youtube.com/watch?v=e9HqIj09QJs ::: |
89. |
C.-C. Hsu, K.-M. Cheong, T.-S. Chi, and Y. Tsao, "Robust Voice Activity Detection Algorithm Based on Feature of Frequency Modulation of Harmonics and Its DSP Implementation," IEICE Transactions on Information and Systems, volume E98-D, number 10, pages 1808-1817, October 2015. ::: |
90. |
Y.-J, Lee, Y.-R. Chien, and Y. Tsao, "Rapid Converging M-max Partial Update Least Mean Square Algorithms with New Variable Step-size Methods," IEICE Transaction on Communications, volume Vol.E98-A, number No.12, pages 2650-2657, August 2015. |
91. |
Y.-H. Lai, Y. Tsao, F. Chen, "Effects of Adaptation Rate and Noise Suppression on the Intelligibility of Compressed-Envelope Based Speech," PLoS ONE, volume 10.1371, pages journal.pone.0133519, July 2015. ::: |
92. |
Y. Tsao, P. Lin, T.-y. Hu, and X. Lu, "Ensemble Environment Modeling using Affine Transform Group," Speech Communication, volume 68, pages 55–68, April 2015. ::: |
93. |
Y. Tsao, S.-H. Fang, and Y. Hsiao, "Acoustic Echo Cancellation Using a Vector-Space-Based Adaptive Filtering Algorithm," IEEE Signal Processing Letters, volume 22, pages 351-355, March 2015. ::: ::: |
94. |
Y. Tsao, T.-y. Hu, S. Sakti, S. Nakamura, and L.-s. Lee, "Variable Selection Linear Regression for Robust Speech Recognition," IEICE Transactions on Information and Systems, volume E97-D, number 6, pages 1477-1487, June 2014. ::: |
95. |
Y. Tsao, X. Lu, P. Dixon, T.-y. Hu, S. Matsuda, and C. Hori, "Incorporating Local Information of the Acoustic Environments to MAP-based Feature Compensation and Acoustic Model Adaptation," Computer Speech and Language, volume 28, number 3, pages 709-726, May 2014. ::: |
96. |
Y. Tsao, S. Matsuda, C. Hori, H. Kashioka, and C.-H. Lee, "A MAP-based Online Estimation Approach to Ensemble Speaker and Speaking Environment," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 22, number 2, pages 403-416, February 2014. ::: |
97. |
Y.-H. Lai, Y. Tsao, and F. Chen, "A Study of Adaptive WDRC in Hearing Aids under Noisy Conditions,," International Journal of Speech & Language Pathology and Audiology, volume 1, number 2, pages 43-51, December 2013, (invited paper) ::: |
98. |
Y. Tsao and C.-H. Lee, "An Ensemble Speaker and Speaking Environment Modeling Approach to Robust Speech Recognition," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 17, pages 1025 - 1037, June 2009. ::: |
99. |
Y. Tsao, S.-M. Lee, and L.-S. Lee, "Segmental Eigenvoice with Delicate Eigenspace for Improved Speaker Adaptation," IEEE Transactions on Speech and Audio Processing, volume 13, pages 399 - 411, April 2005. ::: |
|
|
Conference Papers | |
1. |
S.-W. Fu, K.-H. Hung, Y. Tsao, and Y.-C. F. Wang, "Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech," to appear in ICLR 2024,. ::: |
2. |
Y.-T. Liu, K.-C. Wang, K.-C. Liu, S.-Y. Peng, and Y. Tsao, "SDEMG: Score-based Diffusion Model For Surface Electromyographic Signal Denoising," to appear in IEEE ICASSP 2024,. ::: |
3. |
X. Lu, P. Shen, Y. Tsao, and H. Kawai, "Hierarchical Cross-modality Knowledge Transfer With Sinkhorn Attention For Ctc-based ASR," to appear in IEEE ICASSP 2024,. ::: |
4. |
H. Wu, H.-C. Kuo, Y. Tsao, H.-y. Lee, "Scalable Ensemble-based Detection Method Against Adversarial Attacks For Speaker Verification," to appear in IEEE ICASSP 2024,. ::: |
5. |
Y. Tseng, L. Berry, and Y.-T. Chen et al.,, "A Multi-task Evaluation Benchmark For Audio-visual Representation Models," to appear in IEEE ICASSP 2024,. ::: |
6. |
R. E. Zezario, B.-R. B. Bai, C.-S. Fuh, H.-M. Wang, and Y. Tsao, "Multi-task Pseudo-label Learning For Non-intrusive Speech Quality Assessment Model," to appear in IEEE ICASSP 2024,. ::: |
7. |
X. Lu, P. Shen, Y. Tsao, and H. Kawa, "Cross-modal alignment with optimal transport for CTC-based ASR," IEEE ASRU 2023, December 2023. ::: |
8. |
C.-C. Lee, H.-W. Chen, C.-S. Chen, H.-M. Wang, T.-T. Liu, and Y. Tsao, "LC4SV: A Denoising Framework Learning to Compensate for Unseen Speaker Verification Models," IEEE ASRU 2023, December 2023. ::: |
9. |
H.-T. Chiang, K.-H. Hung, S.-W. Fu, H.-C. Kuo, M.-H. Tsai, and Y. Tsao, "Study on the Correlation between Objective Evaluations and Subjective Speech Quality and Intelligibility," IEEE ASRU 2023, December 2023. ::: |
10. |
E. Cooper, W.-C. Huang, Y.Tsao, H.-M. Wang, T. Toda, and J. Yamagishi, "The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains," IEEE ASRU 2023, December 2023. ::: |
11. |
T.-A. Hsieh, C.-H. Huck Y., P.-Y. Chen, S. M. Siniscalchi, Y. Tsao, "Inference and Denoise: Causal Inference-based Neural Speech Enhancement," IEEE MLSP 2023, September 2023. ::: |
12. |
W.-Y. Ting, S.-S. Wang, Y. Tsao, and B. Su, "IANS: Intelligibility-aware Null-steering Beamforming for Dual-Microphone Arrays," IEEE MLSP 2023, September 2023. ::: |
13. |
I-C. Chern, S. Chern, H.-C. Kuo, H.-H. Tseng, K.-H. Hung, and Y. Tsao, "Voice Direction-of-Arrival Conversion," IEEE MLSP 2023, September 2023. ::: |
14. |
H. Yen, P.-J. Ku, C.-H. H. Yang, H. Hu, S. M. Siniscalchi, P.-Y. Chen, and Y. Tsao, "Neural Model Reprogramming with Similarity Based Mapping for Low-Resource Spoken Command Recognition," Interspeech 2023, August 2023. ::: |
15. |
Y.-L. Chien, H.-H. Chen, M.-C. Yen, S.-W. Tsai, H.-M. Wang, Y. Tsao, T.-S. Chi, "Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion," Interspeech 2023, August 2023. ::: |
16. |
L.-W. Chen, Y.-F. Cheng, H.-S. Lee, Y. Tsao, and H.-M. Wang, "A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean Speech," Interspeech 2023, August 2023. ::: |
17. |
H.-H. Chen, Y.-L. Chien, M.-C. Yen, S.-W. Tsai, T.-S. Chi, Y. Tsao, and H.-M. Wang, "Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features," Interspeech 2023, August 2023. ::: |
18. |
E.-P. Chu, K.-C. Liu, C.-Y. Hsieh, C.-Y. Chang, Y. Tsao, and C.-T. Chan, "Multi-Task Learning U-Net for Functional Shoulder Sub-Task Segmentation," IEEE EMBC 2023, July 2023. ::: |
19. |
C.-P. Liu, J.-H. Li, E.-P. Chu, C.-Y. Hsieh, K.-C. Liu, C.-T. Chan, and Y. Tsao:, "Deep Learning-based Fall Detection Algorithm Using Ensemble Model of Coarse-fine CNN and GRU Networks," IEEE MeMeA 2023, June 2023. ::: |
20. |
J. Kirton-Wingate, S. Ahmed, M. Gogate, Y. Tsao, A. Hussain, "Towards Individualised Speech Enhancement: An SNR Preference learning System For Multi-modal Hearing Aids," IEEE ICASSP 2023 (AMHAT 2023 Workshop), June 2023. ::: |
21. |
H.-Y. Lin, H.-H. Tseng, and Y. Tsao, "On the Robustness of Non-intrusive Speech Quality Model by Adversarial Examples," IEEE ICASSP 2023, June 2023. ::: |
22. |
I-C. Chern, K.-H. Hung, Y.-T. Chen, T. Hussain, M. Gogate, A. Hussain, Y. Tsao, and J.-C. Hou, "Audio-visual Speech Enhancement And Separation By Utilizing Multi-modal Self-supervised Embeddings," IEEE ICASSP 2023 (AMHAT 2023 Workshop), June 2023. ::: |
23. |
K.-C. Wang, K.-C. Liu, S.-Y. Peng, Y. Tsao, "ECG Artifact Removal from Single-Channel Surface EMG Using Fully Convolutional Networks," IEEE ICASSP 2023, June 2023. ::: |
24. |
T.-H. Chi, K.-C. Liu, C.-Y. Hsieh, Y. Tsao, and C.-T. Chan, "Pre-Impact Fall Detection via CNN-ViT Knowledge Distillation," IEEE ICASSP 2023, June 2023. ::: |
25. |
C.-J. Hsu, H.-L. Chung, H.-y. Lee, amd Y. Tsao, "T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5," IEEE ICASSP 2023, June 2023. ::: |
26. |
C.-C. Lee, Y. Tsao, H.-M. Wang, C.-S. Chen, "D4AM: A General Denoising Framework for Downstream Acoustic Models," ICLR 2023, May 2023. ::: |
27. |
H.-H. Tseng, H.-Y. Lin, H.-K. Hsuan, and Y. Tsao, "Interpretations of Domain Adaptations via Layer Variational Analysis," ICLR 2023, May 2023. ::: |
28. |
C.-H. Chen, K.-C. Liu, T.-Y. Lu, C.-Y. Chang, C.-T. Chan, and Y. Tsao, "Wearable-based Pain Assessment in Patients with Adhesive Capsulitis Using Machine Learning," IEEE NER 2023, April 2023. |
29. |
Y.-J. Lu et al.,, "ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding," Interspeech 2022, September 2022. |
30. |
Y.-W. Chen and Y. Tsao, "InQSS: a speech intelligibility and quality assessment model using a multi-task learning network," Interspeech 2022, September 2022. ::: |
31. |
K.-H. Hung, S.-W. Fu, H.-H. Tseng, H.-T. Chiang, Y. Tsao, C.-W. Lin, "Boosting Self-Supervised Embeddings for Speech Enhancement," Interspeech 2022, September 2022. ::: |
32. |
F.-L. Wang, H.-S. Lee, Y. Tsao and H.-M. Wang, "Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks Authors," Interspeech 2022, September 2022. ::: |
33. |
C.-C. Lee, C.-H. Hu, Y.-C. Lin, C.-S. Chen, H.-M. Wang and Y. Tsao, "NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling," Interspeech 2022, September 2022. ::: |
34. |
C.-J. Peng, Y.-J. Chan, Y.-L.Shen, C. Yu, Y. Tsao and T.-S. Chi, "Perceptual Characteristics Based Multi-objective Model for Speech Enhancement," Interspeech 2022, September 2022. ::: |
35. |
R. Chao, C. Yu, S.-W. Fu, X. Lu and Y. Tsao, "Perceptual Contrast Stretching on Target Feature for Speech Enhancement," Interspeech 2022, September 2022. ::: |
36. |
W.-C. Huang, E. C., Y. Tsao, H.-M. Wang, T. Toda and J. Yamagishi, "The VoiceMOS Challenge 2022," Interspeech 2022, September 2022. ::: |
37. |
C. Yu, S.-W. Fu, T.-An Hsieh, Y. Tsao and M. Ravanelli, "OSSEM: one-shot speaker adaptive speech enhancement using meta learning," Interspeech 2022, September 2022. ::: |
38. |
R. E. Zezario, S.-W. Fu, F. Chen, C.-S. Fuh, H.-M. Wang and Y. Tsao, "MTI-Net: A Multi-Target Speech Intelligibility Prediction Model," Interspeech 2022, pages 5463-5467, September 2022. ::: |
39. |
R. E. Zezario, F. Chen, C.-S. Fuh, H.-M. Wang and Y. Tsao, "MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids," Interspeech 2022, pages 3944-3948, September 2022, 1st Place, Machine Learning Challenges for Hearing Aids Challenge; 1st Place, The Hearing Industry Research Consortium Student Prize ::: |
40. |
T. Hussain, M. Diyan, M. Gogate, K. Dashtipour, A. Adeel, Y. Tsao, A. Hussain, "A Novel Speech Intelligibility Enhancement Model based on Canonical Correlation and Deep Learning," IEEE EMBC 2022, July 2022. ::: |
41. |
S.–S. Wang, Y. Tsao, W.–Z. Zheng, H.–W. Yeh, P.–C. Li, S.–H. Fang, Y.–H. Lai, "Dysarthric Speech Enhancement Based on Convolution Neural Network," IEEE EMBC 2022, July 2022. ::: |
42. |
C.-H. H. Yang, J. Qi, S. Y.-C. Chen, Y. Tsao, P.-Y. Chen, "When Bert Meets Quantum Temporal Convolution Learning for Text Classification In Heterogeneous Computing," ICASSP 2022, May 2022. ::: |
43. |
G.-T. Lin, C.-J. Hsu, D.-R. Liu, H.-Y. Lee, and Y. Tsao, "Analyzing The Robustness Of Unsupervised Speech Recognition," ICASSP 2022, May 2022. ::: |
44. |
C.-J. Hsu, H.-y. Lee, Y. Tsao, "XDBERT: Distilling Visual Information to BERT via Cross-Modal Encoders to Improve Language Understanding," ACL 2022, May 2022, (Short Paper) |
45. |
Y.-J. Lu, Z.-Q. Wang, S. Watanabe, A. Richard, C. Yu, and Y. Tsao, "Conditional Diffusion Probabilistic Model For Speech Enhancement," ICASSP 2022, May 2022. ::: |
46. |
Y.-C. Lin,T.-A. Hsieh, K.-H. Hung, C. Yu, H. Garudadri, Y. Tsao, and T.-W. Kuo, "Speech Recovery For Real-world Self-powered Intermittent Devices," ICASSP 2022, May 2022. ::: |
47. |
K.-C. Wang, K.-C. Liu, H.-M. Wang, and Y. Tsao, "EMGSE: Acoustic/emg Fusion For Multimodal Speech Enhancement," ICASSP 2022, May 2022. ::: |
48. |
S.-W. Fu, C. Yu, K.-H. Hung, M. Ravanelli, and Y. Tsao, "MetricGAN-U: Unsupervised Speech Enhancement/ Dereverberation based Only On Noisy/ Reverberated Speech," ICASSP 2022, May 2022. ::: |
49. |
H. Wu, H.-C. Kuo, N. Zheng, K.-H. Hung, H.-Y. Lee, Y. Tsao, H.-M. Wang, H. Meng, "Partially Fake Audio Detection by Self-attention-based Fake Span Discovery," ICASSP 2022, May 2022. ::: |
50. |
H.-Y. Lin, H.-H. Tseng, X. Lu, Yu Tsao, "Unsupervised Noise Adaptive Speech Enhancement by Discriminator-Constrained Optimal Transport," NeurIPS 2021, December 2021. ::: |
51. |
Y.-J. Li, S.-S. Wang, Y. Tsao, and B. Su, "MIMO Speech Compression and Enhancement Based on Convolutional Denoising Autoencoder," APSIPA ASC 2021, December 2021. ::: |
52. |
X. Chang, T. Maekaku, P. Guo, J. Shi,Y.-J. Lu, A. S. Subramanian, T. Wang, S.-w. Yang, Y. Tsao, H.-y. Lee, S. Watanabe, "An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition," ASRU 2021, December 2021. ::: |
53. |
Z. Feng, Yu Tsao, and F. Chen, "Estimation and Correction of Relative Transfer Function for Binaural Speech Separation Networks to Preserve Spatial Cues," APSIPA ASC 2021, December 2021. ::: |
54. |
Y.-J. Lu, Y. Tsao, and S. Watanabe, "A Study on Speech Enhancement Based on Diffusion Probabilistic Model," APSIPA ASC 2021, December 2021. ::: |
55. |
M.-C. Yen, W.-C. Huang, K. Kobayashi, Y.-H. Peng, S.-W. Tsai, Y. Tsao, T. Toda, J.-S. Jang, and H.-M. Wang, "Mandarin Electrolaryngeal Speech Voice Conversion with Sequence-to-Sequence Model-ing," ASRU 2021, December 2021. ::: |
56. |
H.-T. Chiang, Y.-C. Wu, C. Yu, T. Toda, H.-M. Wang, Y.-C. Hu, and Y. Tsao, "HASA-NET: A Non-Intrusive Hearing-Aid Speech Assessment Network," ASRU 2021, December 2021. ::: |
57. |
X. Lu, P. Shen, Y. Tsao, and H. Kawai, "Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification," APSIPA ASC 2021, December 2021. ::: |
58. |
Y.-S. Liou, W.-C. Huang, M.-C. Yen, S.-W. Tsai, Y.-H. Peng, T. Toda, Y. Tsao, and H.-M. Wang,, "Time Alignment Using Lip Images for Frame-Based Electrolaryngeal Voice Conversion," APSIPA ASC 2021, December 2021. ::: |
59. |
M. E Noor, Y.-J. Lu, S.-Si. Wang, S. Ghose, C.-Y. Chang, R. E. Zezario, S. Ahmed, W.-H. Chung, Y. Tsao and H.-M. Wang, "Investigation of A Single-Channel Frequency-Domain Speech Enhancement Network to Improve End-To-End Bengali Automatic Speech Recogni-tion Under Unseen Noisy Conditions," Oriental COCOSDA 2021, November 2021. ::: |
60. |
T.-A. Hsieh, C. Yu, S.-W. Fu, X. Lu, and Y. Tsao, "Improving Perceptual Quality by Phone-Fortified Perceptual Loss using Wasserstein Distance for Speech Enhancement," Interspeech 2021, September 2021. ::: |
61. |
Y,-C. Wu, C.-H. Hu, H.-S. Lee, Y.-H. Peng, W.-C. Huang, Y. Tsao, H.-M. Wang and T. Toda, "Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN Vocoder," Interspeech 2021, September 2021. ::: |
62. |
G.-X. Lin, S.-W. Hu, Y.-J. Lu, Y. Tsao, and C.-S. Lu, "QISTA-Net-Audio: Audio Super-resolution via Non-Convex Lq-normMinimization," Interspeech 2021, September 2021. |
63. |
S.-W. Fu, C. Yu, T.-A. Hsieh, P. Plantinga, M. Ravanelli, X. Lu, Y. Tsao, "MetricGAN +: An Improved Version of MetricGAN for Speech Enhancement," Interspeech 2021, September 2021. ::: |
64. |
W.-C. Huang, K. Kobayashi, Y.-H. Peng, C.-F. Liu, Y. Tsao, H.-M. Wang, T. Toda, "A Preliminary Study of a Two-Stage Paradigm for Preserving SpeakerIdentity in Dysarthric Voice Conversion," Interspeech 2021, September 2021. ::: |
65. |
R. E Zezario, C.-S. Fuh, H.-M. Wang, Y. Tsao, "Speech Enhancement with Zero-Shot Model Selection," EUSIPCO 2021, August 2021. ::: |
66. |
Y.-W. Chen, K.-H. Hung, S.-Y. Chuang, J. Sherman, X. Lu, Y. Tsao, "A Study of Incorporating Articulatory Movement Information in Speech Enhancement," EUSIPCO 2021, August 2021. ::: |
67. |
T.-Y. Lu, K.-C. Liu, C.-Y. Hsieh, C.-Y. Chang, Y. Tsao, C.-T. Chan, "Instrumented Shoulder Functional Assessment using Inertial Measurement Units for Frozen Shoulder," IEEE BHI 2021, pages 1-4, July 2021. ::: |
68. |
Y.-K. Wu, K.-P. Huang, Y. Tsao, H.-y. Lee, "One shot learning for speech separation," ICASSP 2021, June 2021. ::: |
69. |
X. Lu, P. Shen, Y. Tsao, H. Kawai, "Unsupervised neural adaptation model based on optimal transport for spoken language identification," ICASSP 2021, June 2021. ::: |
70. |
Y.-W. Chen, K.-H. Hung, S.-Y. Chuang, J. Sherman, W.-C. Huang, X. Lu, Y. Tsao, "EMA2S: An End-to-End Multimodal Articulatory-to-Speech System," ISCAS 2021, May 2021. ::: |
71. |
C.-J. Peng, Y.-J. Chan, C. Yu, S.-S. Wang, Y. Tsao, T.-S. Chi, "Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenario," ISCAS 2021, May 2021. ::: |
72. |
Y.-T. Chang, Y.-H. Yang, Y.-H. Peng, S.-S. Wang, T.-S. Chi, Y. Tsao and H.-M. Wang, "MoEVC: A Mixture of Experts Voice Conversion System With Sparse Gating Mechanism for Online Computation Acceleration," ISCSLP 2021, January 2021. ::: |
73. |
S.-W. Fu et al., "Boosting Objective Scores of Speech Enhancement Model through MetricGAN Post-Processing," APSIPA 2020, December 2020. ::: |
74. |
R. E. Zezario, S.-W. Fu, C.-S. Fuh, Y. Tsao, and H.-M. Wang, "STOI-Net: A Deep Learning based Non-Intrusive Speech Intelligibility Assessment Model," APSIPA 2020, December 2020. ::: |
75. |
S.-Y. Chuang, Y. Tsao, C.-C. Lo, H.-M. Wang, "Lite Audio-Visual Speech Enhancement," Interspeech 2020, October 2020. ::: |
76. |
C.-Y. Chen, W.-Z. Zheng, S.-S. Wang, Y. Tsao, P.-C. Li and Y.-H. Lai, "Enhancing Intelligibility of Dysarthric Speech Using Gated Convolutional-based Voice Conversion System," Interspeech 2020, October 2020. ::: |
77. |
Y.-J. Lu, C.-F. Liao, X. Lu, J.-w. Hung, Y. Tsao, "Incorporating Broad Phonetic Information for Speech Enhancement," Interspeech 2020, October 2020. ::: |
78. |
H. Li, S.-W. Fu, Y. Tsao, J. Yamagishi, "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric Learning," Interspeech 2020, October 2020. ::: |
79. |
C.-C. Lee, Y.-C. Lin, H.-T. Lin, H.-M. Wang, Y. Tsao, "SERIL: Noise Adaptive Speech Enhancement using Regularization-based Incremental Learning," Interspeech 2020, October 2020. ::: |
80. |
R. E. Zezario, T. Hussain, X. Lu, H.-M. Wang, and Y. Tsao, "Self-supervised Denoising Autoencoder with Linear Regression Decoder for Speech Enhancement," ICASSP 2020, May 2020. ::: |
81. |
T. Hussaink, Y. Tsao, H.-M. Wang, J.-C. Wang, S. M. Siniscalchi, and W.-H. Liao, "Compressed Multimodal Hierarchical Extreme Learning Machine for Speech Enhancement," APSIPA 2019, November 2019. ::: |
82. |
W.-C. Lin, Y. Tsao, F. Chen, and H.-M. Wang, "Investigation of Neural Network Approaches for Unified Spectral and Prosodic Feature Enhancement," APSIPA 2019, pages 1179-1184, November 2019. ::: |
83. |
F. Ye, Y. Tsao, and F. Chen, "Subjective Feedback-based Neural Network Pruning for Speech Enhancement," APSIPA 2019, November 2019. ::: |
84. |
K.-Y. Liu, S.-S. Wang, Y. Tsao, J.-w. Hung, "Speech Enhancement Based on the Integration of Fully Convolutional Network, Temporal Lowpass Filtering and Spectrogram Masking," ROCLING 2019, October 2019. ::: |
85. |
Y.-C. Lin, Y.-T. Hsu, S.-W. Fu, Y. Tsao, and T.-W. Kuo, "IA-NET: Acceleration and Compression of Speech Enhancement using Integer-adder Deep Neural Network," Interspeech 2019, September 2019. ::: |
86. |
F.-K. Chuang, S.-S. Wang, J.-w. Hung, Y. Tsao, and S.-H. Fang, "Speaker-aware Deep Denoising Autoencoder with Embedded Speaker Identity for Speech Enhancement," Interspeech 2019, September 2019. ::: |
87. |
T. Hussain, Y. Tsao, H.-M. Wang, J.-C. Wang, S. M. Siniscalchi, W.-H. Liao, "Audio-Visual Speech Enhancement Using Hierarchical Extreme Learning Machine," EUSIPCO 2019, September 2019. ::: |
88. |
X. Lu, P. Shen, S. Li, Y. Tsao, and H. Kawai, "Class-wise Centroid Distance Metric Learning for Acoustic Event Detection," Interspeech 2019, September 2019. ::: |
89. |
C.-F. Liao, Y. Tsao, H.-y. Lee and H.-M. Wang, "Noise Adaptive Speech Enhancement using Domain Adversarial Training," Interspeech 2019, September 2019, (with ISCA Travel Grant) ::: |
90. |
C.-F. Liao, Y. Tsao, X. Lu and H. Kawai, "Incorporating Symbolic Sequential Modeling for Speech Enhancement," Interspeech 2019, September 2019, (with ISCA Travel Grant) ::: |
91. |
R. E. Zezario, S.-W. Fu, X. Lu, H.-M. Wang, and Y. Tsao, "Specialized Speech Enhancement Model Selection Based on Learned Non-Intrusive Quality Assessment Metric," Interspeech 2019, September 2019. ::: |
92. |
W.-C. Huang, Y.-C. Wu, C.-C. Lo, P. L. Tobing, T. Hayashi, K. Kobayashi, T. Toda, Y. Tsao and H.-M. Wang, "Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice Conversion," Interspeech 2019, September 2019, (with ISCA Travel Grant) ::: |
93. |
W.-C. Huang et al.,, "Generalization of Spectrum Differential based Direct Waveform Modification for Voice Conversion," ISCA SSW 10, September 2019. ::: |
94. |
C.-C. Lo, S.-w. Fu, W. C. Huang, X. Wang, J. Yamagishi, Y. Tsao and H.-M. Wang, "MOSNet: Deep Learning based Objective Assessment for Voice Conversion," Interspeech 2019, September 2019. ::: |
95. |
P.-T. Huang, H.-S. Lee, S.-S. Wang, K.-Y. Chen, Y. Tsao and H.-M. Wang, "Exploring the Encoder Layers of Discriminative Autoencoders for LVCSR," Interspeech 2019, September 2019, (with ISCA Travel Grant) ::: |
96. |
W.-C. Huang, Y.-C. Wu, H.-T. Hwang, P. L. Tobing, T. Hayashiy, K. Kobayashi, T. Toda, Y. Tsao, H.-M. Wang, "Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion," EUSIPCO 2019, September 2019. ::: |
97. |
L.-W. Chen, H.-Y. Lee, and Y. Tsao, "Generative Adversarial Networks for Unpaired Voice Transformation on Impaired Speech," Interspeech 2019, September 2019. ::: |
98. |
S.-W. Fu, C.-F. Liao, Y. Tsao, S.-D. Lin, "MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement," ICML 2019, June 2019, Long Oral with ICML (top 3%) Travel Grant; Codes: https://github.com/JasonSWFu/MetricGAN ::: |
99. |
Y.-L. Shen, C.-Y. Huang, S.-S. Wang, Y. Tsao, H.-M. Wang, and T.-S. Chi, "Reinforcement Learning Based Speech Enhancement for Robust Speech Recognition," ICASSP 2019, May 2019. ::: |
100. |
K.-Y. Liu, S.-k. Lee, S.-S. Wang, Y. Tsao, J.-w. Hung, "Reducing noise and reverberation in speech signals via the integration of denoising autoencoder and temporal lowpass filtering," ICASI 2019, April 2019. ::: |
101. |
T. Hussain, Y. Tsao, S. M. Sinicalchi, J.-C. Wang, H.-M. Wang, and W.-H. Liao, "Bone-conducted Speech Enhancement using Hierarchical Extreme Learning Machine," IWSDS 2019, April 2019. ::: |
102. |
Shang-Chih Lin*, Yu Tsao, Shun-Feng Su, Yennun Huang, and Zi-Qing Zhong, "An Abnormal Detection Strategy of Rotating Electric Machine based on Frequency Distribution," The 39th Symposium on Electrical Power Engineering, December 2018. |
103. |
R. E. Zezario, J.-W. Huang, X. Lu, Y. Tsao, H.-T. Hwang, H.-M. Wang, "Deep Denoising Autoencoder Based Post Filtering for Speech Enhancement," APSIPA 2018, December 2018. ::: |
104. |
Y.-T. Hsu, Y.-C. Lin, S.-W. Fu, Y. Tsao, T.-W. Kuo, "A study on speech enhancement using exponent-only floating point quantized neural network (EOFP-QNN)," SLT 2018, November 2018. ::: |
105. |
Shang-Chih Lin*, Yu Tsao, Shun-Feng Su, and Yennun Huang, "An Industrial IoT Analysis System Based on Machining Data of Metal Materials," International Conference on Fuzzy Theory and Its Applications, November 2018. |
106. |
S.-k. Lee, S.-S. Wang, Y. Tsao, J.-w. Hung, "Speech Enhancement based on Reducing the Detail Portion of Speech Spectrograms in Modulation Domain via Discrete Wavelet Transform," ISCSLP 2018, November 2018. ::: |
107. |
Y.-T. Hsu, Z. Zhu, C.-T. Wang, S.-H. Fang, F. Rudzicz, and Y. Tsao, "Robustness against the channel effect in pathological voice detection," NeurIPS 2018, Machine Learning for Health (ML4H) Workshop, November 2018. ::: |
108. |
Shang-Chih Lin*, Chuan-Hsiang Su, Yu Tsao, Shun-Feng Su, Hong-Yuan Mark Liao, and Yennun Huang, "FIS-based Domestic Milling Machine PHM System Considering Multi-speed Frequency Variation," IEEE International Conference on Advanced Manufacturing, November 2018, (Best Paper Award) (獲推薦轉投SCI期刊, 擴充研究修改中) |
109. |
W.-C. Huang, H.-T. Hwang, Y.-H. Peng, Y. Tsao, H.-M. Wang, "Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders," ISCSLP 2018, November 2018, Best Student Paper Award ::: |
110. |
Hung-Chung Li, Shang-Chih Lin, Yu Tsao, Shun-Feng Su, Pei-Li Sun and Yennun Huang, "A Supervised Learning Algorithm Considering Light Conditions for Visual Inspection of Metal Objects," The 54th Annual Conference of Chinese Society for Quality 2018 International Symposium of Quality Management, November 2018, (Makalot Industry-Academic Collaboration Award) (獲推薦轉投EI期刊, 擴充研究修改中) |
111. |
Y.-Y. Kao, H.-P. Hsu, C.-F. Liao, Y. Tsao, H.-C. Yang, J.-L. Li, C.-C. Lee, H.-S. Lee, and H.-M. Wang, "Automatic Detection of Speech Under Cold Using Discriminative Autoencoders and Strength Modeling with Multiple Sub-Dictionary Generation," IEEE IWAENC, September 2018. ::: |
112. |
X. Lu, P. Shen, S. Li, Y. Tsao, H. Kawai, "Temporal Attentive Pooling for Acoustic Event Detection," Interspeech 2018, September 2018. ::: |
113. |
S.-W. Fu, Y. Tsao, H.-T. Hwang, H.-M. Wang, "Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM," Interspeech 2018, September 2018. ::: |
114. |
Y.-H. Peng, H.-T. Hwang, Y.-C. Wu, Y. Tsao, H.-M. Wang, "Exemplar-Based Spectral Detail Compensation for Voice Conversion," Interspeech 2018, September 2018. ::: |
115. |
B.-S. Yu, Y. Tsao, S.-W. Yang, Y.-K. Chen, and S.Y. Chien, "Architecture Design of Convolutional Neural Networks for Face Detection on an FPGA Platform," IEEE SiPS 2018, September 2018. |
116. |
Y.-H. Lai, W.-Z. Zheng, S.-T. Tang, S.-H. Fang, W.-H. Liao, and Y. Tsao, "Improving the Performance of Hearing Aids in Noisy Environments based on Deep Learning Technology," EMBC 2018, April 2018. ::: |
117. |
N. Ryant et al., "Enhancement and Analysis of Conversational Speech: JSALT 2017," ICASSP, April 2018. ::: |
118. |
W.-J. Lee, S.-S. Wang, F. Chen, X. Lu, S.-Y. Chien, and Y. Tsao,, "Speech Dereverberation Based on Integrated Deep and Ensemble Learning Algorithm," ICASSP, April 2018. ::: |
119. |
L. Sun, J. Du, T. Gao, Y.-D. Lu, Y. Tsao, C.-H. Lee, N. Ryant, "A Novel LSTM-based Speech Preprocessor For Speaker Diarization in Realistic Mismatch Conditions," ICASSP, April 2018. ::: |
120. |
S.-W. Fu, Y. Tsao, X. Lu, and H. Kawai, "Raw Waveform-based Speech Enhancement by Fully Convolutional Networks," APSIPA 2017, November 2017. ::: |
121. |
Y.-H. Peng, C.-C. Hsu, Y.-C. Wu, H.-T. Hwang, Y.-W. Liu, Y. Tsao, and H.-M. Wang, "Fast Locally Linear Embedding Algorithm for Exemplar-based Voice Conversion," APSIPA 2017, November 2017, (Poster Presentation Award) ::: |
122. |
S.-S. Wang, Y. Tsao, H.-L. S. Wang, Y.-H. Lai*, and L. P.-H. Li, "A Deep Learning based Noise Reduction Approach to Improve Speech Intelligibility for Cochlear Implant Recipients in the Presence of Competing Speech Noise," APSIPA 2017, November 2017. ::: |
123. |
T.-H. Lin, Y.-H. Wang, S.-S. Lu, H.-W. Yen, and Y. Tsao, "Computing Biodiversity Change via a Soundscape Monitoring Network," PNC 2017 Annual Conference and Joint Meetings, November 2017. ::: |
124. |
S.-W. Fu, T.-y. Hu, Y. Tsao, X. Lu, "Complex Spectrogram Enhancement by Convolutional Neural Network with Multi-metrics Learning," IEEE MLSP 2017, September 2017. ::: |
125. |
T.-H. Lin and Y. Tsao, "Deblending of Simultaneous-source Seismic Data via Periodicity-coded Nonnegative Matrix Factorization," IEEE Dataport, September 2017. ::: |
126. |
M.-H. Yang, H.-S. Lee, Y.-D. Lu, K.-Y. Chen, Y. Tsao, B. Chen, and H.-M. Wang, "Discriminative Autoencoders for Acoustic Modeling," Interspeech2017, August 2017. ::: |
127. |
C.-L. Wu, H.-P. Hsu, S.-S. Wang, J.-W. Hung, Y.-H. Lai, H.-M. Wang, and Y. Tsao, "Wavelet Speech Enhancement Based on Robust Principal Component Analysis," Interspeech2017, August 2017. ::: |
128. |
C.-C. Hsu, H.-T. Hwang, Y.-C. Wu, Y. Tsao, and H.-M. Wang, "Voice Conversion from Unaligned Corpora Using Variational Autoencoding Wasserstein Generative Adversarial Networks," Interspeech2017, August 2017. ::: |
129. |
Y.-C. Wu, H.-T. Hwang, S.-S. Wang, C.-C. Hsu, Y. Tsao, and H.-M. Wang, "A Post-filtering Approach Based on Locally Linear Embedding Difference Compensation for Speech Enhancement," Interspeech2017, August 2017. ::: |
130. |
S.-T. Lin, Y.-H. Liao, Y. Tsao, and S.-Y. Chien,, "Object-based on-line video summarization for internet of video things," EEE ISCAS, May 2017. ::: |
131. |
H.-S. Lee, Y.-D. Lu, C.-C. Hsu, Y. Tsao, H.-M. Wang, and S.-K. Jeng, "Discriminative Autoencoders for Speaker Verification," IEEE ICASSP, March 2017. ::: |
132. |
Y.-C. Wu, H.-T. Hwang, S.-S. Wang, C.-C. Hsu, Y.-H. Lai, Y. Tsao, and H.-M. Wang, "A Locally Linear Embbeding Based Postfiltering Approach for Speech Enhancement," IEEE ICASSP, March 2017. ::: |
133. |
C.-C. Hsu, H.-T. Hwang, Y.-C. Wu, Y. Tsao and H.-M. Wang, "Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder," APSIPA ASC, December 2016. ::: |
134. |
J.-C. Hou, S.-S. Wang, Y.-H. Lai, J.-C. Lin, Y. Tsao, H.-W. Chang, and H.-M. Wang, "Audio-Visual Speech Enhancement using Deep Neural Networks," APSIPA 2016, December 2016. ::: |
135. |
C.-C. Hsu, H.-T. Hwang, Y.-C. Wu, Y. Tsao, and H.-M. Wang, "Dictionary Update for NMF-based Voice Conversion Using an Encoder-Decoder Network," ISCSLP, November 2016. ::: |
136. |
Y.-Y. Hsieh, C.-D. Wu, Y. Tsao, and S.-S. Lu, "A Linear Regression Model with Dynamic Pulse Transit Time Features for Noninvasive Blood Pressure Prediction," BioCAS, October 2016. ::: |
137. |
Y.-H. Lai, S.-S. Wang, Y.-T. Su, H.-C. Cheng, F. K. Fu, and Y. Tsao, "Improving the Performance of Speech Perception in Noisy Environment based on a FAME Strategy," ISCSLP 2016, October 2016. ::: |
138. |
C.-Y. Hsu, R. E. Zezario, J.-C. Wang, X. Lu, and Y. Tsao, "Incorporating Local Environment Information with Ensemble Neural Networks to Robust Automatic Speech Recognition," ISCSLP 2016, October 2016. ::: |
139. |
H.-S. Lee, Y. Tsao, C.-C. Lee, H.-M. Wang, W.-C. Lin, W.-C. Chen, S.-W. Hsiao, S.-K. Jeng, "Minimization of Regression and Ranking Losses with Shallow Neural Networks on Automatic Sincerity Evaluation," Interspeech, September 2016. ::: |
140. |
Y.-C. Wu, H.-T. Hwang, C.-C. Hsu, Y. Tsao, H.-M. Wang, "Locally Linear Embedding for Exemplar-Based Spectral Conversion," Interspeech, September 2016. ::: |
141. |
X. Lu, P. Shen, Y. Tsao, H. Kawai, "Pair-wise Distance Metric Learning of Neural Network Model for Spoken Language Identification," Interspeech, September 2016. ::: |
142. |
S.-W. Fu, Y. Tsao, X. Lu, "SNR-Aware Convolutional Neural Network Modeling for Speech Enhancement," Interspeech, September 2016. ::: |
143. |
Y.-H. Lai, C.-H. Wang, S.-Y. Hou, B.-Y. Chen, Y. Tsao, and Y.-W. Liu, "DCASE Report for Task 3: Sound Event Detection in Real Life Audio," DCASE 2016 workshop, September 2016. ::: |
144. |
C.-W. Wu, M.-T. Zhong, Y. Tsao, S.-W. Yang, Y.-K. Chen, and S.-Y. Chien, "Track-clustering Error Evaluation for Track-based Multi-camera Tracking System Employing Human Re-identification," CVPR workshop, August 2016, Codes: https://github.com/cw1204772/ClustTMCT ::: |
145. |
Y.-T. Liu, Y. Tsao, R. Y. Chang:, "Nonnegative Matrix Factorization-based Frequency Lowering Technology for Mandarin-speaking Hearing Aid Users," IEEE ICASSP2016, pages 5905-5909, May 2016. ::: |
146. |
Jeremy Chiaming Yang, Syu-Siang Wang, Yu Tsao, and Jeih-weih Hung, "Speech Enhancement via Ensemble Modeling NMF Adaptation," IEEE ICCE-Taiwan 2016, May 2016. ::: |
147. |
Syu-Siang Wang, Jeremy Chiaming Yang, Yu Tsao, and Jeih-weih Hung, "Leveraging Nonnegative Matrix Factorization in Processing the Temporal Modulation Spectrum for Speech Enhancement," IEEE ICCE-Taiwan 2016, May 2016. ::: |
148. |
S.-S. Wang and Y. Tsao, "Temporal Modulation Spectral Restoration for Robust Speech Recognition," IEEE International Conference on Multimedia Big Data, April 2016. ::: |
149. |
Ying-Hui Lai, Chien-Hsun Chen, Shih-Tsang Tang, Zong-Mu Yeh, and Yu Tsao, "Improving the Performance of Noise Reduction in Hearing Aids Based on the Genetic Algorithm," IFMBE Proceedings 57, March 2016. |
150. |
P. Lin, D.-C. Lyu, Y.-F. Chang, and Y. Tsao, "Temporal Alignment for Deep Neural Networks," GlobalSIP 2015, December 2015. ::: |
151. |
H.-T. Hwang, Y. Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen, "A Probabilistic Interpretation for Artificial Neural Network-based Voice Conversion," APSIPA 2015, December 2015. ::: |
152. |
S.-S. Wang, H.-T. Hwang, Y.-H. Lai, Y. Tsao, X. Lu, H.-M. Wang, and B. Su, "Improving Denoising Auto-encoder Based Speech Enhancement With the Speech Parameter Generation Algorithm," APSIPA 2015, December 2015. ::: |
153. |
Y.-T. Liu, R. Y. Chang, Y. Tsao, and Y.-p. Chang, "A New Frequency Lowering Technique for Mandarin-Speaking Hearing Aid Users," GlobalSIP 2015, December 2015. ::: |
154. |
X. Lu, P. Shen, Y. Tsao, C. Hori, H. Kawai, "Sparse Representation with Temporal Max-Smoothing for Acoustic Event Detection," Interspeech 2015, ISCA, editor, pages 1176-1180, September 2015. ::: |
155. |
P. Lin, D.-C. Lyu, Y.-F. Chang, and Y. Tsao, "Speech Recognition with Temporal Neural Networks," Interspeech 2015, ISCA, editor, pages 21–25, September 2015. ::: |
156. |
P. Lin, S.-S. Wang, and Y. Tsao, "Temporal Information in Tone Recognition," IEEE ICCE 2015, June 2015. ::: |
157. |
W.-C. Chen, P.-T. Lai, Y. Tsao, and C.-C. Lee, "Multimodal Arousal Rating using Unsupervised Fusion Technique," ICASSP 2015, April 2015. ::: |
158. |
Y.-H. Lai, S.-S. Wang, P.-C. Li, and Yu Tsao, "A Discriminative Post-filter for Speech Enhancement in Hearing Aids," ICASSP 2015, April 2015. ::: |
159. |
Y.-H. Lai, F. Chen, and Y. Tsao, "Effect of Adaptive Envelope Compression in Simulated Electric Hearing in Reverberation," ISIC 2014, December 2014. ::: |
160. |
Y.-F. Chang, P. Lin, S.-H. Cheng, K.-H. Chan, Y.-C. Zeng, C.-W. Liao, W.-T. Chang, Y.-C. Wang, Y. Tsao, "Robust Anchorperson Detection Based on Audio Streams using a Hybrid I-vector and DNN System," APSIPA 2014, December 2014. ::: |
161. |
H. Jing, A.-C. Liang, S.-D. Lin, and Y. Tsao, "A Transfer Probabilistic Collective Factorization Model to Handle Sparse Data in Collaborative Filtering," ICDM 2014, December 2014, accepted as a regular paper (acceptance rate=9.5%) ::: |
162. |
X. Lu, Y. Tsao, S. Matsuda, and C. Hori, "Ensemble Modeling of Denoising Autoencoder for Speech Spectrum Restoration," Interspeech 2014, September 2014. ::: |
163. |
P. Lin, F. Chen, S.-S. Wang, Y. Tsao and Y. H. Lai, "Automatic Speech Recognition with Primarily Temporal Envelope Information," Interspeech 2014, September 2014. ::: |
164. |
X. Lu, Y. Tsao, P. Shen, and C. Hori, "Spectral Patch Based Sparse Coding for Acoustic Event Detection," ISCSLP 2014, September 2014. ::: |
165. |
H.-S. Lee, Y. Tsao, H.-M. Wang and S.-K. Jen, "Clustering-Based I-Vector Formulation for Speaker Recognition," Interspeech 2014, September 2014. ::: |
166. |
S.-S. Wang, P. Lin, D.-C. Lyu, Y. Tsao, H.-T. Hwang, B. Su and H.-M. Wang, "Acoustic Feature Conversion using a Polynomial based Feature Transferring Algorithm," ISCSLP 2014, September 2014. ::: |
167. |
Y. H. Lai, F. Chen, and Y. Tsao, "An Adaptive Envelope Compression Strategy for Speech Processing in Cochlear Implants," Interspeech 2014, September 2014. ::: |
168. |
H. Jing, T.-Y. Hu, H.-S. Lee, W.-C. Chen, C.-C. Lee, Y. Tsao and H.-M. Wang, "Ensemble of Machine Learning Algorithms for Cognitive and Physical Speaker Load Detection," Interspeech 2014, September 2014. ::: |
169. |
H.-t. Fan, J.-w. Hung, X. Lu, S.-S. Wang, Yu Tsao, "Speech Enhancement using Segmental Nonnegative Matrix Factorization," ICASSP 2014, May 2014. ::: |
170. |
H.-S. Lee, Y. Tsao, Y.-F. Chang, H.-M. Wang, and S.-K. Jeng, "Speaker Verification Using Kernel-Based Binary Classifiers with Binary Operation Derived Features," ICASSP 2014, May 2014. ::: |
171. |
X. Lu, Yu Tsao, S. Matsuda, and C. Hori, "Sparse Representation Based on a Bag of Spectral Exemplars for Acoustic Event Detection," ICASSP 2014, May 2014. ::: |
172. |
H. Jing, Y. Tsao, K.-Y. Chen and H.-M. Wang, "Semantic Naïve Bayes Classifier for Document Classification," IJCNLP, December 2013. ::: |
173. |
H.-T. Hwang, Y. Tsao, H.-M. Wang, Y.-R. Wang, S.-H. Chen, "Incorporating Global Variance in the Training Phase of GMM-based Voice Conversion," APSIPA 2013, October 2013. ::: |
174. |
C.-H. Wang, T.-W. Kao, S.-H. Fang, Y. Tsao, L.-C. Kuo, S.-W. Kao, and N.-C. Lin, "Robust Wi-Fi Location Fingerprinting Against Device Diversity based on Spatial Mean Normalization," APSIPA 2013, October 2013. ::: |
175. |
Hung-yi Lee, Ting-yao Hu, How Jing, Yun-Fan Chang, Yu Tsao, Yu-Cheng Kao and Tsang-Long Pao, "Ensemble of Machine Learning and Acoustic Segment Model Techniques for Speech Emotion and Autism Spectrum Disorders Recognition," Interspeech 2013, August 2013, (Second Place In the Autism Sub-Challenge) ::: |
176. |
Tsung-Hsien Wen, Aaron Heidel, Hung-yi Lee, Yu Tsao and Lin-Shan Lee, "Recurrent Neural Network Based Language Model Personalization by Social Network Crowdsourcing," Interspeech 2013, August 2013, (Best Student Paper Award Nomination) ::: |
177. |
Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Yih-Ru Wang and Sin-Horng Chen, "Alleviating the Over-Smoothing Problem in GMM-Based Voice Conversion with Discriminative Training," Interspeech 2013, August 2013. ::: |
178. |
Bo Li, Yu Tsao and Khe Chai Sim, "An Investigation of Spectral Restoration Algorithms for Deep Neural Networks based Noise Robust Speech Recognition," Interspeech 2013, August 2013. ::: |
179. |
Xugang Lu, Yu Tsao, Shigeki Matsuda and Chiori Hori, "Speech Enhancement Based on Deep Denoising Autoencoder," Interspeech 2013, August 2013, Codes: Tensor Flow: https://github.com/jonlu0602/DeepDenoisingAutoencoder; Keras: https://github.com/jerrygood0703/DDAE; Matlab: https://drive.google.com/open?id=0B8ZEsMh6ITIlNVZ1VmROdTdQNUU ::: ::: |
180. |
Ying-Hui Lai, Yu-Cheng Su, Yu Tsao, Shuenn-Tsong Young, "Evaluation of Generalized Maximum a Posteriori Spectral Amplitude (GMAPA) Speech Enhancement Algorithm in Hearing Aids," ISCE 2013, June 2013. ::: |
181. |
Syu-Siang Wang, Yu Tsao, Jeih-weih Hung, "Filtering on the Temporal Probability Sequence in Histogram Equalization for Robust Speech Recognition," ICASSP 2013, IEEE, May 2013. ::: |
182. |
Yu-Cheng Su, Yu Tsao, Jung-En Wu, Fu-Rong Jean, "Speech Enhancement using Generalized Maximum a Posteriori Spectral Amplitude Estimator," ICASSP 2013, IEEE, May 2013. ::: |
183. |
How Jing and Yu Tsao, "Sparse Maximum Entropy Deep Belief Nets," IJCNN 2013, IEEE, April 2013. ::: |
184. |
H.-T. Hwang, Yu Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen, "Exploring Mutual Information for GMM-Based Spectral Conversion," ISCSLP 2012, IEEE, December 2012. ::: |
185. |
S.-S. Wang, J.-W. Hung, and Yu Tsao, "A Study on Cepstral Subband Normalization for Robust ASR," ISCSLP 2012, IEEE, December 2012. ::: |
186. |
X. Lu, Yu Tsao, S. Matsuda, C. Hori, and H. Kashioka, "Acoustic Space Partition based on Broad Phonetic Class for Ensemble Acoustic Modeling," ISCSLP 2012, IEEE, December 2012. ::: |
187. |
T.-Y. Hu, Yu Tsao, and L.-S. Lee, "Discriminative Fuzzy Clustering Maximum a Posterior Linear Regression for Speaker Adaptation," Interspeech 2012, ISCA, September 2012. ::: |
188. |
H.-T. Hwang, Yu Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen, "A Study of Mutual Information for GMM-Based Spectral Conversion," Interspeech 2012, ISCA, September 2012. ::: |
189. |
Yu Tsao, C.-L. Huang, S. Matsuda, C. Hori, and H. Kashioka, "A Linear Projection Approach to Environment Modeling for Robust Speech Recognition," ICASSP 2012, IEEE, April 2012. ::: |
190. |
C.-L. Huang, Yu Tsao, and C. Hori, "Feature Normalization and Selection for Robust Speaker State Recognition," COCOSDA 2011, IEEE, October 2011. ::: |
191. |
Yu Tsao, P. R. Dixon, C. Hori, and H. Kawai, "Incorporating Regional Information to Enhance MAP-based Stochastic Feature Compensation for Robust Speech Recognition," Interspeech, ISCA, August 2011. ::: |
192. |
Yu Tsao, R. Isotani, H. Kawai, and S. Nakamura, "Increasing Discriminative Capability on Map-based Mapping Function Estimation for Acoustic Model Adaptation," ICASSP, IEEE, May 2011. ::: |
193. |
Y. Tsao, S. Matsuda, S. Sakai, R. Isotani, H. Kawai, and S. Nakamura, "A Sampling-based Environment Population Projection Approach for Rapid Acoustic Model Adaptation," ICASSP, IEEE, May 2011. ::: |
194. |
J. Li, Y. Tsao, and C.-H. Lee, "Shrinkage Model Adaptation in Automatic Speech Recognition," Interspeech, ISCA, September 2010. ::: |
195. |
A. Mushtaq, Y. Tsao, and C.-H. Lee, "A Particle Filter Feature Compensation Approach to Robust Speech Recognition," Interspeech, ISCA, September 2010. ::: |
196. |
Yu Tsao, H. Sun, H. Li, and C.-H. Lee, "An Acoustic Segment Model Approach to Incorporating Temporal Information into Speaker Modeling for Text-Independent Speaker Recognition," ICASSP, IEEE, May 2010. ::: |
197. |
Y. Tsao, S. Matsuda, S. Nakamura, and C.-H. Lee, "MAP Estimation of Online Mapping Parameters in Ensemble Speaker and Speaking Environment Modeling," ASRU, IEEE, December 2009. ::: |
198. |
Y. Tsao, J. Li, C.-H. Lee, and S. Nakamura, "Soft Margin Estimation on Improving Environment Structures for Ensemble Speaker and Speaking Environment Modeling," IUCS, ACM, December 2009. ::: |
199. |
S. Matsuda, Y. Tsao, J. Li, S. Nakamura, and C.-H. Lee, "A Study on Soft Margin Estimation of Linear Regression Parameters for Speaker Adaptation," Interspeech, ISCA, December 2009. ::: |
200. |
Y. Tsao, J. Li, and C.-H. Lee, "Ensemble Speaker and Speaking Environment Modeling Approach with Advanced Online Estimation Process," ICASSP, IEEE, May 2009. ::: |
201. |
S.-Y. Peng, Y. Tsao, P. E. Hasler, and D. V. Anderson, "A Programmable Analog Radial-Basis-Function Based Classifier," ICASSP, IEEE, December 2008. ::: |
202. |
Y. Tsao and C.-H. Lee, "Improving the Ensemble Speaker and Speaking Environment Modeling Approach by Enhancing the Precision of the Online Estimation Process," Interspeech, ISCA, September 2008. ::: |
203. |
Y. Tsao and C.-H. Lee, "Two Extensions to Ensemble Speaker and Speaking Environment Modeling for Robust Automatic Speech Recognition," ASRU, IEEE, December 2007. ::: |
204. |
I. Bromberg, Q. Fu, J. Hou, J. Li, C. Ma, B. Mattews, A. Moreno-Daniel, J. Morris, S. M. Siniscalchi, Y. Tsao, and Y. Wang, "Detection-based ASR In the Automatic Speech Attribute Transcription Project," Interspeech, ISCA, September 2007. ::: |
205. |
Y. Tsao and C.-H. Lee, "An Ensemble Modeling Approach to Joint Characterization of Speaker and Speaking Environments," Interspeech, ISCA, September 2007. ::: |
206. |
C. Ma, Y. Tsao, and C.-H. Lee, "A Study on Detection Based Automatic Speech Recognition," Interspeech, ISCA, September 2006. ::: |
207. |
Y. Tsao and C.-H. Lee, "A Vector Space Approach to Environment Modeling for Robust Speech Recognition," Interspeech, ISCA, September 2006. ::: |
208. |
Y. Tsao, J. Li, and C.-H. Lee, "A Study on Separation between Acoustic Models and Its Applications," Eurospeech, ISCA, September 2005. ::: |
209. |
J. Li, Y. Tsao, and C.-H. Lee, "A Study on Knowledge Source Integration for Candidate Rescoring in Automatic Speech Recognition," ICASSP, IEEE, April 2005. ::: |
210. |
Y. Tsao, S.-M. Lee, and L.-S. Lee, "Segmental Eigenvoice for Rapid Speaker Adaptation," Eurospeech, ISCA, September 2001. ::: |
|
|
Technical Reports | |
1. |
王豫煌、林誠謙、嚴漢偉、林子皓、陸聲山、曹昱、端木茂甯、黃俊嘉、莊庭瑞, "亞洲聲景長期監測網," number 3, 臺灣生態學會、中央研究院、日本國立研究開發法人海洋研究開發機構、林業試驗所森林保護組, August 2019. ::: |
2. |
曹昱, "基於人工智慧之語音溝通輔具," 中研院 | 數理科學, 漫步科研, 科普專欄 2019-06-20, 2019. ::: |
3. |
張佑榕、曹昱, "研之有物(智慧聽)," 中央研究院, 2019. ::: |
4. |
端木茂甯, "研之有物(蝙蝠的超音波,藏了什麼訊息?)," 中央研究院, 2018. ::: |
|
|
Book & Book Chapters | |
1. |
P. Lin, Y. Tsao, and L.-W. Kuo,, chapter "Controlling the Biocompatibility and Mechanical Effects of Implantable Microelectrodes to Improve Chronic Neural Recordings in the Auditory Nervous System," "An Excursus into Hearing Loss," S. Hatzopoulos and A. Ciorba, editor, pages 173-195, IntechOpen, May 2018. ::: |
2. |
Y.-H. Lai, Fe. Chen, and Y. Tsao,, chapter "Adaptive Dynamic Range Compression for Improving Envelope-Based Speech Perception: Implications for Cochlear Implants," "Emerging Technology and Architecture for Big-data Analytics," A. Chattopadhyay and Y. Hao, editor, pages 191-214, Springer, April 2017. ::: |
|
|
Others | |
1. |
Yu Tsao, "基於深度學習之語音增強技術及其應用,", 2020大數據人工智能. ::: ::: |
2. |
"Yu Tsao's CV," 2024. ::: |
3. |
Yu Tsao, "Utilizing Deep Learning for Speech Enhancement in Assistive Oral Communication Technologies,", Keynote Speech in M3Oriental Workshop, ACM Multimedia Asia 2023 December 2023. ::: |
4. |
Yu Tsao, "Wearable Devices and Machine Learning Algorithms for Augmented Oral Communication Assistance,", CTSoc Technical Talk November 2023. ::: |
5. |
Fei Chen and Yu Tsao, "Advances in Psychoacoustics and Machine Learning towards Objective Speech Intelligibility Evaluation," October 2023. ::: ::: |
6. |
Fei Chen and Yu Tsao, "Speech Assessment Metrics: From Psychoacoustics to Machine Learning,", Tutorial in Interspeech 2023 August 2023. ::: ::: |
7. |
Yu Tsao, "聽說 AI," November 2022, 國科會工程處記者會 ::: ::: |
8. |
Fei Chen and Yu Tsao, "Speech enhancement for cochlear implants: From psychoacoustics to machine learning,", Tutorial in Interspeech 2022 September 2022. ::: |
9. |
Fei Chen and Yu Tsao, "Advances in Cochlear Implants: From Speech Perception, Enhancement to Evaluation,", Tutorial in EUSIPCO 2022 September 2022. ::: |
10. |
Fei Chen and Yu Tsao, "Speech Perception and Enhancement in Cochlear Implants," December 2021, Tutorial in APSIPA 2021 ::: ::: |
11. |
Berrak Sisman, Yu Tsao, Haizhou Li, "Theory and Practice of Voice Conversion,", Tutorial in APSIPA 2020 December 2020. ::: |
12. |
Fei Chen and Yu Tsao, "Intelligibility Evaluation and Speech Enhancement based on Deep Learning,", Tutorial in Interspeech 2020 October 2020, Video: https://www.youtube.com/watch?v=89S4CgfPWG0 ::: ::: |
13. |
Yu Tsao, "Speech Enhancement based on Deep Learning and Intelligibility Evaluation,", Tutorial in APSIPA 2019 November 2019. ::: |
14. |
H.-Y. Lee and Y. Tsao, "Generative Adversarial Network and its Applications to Speech Signal Processing and Natural Language Processing,", Tutorial in Interspeech 2019 September 2019. ::: |
15. |
"Improving biodiversity monitoring through soundscape information retrieval," May 2018. ::: |
16. |
Hung-iy Lee and Yu Tsao, "Generative Adversarial Network and its Applications to Speech Signal Processing and Natural Language Processing,", Tutorial in ICASSP 2018 April 2018. ::: |
17. |
Y.-C. Lin, Y.-H. Lai, H.-W. Chang, Y. Tsao, Y.-p. Chang, and R. Y. Chang, "PAD-MMRT," August 2014, Original corpus is prepared by K.-S. Tsai, L.-H. Tseng, C.-J.Wu, and S.-T. Young: “Development of a Mandarin monosyllable recognition test,” Ear and Hearing, vol. 30, no. 1, pp. 90–99, 2009. ::: ::: |
18. |
曹昱,蘇煜程,王緒翔, "線性映射轉換函數於聲學模型調適之強健式語音辨識,", 計算語言學學會通訊 第 23 卷第 2 期 (2012 年 6 月 ) June 2012. ::: |
|
|
|
|
|
|
|
|
|
|
|
|