Speak in the Scene: Diffusion-based Acoustic Scene Transfer toward Immersive Speech Generation INTERSPEECH, 2024
MF-PAM: Accurate Pitch Estimation through Periodicity Analysis and Multi-level Feature Fusion INTERSPEECH, 2023
Imaginary Voice: Face-styled Diffusion Model for Text-to-Speech International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023
MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech Recognition International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023
An Empirical Study on Speech Restoration Guided by Self-supervised Speech Representation International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023
Diffusion-based Generative Speech Source Separation International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023
Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion Odyssey, 2022
Phase Continuity: Learning Derivatives of Phase Spectrum for Speech Enhancement International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022
Looking Into Your Speech: Learning Cross-Modal Affinity for Audio-Visual Speech Separation IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021
End-To-End Lip Synchronisation Based on Pattern Classification IEEE Spoken Language Technology Workshop (SLT), 2021
Intra-Class Variation Reduction of Speaker Representation in Disentanglement Framework INTERSPEECH, 2020
Seeing Voices and Hearing Voices: Learning Discriminative Embeddings Using Cross-Modal Self-Supervision INTERSPEECH, 2020
Gradient-based Active Learning Query Strategy for End-to-end Speech Recognition International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019
Perfect Match: Improved Cross-modal Embeddings for Audio-visual Synchronisation International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019
A Study on Search Grid Points for Data-Driven 3-D Beamsteering 2017 Hands-free Speech Communications and Microphone Arrays (HSCMA), 2017
Looking Into Your Speech: Learning Cross-Modal Affinity for Audio-Visual Speech Separation IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021
End-To-End Lip Synchronisation Based on Pattern Classification IEEE Spoken Language Technology Workshop (SLT), 2021
Seeing Voices and Hearing Voices: Learning Discriminative Embeddings Using Cross-Modal Self-Supervision INTERSPEECH, 2020
Perfect Match: Self-Supervised Embeddings for Cross-Modal Retrieval Journal of Selected Topics in Signal Processing, 2020
Perfect Match: Improved Cross-modal Embeddings for Audio-visual Synchronisation International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019
A Study on Speech Disentanglement Framework based on Adversarial Learning for Speaker Recognition The Journal of the Acoustical Society of Korea, 2020
Perfect Match: Self-Supervised Embeddings for Cross-Modal Retrieval Journal of Selected Topics in Signal Processing, 2020
Generic Uniform Search Grid Generation Algorithm for Far-field Source Localization The Journal of the Acoustical Society of America, 2018
Diffusion-based Generative Speech Source Separation International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023
Generic Uniform Search Grid Generation Algorithm for Far-field Source Localization The Journal of the Acoustical Society of America, 2018
A Study on Search Grid Points for Data-Driven 3-D Beamsteering 2017 Hands-free Speech Communications and Microphone Arrays (HSCMA), 2017
Perfect Match: Self-Supervised Embeddings for Cross-Modal Retrieval Journal of Selected Topics in Signal Processing, 2020
Perfect Match: Improved Cross-modal Embeddings for Audio-visual Synchronisation International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019
Intra-Class Variation Reduction of Speaker Representation in Disentanglement Framework INTERSPEECH, 2020
A Study on Speech Disentanglement Framework based on Adversarial Learning for Speaker Recognition The Journal of the Acoustical Society of Korea, 2020
Intra-Class Variation Reduction of Speaker Representation in Disentanglement Framework INTERSPEECH, 2020
Phase Continuity: Learning Derivatives of Phase Spectrum for Speech Enhancement International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022
An Empirical Study on Speech Restoration Guided by Self-supervised Speech Representation International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023
A Study on Search Grid Points for Data-Driven 3-D Beamsteering 2017 Hands-free Speech Communications and Microphone Arrays (HSCMA), 2017
Generic Uniform Search Grid Generation Algorithm for Far-field Source Localization The Journal of the Acoustical Society of America, 2018
Gradient-based Active Learning Query Strategy for End-to-end Speech Recognition International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019
Gradient-based Active Learning Query Strategy for End-to-end Speech Recognition International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019
Seeing Voices and Hearing Voices: Learning Discriminative Embeddings Using Cross-Modal Self-Supervision INTERSPEECH, 2020
A Study on Speech Disentanglement Framework based on Adversarial Learning for Speaker Recognition The Journal of the Acoustical Society of Korea, 2020
End-To-End Lip Synchronisation Based on Pattern Classification IEEE Spoken Language Technology Workshop (SLT), 2021
Looking Into Your Speech: Learning Cross-Modal Affinity for Audio-Visual Speech Separation IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021
Phase Continuity: Learning Derivatives of Phase Spectrum for Speech Enhancement International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022
Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion Odyssey, 2022
Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion Odyssey, 2022
Diffusion-based Generative Speech Source Separation International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023
An Empirical Study on Speech Restoration Guided by Self-supervised Speech Representation International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023
MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech Recognition International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023
MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech Recognition International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023
Imaginary Voice: Face-styled Diffusion Model for Text-to-Speech International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023
Imaginary Voice: Face-styled Diffusion Model for Text-to-Speech International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023
MF-PAM: Accurate Pitch Estimation through Periodicity Analysis and Multi-level Feature Fusion INTERSPEECH, 2023
MF-PAM: Accurate Pitch Estimation through Periodicity Analysis and Multi-level Feature Fusion INTERSPEECH, 2023
Speak in the Scene: Diffusion-based Acoustic Scene Transfer toward Immersive Speech Generation INTERSPEECH, 2024
Speak in the Scene: Diffusion-based Acoustic Scene Transfer toward Immersive Speech Generation INTERSPEECH, 2024