Hi there 🙌
I am a second year Ph.D. student in Computer Science at The University of Hong Kong (HKU) and a Research Assistant at Hong Kong Baptist University (HKBU), jointly supervised by Prof. Francis C.M. Lau, Prof. Reynold C.K. Cheng, and Prof. Yupeng Li. Previously, I worked as a Research Assistant under Prof. Zhizheng Wu at the Chinese University of Hong Kong, Shenzhen and the Shanghai AI Laboratory, and as a Mitacs Research Intern under Prof. Zhen Ming (Jack) Jiang at York University (YorkU). I hold a B.Eng. in Software Engineering from Nanjing University of Posts and Telecommunications (NJUPT), where I received the Outstanding Bachelor's Thesis Award.
I am the creator of Emilia, a leading dataset for expressive and spontaneous text-to-speech (TTS) synthesis, and its preprocessing pipeline, Emilia-Pipe. As of May 2025, Emilia has surpassed 500,000 downloads by over 1,000 institutions and companies, including Stanford, CMU, OpenAI, Google, and NVIDIA. It is the "most liked dataset" in the audio category on HuggingFace and serves as a foundational training dataset for state-of-the-art TTS models like F5-TTS, MaskGCT, and SparkTTS, as well as audio language models such as Kimi-Audio, VITA-Audio, and Ming-Omni.
My current research interests revolve around Social Computing and Large Language Models (LLMs), where I aim to leverage LLMs to address critical societal challenges such as misinformation, fake news, and deepfakes.
Links 🔗