Skip to content
View HarryHe11's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Highlights

  • Pro

Block or report HarryHe11

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
HarryHe11/README.md

Hi there 🙌

I am a second year Ph.D. student in Computer Science at The University of Hong Kong (HKU) and a Research Assistant at Hong Kong Baptist University (HKBU), jointly supervised by Prof. Francis C.M. Lau, Prof. Reynold C.K. Cheng, and Prof. Yupeng Li. Previously, I worked as a Research Assistant under Prof. Zhizheng Wu at the Chinese University of Hong Kong, Shenzhen and the Shanghai AI Laboratory, and as a Mitacs Research Intern under Prof. Zhen Ming (Jack) Jiang at York University (YorkU). I hold a B.Eng. in Software Engineering from Nanjing University of Posts and Telecommunications (NJUPT), where I received the Outstanding Bachelor's Thesis Award.

I am the creator of Emilia, a leading dataset for expressive and spontaneous text-to-speech (TTS) synthesis, and its preprocessing pipeline, Emilia-Pipe. As of May 2025, Emilia has surpassed 500,000 downloads by over 1,000 institutions and companies, including Stanford, CMU, OpenAI, Google, and NVIDIA. It is the "most liked dataset" in the audio category on HuggingFace and serves as a foundational training dataset for state-of-the-art TTS models like F5-TTS, MaskGCT, and SparkTTS, as well as audio language models such as Kimi-Audio, VITA-Audio, and Ming-Omni.

My current research interests revolve around Social Computing and Large Language Models (LLMs), where I aim to leverage LLMs to address critical societal challenges such as misinformation, fake news, and deepfakes.

Links 🔗

Pinned Loading

  1. open-mmlab/Amphion open-mmlab/Amphion Public

    Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

    Python 9.3k 737