I am Xiaoxu Zhu (朱晓旭), a Speech AI researcher focusing on large speech models and speech generation. I have been working on speech algorithm research and development at SenseTime since 2021. I have also worked or interned at Siemens, Cheetah Mobile, and Shanghai AI Lab.
I am expected to receive my Master's degree in Information Management from Tsinghua University in 2025 while working. I received my Bachelor's degree from Harbin Institute of Technology in 2016, and my Master's degree in Informatics and Computing Technology from Peter the Great St. Petersburg Polytechnic University in 2019.
Performance Optimization: Implemented pre-computation of GRU B conditioning vectors to achieve approximately 10% speed improvement in LPCNet inference. This optimization reduces computational overhead by caching frequently used conditioning vectors, significantly improving real-time speech synthesis performance.
I am always eager to connect and exchange ideas with fellow researchers in large speech models and speech AI. Feel free to reach out if you'd like to discuss research collaborations or share insights!