Publications

(* denotes equal contribution)

  1. Voxlect: A Speech Foundation Model Benchmark for Modeling Dialects and Regional Languages Around the Globe

    Tiantian Feng, Kevin Huang, Anfeng Xu, Xuan Shi, Thanathai Lertpetchpun, Jihwan Lee, Yoonjeong Lee, Dani Byrd, Shrikanth Narayanan

    Submitted to KDD, 2026

  2. Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits

    Tiantian Feng, Jihwan Lee, Anfeng Xu, Yoonjeong Lee, Thanathai Lertpetchpun, Xuan Shi, Helin Wang, Thomas Thebaud, Laureano Moro-Velazquez, Dani Byrd, Najim Dehak, Shrikanth Narayanan

    Submitted to Neurips, 2025

  3. Developing a High-performance Framework for Speech Emotion Recognition in Naturalistic Conditions Challenge for Emotional Attribute Prediction

    Thanathai Lertpetchpun*, Tiantian Feng*, Dani Byrd, Shrikanth Narayanan

    Interspeech, 2025

  4. Developing a Top-tier Framework in Naturalistic Conditions Challenge for Categorized Emotion Prediction: From Speech Foundation Models and Learning Objective to Data Augmentation and Engineering Choices

    Tiantian Feng*, Thanathai Lertpetchpun*, Dani Byrd, Shrikanth Narayanan

    Interspeech, 2025

  5. Amplifying Artifacts with Speech Enhancement in Voice Anti-spoofing Developing

    Thanapat Trachu*, Thanathai Lertpetchpun*, Ekapol Chuangsuwanich

    Interspeech, 2025

  6. Instance-based Temporal Normalization for Speaker Verification

    Thanathai Lertpetchpun, Ekapol Chuangsuwanich

    Interspeech, 2023