Publications

(* denotes equal contribution)

  1. Learning-free L2-Accented Speech Generation using Phonological Rules

    Thanathai Lertpetchpun, Yoonjeong Lee, Jihwan Lee, Tiantian Feng, Dani Byrd, Shrikanth Narayanan

    Submitted to Interspeech 2026

  2. Targeted Speaker Poisoning Framework in Zero-Shot Text-to-Speech

    Thanapat Trachu*, Thanathai Lertpetchpun*, Sai Praneeth Karimireddy, Shrikanth Narayanan

    Submitted to Interspeech 2026

  3. Accent Vector: Controllable Accent Manipulation for Multilingual TTS Without Accented Data

    Thanathai Lertpetchpun*, Thanapat Trachu*, Jihwan Lee, Tiantian Feng, Dani Byrd, Shrikanth Narayanan

    Submitted to Interspeech 2026

  4. Trade-offs Between Capacity and Robustness in Neural Audio Codecs for Adversarially Robust Speech Recognition

    Jordan Prescott, Thanathai Lertpetchpun, Shrikanth Narayanan

    Submitted to Interspeech 2026

  5. Quantifying Speaker Embedding Phonological Rule Interactions in Accented Speech Synthesis

    Thanathai Lertpetchpun*, Yoonjeong Lee*, Thanapat Trachu, Jihwan Lee, Tiantian Feng, Dani Byrd, Shrikanth Narayanan

    ICASSP, 2026

  6. ARTI-6: Towards Six-dimensional Articulatory Speech Encoding

    Jihwan Lee, Sean Foley, Thanathai Lertpetchpun, Kevin Huang, Yoonjeong Lee, Tiantian Feng, Louis Goldstein, Dani Byrd, Shrikanth Narayanan

    ICASSP, 2026

  7. VoxGuard: Evaluating User and Attribute Privacy in Speech via Membership Inference Attacks

    Efthymios Tsaprazlis, Thanathai Lertpetchpun, Tiantian Feng, Sai Praneeth Karimireddy, Shrikanth Narayanan

    ICASSP, 2026

  8. Voxlect: A Speech Foundation Model Benchmark for Modeling Dialects and Regional Languages Around the Globe

    Tiantian Feng, Kevin Huang, Anfeng Xu, Xuan Shi, Thanathai Lertpetchpun, Jihwan Lee, Yoonjeong Lee, Dani Byrd, Shrikanth Narayanan

    KDD 2026

  9. Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits

    Tiantian Feng, Jihwan Lee, Anfeng Xu, Yoonjeong Lee, Thanathai Lertpetchpun, Xuan Shi, Helin Wang, Thomas Thebaud, Laureano Moro-Velazquez, Dani Byrd, Najim Dehak, Shrikanth Narayanan

    Submitted to DMLR

  10. Developing a High-performance Framework for Speech Emotion Recognition in Naturalistic Conditions Challenge for Emotional Attribute Prediction

    Thanathai Lertpetchpun*, Tiantian Feng*, Dani Byrd, Shrikanth Narayanan

    Interspeech, 2025

  11. Developing a Top-tier Framework in Naturalistic Conditions Challenge for Categorized Emotion Prediction: From Speech Foundation Models and Learning Objective to Data Augmentation and Engineering Choices

    Tiantian Feng*, Thanathai Lertpetchpun*, Dani Byrd, Shrikanth Narayanan

    Interspeech, 2025

  12. Amplifying Artifacts with Speech Enhancement in Voice Anti-spoofing Developing

    Thanapat Trachu*, Thanathai Lertpetchpun*, Ekapol Chuangsuwanich

    Interspeech, 2025

  13. Instance-based Temporal Normalization for Speaker Verification

    Thanathai Lertpetchpun, Ekapol Chuangsuwanich

    Interspeech, 2023