We gratefully acknowledge support from
the Simons Foundation and member institutions.

Audio and Speech Processing

Authors and titles for recent submissions

[ total of 23 entries: 1-5 | 6-10 | 11-15 | 16-20 | 21-23 ]
[ showing 5 entries per page: fewer | more | all ]

Fri, 5 Dec 2025

[1]  arXiv:2512.04964 [pdf, ps, other]
Title: HiPPO: Exploring A Novel Hierarchical Pronunciation Assessment Approach for Spoken Languages
Comments: Accepted and to appear in AACL-IJCNLP2025
Subjects: Audio and Speech Processing (eess.AS)
[2]  arXiv:2512.04945 [pdf, ps, other]
Title: TripleC Learning and Lightweight Speech Enhancement for Multi-Condition Target Speech Extraction
Authors: Ziling Huang (Shanghai Normal University, China)
Comments: Submitted to ICASSP2026
Subjects: Audio and Speech Processing (eess.AS)
[3]  arXiv:2512.04792 [pdf, ps, other]
Title: Towards predicting binaural audio quality in listeners with normal and impaired hearing
Comments: accepted for publication in Forum Acusticum
Subjects: Audio and Speech Processing (eess.AS)
[4]  arXiv:2512.04552 (cross-list from cs.SD) [pdf, ps, other]
Title: RRPO: Robust Reward Policy Optimization for LLM-based Emotional TTS
Comments: Submitted to ICASSP 2026. Copyright 2026 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
[5]  arXiv:2512.04551 (cross-list from cs.SD) [pdf, ps, other]
Title: Multi-Loss Learning for Speech Emotion Recognition with Energy-Adaptive Mixup and Frame-Level Attention
Comments: Submitted to ICASSP 2026. Copyright 2026 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
[ total of 23 entries: 1-5 | 6-10 | 11-15 | 16-20 | 21-23 ]
[ showing 5 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, eess, new, 2512, contact, help  (Access key information)