Audio and Speech Processing

Authors and titles for recent submissions

[ total of 23 entries: 1-10 | 11-20 | 21-23 ]
[ showing 10 entries per page: fewer | more | all ]

Fri, 5 Dec 2025

[1] arXiv:2512.04964 [pdf, ps, other]: Title: HiPPO: Exploring A Novel Hierarchical Pronunciation Assessment Approach for Spoken Languages

Authors: Bi-Cheng Yan, Hsin-Wei Wang, Fu-An Chao, Tien-Hong Lo, Yung-Chang Hsu, Berlin Chen

Comments: Accepted and to appear in AACL-IJCNLP2025

Subjects: Audio and Speech Processing (eess.AS)
[2] arXiv:2512.04945 [pdf, ps, other]: Title: TripleC Learning and Lightweight Speech Enhancement for Multi-Condition Target Speech Extraction

Authors: Ziling Huang (Shanghai Normal University, China)

Comments: Submitted to ICASSP2026

Subjects: Audio and Speech Processing (eess.AS)
[3] arXiv:2512.04792 [pdf, ps, other]: Title: Towards predicting binaural audio quality in listeners with normal and impaired hearing

Authors: Thomas Biberger, Stephan D. Ewert

Comments: accepted for publication in Forum Acusticum

Subjects: Audio and Speech Processing (eess.AS)
[4] arXiv:2512.04552 (cross-list from cs.SD) [pdf, ps, other]: Title: RRPO: Robust Reward Policy Optimization for LLM-based Emotional TTS

Authors: Cong Wang, Changfeng Gao, Yang Xiang, Zhihao Du, Keyu An, Han Zhao, Qian Chen, Xiangang Li, Yingming Gao, Ya Li

Comments: Submitted to ICASSP 2026. Copyright 2026 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
[5] arXiv:2512.04551 (cross-list from cs.SD) [pdf, ps, other]: Title: Multi-Loss Learning for Speech Emotion Recognition with Energy-Adaptive Mixup and Frame-Level Attention

Authors: Cong Wang, Yizhong Geng, Yuhua Wen, Qifei Li, Yingming Gao, Ruimin Wang, Chunfeng Wang, Hao Li, Ya Li, Wei Chen

Comments: Submitted to ICASSP 2026. Copyright 2026 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)

Thu, 4 Dec 2025

[6] arXiv:2512.03486 [pdf, ps, other]: Title: A Universal Harmonic Discriminator for High-quality GAN-based Vocoder

Authors: Nan Xu, Zhaolong Huang, Xiao Zeng

Comments: Accepted by ASRU2025

Subjects: Audio and Speech Processing (eess.AS)
[7] arXiv:2512.03301 [pdf, ps, other]: Title: Comparing Unsupervised and Supervised Semantic Speech Tokens: A Case Study of Child ASR

Authors: Mohan Shi, Natarajan Balaji Shankar, Kaiyuan Zhang, Zilai Wang, Abeer Alwan

Comments: ASRU-AI4CSL

Subjects: Audio and Speech Processing (eess.AS)
[8] arXiv:2512.03636 (cross-list from cs.HC) [pdf, ps, other]: Title: Head, posture, and full-body gestures in dyadic conversations

Authors: Ľuboš Hládek, Bernhard U. Seeber

Comments: 7 figures, 10 tables, 29 pages

Subjects: Human-Computer Interaction (cs.HC); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[9] arXiv:2512.03458 (cross-list from eess.SP) [pdf, ps, other]: Title: A Convolutional Framework for Mapping Imagined Auditory MEG into Listened Brain Responses

Authors: Maryam Maghsoudi, Mohsen Rezaeizadeh, Shihab Shamma

Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)

Wed, 3 Dec 2025 (showing first 1 of 5 entries)

[10] arXiv:2512.02891 [pdf, ps, other]: Title: Perceptual evaluation of Acoustic Level of Detail in Virtual Acoustic Environments

Authors: Stefan Fichna, Steven van de Par, Bernhard U. Seeber, Stephan D. Ewert

Comments: This work has been submitted to Acoustics for possible publication. Template provided by MDPI

Subjects: Audio and Speech Processing (eess.AS)

[ total of 23 entries: 1-10 | 11-20 | 21-23 ]
[ showing 10 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, eess, new, 2512, contact, help (Access key information)

> eess > eess.AS

Audio and Speech Processing

Authors and titles for recent submissions

Fri, 5 Dec 2025

Thu, 4 Dec 2025

Wed, 3 Dec 2025 (showing first 1 of 5 entries)