We gratefully acknowledge support from
the Simons Foundation and member institutions.

Audio and Speech Processing

Authors and titles for recent submissions, skipping first 19

[ total of 23 entries: 1-5 | 5-9 | 10-14 | 15-19 | 20-23 ]
[ showing 5 entries per page: fewer | more | all ]

Mon, 1 Dec 2025 (continued, showing last 4 of 5 entries)

[20]  arXiv:2511.22687 (cross-list from cs.SD) [pdf, ps, other]
Title: PURE Codec: Progressive Unfolding of Residual Entropy for Speech Codec Learning
Comments: Accepted by ASRU2025
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[21]  arXiv:2511.22503 (cross-list from cs.CL) [pdf, ps, other]
Title: Joint Speech and Text Training for LLM-Based End-to-End Spoken Dialogue State Tracking
Comments: submitted to ICASSP 2026
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[22]  arXiv:2511.22293 (cross-list from cs.SD) [pdf, ps, other]
Title: GLA-Grad++: An Improved Griffin-Lim Guided Diffusion Model for Speech Synthesis
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
[23]  arXiv:2511.21872 (cross-list from cs.SD) [pdf, ps, other]
Title: Advancing Marine Bioacoustics with Deep Generative Models: A Hybrid Augmentation Strategy for Southern Resident Killer Whale Detection
Comments: 16 pages, 6 Figures, 2 Tables, submitted to Marine Mammal Science as part of a special issue on Machine Learning and Artificial Intelligence in Marine Mammal Research
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[ total of 23 entries: 1-5 | 5-9 | 10-14 | 15-19 | 20-23 ]
[ showing 5 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, eess, new, 2512, contact, help  (Access key information)