We gratefully acknowledge support from
the Simons Foundation and member institutions.

Sound

Authors and titles for recent submissions

[ total of 48 entries: 1-5 | 6-10 | 11-15 | 16-20 | ... | 46-48 ]
[ showing 5 entries per page: fewer | more | all ]

Fri, 5 Dec 2025 (showing first 5 of 10 entries)

[1]  arXiv:2512.04847 [pdf, ps, other]
Title: Language Models as Semantic Teachers: Post-Training Alignment for Medical Audio Understanding
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI)
[2]  arXiv:2512.04827 [pdf, ps, other]
Title: Contract-Driven QoE Auditing for Speech and Singing Services: From MOS Regression to Service Graphs
Authors: Wenzhang Du
Comments: 11 pages, 3 figures
Subjects: Sound (cs.SD); Machine Learning (cs.LG)
[3]  arXiv:2512.04814 [pdf, ps, other]
Title: Shared Multi-modal Embedding Space for Face-Voice Association
Comments: Ranked 1st in Fame 2026 Challenge, ICASSP
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV)
[4]  arXiv:2512.04793 [pdf, ps, other]
Title: YingMusic-SVC: Real-World Robust Zero-Shot Singing Voice Conversion with Flow-GRPO and Singing-Specific Inductive Biases
Comments: 17 pages, 5 figures
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI)
[5]  arXiv:2512.04779 [pdf, ps, other]
Title: YingMusic-Singer: Zero-shot Singing Voice Synthesis and Editing with Annotation-free Melody Guidance
Comments: 13 pages, 3 figures
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI)
[ total of 48 entries: 1-5 | 6-10 | 11-15 | 16-20 | ... | 46-48 ]
[ showing 5 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)