Sound

Authors and titles for recent submissions

[ total of 48 entries: 1-5 | 6-10 | 11-15 | 16-20 | ... | 46-48 ]
[ showing 5 entries per page: fewer | more | all ]

Fri, 5 Dec 2025 (showing first 5 of 10 entries)

[1] arXiv:2512.04847 [pdf, ps, other]: Title: Language Models as Semantic Teachers: Post-Training Alignment for Medical Audio Understanding

Authors: Tsai-Ning Wang, Lin-Lin Chen, Neil Zeghidour, Aaqib Saeed

Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI)
[2] arXiv:2512.04827 [pdf, ps, other]: Title: Contract-Driven QoE Auditing for Speech and Singing Services: From MOS Regression to Service Graphs

Authors: Wenzhang Du

Comments: 11 pages, 3 figures

Subjects: Sound (cs.SD); Machine Learning (cs.LG)
[3] arXiv:2512.04814 [pdf, ps, other]: Title: Shared Multi-modal Embedding Space for Face-Voice Association

Authors: Christopher Simic, Korbinian Riedhammer, Tobias Bocklet

Comments: Ranked 1st in Fame 2026 Challenge, ICASSP

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV)
[4] arXiv:2512.04793 [pdf, ps, other]: Title: YingMusic-SVC: Real-World Robust Zero-Shot Singing Voice Conversion with Flow-GRPO and Singing-Specific Inductive Biases

Authors: Gongyu Chen, Xiaoyu Zhang, Zhenqiang Weng, Junjie Zheng, Da Shen, Chaofan Ding, Wei-Qiang Zhang, Zihao Chen

Comments: 17 pages, 5 figures

Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI)
[5] arXiv:2512.04779 [pdf, ps, other]: Title: YingMusic-Singer: Zero-shot Singing Voice Synthesis and Editing with Annotation-free Melody Guidance

Authors: Junjie Zheng, Chunbo Hao, Guobin Ma, Xiaoyu Zhang, Gongyu Chen, Chaofan Ding, Zihao Chen, Lei Xie

Comments: 13 pages, 3 figures

Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI)

[ total of 48 entries: 1-5 | 6-10 | 11-15 | 16-20 | ... | 46-48 ]
[ showing 5 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help (Access key information)

> cs > cs.SD

Sound

Authors and titles for recent submissions

Fri, 5 Dec 2025 (showing first 5 of 10 entries)