We gratefully acknowledge support from
the Simons Foundation and member institutions.

Sound

Authors and titles for recent submissions, skipping first 35

[ total of 44 entries: 1-5 | ... | 21-25 | 26-30 | 31-35 | 36-40 | 41-44 ]
[ showing 5 entries per page: fewer | more | all ]

Tue, 2 Dec 2025 (continued, showing 5 of 12 entries)

[36]  arXiv:2512.00621 [pdf, ps, other]
Title: Melody or Machine: Detecting Synthetic Music with Dual-Stream Contrastive Learning
Comments: Accepted at Transactions on Machine Learning Research (TMLR)
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[37]  arXiv:2512.00563 [pdf, ps, other]
Title: Explainable Multi-Modal Deep Learning for Automatic Detection of Lung Diseases from Respiratory Audio Signals
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[38]  arXiv:2512.00451 [pdf, ps, other]
Title: STCTS: Generative Semantic Compression for Ultra-Low Bitrate Speech via Explicit Text-Prosody-Timbre Decomposition
Comments: The complete source code and online speech reconstruction demo is publicly available at this https URL
Subjects: Sound (cs.SD); Multimedia (cs.MM)
[39]  arXiv:2512.00120 [pdf, ps, other]
Title: Art2Music: Generating Music for Art Images with Multi-modal Feeling Alignment
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[40]  arXiv:2512.00115 [pdf, ps, other]
Title: MoLT: Mixture of Layer-Wise Tokens for Efficient Audio-Visual Learning
Comments: 10 pages, 5 figures
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[ total of 44 entries: 1-5 | ... | 21-25 | 26-30 | 31-35 | 36-40 | 41-44 ]
[ showing 5 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)