We gratefully acknowledge support from
the Simons Foundation and member institutions.

Multimedia

Authors and titles for recent submissions, skipping first 17

[ total of 23 entries: 1-5 | 3-7 | 8-12 | 13-17 | 18-22 | 23 ]
[ showing 5 entries per page: fewer | more | all ]

Tue, 2 Dec 2025 (continued, showing 5 of 9 entries)

[18]  arXiv:2512.00883 [pdf, ps, other]
Title: Audio-Visual World Models: Towards Multisensory Imagination in Sight and Sound
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD)
[19]  arXiv:2512.01603 (cross-list from cs.CL) [pdf, ps, other]
Title: MAC-SLU: Multi-Intent Automotive Cabin Spoken Language Understanding Benchmark
Subjects: Computation and Language (cs.CL); Multimedia (cs.MM)
[20]  arXiv:2512.00537 (cross-list from cs.HC) [pdf, ps, other]
Title: Speculating on the Role of Media Architecture in Post-disaster Rebuilding and Recovery: Insights from Architects and Interaction Designers
Subjects: Human-Computer Interaction (cs.HC); Computers and Society (cs.CY); Emerging Technologies (cs.ET); Multimedia (cs.MM)
[21]  arXiv:2512.00451 (cross-list from cs.SD) [pdf, ps, other]
Title: STCTS: Generative Semantic Compression for Ultra-Low Bitrate Speech via Explicit Text-Prosody-Timbre Decomposition
Comments: The complete source code and online speech reconstruction demo is publicly available at this https URL
Subjects: Sound (cs.SD); Multimedia (cs.MM)
[22]  arXiv:2512.00120 (cross-list from cs.SD) [pdf, ps, other]
Title: Art2Music: Generating Music for Art Images with Multi-modal Feeling Alignment
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[ total of 23 entries: 1-5 | 3-7 | 8-12 | 13-17 | 18-22 | 23 ]
[ showing 5 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)