We gratefully acknowledge support from
the Simons Foundation and member institutions.

Multimedia

Authors and titles for recent submissions, skipping first 12

[ total of 23 entries: 1-5 | 3-7 | 8-12 | 13-17 | 18-22 | 23 ]
[ showing 5 entries per page: fewer | more | all ]

Wed, 3 Dec 2025 (continued, showing last 2 of 6 entries)

[13]  arXiv:2512.02652 (cross-list from cs.SD) [pdf, ps, other]
Title: Pianist Transformer: Towards Expressive Piano Performance Rendering via Scalable Self-Supervised Pre-Training
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[14]  arXiv:2512.02650 (cross-list from cs.CV) [pdf, ps, other]
Title: Hear What Matters! Text-conditioned Selective Video-to-Audio Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)

Tue, 2 Dec 2025 (showing first 3 of 9 entries)

[15]  arXiv:2512.01442 [pdf, ps, other]
Title: PSA-MF: Personality-Sentiment Aligned Multi-Level Fusion for Multimodal Sentiment Analysis
Comments: AAAI 2026 accepted
Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI)
[16]  arXiv:2512.01267 [pdf, ps, other]
Title: ZO-ASR: Zeroth-Order Fine-Tuning of Speech Foundation Models without Back-Propagation
Comments: 2025 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
Subjects: Multimedia (cs.MM); Sound (cs.SD)
[17]  arXiv:2512.00928 [pdf, ps, other]
Title: Augmenting Intra-Modal Understanding in MLLMs for Robust Multimodal Keyphrase Generation
Subjects: Multimedia (cs.MM)
[ total of 23 entries: 1-5 | 3-7 | 8-12 | 13-17 | 18-22 | 23 ]
[ showing 5 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)