Multimedia

Authors and titles for recent submissions

Thu, 11 Dec 2025
Wed, 10 Dec 2025
Tue, 9 Dec 2025
Mon, 8 Dec 2025
Fri, 5 Dec 2025

[ total of 16 entries: 1-16 ]
[ showing up to 19 entries per page: fewer | more ]

Thu, 11 Dec 2025

[1] arXiv:2512.09841 (cross-list from cs.CL) [pdf, ps, other]: Title: ChronusOmni: Improving Time Awareness of Omni Large Language Models

Authors: Yijing Chen, Yihan Wu, Kaisi Guan, Yuchen Ren, Yuyue Wang, Ruihua Song, Liyun Ru

Comments: Code available at this https URL

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[2] arXiv:2512.09824 (cross-list from cs.CV) [pdf, ps, other]: Title: Composing Concepts from Images and Videos via Concept-prompt Binding

Authors: Xianghao Kong, Zeyu Zhang, Yuwei Guo, Zhuoran Zhao, Songchun Zhang, Anyi Rao

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[3] arXiv:2512.09335 (cross-list from cs.CV) [pdf, ps, other]: Title: Relightable and Dynamic Gaussian Avatar Reconstruction from Monocular Video

Authors: Seonghwa Choi, Moonkyeong Choi, Mingyu Jang, Jaekyung Kim, Jianfei Cai, Wen-Huang Cheng, Sanghoon Lee

Comments: 8 pages, 9 figures, published in ACM MM 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)

Wed, 10 Dec 2025

[4] arXiv:2512.08551 (cross-list from cs.SE) [pdf, ps, other]: Title: Gamification with Purpose: What Learners Prefer to Motivate Their Learning

Authors: Kai Marquardt, Mona Schulz, Anne Koziolek, Lucia Happe

Comments: 31 pages, 10 figures, Springer EAIT in review

Subjects: Software Engineering (cs.SE); Computers and Society (cs.CY); Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
[5] arXiv:2512.08282 (cross-list from cs.CV) [pdf, ps, other]: Title: PAVAS: Physics-Aware Video-to-Audio Synthesis

Authors: Oh Hyun-Bin, Yuhta Takida, Toshimitsu Uesaka, Tae-Hyun Oh, Yuki Mitsufuji

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD)
[6] arXiv:2512.07838 (cross-list from cs.CV) [pdf, ps, other]: Title: Detection of Cyberbullying in GIF using AI

Authors: Pal Dave, Xiaohong Yuan, Madhuri Siddula, Kaushik Roy

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)

Tue, 9 Dec 2025

[7] arXiv:2512.07209 [pdf, ps, other]: Title: Coherent Audio-Visual Editing via Conditional Audio Generation Following Video Edits

Authors: Masato Ishii, Akio Hayakawa, Takashi Shibuya, Yuki Mitsufuji

Subjects: Multimedia (cs.MM); Machine Learning (cs.LG); Sound (cs.SD)
[8] arXiv:2512.07571 (cross-list from cs.CL) [pdf, ps, other]: Title: A Simple Method to Enhance Pre-trained Language Models with Speech Tokens for Classification

Authors: Nicolas Calbucura, Valentin Barriere

Subjects: Computation and Language (cs.CL); Multimedia (cs.MM)
[9] arXiv:2512.06811 (cross-list from cs.CV) [pdf, ps, other]: Title: RMAdapter: Reconstruction-based Multi-Modal Adapter for Vision-Language Models

Authors: Xiang Lin, Weixin Li, Shu Guo, Lihong Wang, Di Huang

Comments: Accepted by AAAI 2026(Oral)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[10] arXiv:2512.06282 (cross-list from cs.CV) [pdf, ps, other]: Title: A Sleep Monitoring System Based on Audio, Video and Depth Information

Authors: Lyn Chao-ling Chen, Kuan-Wen Chen, Yi-Ping Hung

Comments: Accepted in the Computer Vision, Graphics and Image Processing (CVGIP 2013)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[11] arXiv:2512.06022 (cross-list from cs.SD) [pdf, ps, other]: Title: DreamFoley: Scalable VLMs for High-Fidelity Video-to-Audio Generation

Authors: Fu Li, Weichao Zhao, You Li, Zhichao Zhou, Dongliang He

Comments: 10 pages; Bytedance

Subjects: Sound (cs.SD); Multimedia (cs.MM)

Mon, 8 Dec 2025

[12] arXiv:2512.05745 (cross-list from cs.CR) [pdf, ps, other]: Title: ARGUS: Defending Against Multimodal Indirect Prompt Injection via Steering Instruction-Following Behavior

Authors: Weikai Lu, Ziqian Zeng, Kehua Zhang, Haoran Li, Huiping Zhuang, Ruidong Wang, Cen Chen, Hao Peng

Subjects: Cryptography and Security (cs.CR); Multimedia (cs.MM)
[13] arXiv:2512.05438 (cross-list from cs.HC) [pdf, ps, other]: Title: EXR: An Interactive Immersive EHR Visualization in Extended Reality

Authors: Benoit Marteau, Shaun Q. Y. Tan, Jieru Li, Andrew Hornback, Yishan Zhong, Shaunna Wang, Christian Lowson, Jason Woloff, Joshua M. Pahys, Steven W. Hwang, Coleman Hilton, May D. Wang

Comments: 11 pages, 6 figures. Preprint version. This paper has been accepted to IEEE ICIR 2025. This is the author-prepared version and not the final published version. The final version will appear in IEEE Xplo

Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[14] arXiv:2512.05126 (cross-list from eess.AS) [pdf, ps, other]: Title: SyncVoice: Towards Video Dubbing with Vision-Augmented Pretrained TTS Model

Authors: Kaidi Wang, Yi He, Wenhao Guan, Weijie Wu, Hongwu Ding, Xiong Zhang, Di Wu, Meng Meng, Jian Luan, Lin Li, Qingyang Hong

Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD)

Fri, 5 Dec 2025

[15] arXiv:2512.04112 [pdf, ps, other]: Title: MindFuse: Towards GenAI Explainability in Marketing Strategy Co-Creation

Authors: Aleksandr Farseev, Marlo Ongpin, Qi Yang, Ilia Gossoudarev, Yu-Yi Chu-Farseeva, Sergey Nikolenko

Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI)
[16] arXiv:2512.04398 (cross-list from cs.HC) [pdf, ps, other]: Title: What is Beyond Presence? Dimensionality, Control, and Information Spaces

Authors: E. Ch'ng

Comments: 38 pages, accepted for Presence: Virtual and Augmented Reality 2026(37)

Subjects: Human-Computer Interaction (cs.HC); Multimedia (cs.MM)

Thu, 11 Dec 2025
Wed, 10 Dec 2025
Tue, 9 Dec 2025
Mon, 8 Dec 2025
Fri, 5 Dec 2025

[ total of 16 entries: 1-16 ]
[ showing up to 19 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help (Access key information)

> cs > cs.MM

Multimedia

Authors and titles for recent submissions

Thu, 11 Dec 2025

Wed, 10 Dec 2025

Tue, 9 Dec 2025

Mon, 8 Dec 2025

Fri, 5 Dec 2025