We gratefully acknowledge support from
the Simons Foundation and member institutions.

Multimedia

Authors and titles for recent submissions, skipping first 25

[ total of 32 entries: 1-50 | 26-32 ]
[ showing up to 50 entries per page: fewer | more ]

Mon, 1 Dec 2025 (continued, showing last 7 of 12 entries)

[26]  arXiv:2511.21698 [pdf, ps, other]
Title: TIP and Polish: Text-Image-Prototype Guided Multi-Modal Generation via Commonality-Discrepancy Modeling and Refinement
Comments: Submitted to ICASSP2026
Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI)
[27]  arXiv:2511.21694 [pdf, ps, other]
Title: A Survey of Information Disorder on Video-Sharing Platforms
Comments: Accepted by 2025 IEEE International Conference on Content-Based Multimedia Indexing
Subjects: Multimedia (cs.MM); Computers and Society (cs.CY)
[28]  arXiv:2511.21693 [pdf, ps, other]
Title: Designing a Multimodal Viewer for Piano Performance Analysis -- a Pedagogy-First Approach
Subjects: Multimedia (cs.MM)
[29]  arXiv:2511.22805 (cross-list from cs.CV) [pdf, ps, other]
Title: From Pixels to Feelings: Aligning MLLMs with Human Cognitive Perception of Images
Comments: Project page with codes/datasets/models: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[30]  arXiv:2511.22715 (cross-list from cs.CV) [pdf, ps, other]
Title: ReAG: Reasoning-Augmented Generation for Knowledge-based Visual Question Answering
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multimedia (cs.MM)
[31]  arXiv:2511.22055 (cross-list from cs.CV) [pdf, ps, other]
Title: OralGPT-Omni: A Versatile Dental Multimodal Large Language Model
Comments: 47 pages, 42 figures, 13 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[32]  arXiv:2511.22046 (cross-list from cs.NI) [pdf, ps, other]
Title: AutoRec: Accelerating Loss Recovery for Live Streaming in a Multi-Supplier Market
Subjects: Networking and Internet Architecture (cs.NI); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[ total of 32 entries: 1-50 | 26-32 ]
[ showing up to 50 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)