We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 355

[ total of 778 entries: 1-25 | ... | 281-305 | 306-330 | 331-355 | 356-380 | 381-405 | 406-430 | 431-455 | ... | 756-778 ]
[ showing 25 entries per page: fewer | more | all ]

Thu, 4 Dec 2025 (continued, showing last 4 of 130 entries)

[356]  arXiv:2512.03166 (cross-list from cs.RO) [pdf, ps, other]
Title: Multi-Agent Reinforcement Learning and Real-Time Decision-Making in Robotic Soccer for Virtual Environments
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[357]  arXiv:2512.03111 (cross-list from q-bio.GN) [pdf, ps, other]
Title: PanFoMa: A Lightweight Foundation Model and Benchmark for Pan-Cancer
Comments: Accepted by AAAI 2026
Subjects: Genomics (q-bio.GN); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[358]  arXiv:2512.03054 (cross-list from cs.LG) [pdf, ps, other]
Title: Energy-Efficient Federated Learning via Adaptive Encoder Freezing for MRI-to-CT Conversion: A Green AI-Guided Research
Comments: 22 pages, 13 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Medical Physics (physics.med-ph)
[359]  arXiv:2512.03052 (cross-list from cs.GR) [pdf, ps, other]
Title: LATTICE: Democratize High-Fidelity 3D Generation at Scale
Comments: Technical Report
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)

Wed, 3 Dec 2025 (showing first 21 of 141 entries)

[360]  arXiv:2512.03046 [pdf, ps, other]
Title: MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues
Comments: Code and demo available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[361]  arXiv:2512.03045 [pdf, ps, other]
Title: CAMEO: Correspondence-Attention Alignment for Multi-View Diffusion Models
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[362]  arXiv:2512.03043 [pdf, ps, other]
Title: OneThinker: All-in-one Reasoning Model for Image and Video
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[363]  arXiv:2512.03042 [pdf, ps, other]
Title: PPTArena: A Benchmark for Agentic PowerPoint Editing
Comments: 25 pages, 26 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[364]  arXiv:2512.03041 [pdf, ps, other]
Title: MultiShotMaster: A Controllable Multi-Shot Video Generation Framework
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[365]  arXiv:2512.03040 [pdf, ps, other]
Title: Video4Spatial: Towards Visuospatial Intelligence with Context-Guided Video Generation
Comments: Project page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[366]  arXiv:2512.03036 [pdf, ps, other]
Title: ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[367]  arXiv:2512.03034 [pdf, ps, other]
Title: MAViD: A Multimodal Framework for Audio-Visual Dialogue Understanding and Generation
Comments: Our project website is this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[368]  arXiv:2512.03020 [pdf, ps, other]
Title: Unrolled Networks are Conditional Probability Flows in MRI Reconstruction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[369]  arXiv:2512.03018 [pdf, ps, other]
Title: AutoBrep: Autoregressive B-Rep Generation with Unified Topology and Geometry
Comments: Accepted to Siggraph Asia 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[370]  arXiv:2512.03014 [pdf, ps, other]
Title: Instant Video Models: Universal Adapters for Stabilizing Image-Based Networks
Comments: NeurIPS 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[371]  arXiv:2512.03013 [pdf, ps, other]
Title: In-Context Sync-LoRA for Portrait Video Editing
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[372]  arXiv:2512.03010 [pdf, ps, other]
Title: SurfFill: Completion of LiDAR Point Clouds via Gaussian Surfel Splatting
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Robotics (cs.RO)
[373]  arXiv:2512.03004 [pdf, ps, other]
Title: DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[374]  arXiv:2512.03000 [pdf, ps, other]
Title: DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[375]  arXiv:2512.02993 [pdf, ps, other]
Title: TEXTRIX: Latent Attribute Grid for Native Texture Generation and Beyond
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[376]  arXiv:2512.02991 [pdf, ps, other]
Title: GraphFusion3D: Dynamic Graph Attention Convolution with Adaptive Cross-Modal Transformer for 3D Object Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[377]  arXiv:2512.02982 [pdf, ps, other]
Title: U4D: Uncertainty-Aware 4D World Modeling from LiDAR Sequences
Comments: Preprint; 19 pages, 7 figures, 8 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[378]  arXiv:2512.02981 [pdf, ps, other]
Title: InEx: Hallucination Mitigation via Introspection and Cross-Modal Multi-Agent Collaboration
Comments: Published in AAAI 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[379]  arXiv:2512.02973 [pdf, ps, other]
Title: Contextual Image Attack: How Visual Context Exposes Multimodal Safety Vulnerabilities
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[380]  arXiv:2512.02972 [pdf, ps, other]
Title: BEVDilation: LiDAR-Centric Multi-Modal Fusion for 3D Object Detection
Comments: Accept by AAAI26
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[ total of 778 entries: 1-25 | ... | 281-305 | 306-330 | 331-355 | 356-380 | 381-405 | 406-430 | 431-455 | ... | 756-778 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)