We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 359

[ total of 778 entries: 1-50 | ... | 210-259 | 260-309 | 310-359 | 360-409 | 410-459 | 460-509 | 510-559 | ... | 760-778 ]
[ showing 50 entries per page: fewer | more | all ]

Wed, 3 Dec 2025 (showing first 50 of 141 entries)

[360]  arXiv:2512.03046 [pdf, ps, other]
Title: MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues
Comments: Code and demo available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[361]  arXiv:2512.03045 [pdf, ps, other]
Title: CAMEO: Correspondence-Attention Alignment for Multi-View Diffusion Models
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[362]  arXiv:2512.03043 [pdf, ps, other]
Title: OneThinker: All-in-one Reasoning Model for Image and Video
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[363]  arXiv:2512.03042 [pdf, ps, other]
Title: PPTArena: A Benchmark for Agentic PowerPoint Editing
Comments: 25 pages, 26 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[364]  arXiv:2512.03041 [pdf, ps, other]
Title: MultiShotMaster: A Controllable Multi-Shot Video Generation Framework
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[365]  arXiv:2512.03040 [pdf, ps, other]
Title: Video4Spatial: Towards Visuospatial Intelligence with Context-Guided Video Generation
Comments: Project page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[366]  arXiv:2512.03036 [pdf, ps, other]
Title: ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[367]  arXiv:2512.03034 [pdf, ps, other]
Title: MAViD: A Multimodal Framework for Audio-Visual Dialogue Understanding and Generation
Comments: Our project website is this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[368]  arXiv:2512.03020 [pdf, ps, other]
Title: Unrolled Networks are Conditional Probability Flows in MRI Reconstruction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[369]  arXiv:2512.03018 [pdf, ps, other]
Title: AutoBrep: Autoregressive B-Rep Generation with Unified Topology and Geometry
Comments: Accepted to Siggraph Asia 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[370]  arXiv:2512.03014 [pdf, ps, other]
Title: Instant Video Models: Universal Adapters for Stabilizing Image-Based Networks
Comments: NeurIPS 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[371]  arXiv:2512.03013 [pdf, ps, other]
Title: In-Context Sync-LoRA for Portrait Video Editing
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[372]  arXiv:2512.03010 [pdf, ps, other]
Title: SurfFill: Completion of LiDAR Point Clouds via Gaussian Surfel Splatting
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Robotics (cs.RO)
[373]  arXiv:2512.03004 [pdf, ps, other]
Title: DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[374]  arXiv:2512.03000 [pdf, ps, other]
Title: DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[375]  arXiv:2512.02993 [pdf, ps, other]
Title: TEXTRIX: Latent Attribute Grid for Native Texture Generation and Beyond
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[376]  arXiv:2512.02991 [pdf, ps, other]
Title: GraphFusion3D: Dynamic Graph Attention Convolution with Adaptive Cross-Modal Transformer for 3D Object Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[377]  arXiv:2512.02982 [pdf, ps, other]
Title: U4D: Uncertainty-Aware 4D World Modeling from LiDAR Sequences
Comments: Preprint; 19 pages, 7 figures, 8 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[378]  arXiv:2512.02981 [pdf, ps, other]
Title: InEx: Hallucination Mitigation via Introspection and Cross-Modal Multi-Agent Collaboration
Comments: Published in AAAI 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[379]  arXiv:2512.02973 [pdf, ps, other]
Title: Contextual Image Attack: How Visual Context Exposes Multimodal Safety Vulnerabilities
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[380]  arXiv:2512.02972 [pdf, ps, other]
Title: BEVDilation: LiDAR-Centric Multi-Modal Fusion for 3D Object Detection
Comments: Accept by AAAI26
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[381]  arXiv:2512.02965 [pdf, ps, other]
Title: A Lightweight Real-Time Low-Light Enhancement Network for Embedded Automotive Vision Systems
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[382]  arXiv:2512.02952 [pdf, ps, other]
Title: Layout Anything: One Transformer for Universal Room Layout Estimation
Comments: Published at WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[383]  arXiv:2512.02942 [pdf, ps, other]
Title: Benchmarking Scientific Understanding and Reasoning for Video Generation using VideoScience-Bench
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[384]  arXiv:2512.02933 [pdf, ps, other]
Title: LoVoRA: Text-guided and Mask-free Video Object Removal and Addition with Learnable Object-aware Localization
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[385]  arXiv:2512.02932 [pdf, ps, other]
Title: EGGS: Exchangeable 2D/3D Gaussian Splatting for Geometry-Appearance Balanced Novel View Synthesis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[386]  arXiv:2512.02931 [pdf, ps, other]
Title: DiverseAR: Boosting Diversity in Bitwise Autoregressive Image Generation
Comments: 23 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[387]  arXiv:2512.02906 [pdf, ps, other]
Title: MRD: Multi-resolution Retrieval-Detection Fusion for High-Resolution Image Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[388]  arXiv:2512.02899 [pdf, ps, other]
Title: Glance: Accelerating Diffusion Models with 1 Sample
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[389]  arXiv:2512.02897 [pdf, ps, other]
Title: Polar Perspectives: Evaluating 2-D LiDAR Projections for Robust Place Recognition with Visual Foundation Models
Comments: 13 Pages, 5 Figures, 2 Tables Under Review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[390]  arXiv:2512.02895 [pdf, ps, other]
Title: MindGPT-4ov: An Enhanced MLLM via a Multi-Stage Post-Training Paradigm
Comments: 33 pages, 14 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[391]  arXiv:2512.02870 [pdf, ps, other]
Title: Taming Camera-Controlled Video Generation with Verifiable Geometry Reward
Comments: 11 pages, 4 figures, 7 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[392]  arXiv:2512.02867 [pdf, ps, other]
Title: MICCAI STSR 2025 Challenge: Semi-Supervised Teeth and Pulp Segmentation and CBCT-IOS Registration
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[393]  arXiv:2512.02860 [pdf, ps, other]
Title: RFOP: Rethinking Fusion and Orthogonal Projection for Face-Voice Association
Comments: Ranked 3rd in Fame 2026 Challenge, ICASSP
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[394]  arXiv:2512.02850 [pdf, ps, other]
Title: Are Detectors Fair to Indian IP-AIGC? A Cross-Generator Study
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[395]  arXiv:2512.02846 [pdf, ps, other]
Title: Action Anticipation at a Glimpse: To What Extent Can Multimodal Cues Replace Video?
Comments: Accepted in WACV 2026 - Applications Track
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[396]  arXiv:2512.02835 [pdf, ps, other]
Title: ReVSeg: Incentivizing the Reasoning Chain for Video Segmentation with Reinforcement Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[397]  arXiv:2512.02830 [pdf, ps, other]
Title: Defense That Attacks: How Robust Models Become Better Attackers
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[398]  arXiv:2512.02794 [pdf, ps, other]
Title: PhyCustom: Towards Realistic Physical Customization in Text-to-Image Generation
Comments: codes:this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[399]  arXiv:2512.02793 [pdf, ps, other]
Title: IC-World: In-Context Generation for Shared World Modeling
Comments: codes:this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[400]  arXiv:2512.02792 [pdf, ps, other]
Title: HUD: Hierarchical Uncertainty-Aware Disambiguation Network for Composed Video Retrieval
Comments: Accepted by ACM MM 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[401]  arXiv:2512.02790 [pdf, ps, other]
Title: UnicEdit-10M: A Dataset and Benchmark Breaking the Scale-Quality Barrier via Unified Verification for Reasoning-Enriched Edits
Comments: 31 pages, 15 figures, 12 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[402]  arXiv:2512.02789 [pdf, ps, other]
Title: TrackNetV5: Residual-Driven Spatio-Temporal Refinement and Motion Direction Decoupling for Fast Object Tracking
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[403]  arXiv:2512.02781 [pdf, ps, other]
Title: LumiX: Structured and Coherent Text-to-Intrinsic Generation
Comments: The code will be available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[404]  arXiv:2512.02780 [pdf, ps, other]
Title: Rethinking Surgical Smoke: A Smoke-Type-Aware Laparoscopic Video Desmoking Method and Dataset
Comments: 12 pages, 15 figures. Accepted to AAAI-26 (Main Technical Track)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[405]  arXiv:2512.02751 [pdf, ps, other]
Title: AttMetNet: Attention-Enhanced Deep Neural Network for Methane Plume Detection in Sentinel-2 Satellite Imagery
Comments: 15 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[406]  arXiv:2512.02743 [pdf, ps, other]
Title: Reasoning-Aware Multimodal Fusion for Hateful Video Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[407]  arXiv:2512.02737 [pdf, ps, other]
Title: Beyond Paired Data: Self-Supervised UAV Geo-Localization from Reference Imagery Alone
Comments: Accepted at WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[408]  arXiv:2512.02727 [pdf, ps, other]
Title: DF-Mamba: Deformable State Space Modeling for 3D Hand Pose Estimation in Interactions
Comments: Accepted to WACV 2026. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[409]  arXiv:2512.02715 [pdf, ps, other]
Title: GeoViS: Geospatially Rewarded Visual Search for Remote Sensing Visual Grounding
Comments: 11 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 778 entries: 1-50 | ... | 210-259 | 260-309 | 310-359 | 360-409 | 410-459 | 460-509 | 510-559 | ... | 760-778 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)