We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 84

[ total of 749 entries: 1-100 | 85-184 | 185-284 | 285-384 | 385-484 | ... | 685-749 ]
[ showing 100 entries per page: fewer | more | all ]

Wed, 10 Dec 2025 (continued, showing last 47 of 131 entries)

[85]  arXiv:2512.08253 [pdf, ps, other]
Title: Query-aware Hub Prototype Learning for Few-Shot 3D Point Cloud Semantic Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[86]  arXiv:2512.08247 [pdf, ps, other]
Title: Distilling Future Temporal Knowledge with Masked Feature Reconstruction for 3D Object Detection
Comments: AAAI-26
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[87]  arXiv:2512.08243 [pdf, ps, other]
Title: Residual-SwinCA-Net: A Channel-Aware Integrated Residual CNN-Swin Transformer for Malignant Lesion Segmentation in BUSI
Authors: Saeeda Naz, Saddam Hussain Khan (Artificial Intelligence Lab, Department of Computer Systems Engineering, University of Engineering and Applied Sciences (UEAS), Swat, Pakistan)
Comments: 26 Pages, 10 Figures, 4 Tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[88]  arXiv:2512.08240 [pdf, ps, other]
Title: HybridToken-VLM: Hybrid Token Compression for Vision-Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[89]  arXiv:2512.08237 [pdf, ps, other]
Title: FastBEV++: Fast by Algorithm, Deployable by Design
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[90]  arXiv:2512.08229 [pdf, ps, other]
Title: Geometry-Aware Sparse Depth Sampling for High-Fidelity RGB-D Depth Completion in Robotic Systems
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[91]  arXiv:2512.08228 [pdf, ps, other]
Title: MM-CoT:A Benchmark for Probing Visual Chain-of-Thought Reasoning in Multimodal Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[92]  arXiv:2512.08227 [pdf, ps, other]
Title: New VVC profiles targeting Feature Coding for Machines
Comments: Accepted for presentation at ICIP 2025 workshop on Coding for Machines
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[93]  arXiv:2512.08223 [pdf, ps, other]
Title: SOP^2: Transfer Learning with Scene-Oriented Prompt Pool on 3D Object Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[94]  arXiv:2512.08221 [pdf, ps, other]
Title: VisKnow: Constructing Visual Knowledge Base for Object Understanding
Comments: 16 pages, 12 figures, 7 tables. Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[95]  arXiv:2512.08215 [pdf, ps, other]
Title: Blur2Sharp: Human Novel Pose and View Synthesis with Generative Prior Refinement
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[96]  arXiv:2512.08198 [pdf, ps, other]
Title: Animal Re-Identification on Microcontrollers
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[97]  arXiv:2512.08180 [pdf, ps, other]
Title: GeoLoom: High-quality Geometric Diagram Generation from Textual Input
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[98]  arXiv:2512.08163 [pdf, ps, other]
Title: Accuracy Does Not Guarantee Human-Likeness in Monocular Depth Estimators
Comments: 22 pages, 12 figures, 1 table
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[99]  arXiv:2512.08161 [pdf, ps, other]
Title: Fourier-RWKV: A Multi-State Perception Network for Efficient Image Dehazing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[100]  arXiv:2512.08135 [pdf, ps, other]
Title: CVP: Central-Peripheral Vision-Inspired Multimodal Model for Spatial Reasoning
Comments: Accepted to WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[101]  arXiv:2512.08075 [pdf, ps, other]
Title: Identification of Deforestation Areas in the Amazon Rainforest Using Change Detection Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[102]  arXiv:2512.08048 [pdf, ps, other]
Title: Mask to Adapt: Simple Random Masking Enables Robust Continual Test-Time Learning
Comments: ongoing work
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[103]  arXiv:2512.08042 [pdf, ps, other]
Title: Towards Sustainable Universal Deepfake Detection with Frequency-Domain Masking
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[104]  arXiv:2512.08040 [pdf, ps, other]
Title: Lost in Translation, Found in Embeddings: Sign Language Translation and Alignment
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[105]  arXiv:2512.08038 [pdf, ps, other]
Title: SSplain: Sparse and Smooth Explainer for Retinopathy of Prematurity Classification
Comments: 20 pages, 16 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[106]  arXiv:2512.08016 [pdf, ps, other]
Title: FRIEDA: Benchmarking Multi-Step Cartographic Reasoning in Vision-Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[107]  arXiv:2512.07984 [pdf, ps, other]
Title: Restrictive Hierarchical Semantic Segmentation for Stratified Tooth Layer Detection
Comments: 13 pages, 7 figures, 3 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[108]  arXiv:2512.07951 [pdf, ps, other]
Title: Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality
Comments: Project webpage: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[109]  arXiv:2512.07925 [pdf, ps, other]
Title: Near-real time fires detection using satellite imagery in Sudan conflict
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[110]  arXiv:2512.07838 [pdf, ps, other]
Title: Detection of Cyberbullying in GIF using AI
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[111]  arXiv:2512.08715 (cross-list from cs.PF) [pdf, ps, other]
Title: Multi-domain performance analysis with scores tailored to user preferences
Subjects: Performance (cs.PF); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[112]  arXiv:2512.08629 (cross-list from cs.AI) [pdf, ps, other]
Title: See-Control: A Multimodal Agent Framework for Smartphone Interaction with a Robotic Arm
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[113]  arXiv:2512.08545 (cross-list from cs.CL) [pdf, ps, other]
Title: Curriculum Guided Massive Multi Agent System Solving For Robust Long Horizon Tasks
Comments: 22 pages, 2 tables, 9 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Multiagent Systems (cs.MA)
[114]  arXiv:2512.08500 (cross-list from cs.GR) [pdf, ps, other]
Title: Learning to Control Physically-simulated 3D Characters via Generating and Mimicking 2D Motions
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[115]  arXiv:2512.08360 (cross-list from cs.NE) [pdf, ps, other]
Title: Conditional Morphogenesis: Emergent Generation of Structural Digits via Neural Cellular Automata
Authors: Ali Sakour
Comments: 13 pages, 5 figures. Code available at: this https URL
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[116]  arXiv:2512.08284 (cross-list from physics.geo-ph) [pdf, ps, other]
Title: Self-Reinforced Deep Priors for Reparameterized Full Waveform Inversion
Comments: Submitted to GEOPHYSICS
Subjects: Geophysics (physics.geo-ph); Computer Vision and Pattern Recognition (cs.CV)
[117]  arXiv:2512.08271 (cross-list from cs.RO) [pdf, ps, other]
Title: Zero-Splat TeleAssist: A Zero-Shot Pose Estimation Framework for Semantic Teleoperation
Comments: Published and Presented at 3rd Workshop on Human-Centric Multilateral Teleoperation in ICRA 2025
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[118]  arXiv:2512.08216 (cross-list from eess.IV) [pdf, ps, other]
Title: Tumor-anchored deep feature random forests for out-of-distribution detection in lung cancer segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[119]  arXiv:2512.08188 (cross-list from cs.RO) [pdf, ps, other]
Title: Embodied Tree of Thoughts: Deliberate Manipulation Planning with Embodied World Model
Comments: Website at this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[120]  arXiv:2512.08170 (cross-list from cs.RO) [pdf, ps, other]
Title: RAVES-Calib: Robust, Accurate and Versatile Extrinsic Self Calibration Using Optimal Geometric Features
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[121]  arXiv:2512.08153 (cross-list from cs.LG) [pdf, ps, other]
Title: TreeGRPO: Tree-Advantage GRPO for Online RL Post-Training of Diffusion Models
Authors: Zheng Ding, Weirui Ye
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[122]  arXiv:2512.08125 (cross-list from eess.IV) [pdf, ps, other]
Title: FlowSteer: Conditioning Flow Field for Consistent Image Restoration
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[123]  arXiv:2512.08099 (cross-list from math.NA) [pdf, ps, other]
Title: Generalizations of the Normalized Radon Cumulative Distribution Transform for Limited Data Recognition
Subjects: Numerical Analysis (math.NA); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
[124]  arXiv:2512.08029 (cross-list from cs.LG) [pdf, ps, other]
Title: CLARITY: Medical World Model for Guiding Treatment Decisions by Modeling Context-Aware Disease Trajectories in Latent Space
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[125]  arXiv:2512.07998 (cross-list from cs.RO) [pdf, ps, other]
Title: DIJIT: A Robotic Head for an Active Observer
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[126]  arXiv:2512.07981 (cross-list from cs.LG) [pdf, ps, other]
Title: CIP-Net: Continual Interpretable Prototype-based Network
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[127]  arXiv:2512.07976 (cross-list from cs.RO) [pdf, ps, other]
Title: VLD: Visual Language Goal Distance for Reinforcement Learning Navigation
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[128]  arXiv:2512.07969 (cross-list from cs.RO) [pdf, ps, other]
Title: Sparse Variable Projection in Robotic Perception: Exploiting Separable Structure for Efficient Nonlinear Optimization
Comments: 8 pages, submitted for review
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[129]  arXiv:2512.07884 (cross-list from cs.LG) [pdf, ps, other]
Title: GSPN-2: Efficient Parallel Sequence Modeling
Comments: NeurIPS 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[130]  arXiv:2512.07855 (cross-list from cs.LG) [pdf, ps, other]
Title: LAPA: Log-Domain Prediction-Driven Dynamic Sparsity Accelerator for Transformer Model
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[131]  arXiv:2512.05791 (cross-list from physics.med-ph) [pdf, ps, other]
Title: Fast and Robust Diffusion Posterior Sampling for MR Image Reconstruction Using the Preconditioned Unadjusted Langevin Algorithm
Comments: Submitted to Magnetic Resonance in Medicine
Subjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Probability (math.PR)

Tue, 9 Dec 2025 (showing first 53 of 259 entries)

[132]  arXiv:2512.07834 [pdf, ps, other]
Title: Voxify3D: Pixel Art Meets Volumetric Rendering
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[133]  arXiv:2512.07833 [pdf, ps, other]
Title: Relational Visual Similarity
Comments: Project page, data, and code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[134]  arXiv:2512.07831 [pdf, ps, other]
Title: UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation
Comments: Project Website this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[135]  arXiv:2512.07829 [pdf, ps, other]
Title: One Layer Is Enough: Adapting Pretrained Visual Encoders for Image Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[136]  arXiv:2512.07826 [pdf, ps, other]
Title: OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing
Comments: 38 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[137]  arXiv:2512.07821 [pdf, ps, other]
Title: WorldReel: 4D Video Generation with Consistent Geometry and Motion Modeling
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[138]  arXiv:2512.07807 [pdf, ps, other]
Title: Lang3D-XL: Language Embedded 3D Gaussians for Large-scale Scenes
Comments: Accepted to SIGGRAPH Asia 2025. Project webpage: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[139]  arXiv:2512.07806 [pdf, ps, other]
Title: Multi-view Pyramid Transformer: Look Coarser to See Broader
Comments: Project page: see this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[140]  arXiv:2512.07802 [pdf, ps, other]
Title: OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[141]  arXiv:2512.07778 [pdf, ps, other]
Title: Distribution Matching Variational AutoEncoder
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[142]  arXiv:2512.07776 [pdf, ps, other]
Title: GorillaWatch: An Automated System for In-the-Wild Gorilla Re-Identification and Population Monitoring
Comments: Accepted at WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[143]  arXiv:2512.07760 [pdf, ps, other]
Title: Modality-Aware Bias Mitigation and Invariance Learning for Unsupervised Visible-Infrared Person Re-Identification
Comments: Accepted to AAAI 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[144]  arXiv:2512.07756 [pdf, ps, other]
Title: UltrasODM: A Dual Stream Optical Flow Mamba Network for 3D Freehand Ultrasound Reconstruction
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[145]  arXiv:2512.07747 [pdf, ps, other]
Title: Unison: A Fully Automatic, Task-Universal, and Low-Cost Framework for Unified Understanding and Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[146]  arXiv:2512.07745 [pdf, ps, other]
Title: DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[147]  arXiv:2512.07738 [pdf, ps, other]
Title: HLTCOE Evaluation Team at TREC 2025: VQA Track
Comments: 7 pages, 1 figure
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[148]  arXiv:2512.07733 [pdf, ps, other]
Title: SpatialDreamer: Incentivizing Spatial Reasoning via Active Mental Imagery
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[149]  arXiv:2512.07730 [pdf, ps, other]
Title: SAVE: Sparse Autoencoder-Driven Visual Information Enhancement for Mitigating Object Hallucination
Comments: WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[150]  arXiv:2512.07729 [pdf, ps, other]
Title: Improving action classification with brain-inspired deep networks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[151]  arXiv:2512.07720 [pdf, ps, other]
Title: ViSA: 3D-Aware Video Shading for Real-Time Upper-Body Avatar Creation
Comments: Project page: \url{this https URL}
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[152]  arXiv:2512.07712 [pdf, ps, other]
Title: UnCageNet: Tracking and Pose Estimation of Caged Animal
Comments: 9 pages, 2 figures, 2 tables. Accepted to the Indian Conference on Computer Vision, Graphics, and Image Processing (ICVGIP 2025), Mandi, India
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[153]  arXiv:2512.07703 [pdf, ps, other]
Title: PVeRA: Probabilistic Vector-Based Random Matrix Adaptation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[154]  arXiv:2512.07702 [pdf, ps, other]
Title: Guiding What Not to Generate: Automated Negative Prompting for Text-Image Alignment
Comments: WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[155]  arXiv:2512.07698 [pdf, ps, other]
Title: sim2art: Accurate Articulated Object Modeling from a Single Video using Synthetic Training Data Only
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[156]  arXiv:2512.07674 [pdf, ps, other]
Title: DIST-CLIP: Arbitrary Metadata and Image Guided MRI Harmonization via Disentangled Anatomy-Contrast Representations
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[157]  arXiv:2512.07668 [pdf, ps, other]
Title: EgoCampus: Egocentric Pedestrian Eye Gaze Model and Dataset
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[158]  arXiv:2512.07661 [pdf, ps, other]
Title: Optimization-Guided Diffusion for Interactive Scene Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[159]  arXiv:2512.07652 [pdf, ps, other]
Title: An AI-Powered Autonomous Underwater System for Sea Exploration and Scientific Research
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[160]  arXiv:2512.07651 [pdf, ps, other]
Title: Liver Fibrosis Quantification and Analysis: The LiQA Dataset and Baseline Method
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[161]  arXiv:2512.07628 [pdf, ps, other]
Title: MoCA: Mixture-of-Components Attention for Scalable Compositional 3D Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[162]  arXiv:2512.07606 [pdf, ps, other]
Title: Decomposition Sampling for Efficient Region Annotations in Active Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[163]  arXiv:2512.07599 [pdf, ps, other]
Title: Online Segment Any 3D Thing as Instance Tracking
Comments: NeurIPS 2025, Code is at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[164]  arXiv:2512.07596 [pdf, ps, other]
Title: More than Segmentation: Benchmarking SAM 3 for Segmentation, 3D Perception, and Reconstruction in Robotic Surgery
Comments: Technical Report
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[165]  arXiv:2512.07590 [pdf, ps, other]
Title: Robust Variational Model Based Tailored UNet: Leveraging Edge Detector and Mean Curvature for Improved Image Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[166]  arXiv:2512.07584 [pdf, ps, other]
Title: LongCat-Image Technical Report
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[167]  arXiv:2512.07580 [pdf, ps, other]
Title: All You Need Are Random Visual Tokens? Demystifying Token Pruning in VLLMs
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[168]  arXiv:2512.07568 [pdf, ps, other]
Title: Dual-Stream Cross-Modal Representation Learning via Residual Semantic Decorrelation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[169]  arXiv:2512.07564 [pdf, ps, other]
Title: Toward More Reliable Artificial Intelligence: Reducing Hallucinations in Vision-Language Models
Comments: 24 pages, 3 figures, 2 tables. Training-free self-correction framework for vision-language models. Code and implementation details will be released at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[170]  arXiv:2512.07527 [pdf, ps, other]
Title: From Orbit to Ground: Generative City Photogrammetry from Extreme Off-Nadir Satellite Images
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[171]  arXiv:2512.07514 [pdf, ps, other]
Title: MeshRipple: Structured Autoregressive Generation of Artist-Meshes
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[172]  arXiv:2512.07504 [pdf, ps, other]
Title: ControlVP: Interactive Geometric Refinement of AI-Generated Images with Consistent Vanishing Points
Comments: Accepted to WACV 2026, 8 pages, supplementary included. Dataset and code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[173]  arXiv:2512.07503 [pdf, ps, other]
Title: SJD++: Improved Speculative Jacobi Decoding for Training-free Acceleration of Discrete Auto-regressive Text-to-Image Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[174]  arXiv:2512.07500 [pdf, ps, other]
Title: MultiMotion: Multi Subject Video Motion Transfer via Video Diffusion Transformer
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[175]  arXiv:2512.07498 [pdf, ps, other]
Title: Towards Robust DeepFake Detection under Unstable Face Sequences: Adaptive Sparse Graph Embedding with Order-Free Representation and Explicit Laplacian Spectral Prior
Comments: 16 pages (including appendix)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[176]  arXiv:2512.07480 [pdf, ps, other]
Title: Single-step Diffusion-based Video Coding with Semantic-Temporal Guidance
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[177]  arXiv:2512.07469 [pdf, ps, other]
Title: Unified Video Editing with Temporal Reasoner
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[178]  arXiv:2512.07426 [pdf, ps, other]
Title: When normalization hallucinates: unseen risks in AI-powered whole slide image processing
Comments: 4 pages, accepted for oral presentation at SPIE Medical Imaging, 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[179]  arXiv:2512.07415 [pdf, ps, other]
Title: Data-driven Exploration of Mobility Interaction Patterns
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[180]  arXiv:2512.07410 [pdf, ps, other]
Title: InterAgent: Physics-based Multi-agent Command Execution via Diffusion on Interaction Graphs
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[181]  arXiv:2512.07394 [pdf, ps, other]
Title: Reconstructing Objects along Hand Interaction Timelines in Egocentric Video
Comments: webpage: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[182]  arXiv:2512.07391 [pdf, ps, other]
Title: GlimmerNet: A Lightweight Grouped Dilated Depthwise Convolutions for UAV-Based Emergency Monitoring
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[183]  arXiv:2512.07385 [pdf, ps, other]
Title: How Far are Modern Trackers from UAV-Anti-UAV? A Million-Scale Benchmark and New Baseline
Comments: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[184]  arXiv:2512.07383 [pdf, ps, other]
Title: LogicCBMs: Logic-Enhanced Concept-Based Learning
Comments: 18 pages, 19 figures, WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 749 entries: 1-100 | 85-184 | 185-284 | 285-384 | 385-484 | ... | 685-749 ]
[ showing 100 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)