We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 150

[ total of 778 entries: 1-100 | 51-150 | 151-250 | 251-350 | 351-450 | 451-550 | ... | 751-778 ]
[ showing 100 entries per page: fewer | more | all ]

Fri, 5 Dec 2025 (continued, showing last 79 of 135 entries)

[151]  arXiv:2512.04619 [pdf, ps, other]
Title: Denoise to Track: Harnessing Video Diffusion Priors for Robust Correspondence
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[152]  arXiv:2512.04599 [pdf, ps, other]
Title: Malicious Image Analysis via Vision-Language Segmentation Fusion: Detection, Element, and Location in One-shot
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[153]  arXiv:2512.04597 [pdf, ps, other]
Title: When Robots Should Say "I Don't Know": Benchmarking Abstention in Embodied Question Answering
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[154]  arXiv:2512.04585 [pdf, ps, other]
Title: SAM3-I: Segment Anything with Instructions
Comments: Preliminary results; work in progress
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[155]  arXiv:2512.04581 [pdf, ps, other]
Title: Infrared UAV Target Tracking with Dynamic Feature Refinement and Global Contextual Attention Knowledge Distillation
Comments: Accepted by IEEE TMM
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[156]  arXiv:2512.04576 [pdf, ps, other]
Title: TARDis: Time Attenuated Representation Disentanglement for Incomplete Multi-Modal Tumor Segmentation and Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[157]  arXiv:2512.04568 [pdf, ps, other]
Title: Prompt2Craft: Generating Functional Craft Assemblies with LLMs
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[158]  arXiv:2512.04564 [pdf, ps, other]
Title: Dataset creation for supervised deep learning-based analysis of microscopic images -- review of important considerations and recommendations
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[159]  arXiv:2512.04563 [pdf, ps, other]
Title: COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[160]  arXiv:2512.04554 [pdf, ps, other]
Title: Counterfeit Answers: Adversarial Forgery against OCR-Free Document Visual Question Answering
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[161]  arXiv:2512.04542 [pdf, ps, other]
Title: Gaussian Entropy Fields: Driving Adaptive Sparsity in 3D Gaussian Optimization
Comments: 28 pages,11 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[162]  arXiv:2512.04540 [pdf, ps, other]
Title: VideoMem: Enhancing Ultra-Long Video Understanding via Adaptive Memory Management
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[163]  arXiv:2512.04537 [pdf, ps, other]
Title: X-Humanoid: Robotize Human Videos to Generate Humanoid Videos at Scale
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[164]  arXiv:2512.04536 [pdf, ps, other]
Title: Detection of Intoxicated Individuals from Facial Video Sequences via a Recurrent Fusion Model
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[165]  arXiv:2512.04534 [pdf, ps, other]
Title: Refaçade: Editing Object with Given Reference Texture
Authors: Youze Huang (1), Penghui Ruan (2), Bojia Zi (3), Xianbiao Qi (4), Jianan Wang (5), Rong Xiao (4) ((1) University of Electronic Science and Technology of China, (2) The Hong Kong Polytechnic University, (3) The Chinese University of Hong Kong, (4) IntelliFusion Inc., (5) Astribot Inc.)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[166]  arXiv:2512.04532 [pdf, ps, other]
Title: PhyVLLM: Physics-Guided Video Language Model with Motion-Appearance Disentanglement
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[167]  arXiv:2512.04528 [pdf, ps, other]
Title: Auto3R: Automated 3D Reconstruction and Scanning via Data-driven Uncertainty Quantification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[168]  arXiv:2512.04522 [pdf, ps, other]
Title: Identity Clue Refinement and Enhancement for Visible-Infrared Person Re-Identification
Comments: 14 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[169]  arXiv:2512.04521 [pdf, ps, other]
Title: WiFi-based Cross-Domain Gesture Recognition Using Attention Mechanism
Subjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[170]  arXiv:2512.04520 [pdf, ps, other]
Title: Boundary-Aware Test-Time Adaptation for Zero-Shot Medical Image Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[171]  arXiv:2512.04519 [pdf, ps, other]
Title: VideoSSM: Autoregressive Long Video Generation with Hybrid State-Space Memory
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[172]  arXiv:2512.04515 [pdf, ps, other]
Title: EgoLCD: Egocentric Video Generation with Long Context Diffusion
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[173]  arXiv:2512.04511 [pdf, ps, other]
Title: DuGI-MAE: Improving Infrared Mask Autoencoders via Dual-Domain Guidance
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[174]  arXiv:2512.04504 [pdf, ps, other]
Title: UltraImage: Rethinking Resolution Extrapolation in Image Diffusion Transformers
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[175]  arXiv:2512.04499 [pdf, ps, other]
Title: Back to Basics: Motion Representation Matters for Human Motion Generation Using Diffusion Model
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[176]  arXiv:2512.04496 [pdf, ps, other]
Title: Shift-Window Meets Dual Attention: A Multi-Model Architecture for Specular Highlight Removal
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[177]  arXiv:2512.04487 [pdf, ps, other]
Title: Controllable Long-term Motion Generation with Extended Joint Targets
Comments: WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[178]  arXiv:2512.04485 [pdf, ps, other]
Title: Not All Birds Look The Same: Identity-Preserving Generation For Birds
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[179]  arXiv:2512.04483 [pdf, ps, other]
Title: DeRA: Decoupled Representation Alignment for Video Tokenization
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[180]  arXiv:2512.04461 [pdf, ps, other]
Title: UniTS: Unified Time Series Generative Model for Remote Sensing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[181]  arXiv:2512.04459 [pdf, ps, other]
Title: dVLM-AD: Enhance Diffusion Vision-Language-Model for Driving via Controllable Reasoning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[182]  arXiv:2512.04456 [pdf, ps, other]
Title: GuidNoise: Single-Pair Guided Diffusion for Generalized Noise Synthesis
Comments: AAAI2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[183]  arXiv:2512.04451 [pdf, ps, other]
Title: StreamEQA: Towards Streaming Video Understanding for Embodied Scenarios
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[184]  arXiv:2512.04441 [pdf, ps, other]
Title: MindDrive: An All-in-One Framework Bridging World Models and Vision-Language Model for End-to-End Autonomous Driving
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[185]  arXiv:2512.04426 [pdf, ps, other]
Title: Self-Paced and Self-Corrective Masked Prediction for Movie Trailer Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[186]  arXiv:2512.04425 [pdf, ps, other]
Title: Explainable Parkinsons Disease Gait Recognition Using Multimodal RGB-D Fusion and Large Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[187]  arXiv:2512.04421 [pdf, ps, other]
Title: UTrice: Unifying Primitives in Differentiable Ray Tracing and Rasterization via Triangles for Particle-Based 3D Scenes
Comments: 13 pages, 10 figures, submitted to CVPR2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[188]  arXiv:2512.04413 [pdf, ps, other]
Title: Dual-Stream Spectral Decoupling Distillation for Remote Sensing Object Detection
Comments: 12 pages, 8 figures, 11 tables
Journal-ref: IEEE Transactions on Geoscience and Remote Sensing 63 (2025) 1-11
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[189]  arXiv:2512.04397 [pdf, ps, other]
Title: Performance Evaluation of Transfer Learning Based Medical Image Classification Techniques for Disease Detection
Journal-ref: 2025 47th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Copenhagen, Denmark, 2025, pp. 1-5
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[190]  arXiv:2512.04395 [pdf, ps, other]
Title: Fourier-Attentive Representation Learning: A Fourier-Guided Framework for Few-Shot Generalization in Vision-Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[191]  arXiv:2512.04390 [pdf, ps, other]
Title: FMA-Net++: Motion- and Exposure-Aware Real-World Joint Video Super-Resolution and Deblurring
Comments: 20 pages, 15 figures. Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[192]  arXiv:2512.04358 [pdf, ps, other]
Title: MAFNet:Multi-frequency Adaptive Fusion Network for Real-time Stereo Matching
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[193]  arXiv:2512.04356 [pdf, ps, other]
Title: Mitigating Object and Action Hallucinations in Multimodal LLMs via Self-Augmented Contrastive Alignment
Comments: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2026. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[194]  arXiv:2512.04331 [pdf, ps, other]
Title: Open Set Face Forgery Detection via Dual-Level Evidence Collection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[195]  arXiv:2512.04329 [pdf, ps, other]
Title: A Retrieval-Augmented Generation Approach to Extracting Algorithmic Logic from Neural Networks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Software Engineering (cs.SE)
[196]  arXiv:2512.04323 [pdf, ps, other]
Title: Bayes-DIC Net: Estimating Digital Image Correlation Uncertainty with Bayesian Neural Networks
Comments: 17 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG)
[197]  arXiv:2512.04315 [pdf, ps, other]
Title: SyncTrack4D: Cross-Video Motion Alignment and Video Synchronization for Multi-Video 4D Gaussian Splatting
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[198]  arXiv:2512.04314 [pdf, ps, other]
Title: DisentangleFormer: Spatial-Channel Decoupling for Multi-Channel Vision
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[199]  arXiv:2512.04313 [pdf, ps, other]
Title: Mind-to-Face: Neural-Driven Photorealistic Avatar Synthesis via EEG Decoding
Comments: 16 pages, 11 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[200]  arXiv:2512.04311 [pdf, ps, other]
Title: Real-time Cricket Sorting By Sex
Comments: 13 pages, 14 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[201]  arXiv:2512.04309 [pdf, ps, other]
Title: Text-Only Training for Image Captioning with Retrieval Augmentation and Modality Gap Correction
Comments: Submitted to CVPR 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[202]  arXiv:2512.04305 [pdf, ps, other]
Title: How (Mis)calibrated is Your Federated CLIP and What To Do About It?
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[203]  arXiv:2512.04303 [pdf, ps, other]
Title: Gamma-from-Mono: Road-Relative, Metric, Self-Supervised Monocular Geometry for Vehicular Applications
Comments: Accepted in 3DV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[204]  arXiv:2512.04284 [pdf, ps, other]
Title: Learning Single-Image Super-Resolution in the JPEG Compressed Domain
Comments: 7 pages, 4 figures, 2 tables, SEEDS Workshop, ICIP 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[205]  arXiv:2512.04283 [pdf, ps, other]
Title: Plug-and-Play Image Restoration with Flow Matching: A Continuous Viewpoint
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[206]  arXiv:2512.04282 [pdf, ps, other]
Title: Inference-time Stochastic Refinement of GRU-Normalizing Flow for Real-time Video Motion Transfer
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[207]  arXiv:2512.04267 [pdf, ps, other]
Title: UniLight: A Unified Representation for Lighting
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[208]  arXiv:2512.04248 [pdf, ps, other]
Title: MVRoom: Controllable 3D Indoor Scene Generation with Multi-View Diffusion Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[209]  arXiv:2512.04238 [pdf, ps, other]
Title: 6 Fingers, 1 Kidney: Natural Adversarial Medical Images Reveal Critical Weaknesses of Vision-Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[210]  arXiv:2512.04222 [pdf, ps, other]
Title: ReasonX: MLLM-Guided Intrinsic Image Decomposition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[211]  arXiv:2512.04221 [pdf, ps, other]
Title: MoReGen: Multi-Agent Motion-Reasoning Engine for Code-based Text-to-Video Synthesis
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[212]  arXiv:2512.04219 [pdf, ps, other]
Title: Generalized Event Partonomy Inference with Structured Hierarchical Predictive Learning
Comments: 16 pages, 7 figures, 3 tables. Under Review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[213]  arXiv:2512.04187 [pdf, ps, other]
Title: OnSight Pathology: A real-time platform-agnostic computational pathology companion for histopathology
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[214]  arXiv:2512.04175 [pdf, ps, other]
Title: Beyond Flicker: Detecting Kinematic Inconsistencies for Generalizable Deepfake Video Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[215]  arXiv:2512.05117 (cross-list from cs.LG) [pdf, ps, other]
Title: The Universal Weight Subspace Hypothesis
Comments: 37 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[216]  arXiv:2512.05116 (cross-list from cs.LG) [pdf, ps, other]
Title: Value Gradient Guidance for Flow Matching Alignment
Comments: Accepted at NeurIPS 2025; 26 pages, 20 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[217]  arXiv:2512.05114 (cross-list from cs.LG) [pdf, ps, other]
Title: Deep infant brain segmentation from multi-contrast MRI
Comments: 8 pages, 8 figures, 1 table, website at this https URL, presented at the 2025 IEEE Asilomar Conference on Signals, Systems, and Computers
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[218]  arXiv:2512.05103 (cross-list from cs.LG) [pdf, ps, other]
Title: TV2TV: A Unified Framework for Interleaved Language and Video Generation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[219]  arXiv:2512.05094 (cross-list from cs.RO) [pdf, ps, other]
Title: From Generated Human Videos to Physically Plausible Robot Trajectories
Comments: For project website, see this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[220]  arXiv:2512.04814 (cross-list from cs.SD) [pdf, ps, other]
Title: Shared Multi-modal Embedding Space for Face-Voice Association
Comments: Ranked 1st in Fame 2026 Challenge, ICASSP
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV)
[221]  arXiv:2512.04763 (cross-list from cs.LG) [pdf, ps, other]
Title: MemLoRA: Distilling Expert Adapters for On-Device Memory Systems
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[222]  arXiv:2512.04705 (cross-list from cs.CC) [pdf, ps, other]
Title: Hardware-aware Neural Architecture Search of Early Exiting Networks on Edge Accelerators
Comments: Submitted to IEEE Transactions on Emerging Topics in Computing
Subjects: Computational Complexity (cs.CC); Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV)
[223]  arXiv:2512.04625 (cross-list from cs.LG) [pdf, ps, other]
Title: Rethinking Decoupled Knowledge Distillation: A Predictive Distribution Perspective
Comments: Accepted to IEEE TNNLS
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[224]  arXiv:2512.04556 (cross-list from cs.GR) [pdf, ps, other]
Title: Efficient Spatially-Variant Convolution via Differentiable Sparse Kernel Complex
Comments: 10 pages, 7 figures
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[225]  arXiv:2512.04464 (cross-list from cs.LG) [pdf, ps, other]
Title: Feature Engineering vs. Deep Learning for Automated Coin Grading: A Comparative Study on Saint-Gaudens Double Eagles
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[226]  arXiv:2512.04385 (cross-list from cs.LG) [pdf, ps, other]
Title: STeP-Diff: Spatio-Temporal Physics-Informed Diffusion Models for Mobile Fine-Grained Pollution Forecasting
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[227]  arXiv:2512.04264 (cross-list from cs.LG) [pdf, ps, other]
Title: Studying Various Activation Functions and Non-IID Data for Machine Learning Model Robustness
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[228]  arXiv:2512.04092 (cross-list from physics.soc-ph) [pdf, ps, other]
Title: The changing surface of the world's roads
Subjects: Physics and Society (physics.soc-ph); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[229]  arXiv:2512.04087 (cross-list from q-bio.NC) [pdf, ps, other]
Title: Human-Centred Evaluation of Text-to-Image Generation Models for Self-expression of Mental Distress: A Dataset Based on GPT-4o
Authors: Sui He, Shenbin Qian
Subjects: Neurons and Cognition (q-bio.NC); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Human-Computer Interaction (cs.HC)

Thu, 4 Dec 2025 (showing first 21 of 130 entries)

[230]  arXiv:2512.04085 [pdf, ps, other]
Title: Unique Lives, Shared World: Learning from Single-Life Videos
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[231]  arXiv:2512.04084 [pdf, ps, other]
Title: SimFlow: Simplified and End-to-End Training of Latent Normalizing Flows
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[232]  arXiv:2512.04082 [pdf, ps, other]
Title: PosterCopilot: Toward Layout Reasoning and Controllable Editing for Professional Graphic Design
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[233]  arXiv:2512.04069 [pdf, ps, other]
Title: SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[234]  arXiv:2512.04048 [pdf, ps, other]
Title: Stable Signer: Hierarchical Sign Language Generative Model
Comments: 12 pages, 7 figures. More Demo at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Computers and Society (cs.CY)
[235]  arXiv:2512.04040 [pdf, ps, other]
Title: RELIC: Interactive Video World Model with Long-Horizon Memory
Comments: 22 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[236]  arXiv:2512.04039 [pdf, ps, other]
Title: Fast & Efficient Normalizing Flows and Applications of Image Generative Models
Authors: Sandeep Nagar
Comments: PhD Thesis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[237]  arXiv:2512.04025 [pdf, ps, other]
Title: PSA: Pyramid Sparse Attention for Efficient Video Understanding and Generation
Comments: Tech report
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[238]  arXiv:2512.04021 [pdf, ps, other]
Title: C3G: Learning Compact 3D Representations with 2K Gaussians
Comments: Project Page : this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[239]  arXiv:2512.04019 [pdf, ps, other]
Title: Ultra-lightweight Neural Video Representation Compression
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[240]  arXiv:2512.04015 [pdf, ps, other]
Title: Learning Group Actions In Disentangled Latent Image Representations
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[241]  arXiv:2512.04012 [pdf, ps, other]
Title: Emergent Outlier View Rejection in Visual Geometry Grounded Transformers
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[242]  arXiv:2512.04007 [pdf, ps, other]
Title: On the Temporality for Sketch Representation Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[243]  arXiv:2512.04000 [pdf, ps, other]
Title: Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[244]  arXiv:2512.03996 [pdf, ps, other]
Title: Highly Efficient Test-Time Scaling for T2I Diffusion Models with Text Embedding Perturbation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[245]  arXiv:2512.03992 [pdf, ps, other]
Title: DIQ-H: Evaluating Hallucination Persistence in VLMs Under Temporal Visual Degradation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[246]  arXiv:2512.03981 [pdf, ps, other]
Title: DirectDrag: High-Fidelity, Mask-Free, Prompt-Free Drag-based Image Editing via Readout-Guided Feature Alignment
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[247]  arXiv:2512.03979 [pdf, ps, other]
Title: BlurDM: A Blur Diffusion Model for Image Deblurring
Comments: NeurIPS 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[248]  arXiv:2512.03964 [pdf, ps, other]
Title: Training for Identity, Inference for Controllability: A Unified Approach to Tuning-Free Face Personalization
Comments: 17 pages, 13 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[249]  arXiv:2512.03963 [pdf, ps, other]
Title: TempR1: Improving Temporal Understanding of MLLMs via Temporal-Aware Multi-Task Reinforcement Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[250]  arXiv:2512.03939 [pdf, ps, other]
Title: MUT3R: Motion-aware Updating Transformer for Dynamic 3D Reconstruction
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[ total of 778 entries: 1-100 | 51-150 | 151-250 | 251-350 | 351-450 | 451-550 | ... | 751-778 ]
[ showing 100 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)