We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 34

[ total of 749 entries: 1-100 | 35-134 | 135-234 | 235-334 | 335-434 | ... | 735-749 ]
[ showing 100 entries per page: fewer | more | all ]

Wed, 10 Dec 2025 (continued, showing last 97 of 131 entries)

[35]  arXiv:2512.08589 [pdf, ps, other]
Title: Automated Pollen Recognition in Optical and Holographic Microscopy Images
Comments: 08 pages, 10 figures, 04 tables, 20 references. Date of Conference: 13-14 June 2025 Date Added to IEEE Xplore: 10 July 2025 Electronic ISBN: 979-8-3315-0969-9 Print on Demand(PoD) ISBN: 979-8-3315-0970-5 DOI: 10.1109/AICCONF64766.2025.11064260 Conference Location: Prague, Czech Republic Online Access: this https URL
Journal-ref: 2025 3rd Cognitive Models and Artificial Intelligence Conference (AICCONF), vol. 1, no. 1, pp. 1-8, Prague, Czech Republic, IEEE, 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[36]  arXiv:2512.08577 [pdf, ps, other]
Title: Disturbance-Free Surgical Video Generation from Multi-Camera Shadowless Lamps for Open Surgery
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[37]  arXiv:2512.08572 [pdf, ps, other]
Title: From Cells to Survival: Hierarchical Analysis of Cell Inter-Relations in Multiplex Microscopy for Lung Cancer Prognosis
Comments: 5 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[38]  arXiv:2512.08569 [pdf, ps, other]
Title: Instance-Aware Test-Time Segmentation for Continual Domain Shifts
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[39]  arXiv:2512.08564 [pdf, ps, other]
Title: Modular Neural Image Signal Processing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[40]  arXiv:2512.08560 [pdf, ps, other]
Title: BrainExplore: Large-Scale Discovery of Interpretable Visual Representations in the Human Brain
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[41]  arXiv:2512.08557 [pdf, ps, other]
Title: SSCATeR: Sparse Scatter-Based Convolution Algorithm with Temporal Data Recycling for Real-Time 3D Object Detection in LiDAR Point Clouds
Comments: 22 Pages, 26 Figures, This work has been submitted to the IEEE Sensors Journal for possible publication
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[42]  arXiv:2512.08547 [pdf, ps, other]
Title: An Iteration-Free Fixed-Point Estimator for Diffusion Inversion
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[43]  arXiv:2512.08542 [pdf, ps, other]
Title: A Novel Wasserstein Quaternion Generative Adversarial Network for Color Image Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Numerical Analysis (math.NA)
[44]  arXiv:2512.08537 [pdf, ps, other]
Title: Fast-ARDiff: An Entropy-informed Acceleration Framework for Continuous Space Autoregressive Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[45]  arXiv:2512.08535 [pdf, ps, other]
Title: Photo3D: Advancing Photorealistic 3D Generation through Structure-Aligned Detail Enhancement
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[46]  arXiv:2512.08534 [pdf, ps, other]
Title: PaintFlow: A Unified Framework for Interactive Oil Paintings Editing and Generation
Comments: 14 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[47]  arXiv:2512.08529 [pdf, ps, other]
Title: MVP: Multiple View Prediction Improves GUI Grounding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[48]  arXiv:2512.08524 [pdf, ps, other]
Title: Beyond Real Weights: Hypercomplex Representations for Stable Quantization
Comments: Accepted in Winter Conference on Applications of Computer Vision (WACV) 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[49]  arXiv:2512.08511 [pdf, ps, other]
Title: Thinking with Images via Self-Calling Agent
Comments: Code would be released at this https URL soon
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[50]  arXiv:2512.08506 [pdf, ps, other]
Title: OCCDiff: Occupancy Diffusion Model for High-Fidelity 3D Building Reconstruction from Noisy Point Clouds
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[51]  arXiv:2512.08505 [pdf, ps, other]
Title: Beyond the Noise: Aligning Prompts with Latent Representations in Diffusion Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[52]  arXiv:2512.08503 [pdf, ps, other]
Title: Disrupting Hierarchical Reasoning: Adversarial Protection for Geographic Privacy in Multimodal Reasoning Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[53]  arXiv:2512.08498 [pdf, ps, other]
Title: On-the-fly Large-scale 3D Reconstruction from Multi-Camera Rigs
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[54]  arXiv:2512.08486 [pdf, ps, other]
Title: Temporal Concept Dynamics in Diffusion Models via Prompt-Conditioned Interventions
Comments: Code is available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[55]  arXiv:2512.08478 [pdf, ps, other]
Title: Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[56]  arXiv:2512.08477 [pdf, ps, other]
Title: ContextDrag: Precise Drag-Based Image Editing via Context-Preserving Token Injection and Position-Consistent Attention
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[57]  arXiv:2512.08467 [pdf, ps, other]
Title: Team-Aware Football Player Tracking with SAM: An Appearance-Based Approach to Occlusion Recovery
Comments: 8 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[58]  arXiv:2512.08445 [pdf, ps, other]
Title: Uncertainty-Aware Subset Selection for Robust Visual Explainability under Distribution Shifts
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[59]  arXiv:2512.08441 [pdf, ps, other]
Title: Leveraging Multispectral Sensors for Color Correction in Mobile Cameras
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[60]  arXiv:2512.08439 [pdf, ps, other]
Title: LapFM: A Laparoscopic Segmentation Foundation Model via Hierarchical Concept Evolving Pre-training
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[61]  arXiv:2512.08430 [pdf, ps, other]
Title: SDT-6D: Fully Sparse Depth-Transformer for Staged End-to-End 6D Pose Estimation in Industrial Multi-View Bin Picking
Comments: Accepted to WACV 2026. Preprint version
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[62]  arXiv:2512.08410 [pdf, ps, other]
Title: Towards Effective and Efficient Long Video Understanding of Multimodal Large Language Models via One-shot Clip Retrieval
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[63]  arXiv:2512.08406 [pdf, ps, other]
Title: SAM-Body4D: Training-Free 4D Human Body Mesh Recovery from Videos
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[64]  arXiv:2512.08400 [pdf, ps, other]
Title: Towards Visual Re-Identification of Fish using Fine-Grained Classification for Electronic Monitoring in Fisheries
Comments: The paper has been accepted for publication at Northern Lights Deep Learning (NLDL) Conference 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[65]  arXiv:2512.08397 [pdf, ps, other]
Title: Detection of Digital Facial Retouching utilizing Face Beauty Information
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[66]  arXiv:2512.08378 [pdf, ps, other]
Title: Simultaneous Enhancement and Noise Suppression under Complex Illumination Conditions
Comments: The paper has been accepted and officially published by IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[67]  arXiv:2512.08374 [pdf, ps, other]
Title: The Unseen Bias: How Norm Discrepancy in Pre-Norm MLLMs Leads to Visual Information Loss
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[68]  arXiv:2512.08362 [pdf, ps, other]
Title: SCU-CGAN: Enhancing Fire Detection through Synthetic Fire Image Generation and Dataset Augmentation
Comments: Accepted for main track at MobieSec 2024 (not published in the proceedings)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[69]  arXiv:2512.08358 [pdf, ps, other]
Title: TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels
Comments: Accepted by NeurIPS 2025. Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[70]  arXiv:2512.08337 [pdf, ps, other]
Title: DINO-BOLDNet: A DINOv3-Guided Multi-Slice Attention Network for T1-to-BOLD Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[71]  arXiv:2512.08334 [pdf, ps, other]
Title: HybridSplat: Fast Reflection-baked Gaussian Tracing using Hybrid Splatting
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[72]  arXiv:2512.08331 [pdf, ps, other]
Title: Bi^2MAC: Bimodal Bi-Adaptive Mask-Aware Convolution for Remote Sensing Pansharpening
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[73]  arXiv:2512.08330 [pdf, ps, other]
Title: PointDico: Contrastive 3D Representation Learning Guided by Diffusion Models
Comments: Accepted by IJCNN 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[74]  arXiv:2512.08329 [pdf, ps, other]
Title: Interpreting Structured Perturbations in Image Protection Methods for Diffusion Models
Comments: 32 pages, 17 figures, 1 table, 5 algorithms, preprint
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[75]  arXiv:2512.08327 [pdf, ps, other]
Title: Low Rank Support Quaternion Matrix Machine
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[76]  arXiv:2512.08325 [pdf, ps, other]
Title: GeoDiffMM: Geometry-Guided Conditional Diffusion for Motion Magnification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[77]  arXiv:2512.08323 [pdf, ps, other]
Title: Detecting Dental Landmarks from Intraoral 3D Scans: the 3DTeethLand challenge
Comments: MICCAI 2024, 3DTeethLand, Challenge report, under review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[78]  arXiv:2512.08317 [pdf, ps, other]
Title: GeoDM: Geometry-aware Distribution Matching for Dataset Distillation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[79]  arXiv:2512.08309 [pdf, ps, other]
Title: Terrain Diffusion: A Diffusion-Based Successor to Perlin Noise in Infinite, Real-Time Terrain Generation
Authors: Alexander Goslin
Comments: Project website: this https URL Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
[80]  arXiv:2512.08294 [pdf, ps, other]
Title: OpenSubject: Leveraging Video-Derived Identity and Diversity Priors for Subject-driven Image Generation and Manipulation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[81]  arXiv:2512.08282 [pdf, ps, other]
Title: PAVAS: Physics-Aware Video-to-Audio Synthesis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD)
[82]  arXiv:2512.08269 [pdf, ps, other]
Title: EgoX: Egocentric Video Generation from a Single Exocentric Video
Comments: 21 pages, project page : this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[83]  arXiv:2512.08262 [pdf, ps, other]
Title: RLCNet: An end-to-end deep learning framework for simultaneous online calibration of LiDAR, RADAR, and Camera
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[84]  arXiv:2512.08254 [pdf, ps, other]
Title: SFP: Real-World Scene Recovery Using Spatial and Frequency Priors
Comments: 10 pages, 13 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[85]  arXiv:2512.08253 [pdf, ps, other]
Title: Query-aware Hub Prototype Learning for Few-Shot 3D Point Cloud Semantic Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[86]  arXiv:2512.08247 [pdf, ps, other]
Title: Distilling Future Temporal Knowledge with Masked Feature Reconstruction for 3D Object Detection
Comments: AAAI-26
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[87]  arXiv:2512.08243 [pdf, ps, other]
Title: Residual-SwinCA-Net: A Channel-Aware Integrated Residual CNN-Swin Transformer for Malignant Lesion Segmentation in BUSI
Authors: Saeeda Naz, Saddam Hussain Khan (Artificial Intelligence Lab, Department of Computer Systems Engineering, University of Engineering and Applied Sciences (UEAS), Swat, Pakistan)
Comments: 26 Pages, 10 Figures, 4 Tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[88]  arXiv:2512.08240 [pdf, ps, other]
Title: HybridToken-VLM: Hybrid Token Compression for Vision-Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[89]  arXiv:2512.08237 [pdf, ps, other]
Title: FastBEV++: Fast by Algorithm, Deployable by Design
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[90]  arXiv:2512.08229 [pdf, ps, other]
Title: Geometry-Aware Sparse Depth Sampling for High-Fidelity RGB-D Depth Completion in Robotic Systems
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[91]  arXiv:2512.08228 [pdf, ps, other]
Title: MM-CoT:A Benchmark for Probing Visual Chain-of-Thought Reasoning in Multimodal Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[92]  arXiv:2512.08227 [pdf, ps, other]
Title: New VVC profiles targeting Feature Coding for Machines
Comments: Accepted for presentation at ICIP 2025 workshop on Coding for Machines
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[93]  arXiv:2512.08223 [pdf, ps, other]
Title: SOP^2: Transfer Learning with Scene-Oriented Prompt Pool on 3D Object Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[94]  arXiv:2512.08221 [pdf, ps, other]
Title: VisKnow: Constructing Visual Knowledge Base for Object Understanding
Comments: 16 pages, 12 figures, 7 tables. Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[95]  arXiv:2512.08215 [pdf, ps, other]
Title: Blur2Sharp: Human Novel Pose and View Synthesis with Generative Prior Refinement
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[96]  arXiv:2512.08198 [pdf, ps, other]
Title: Animal Re-Identification on Microcontrollers
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[97]  arXiv:2512.08180 [pdf, ps, other]
Title: GeoLoom: High-quality Geometric Diagram Generation from Textual Input
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[98]  arXiv:2512.08163 [pdf, ps, other]
Title: Accuracy Does Not Guarantee Human-Likeness in Monocular Depth Estimators
Comments: 22 pages, 12 figures, 1 table
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[99]  arXiv:2512.08161 [pdf, ps, other]
Title: Fourier-RWKV: A Multi-State Perception Network for Efficient Image Dehazing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[100]  arXiv:2512.08135 [pdf, ps, other]
Title: CVP: Central-Peripheral Vision-Inspired Multimodal Model for Spatial Reasoning
Comments: Accepted to WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[101]  arXiv:2512.08075 [pdf, ps, other]
Title: Identification of Deforestation Areas in the Amazon Rainforest Using Change Detection Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[102]  arXiv:2512.08048 [pdf, ps, other]
Title: Mask to Adapt: Simple Random Masking Enables Robust Continual Test-Time Learning
Comments: ongoing work
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[103]  arXiv:2512.08042 [pdf, ps, other]
Title: Towards Sustainable Universal Deepfake Detection with Frequency-Domain Masking
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[104]  arXiv:2512.08040 [pdf, ps, other]
Title: Lost in Translation, Found in Embeddings: Sign Language Translation and Alignment
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[105]  arXiv:2512.08038 [pdf, ps, other]
Title: SSplain: Sparse and Smooth Explainer for Retinopathy of Prematurity Classification
Comments: 20 pages, 16 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[106]  arXiv:2512.08016 [pdf, ps, other]
Title: FRIEDA: Benchmarking Multi-Step Cartographic Reasoning in Vision-Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[107]  arXiv:2512.07984 [pdf, ps, other]
Title: Restrictive Hierarchical Semantic Segmentation for Stratified Tooth Layer Detection
Comments: 13 pages, 7 figures, 3 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[108]  arXiv:2512.07951 [pdf, ps, other]
Title: Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality
Comments: Project webpage: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[109]  arXiv:2512.07925 [pdf, ps, other]
Title: Near-real time fires detection using satellite imagery in Sudan conflict
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[110]  arXiv:2512.07838 [pdf, ps, other]
Title: Detection of Cyberbullying in GIF using AI
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[111]  arXiv:2512.08715 (cross-list from cs.PF) [pdf, ps, other]
Title: Multi-domain performance analysis with scores tailored to user preferences
Subjects: Performance (cs.PF); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[112]  arXiv:2512.08629 (cross-list from cs.AI) [pdf, ps, other]
Title: See-Control: A Multimodal Agent Framework for Smartphone Interaction with a Robotic Arm
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[113]  arXiv:2512.08545 (cross-list from cs.CL) [pdf, ps, other]
Title: Curriculum Guided Massive Multi Agent System Solving For Robust Long Horizon Tasks
Comments: 22 pages, 2 tables, 9 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Multiagent Systems (cs.MA)
[114]  arXiv:2512.08500 (cross-list from cs.GR) [pdf, ps, other]
Title: Learning to Control Physically-simulated 3D Characters via Generating and Mimicking 2D Motions
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[115]  arXiv:2512.08360 (cross-list from cs.NE) [pdf, ps, other]
Title: Conditional Morphogenesis: Emergent Generation of Structural Digits via Neural Cellular Automata
Authors: Ali Sakour
Comments: 13 pages, 5 figures. Code available at: this https URL
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[116]  arXiv:2512.08284 (cross-list from physics.geo-ph) [pdf, ps, other]
Title: Self-Reinforced Deep Priors for Reparameterized Full Waveform Inversion
Comments: Submitted to GEOPHYSICS
Subjects: Geophysics (physics.geo-ph); Computer Vision and Pattern Recognition (cs.CV)
[117]  arXiv:2512.08271 (cross-list from cs.RO) [pdf, ps, other]
Title: Zero-Splat TeleAssist: A Zero-Shot Pose Estimation Framework for Semantic Teleoperation
Comments: Published and Presented at 3rd Workshop on Human-Centric Multilateral Teleoperation in ICRA 2025
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[118]  arXiv:2512.08216 (cross-list from eess.IV) [pdf, ps, other]
Title: Tumor-anchored deep feature random forests for out-of-distribution detection in lung cancer segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[119]  arXiv:2512.08188 (cross-list from cs.RO) [pdf, ps, other]
Title: Embodied Tree of Thoughts: Deliberate Manipulation Planning with Embodied World Model
Comments: Website at this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[120]  arXiv:2512.08170 (cross-list from cs.RO) [pdf, ps, other]
Title: RAVES-Calib: Robust, Accurate and Versatile Extrinsic Self Calibration Using Optimal Geometric Features
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[121]  arXiv:2512.08153 (cross-list from cs.LG) [pdf, ps, other]
Title: TreeGRPO: Tree-Advantage GRPO for Online RL Post-Training of Diffusion Models
Authors: Zheng Ding, Weirui Ye
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[122]  arXiv:2512.08125 (cross-list from eess.IV) [pdf, ps, other]
Title: FlowSteer: Conditioning Flow Field for Consistent Image Restoration
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[123]  arXiv:2512.08099 (cross-list from math.NA) [pdf, ps, other]
Title: Generalizations of the Normalized Radon Cumulative Distribution Transform for Limited Data Recognition
Subjects: Numerical Analysis (math.NA); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
[124]  arXiv:2512.08029 (cross-list from cs.LG) [pdf, ps, other]
Title: CLARITY: Medical World Model for Guiding Treatment Decisions by Modeling Context-Aware Disease Trajectories in Latent Space
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[125]  arXiv:2512.07998 (cross-list from cs.RO) [pdf, ps, other]
Title: DIJIT: A Robotic Head for an Active Observer
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[126]  arXiv:2512.07981 (cross-list from cs.LG) [pdf, ps, other]
Title: CIP-Net: Continual Interpretable Prototype-based Network
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[127]  arXiv:2512.07976 (cross-list from cs.RO) [pdf, ps, other]
Title: VLD: Visual Language Goal Distance for Reinforcement Learning Navigation
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[128]  arXiv:2512.07969 (cross-list from cs.RO) [pdf, ps, other]
Title: Sparse Variable Projection in Robotic Perception: Exploiting Separable Structure for Efficient Nonlinear Optimization
Comments: 8 pages, submitted for review
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[129]  arXiv:2512.07884 (cross-list from cs.LG) [pdf, ps, other]
Title: GSPN-2: Efficient Parallel Sequence Modeling
Comments: NeurIPS 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[130]  arXiv:2512.07855 (cross-list from cs.LG) [pdf, ps, other]
Title: LAPA: Log-Domain Prediction-Driven Dynamic Sparsity Accelerator for Transformer Model
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[131]  arXiv:2512.05791 (cross-list from physics.med-ph) [pdf, ps, other]
Title: Fast and Robust Diffusion Posterior Sampling for MR Image Reconstruction Using the Preconditioned Unadjusted Langevin Algorithm
Comments: Submitted to Magnetic Resonance in Medicine
Subjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Probability (math.PR)

Tue, 9 Dec 2025 (showing first 3 of 259 entries)

[132]  arXiv:2512.07834 [pdf, ps, other]
Title: Voxify3D: Pixel Art Meets Volumetric Rendering
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[133]  arXiv:2512.07833 [pdf, ps, other]
Title: Relational Visual Similarity
Comments: Project page, data, and code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[134]  arXiv:2512.07831 [pdf, ps, other]
Title: UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation
Comments: Project Website this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 749 entries: 1-100 | 35-134 | 135-234 | 235-334 | 335-434 | ... | 735-749 ]
[ showing 100 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)