We gratefully acknowledge support from
the Simons Foundation and member institutions.

Artificial Intelligence

Authors and titles for recent submissions, skipping first 421

[ total of 798 entries: 1-10 | ... | 392-401 | 402-411 | 412-421 | 422-431 | 432-441 | 442-451 | 452-461 | ... | 792-798 ]
[ showing 10 entries per page: fewer | more | all ]

Tue, 9 Dec 2025 (showing first 10 of 256 entries)

[422]  arXiv:2512.07810 [pdf, ps, other]
Title: Auditing Games for Sandbagging
Comments: 77 pages (28 non-appendix pages), 38 figures
Subjects: Artificial Intelligence (cs.AI)
[423]  arXiv:2512.07796 [pdf, ps, other]
Title: Large Causal Models from Large Language Models
Comments: 29 pages
Subjects: Artificial Intelligence (cs.AI)
[424]  arXiv:2512.07795 [pdf, ps, other]
Title: ReasonBENCH: Benchmarking the (In)Stability of LLM Reasoning
Comments: 11 pages, 3 tables, 4 figures
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[425]  arXiv:2512.07761 [pdf, ps, other]
Title: RL-MTJail: Reinforcement Learning for Automated Black-Box Multi-Turn Jailbreaking of Large Language Models
Comments: 19 pages, 15 figures
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[426]  arXiv:2512.07710 [pdf, ps, other]
Title: Each Prompt Matters: Scaling Reinforcement Learning Without Wasting Rollouts on Hundred-Billion-Scale MoE
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[427]  arXiv:2512.07631 [pdf, ps, other]
Title: The Agent Capability Problem: Predicting Solvability Through Information-Theoretic Bounds
Authors: Shahar Lutati
Subjects: Artificial Intelligence (cs.AI); Computational Complexity (cs.CC); Information Theory (cs.IT); Machine Learning (cs.LG)
[428]  arXiv:2512.07611 [pdf, ps, other]
Title: Comparative Analysis and Parametric Tuning of PPO, GRPO, and DAPO for LLM Reasoning Enhancement
Authors: Yongsheng Lian
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[429]  arXiv:2512.07497 [pdf, ps, other]
Title: How Do LLMs Fail In Agentic Scenarios? A Qualitative Analysis of Success and Failure Scenarios of Various LLMs in Agentic Simulations
Authors: JV Roig
Comments: 48 pages, 3 tables, 2 listings
Subjects: Artificial Intelligence (cs.AI); Software Engineering (cs.SE)
[430]  arXiv:2512.07436 [pdf, ps, other]
Title: LocalSearchBench: Benchmarking Agentic Search in Real-World Local Life Services
Subjects: Artificial Intelligence (cs.AI)
[431]  arXiv:2512.07355 [pdf, ps, other]
Title: A Geometric Unification of Concept Learning with Concept Cones
Comments: 22 pages
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[ total of 798 entries: 1-10 | ... | 392-401 | 402-411 | 412-421 | 422-431 | 432-441 | 442-451 | 452-461 | ... | 792-798 ]
[ showing 10 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)