Artificial Intelligence

Authors and titles for recent submissions, skipping first 421

[ total of 798 entries: 1-10 | ... | 392-401 | 402-411 | 412-421 | 422-431 | 432-441 | 442-451 | 452-461 | ... | 792-798 ]
[ showing 10 entries per page: fewer | more | all ]

Tue, 9 Dec 2025 (showing first 10 of 256 entries)

[422] arXiv:2512.07810 [pdf, ps, other]: Title: Auditing Games for Sandbagging

Authors: Jordan Taylor, Sid Black, Dillon Bowen, Thomas Read, Satvik Golechha, Alex Zelenka-Martin, Oliver Makins, Connor Kissane, Kola Ayonrinde, Jacob Merizian, Samuel Marks, Chris Cundy, Joseph Bloom

Comments: 77 pages (28 non-appendix pages), 38 figures

Subjects: Artificial Intelligence (cs.AI)
[423] arXiv:2512.07796 [pdf, ps, other]: Title: Large Causal Models from Large Language Models

Authors: Sridhar Mahadevan

Comments: 29 pages

Subjects: Artificial Intelligence (cs.AI)
[424] arXiv:2512.07795 [pdf, ps, other]: Title: ReasonBENCH: Benchmarking the (In)Stability of LLM Reasoning

Authors: Nearchos Potamitis, Lars Klein, Akhil Arora

Comments: 11 pages, 3 tables, 4 figures

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[425] arXiv:2512.07761 [pdf, ps, other]: Title: RL-MTJail: Reinforcement Learning for Automated Black-Box Multi-Turn Jailbreaking of Large Language Models

Authors: Xiqiao Xiong, Ouxiang Li, Zhuo Liu, Moxin Li, Wentao Shi, Fuli Feng, Xiangnan He

Comments: 19 pages, 15 figures

Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[426] arXiv:2512.07710 [pdf, ps, other]: Title: Each Prompt Matters: Scaling Reinforcement Learning Without Wasting Rollouts on Hundred-Billion-Scale MoE

Authors: Anxiang Zeng, Haibo Zhang, Hailing Zhang, Kaixiang Mo, Liang Yao, Ling Hu, Long Zhang, Shuman Liu, Shuyi Xie, Yanshi Li, Yizhang Chen, Yuepeng Sheng, Yuwei Huang, Zhaochen Xu, Zhiqiang Zhou, Ziqin Liew

Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[427] arXiv:2512.07631 [pdf, ps, other]: Title: The Agent Capability Problem: Predicting Solvability Through Information-Theoretic Bounds

Authors: Shahar Lutati

Subjects: Artificial Intelligence (cs.AI); Computational Complexity (cs.CC); Information Theory (cs.IT); Machine Learning (cs.LG)
[428] arXiv:2512.07611 [pdf, ps, other]: Title: Comparative Analysis and Parametric Tuning of PPO, GRPO, and DAPO for LLM Reasoning Enhancement

Authors: Yongsheng Lian

Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[429] arXiv:2512.07497 [pdf, ps, other]: Title: How Do LLMs Fail In Agentic Scenarios? A Qualitative Analysis of Success and Failure Scenarios of Various LLMs in Agentic Simulations

Authors: JV Roig

Comments: 48 pages, 3 tables, 2 listings

Subjects: Artificial Intelligence (cs.AI); Software Engineering (cs.SE)
[430] arXiv:2512.07436 [pdf, ps, other]: Title: LocalSearchBench: Benchmarking Agentic Search in Real-World Local Life Services

Authors: Hang He, Chuhuai Yue, Chengqi Dong, Mingxue Tian, Zhenfeng Liu, Jiajun Chai, Xiaohan Wang, Yufei Zhang, Qun Liao, Guojun Yin, Wei Lin, Chengcheng Wan, Haiying Sun, Ting Su

Subjects: Artificial Intelligence (cs.AI)
[431] arXiv:2512.07355 [pdf, ps, other]: Title: A Geometric Unification of Concept Learning with Concept Cones

Authors: Alexandre Rocchi--Henry, Thomas Fel, Gianni Franchi

Comments: 22 pages

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)

[ total of 798 entries: 1-10 | ... | 392-401 | 402-411 | 412-421 | 422-431 | 432-441 | 442-451 | 452-461 | ... | 792-798 ]
[ showing 10 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help (Access key information)

> cs > cs.AI

Artificial Intelligence

Authors and titles for recent submissions, skipping first 421

Tue, 9 Dec 2025 (showing first 10 of 256 entries)