We gratefully acknowledge support from
the Simons Foundation and member institutions.

Performance

Authors and titles for recent submissions

[ total of 28 entries: 1-25 | 26-28 ]
[ showing 25 entries per page: fewer | more | all ]

Thu, 14 May 2026

[1]  arXiv:2605.13749 [pdf, ps, other]
Title: SPLIT: SymPathy for Large jobs Improves Tail latency
Subjects: Performance (cs.PF); Probability (math.PR)
[2]  arXiv:2605.13209 (cross-list from cs.DC) [pdf, ps, other]
Title: Comparing the Performance of Heterogeneous Conjugate Gradient and Cholesky Solvers on Various Hardware Using SYCL
Comments: 12 pages, to be published in International Workshop on OpenCL and SYCL (IWOCL '26)
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[3]  arXiv:2605.12080 (cross-list from cs.IT) [pdf, ps, other]
Title: On Capacity and Delay of Wireless Networks with Node Failures
Subjects: Information Theory (cs.IT); Distributed, Parallel, and Cluster Computing (cs.DC); Networking and Internet Architecture (cs.NI); Performance (cs.PF)

Wed, 13 May 2026

[4]  arXiv:2605.12445 [pdf, ps, other]
Title: Scalable Packed Layouts for Vector-Length-Agnostic ML Code Generation
Subjects: Performance (cs.PF)
[5]  arXiv:2605.12464 (cross-list from cs.LG) [pdf, ps, other]
Title: Search Your Block Floating Point Scales!
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR); Performance (cs.PF)
[6]  arXiv:2605.12433 (cross-list from cs.AR) [pdf, ps, other]
Title: Enhancing Instruction Prefetching via Cache and TLB Management
Comments: To appear at ISCA 2026
Subjects: Hardware Architecture (cs.AR); Performance (cs.PF)
[7]  arXiv:2605.11999 (cross-list from cs.DC) [pdf, ps, other]
Title: The Illusion of Power Capping in LLM Decode: A Phase-Aware Energy Characterisation Across Attention Architectures
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Performance (cs.PF)
[8]  arXiv:2605.11333 (cross-list from cs.DC) [pdf, ps, other]
Title: MLCommons Chakra: Advancing Performance Benchmarking and Co-design using Standardized Execution Traces
Comments: Accepted at the 9th Conference on Machine Learning and Systems (MLSys 2026)
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG); Performance (cs.PF)
[9]  arXiv:2605.11093 (cross-list from cs.LG) [pdf, ps, other]
Title: Enabling Performant and Flexible Model-Internal Observability for LLM Inference
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Performance (cs.PF); Software Engineering (cs.SE); Systems and Control (eess.SY)
[10]  arXiv:2605.11034 (cross-list from cs.CR) [pdf, ps, other]
Title: MambaNetBurst: Direct Byte-level Network Traffic Classification without Tokenization or Pretraining
Comments: 16 pages, 2 figures. Pareto-optimal frontier. Transformer vs Mamba vs Mamba-2 scaling performance. Code and data available on request
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Performance (cs.PF)

Tue, 12 May 2026

[11]  arXiv:2605.08792 [pdf, ps, other]
Title: A Controlled Study of Memory Hierarchy Transitions in Quantum Circuit Simulation on Apple M4 Pro Unified Memory Architecture
Authors: Gyan Pratipat
Comments: 9 Pages, 5 Figures, 7 Tables
Subjects: Performance (cs.PF); Quantum Physics (quant-ph)
[12]  arXiv:2605.08731 [pdf, ps, other]
Title: Single-Thread JPEG Decoder Benchmarks Mis-Evaluate ML Data Loaders
Comments: 9 pages, 4 figures. Code and data: this https URL
Subjects: Performance (cs.PF); Machine Learning (cs.LG)
[13]  arXiv:2605.10718 (cross-list from cs.DC) [pdf, ps, other]
Title: An Uncertainty-Aware Resilience Micro-Agent for Causal Observability in the Computing Continuum
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Performance (cs.PF); Systems and Control (eess.SY)
[14]  arXiv:2605.10457 (cross-list from cs.GR) [pdf, ps, other]
Title: Geometrically Approximated Modeling for Emitter-Centric Ray-Triangle Filtering in Arbitrarily Dynamic LiDAR Simulation
Comments: 21 pages, 20 figures
Subjects: Graphics (cs.GR); Performance (cs.PF); Robotics (cs.RO)
[15]  arXiv:2605.10175 (cross-list from cs.CR) [pdf, ps, other]
Title: Key Encapsulation Mechanism-Based Integrated Encryption Scheme (KEM-IES)
Authors: Abel C. H. Chen
Subjects: Cryptography and Security (cs.CR); Networking and Internet Architecture (cs.NI); Performance (cs.PF)
[16]  arXiv:2605.09999 (cross-list from cs.RO) [pdf, ps, other]
Title: Muninn: Your Trajectory Diffusion Model But Faster
Comments: Accepted to Robotics: Science and Systems 2026
Subjects: Robotics (cs.RO); Performance (cs.PF); Systems and Control (eess.SY)
[17]  arXiv:2605.09787 (cross-list from cs.DC) [pdf, ps, other]
Title: Cloud Performance Decomposition for Long-Term Performance Engineering: A Case Study
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[18]  arXiv:2605.09623 (cross-list from cs.DC) [pdf, ps, other]
Title: Adaptive DNN Partitioning and Offloading in Heterogeneous Edge-Cloud Continuum
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI); Performance (cs.PF)
[19]  arXiv:2605.08913 (cross-list from cs.LG) [pdf, ps, other]
Title: Non-Monotonic Latency in Apple MPS Decoding: KV Cache Interactions and Execution Regimes
Comments: 9 pages, 5 figures, 6 tables
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR); Computation and Language (cs.CL); Performance (cs.PF)
[20]  arXiv:2605.08725 (cross-list from cs.AR) [pdf, ps, other]
Title: Single 32-bit Sub-Channel DDR5 DIMMs: Architecture, Performance Bounds, and Standardisation
Authors: Chih-Hua Ke
Comments: 10 pages, 1 figure, 6 tables
Subjects: Hardware Architecture (cs.AR); Performance (cs.PF)
[21]  arXiv:2605.08333 (cross-list from cs.LG) [pdf, ps, other]
Title: CDS4RAG: Cyclic Dual-Sequential Hyperparameter Optimization for RAG
Comments: Accepted by main track at IJCAI 2026
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Performance (cs.PF); Software Engineering (cs.SE)
[22]  arXiv:2605.08314 (cross-list from cs.LG) [pdf, ps, other]
Title: FlashSVD v1.5: Making Low-Rank Transformers Inference Actually Fast
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Performance (cs.PF)
[23]  arXiv:2605.08305 (cross-list from cs.LG) [pdf, ps, other]
Title: LLMSYS-HPOBench: Hyperparameter Optimization Benchmark Suite for Real-World LLM Systems
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Performance (cs.PF); Software Engineering (cs.SE)

Mon, 11 May 2026

[24]  arXiv:2605.07719 (cross-list from cs.LG) [pdf, ps, other]
Title: An Efficient Hybrid Sparse Attention with CPU-GPU Parallelism for Long-Context Inference
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Performance (cs.PF)

Fri, 8 May 2026 (showing first 1 of 4 entries)

[25]  arXiv:2605.05699 [pdf, ps, other]
Title: When Quantization Is Free: An int4 KV Cache That Outruns fp16 on Apple Silicon
Subjects: Performance (cs.PF); Artificial Intelligence (cs.AI)
[ total of 28 entries: 1-25 | 26-28 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2605, contact, help  (Access key information)