We gratefully acknowledge support from
the Simons Foundation and member institutions.

Hardware Architecture

Authors and titles for recent submissions, skipping first 18

[ total of 68 entries: 1-25 | 19-43 | 44-68 ]
[ showing 25 entries per page: fewer | more | all ]

Wed, 3 Dec 2025 (continued, showing last 2 of 6 entries)

[19]  arXiv:2512.02189 [pdf, ps, other]
Title: Microbenchmarking NVIDIA's Blackwell Architecture: An in-depth Architectural Analysis
Subjects: Hardware Architecture (cs.AR)
[20]  arXiv:2512.02403 (cross-list from cs.LG) [pdf, ps, other]
Title: ESACT: An End-to-End Sparse Accelerator for Compute-Intensive Transformers via Local Similarity
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)

Tue, 2 Dec 2025 (showing first 23 of 38 entries)

[21]  arXiv:2512.01644 [pdf, ps, other]
Title: A Systematic Characterization of LLM Inference on GPUs
Subjects: Hardware Architecture (cs.AR)
[22]  arXiv:2512.01541 [pdf, ps, other]
Title: RoMe: Row Granularity Access Memory System for Large Language Models
Comments: 15 pages, 14 figures, accepted at HPCA 2026
Subjects: Hardware Architecture (cs.AR)
[23]  arXiv:2512.01463 [pdf, ps, other]
[24]  arXiv:2512.01193 [pdf, ps, other]
Title: Leveraging Recurrent Patterns in Graph Accelerators
Comments: Accepted at DATE 2026
Subjects: Hardware Architecture (cs.AR)
[25]  arXiv:2512.00974 [pdf, ps, other]
Title: A WASM-Subset Stack Architecture for Low-cost FPGAs using Open-Source EDA Flows
Authors: Aradhya Chakrabarti (1) ((1) School of Computer Engineering, KIIT Deemed to be University)
Comments: 6 pages, 5 figures. Source code available at this https URL
Subjects: Hardware Architecture (cs.AR); Emerging Technologies (cs.ET)
[26]  arXiv:2512.00487 [pdf, ps, other]
Title: Partial Cross-Compilation and Mixed Execution for Accelerating Dynamic Binary Translation
Subjects: Hardware Architecture (cs.AR); Programming Languages (cs.PL)
[27]  arXiv:2512.00441 [pdf, ps, other]
Title: A Novel 8T SRAM-Based In-Memory Computing Architecture for MAC-Derived Logical Functions
Authors: Amogh K M, Sunita M S
Comments: 6 pages, 6 figures, Accepted at 39th VLSID 2026 conference
Subjects: Hardware Architecture (cs.AR); Emerging Technologies (cs.ET)
[28]  arXiv:2512.00335 [pdf, ps, other]
Title: Efficient Kernel Mapping and Comprehensive System Evaluation of LLM Acceleration on a CGLA
Comments: This paper is published at IEEE Access
Journal-ref: IEEE Access, 2025
Subjects: Hardware Architecture (cs.AR)
[29]  arXiv:2512.00186 [pdf, ps, other]
Title: Variable Point: A Number Format for Area- and Energy-Efficient Multiplication of High-Dynamic-Range Numbers
Comments: Presented at the 59th Asilomar Conference on Signals, Systems, and Computers
Subjects: Hardware Architecture (cs.AR); Signal Processing (eess.SP)
[30]  arXiv:2512.00138 [pdf, ps, other]
Title: Ternary-Input Binary-Weight CNN Accelerator Design for Miniature Object Classification System with Query-Driven Spatial DVS
Comments: 6 pages.12 figures & 2 table
Subjects: Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[31]  arXiv:2512.00113 [pdf, ps, other]
Title: From RISC-V Cores to Neuromorphic Arrays: A Tutorial on Building Scalable Digital Neuromorphic Processors
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Signal Processing (eess.SP)
[32]  arXiv:2512.00112 [pdf, ps, other]
Title: An Analytical and Empirical Investigation of Tag Partitioning for Energy-Efficient Reliable Cache
Subjects: Hardware Architecture (cs.AR)
[33]  arXiv:2512.00096 [pdf, ps, other]
Title: Modeling and Simulation Frameworks for Processing-in-Memory Architectures
Subjects: Hardware Architecture (cs.AR); Emerging Technologies (cs.ET)
[34]  arXiv:2512.00083 [pdf, ps, other]
Title: LLaMCAT: Optimizing Large Language Model Inference with Cache Arbitration and Throttling
Comments: Accepted to ICPP 2025
Journal-ref: In Proceedings of 54th International Conference on Parallel Processing (ICPP 2025)
Subjects: Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC)
[35]  arXiv:2512.00079 [pdf, ps, other]
Title: InF-ATPG: Intelligent FFR-Driven ATPG with Advanced Circuit Representation Guided Reinforcement Learning
Comments: 9 pages,6 figures
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[36]  arXiv:2512.00070 [pdf, ps, other]
Title: A CNN-Based Technique to Assist Layout-to-Generator Conversion for Analog Circuits
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[37]  arXiv:2512.00059 [pdf, ps, other]
Title: SafeCiM: Investigating Resilience of Hybrid Floating-Point Compute-in-Memory Deep Learning Accelerators
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[38]  arXiv:2512.00055 [pdf, other]
Title: KAN-SAs: Efficient Acceleration of Kolmogorov-Arnold Networks on Systolic Arrays
Authors: Sohaib Errabii (TARAN), Olivier Sentieys (TARAN), Marcello Traiola (TARAN)
Journal-ref: IEEE/ACM Design, Automation \& Test in Europe Conference (DATE) 2026, Apr 2026, Verona, Italy
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI)
[39]  arXiv:2512.00053 [pdf, ps, other]
Title: A Configurable Mixed-Precision Fused Dot Product Unit for GPGPU Tensor Computation
Comments: 3 pages, 2 figures
Subjects: Hardware Architecture (cs.AR)
[40]  arXiv:2512.00045 [pdf, ps, other]
Title: Assessing Large Language Models in Generating RTL Design Specifications
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI)
[41]  arXiv:2512.00044 [pdf, ps, other]
Title: SetupKit: Efficient Multi-Corner Setup/Hold Time Characterization Using Bias-Enhanced Interpolation and Active Learning
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[42]  arXiv:2512.00038 [pdf, ps, other]
Title: Critical Path Aware Timing-Driven Global Placement for Large-Scale Heterogeneous FPGAs
Subjects: Hardware Architecture (cs.AR)
[43]  arXiv:2512.00035 [pdf, ps, other]
Title: WebAssembly on Resource-Constrained IoT Devices: Performance, Efficiency, and Portability
Subjects: Hardware Architecture (cs.AR); Operating Systems (cs.OS)
[ total of 68 entries: 1-25 | 19-43 | 44-68 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)