Science Cast

Quantum-enhanced Large Language Models on Quantum Hardware via Cayley Unitary Adapters

librarianMay 8, 2026 2:37am

Views (11)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

Quantum-enhanced Large Language Models on Quantum Hardware via Cayley Unitary Adapters

arXivPDFMay 7, 2026 12:00am

Authors

Borja Aizpurua, Sukhbinder Singh, Augustine Kshetrimayum, Saeed S. Jahromi, Roman Orus

Abstract

Large language models (LLMs) have transformed artificial intelligence, yet classical architectures impose a fundamental constraint: every trainable parameter demands classical memory that scales unfavourably with model size. Quantum computing offers a qualitatively different pathway, but practical demonstrations on real hardware have remained elusive for models of practical relevance. Here we show that Cayley-parameterised unitary adapters -- quantum circuit blocks inserted into the frozen projection layers of pre-trained LLMs and executed on a 156-qubit IBM Quantum System Two superconducting processor -- improve the perplexity of Llama 3.1 8B, an 8-billion-parameter model in widespread use, by 1.4% with only 6,000 additional parameters and end-to-end inference validated on real Quantum Processing Unit (QPU). A systematic study on SmolLM2 (135M parameters), chosen for its tractability, reveals monotonically improving perplexity with unitary block dimension, 83% recovery of compression-induced degradation, and correct answers to questions that both classical baselines fail -- with a sharp noise-expressivity phase transition identifying the concrete path to quantum utility at larger qubit scales.

TwitterandLinkedIn

0 comments

Add comment

Quantum-enhanced Large Language Models on Quantum Hardware via Cayley Unitary Adapters

Quantum-enhanced Large Language Models on Quantum Hardware via Cayley Unitary Adapters

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments