Science Cast

Hierarchical Reinforcement Learning with Augmented Step-Level Transitions for LLM Agents

librarianApril 8, 2026 2:06am

Views (16)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

Hierarchical Reinforcement Learning with Augmented Step-Level Transitions for LLM Agents

arXivPDFApril 7, 2026 12:00am

Authors

Shuai Zhen, Yanhua Yu, Ruopei Guo, Nan Cheng, Yang Deng

Abstract

Large language model (LLM) agents have demonstrated strong capabilities in complex interactive decision-making tasks. However, existing LLM agents typically rely on increasingly long interaction histories, resulting in high computational cost and limited scalability. In this paper, we propose STEP-HRL, a hierarchical reinforcement learning (HRL) framework that enables step-level learning by conditioning only on single-step transitions rather than full interaction histories. STEP-HRL structures tasks hierarchically, using completed subtasks to represent global progress of overall task. By introducing a local progress module, it also iteratively and selectively summarizes interaction history within each subtask to produce a compact summary of local progress. Together, these components yield augmented step-level transitions for both high-level and low-level policies. Experimental results on ScienceWorld and ALFWorld benchmarks consistently demonstrate that STEP-HRL substantially outperforms baselines in terms of performance and generalization while reducing token usage. Our code is available at https://github.com/TonyStark042/STEP-HRL.

TwitterandLinkedIn

0 comments

Add comment

Hierarchical Reinforcement Learning with Augmented Step-Level Transitions for LLM Agents

Hierarchical Reinforcement Learning with Augmented Step-Level Transitions for LLM Agents

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments