Science Cast

Performance of LLMs on Stochastic Modeling Operations Research Problems: From Theory to Practice

librarianJuly 2, 2025 2:30pm

Views (3)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

Performance of LLMs on Stochastic Modeling Operations Research Problems: From Theory to Practice

arXivPDFJune 30, 2025 12:00am

Authors

Akshit Kumar, Tianyi Peng, Yuhang Wu, Assaf Zeevi

Abstract

Large language models (LLMs) have exhibited expert-level capabilities across various domains. However, their abilities to solve problems in Operations Research (OR) -- the analysis and optimization of mathematical models derived from real-world problems or their verbal descriptions -- remain underexplored. In this work, we take a first step toward evaluating LLMs' abilities to solve stochastic modeling problems, a core class of OR problems characterized by uncertainty and typically involving tools from probability, statistics, and stochastic processes. We manually procure a representative set of graduate-level homework and doctoral qualification-exam problems and test LLMs' abilities to solve them. We further leverage SimOpt, an open-source library of simulation-optimization problems and solvers, to investigate LLMs' abilities to make real-world decisions under uncertainty. Our results show that, though a nontrivial amount of work is still needed to reliably automate the stochastic modeling pipeline in reality, state-of-the-art LLMs demonstrate proficiency on par with human experts in both classroom and practical settings. These findings highlight the potential of building AI agents that assist OR researchers and amplify the real-world impact of OR through automation.

TwitterandLinkedIn

0 comments

Add comment

Performance of LLMs on Stochastic Modeling Operations Research Problems: From Theory to Practice

Performance of LLMs on Stochastic Modeling Operations Research Problems: From Theory to Practice

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments