Computer Science

StepHint: Multi-level Stepwise Hints Enhance Reinforcement Learning to
  Reason
Avatar
Kaiyi Zhang
0 views
DynamiCare: A Dynamic Multi-Agent Framework for Interactive and
  Open-Ended Medical Decision-Making
Avatar
Tianqi Shang
0 views
In-Training Multicalibrated Survival Analysis for Healthcare via
  Constrained Optimization
Avatar
Thiti Suttaket
0 views
Grounding Intelligence in Movement

Grounding Intelligence in Movement

Artificial Intelligence
Avatar
Melanie Segado
0 views
Decoupled Planning and Execution: A Hierarchical Reasoning Framework for
  Deep Search
Avatar
librarian
7 views
Bourbaki: Self-Generated and Goal-Conditioned MDPs for Theorem Proving
Avatar
Matthieu Zimmer
0 views
Knowledge Protocol Engineering: A New Paradigm for AI in Domain-Specific
  Knowledge Work
Avatar
librarian
1 view
Establishing Best Practices for Building Rigorous Agentic Benchmarks
Avatar
librarian
0 views
Revisiting Learning Rate Control
Avatar
librarian
0 views
Agent Ideate: A Framework for Product Idea Generation from Patents Using
  Agentic AI
Avatar
librarian
1 view
Exploring a Hybrid Deep Learning Approach for Anomaly Detection in
  Mental Healthcare Provider Billing: Addressing Label Scarcity through
  Semi-Supervised Anomaly Detection
Avatar
Samirah Bakker
0 views
Exploring Advanced LLM Multi-Agent Systems Based on Blackboard
  Architecture
Avatar
Bochen Han
0 views
Joint Matching and Pricing for Crowd-shipping with In-store Customers
Avatar
Arash Dehghan
1 view
Refining Gelfond Rationality Principle Towards More Comprehensive
  Foundational Principles for Answer Set Semantics
Avatar
Yi-Dong Shen
1 view
FADRM: Fast and Accurate Data Residual Matching for Dataset Distillation

FADRM: Fast and Accurate Data Residual Matchin...

Computer Vision and Pattern Recognition
Avatar
librarian
1 view
On the Predictive Power of Representation Dispersion in Language Models
Avatar
librarian
1 view
STACK: Adversarial Attacks on LLM Safeguard Pipelines
Avatar
librarian
1 view
Performance of LLMs on Stochastic Modeling Operations Research Problems:
  From Theory to Practice
Avatar
librarian
3 views
AI Risk-Management Standards Profile for General-Purpose AI (GPAI) and
  Foundation Models
Avatar
librarian
2 views
Constructing Non-Markovian Decision Process via History Aggregator
Avatar
Yongyi Wang
4 views