Artificial Intelligence

Demystifying Data Organization for Enhanced LLM Training
Avatar
librarian
1 view
SchGen: PCB Schematic Generation with Semantic-Grounded Code Representations
Avatar
Qinpei Luo
3 views
ProjectionBench: Evaluating Scientific Hypothesis Generation in LLMs Under Progressive Information Disclosure
Avatar
Andrew Lew
3 views
MIRA: Mid-training Rubric Anchoring for Source-Aware Data Selection
Avatar
librarian
4 views
Locally Coherent, Globally Incoherent: Bounding Compositional Incoherence in Multi-Component LLM Agents
Avatar
librarian
3 views
SwarmHarness: Skill-Based Task Routing via Decentralized Incentive-Aligned AI Agent Networks
Avatar
Edwin Jose
9 views
CaMBRAIN: Real-time, Continuous EEG Inference with Causal State Space Models
Avatar
librarian
18 views
AutoScientists: Self-Organizing Agent Teams for Long-Running Scientific Experimentation
Avatar
librarian
23 views
CORE: Contrastive Reflection Enables Rapid Improvements in Reasoning
Avatar
librarian
19 views
Calibrating Conservatism for Scalable Oversight
Avatar
librarian
22 views
Detecting Is Not Resolving: The Monitoring Control Gap in Retrieval Augmented LLMs
Avatar
librarian
27 views
VitaBench 2.0: Evaluating Personalized and Proactive Agents in Long-Term User Interactions
Avatar
librarian
22 views
SIA: Self Improving AI with Harness & Weight Updates
Avatar
librarian
19 views
Alignment Tampering: How Reinforcement Learning from Human Feedback Is Exploited to Optimize Misaligned Biases
Avatar
librarian
25 views
MUSE-Autoskill: Self-Evolving Agents via Skill Creation, Memory, Management, and Evaluation
Avatar
Tieying Zhang
25 views
The Attribution Blind Spot: Detecting When Language Models Rely on Memory Rather Than Retrieved Context
Avatar
librarian
20 views
Helicase: Uncertainty-Guided Supply Chain Knowledge Graph Construction with Autonomous Multi-Agent LLMs
Avatar
Yunbo Long
20 views
Neuro-Symbolic Verification of LLM Outputs for Data-Sensitive Domains (extended preprint)
Avatar
Paul Sigloch
22 views
Neural Scalable Symbolic Search Framework for Complex Logical Queries with Multiple Free Variables
Avatar
Weizhi Fei
26 views
CausaLab: A Scalable Environment for Interactive Causal Discovery Toward AI Scientists
Avatar
librarian
27 views
VeriTrace: Evolving Mental Models for Deep Research Agents
Avatar
Haolang Zhao
29 views
From Model Scaling to System Scaling: Scaling the Harness in Agentic AI
Avatar
librarian
24 views
LCGuard: Latent Communication Guard for Safe KV Sharing in Multi-Agent Systems
Avatar
Sadia Asif
31 views
Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention
Avatar
librarian
31 views
Claw AI Lab: An Autonomous Multi-Agent Research Team
Avatar
librarian
31 views
MOSS: Self-Evolution through Source-Level Rewriting in Autonomous Agent Systems
Avatar
librarian
29 views
Advancing Mathematics Research with AI-Driven Formal Proof Search
Avatar
librarian
25 views
Insights Generator: Systematic Corpus-Level Trace Diagnostics for LLM Agents
Avatar
Akshay Manglik
28 views
Mind the Sim-to-Real Gap & Think Like a Scientist
Avatar
librarian
30 views
DeepWeb-Bench: A Deep Research Benchmark Demanding Massive Cross-Source Evidence and Long-Horizon Derivation
Avatar
librarian
38 views
PlanningBench: Generating Scalable and Verifiable Planning Data for Evaluating and Training Large Language Models
Avatar
Ziliang Zhao
15 views
AutoRPA: Efficient GUI Automation through LLM-Driven Code Synthesis from Interactions
Avatar
librarian
17 views