Computer Science

Demystifying Data Organization for Enhanced LLM Training
Avatar
librarian
1 view
CalArena: A Large-Scale Post-Hoc Calibration Benchmark
Avatar
Eugène Berta
5 views
SchGen: PCB Schematic Generation with Semantic-Grounded Code Representations
Avatar
Qinpei Luo
3 views
When, why, and how do diffusion posterior samplers fail? A finite-sample lens
Avatar
Benjamin Burns
3 views
ProjectionBench: Evaluating Scientific Hypothesis Generation in LLMs Under Progressive Information Disclosure
Avatar
Andrew Lew
3 views
MIRA: Mid-training Rubric Anchoring for Source-Aware Data Selection
Avatar
librarian
4 views
Locally Coherent, Globally Incoherent: Bounding Compositional Incoherence in Multi-Component LLM Agents
Avatar
librarian
3 views
SwarmHarness: Skill-Based Task Routing via Decentralized Incentive-Aligned AI Agent Networks
Avatar
Edwin Jose
9 views
Rethinking Memory as Continuously Evolving Connectivity
Avatar
librarian
7 views
Do Agents Need Semantic Metadata? A Comparative Study in Agentic Data Retrieval
Avatar
Shiyu Chen
8 views
CaMBRAIN: Real-time, Continuous EEG Inference with Causal State Space Models
Avatar
librarian
17 views
AutoScientists: Self-Organizing Agent Teams for Long-Running Scientific Experimentation
Avatar
librarian
23 views
CORE: Contrastive Reflection Enables Rapid Improvements in Reasoning
Avatar
librarian
19 views
Calibrating Conservatism for Scalable Oversight
Avatar
librarian
22 views
PEFT-Arena: Understanding Parameter-Efficient Finetuning from a Stability-Plasticity Perspective
Avatar
Yangyi Huang
13 views
Detecting Is Not Resolving: The Monitoring Control Gap in Retrieval Augmented LLMs
Avatar
librarian
27 views
VitaBench 2.0: Evaluating Personalized and Proactive Agents in Long-Term User Interactions
Avatar
librarian
22 views
SIA: Self Improving AI with Harness & Weight Updates
Avatar
librarian
19 views
Alignment Tampering: How Reinforcement Learning from Human Feedback Is Exploited to Optimize Misaligned Biases
Avatar
librarian
25 views
MUSE-Autoskill: Self-Evolving Agents via Skill Creation, Memory, Management, and Evaluation
Avatar
Tieying Zhang
25 views