Computation and Language

AutoMind: Adaptive Knowledgeable Agent for Automated Data Science
Avatar
librarian
2 views
Causal Sufficiency and Necessity Improves Chain-of-Thought Reasoning
Avatar
librarian
31 views
Constrained Entropic Unlearning: A Primal-Dual Framework for Large
  Language Models
Avatar
librarian
84 views
Critique-GRPO: Advancing LLM Reasoning with Natural Language and
  Numerical Feedback
Avatar
librarian
84 views
ATLAS: Learning to Optimally Memorize the Context at Test Time
Avatar
librarian
127 views
ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning
  Engineering
Avatar
librarian
112 views
LoLA: Low-Rank Linear Attention With Sparse Caching
Avatar
librarian
111 views
DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural
  Language and Reinforcement Learning
Avatar
Jiahao Xu
108 views
Learning Composable Chains-of-Thought
Avatar
librarian
108 views
"KAN you hear me?" Exploring Kolmogorov-Arnold Networks for Spoken
  Language Understanding
Avatar
Alkis Koudounas
120 views
THiNK: Can Large Language Models Think-aloud?
Avatar
Yongan Yu
115 views
Do Large Language Models Excel in Complex Logical Reasoning with Formal
  Language?
Avatar
Jin Jiang
113 views
MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent
  Systems
Avatar
librarian
109 views
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs
  via Reinforcement Learning
Avatar
librarian
114 views
Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous
  Concept Space
Avatar
librarian
109 views
A Federated Splitting Framework for LLMs: Security, Efficiency, and
  Adaptability
Avatar
librarian
107 views
VerifyBench: Benchmarking Reference-based Reward Systems for Large
  Language Models
Avatar
librarian
106 views
BIM-GPT: a Prompt-Based Virtual Assistant Framework for BIM Information
  Retrieval
Avatar
Hervé Onguéné
127 views
Learning Dynamics in Continual Pre-Training for Large Language Models
Avatar
librarian
115 views
ComPO: Preference Alignment via Comparison Oracles
Avatar
librarian
115 views
Reasoning Models Don't Always Say What They Think
Avatar
Yanda Chen
128 views
Whisper-LM: Improving ASR Models with Language Models for Low-Resource
  Languages
Avatar
Hussein Kedir
130 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
경택 오
190 views
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Avatar
yorba
159 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
Ilya Baimetov
373 views
A Pipeline For Discourse Circuits From CCG
Avatar
ScienceCast Board
310 views
All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated
  Text
Avatar
Yael Flax
317 views
Meta-path Augmented Response Generation
Avatar
ScienceCast Board
303 views
CliNER 2.0: Accessible and Accurate Clinical Concept Extraction
Avatar
Sasa Pure
284 views
A Hybrid Architecture for Multi-Party Conversational Systems
Avatar
priaon-flag
293 views
Analyzing the Structure of Attention in a Transformer Language Model
Avatar
levymoshe16
312 views
Direct Neural Machine Translation with Task-level Mixture of Experts
  models
Avatar
Isidora Tourni
310 views