Computer Science

Self-Adapting Language Models
Avatar
Adam Zweiger
16 views
A Study on Individual Spatiotemporal Activity Generation Method Using
  MCP-Enhanced Chain-of-Thought Large Language Models
Avatar
librarian
14 views
Breaking Bad Molecules: Are MLLMs Ready for Structure-Level Molecular
  Detoxification?
Avatar
Fei-Yue Wang
14 views
Spurious Rewards: Rethinking Training Signals in RLVR
Avatar
Rulin Shao
14 views
LLMail-Inject: A Dataset from a Realistic Adaptive Prompt Injection
  Challenge
Avatar
librarian
18 views
Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven
  Thinking and Visual Drawing

Reinforcing Spatial Reasoning in Vision-Langua...

Computer Vision and Pattern Recognition
Avatar
librarian
20 views
Outside Knowledge Conversational Video (OKCV) Dataset -- Dialoguing over
  Videos

Outside Knowledge Conversational Video (OKCV) ...

Computer Vision and Pattern Recognition
Avatar
librarian
23 views
Multiverse: Your Language Models Secretly Decide How to Parallelize and
  Merge Generation
Avatar
Xinyu Yang
26 views
Causal Sufficiency and Necessity Improves Chain-of-Thought Reasoning
Avatar
librarian
25 views
How Do People Revise Inconsistent Beliefs? Examining Belief Revision in
  Humans with User Studies
Avatar
Stylianos Vasileiou
25 views
V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction
  and Planning
Avatar
Nicolas Ballas
25 views
VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement
  Learning
Avatar
librarian
33 views
Measuring Data Science Automation: A Survey of Evaluation Tools for AI
  Assistants and Agents
Avatar
Irene Testini
35 views
Cost-Optimal Active AI Model Evaluation
Avatar
librarian
40 views
Decoupling the Image Perception and Multimodal Reasoning for Reasoning
  Segmentation with Digital Twin Representations

Decoupling the Image Perception and Multimodal...

Computer Vision and Pattern Recognition
Avatar
librarian
49 views
Reinforcing Multimodal Understanding and Generation with Dual
  Self-rewards
Avatar
librarian
47 views
GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection
  Behavior
Avatar
librarian
43 views
$τ^2$-Bench: Evaluating Conversational Agents in a Dual-Control
  Environment
Avatar
Victor Barres
44 views
CausalPFN: Amortized Causal Effect Estimation via In-Context Learning
Avatar
Vahid Balazadeh
44 views
Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction
Avatar
Junhong Shen
45 views
Solving Inequality Proofs with Large Language Models
Avatar
librarian
43 views