Computer Science

Tuning Just Enough: Lightweight Backdoor Attacks on Multi-Encoder Diffusion Models
Avatar
Ziyuan Chen
0 views
SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via Continuous Integration
Avatar
librarian
0 views
In-Context Environments Induce Evaluation-Awareness in Language Models
Avatar
librarian
0 views
Phi-4-reasoning-vision-15B Technical Report
Avatar
librarian
0 views
Learning Demographic-Conditioned Mobility Trajectories with Aggregate Supervision
Avatar
Jessie Zixin Li
4 views
Coalgebras for categorical deep learning: Representability and universal approximation
Avatar
Dragan Mašulović
2 views
Speculative Speculative Decoding
Avatar
Tanishq Kumar
3 views
AI-for-Science Low-code Platform with Bayesian Adversarial Multi-Agent Framework
Avatar
librarian
2 views
Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals
Avatar
Patrick Gerard
4 views
Inherited Goal Drift: Contextual Pressure Can Undermine Agentic Goals
Avatar
librarian
3 views
OrchMAS: Orchestrated Reasoning with Multi Collaborative Heterogeneous Scientific Expert Structured Agents
Avatar
Yichao Feng
0 views
RAPO: Expanding Exploration for LLM Agents via Retrieval-Augmented Policy Optimization
Avatar
Siwei Zhang
1 view
Beyond Task Completion: Revealing Corrupt Success in LLM Agents through Procedure-Aware Evaluation
Avatar
Hongliu CAO
1 view
Teaching LLMs to Plan: Logical Chain-of-Thought Instruction Tuning for Symbolic Planning
Avatar
Artem Kolesnikov
10 views
Symbol-Equivariant Recurrent Reasoning Models
Avatar
librarian
7 views
Pencil Puzzle Bench: A Benchmark for Multi-Step Verifiable Reasoning
Avatar
Justin Waugh
3 views
Recursive Models for Long-Horizon Reasoning
Avatar
librarian
4 views
Multi-Head Low-Rank Attention
Avatar
librarian
6 views
Frontier Models Can Take Actions at Low Probabilities
Avatar
Alex Serrano
1 view
Nano-EmoX: Unifying Multimodal Emotional Intelligence from Perception to Empathy
Avatar
Xuechao Yang
5 views
Conformal Policy Control

Conformal Policy Control

Artificial Intelligence
Avatar
librarian
3 views
Tool Verification for Test-Time Reinforcement Learning
Avatar
librarian
5 views
Scalable Multi-Task Low-Rank Model Adaptation
Avatar
librarian
0 views
CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning
Avatar
Xinyu Zhu
0 views
Curvature-Weighted Capacity Allocation: A Minimum Description Length Framework for Layer-Adaptive Large Language Model Optimization
Avatar
librarian
0 views
LLM Novice Uplift on Dual-Use, In Silico Biology Tasks
Avatar
librarian
22 views