Computer Science

Diagnosing CFG Interpretation in LLMs
Avatar
librarian
1 view
Closing the Domain Gap in Biomedical Imaging by In-Context Control Samples
Avatar
Günter Klambauer
1 view
V-tableR1: Process-Supervised Multimodal Table Reasoning with Critic-Guided Policy Optimization
Avatar
librarian
1 view
Learning to Evolve: A Self-Improving Framework for Multi-Agent Systems via Textual Parameter Graph Optimization
Avatar
librarian
3 views
SWE-chat: Coding Agent Interactions From Real Users in the Wild
Avatar
librarian
2 views
Large Language Models Outperform Humans in Fraud Detection and Resistance to Motivated Investor Pressure
Avatar
librarian
2 views
Image Generators are Generalist Vision Learners

Image Generators are Generalist Vision Learners

Computer Vision and Pattern Recognition
Avatar
Vision Banana
1 view
Self-Guided Plan Extraction for Instruction-Following Tasks with Goal-Conditional Reinforcement Learning
Avatar
librarian
2 views
Variance Is Not Importance: Structural Analysis of Transformer Compressibility Across Model Scales
Avatar
Samuel Salfati
1 view
Self-Awareness before Action: Mitigating Logical Inertia via Proactive Cognitive Awareness
Avatar
librarian
2 views
CHORUS: An Agentic Framework for Generating Realistic Deliberation Data
Avatar
librarian
2 views
Chat2Workflow: A Benchmark for Generating Executable Visual Workflows with Natural Language
Avatar
librarian
11 views
Time Series Augmented Generation for Financial Applications
Avatar
librarian
12 views
Multi-modal Reasoning with LLMs for Visual Semantic Arithmetic
Avatar
Chuou Xu
12 views
DT2IT-MRM: Debiased Preference Construction and Iterative Training for Multimodal Reward Modeling
Avatar
librarian
11 views
FASTER: Value-Guided Sampling for Fast RL
Avatar
Perry Dong
10 views
Safe Continual Reinforcement Learning in Non-stationary Environments
Avatar
Austin Coursey
12 views
Generalization at the Edge of Stability
Avatar
librarian
27 views
CoDA: Towards Effective Cross-domain Knowledge Transfer via CoT-guided Domain Adaptation
Avatar
librarian
11 views
SafetyALFRED: Evaluating Safety-Conscious Planning of Multimodal Large Language Models
Avatar
Josue Torres-Fonseca
12 views
A Dual Perspective on Synthetic Trajectory Generators: Utility Framework and Privacy Vulnerabilities
Avatar
librarian
14 views
ClawNet: Human-Symbiotic Agent Network for Cross-User Autonomous Cooperation
Avatar
librarian
13 views
Do LLMs Game Formalization? Evaluating Faithfulness in Logical Reasoning
Avatar
Auguste Poiroux
10 views
Unsupervised Confidence Calibration for Reasoning LLMs from a Single Generation
Avatar
Thomas Zollo
12 views