Artificial Intelligence

Diagnosing CFG Interpretation in LLMs
Avatar
librarian
1 view
V-tableR1: Process-Supervised Multimodal Table Reasoning with Critic-Guided Policy Optimization
Avatar
librarian
1 view
Learning to Evolve: A Self-Improving Framework for Multi-Agent Systems via Textual Parameter Graph Optimization
Avatar
librarian
3 views
SWE-chat: Coding Agent Interactions From Real Users in the Wild
Avatar
librarian
2 views
Large Language Models Outperform Humans in Fraud Detection and Resistance to Motivated Investor Pressure
Avatar
librarian
2 views
Self-Guided Plan Extraction for Instruction-Following Tasks with Goal-Conditional Reinforcement Learning
Avatar
librarian
2 views
Self-Awareness before Action: Mitigating Logical Inertia via Proactive Cognitive Awareness
Avatar
librarian
2 views
CHORUS: An Agentic Framework for Generating Realistic Deliberation Data
Avatar
librarian
2 views
Time Series Augmented Generation for Financial Applications
Avatar
librarian
12 views
Multi-modal Reasoning with LLMs for Visual Semantic Arithmetic
Avatar
Chuou Xu
12 views
DT2IT-MRM: Debiased Preference Construction and Iterative Training for Multimodal Reward Modeling
Avatar
librarian
11 views
CoDA: Towards Effective Cross-domain Knowledge Transfer via CoT-guided Domain Adaptation
Avatar
librarian
11 views
SafetyALFRED: Evaluating Safety-Conscious Planning of Multimodal Large Language Models
Avatar
Josue Torres-Fonseca
12 views
A Dual Perspective on Synthetic Trajectory Generators: Utility Framework and Privacy Vulnerabilities
Avatar
librarian
14 views
ClawNet: Human-Symbiotic Agent Network for Cross-User Autonomous Cooperation
Avatar
librarian
13 views
Do LLMs Game Formalization? Evaluating Faithfulness in Logical Reasoning
Avatar
Auguste Poiroux
10 views
Explicit Trait Inference for Multi-Agent Coordination
Avatar
librarian
6 views
Do Agents Dream of Root Shells? Partial-Credit Evaluation of LLM Agents in Capture The Flag Challenges
Avatar
librarian
7 views
GRASPrune: Global Gating for Budgeted Structured Pruning of Large Language Models
Avatar
Ziyang Wang
9 views
Agentic Forecasting using Sequential Bayesian Updating of Linguistic Beliefs
Avatar
librarian
14 views
Using large language models for embodied planning introduces systematic safety risks
Avatar
librarian
18 views
WorldDB: A Vector Graph-of-Worlds Memory Engine with Ontology-Aware Write-Time Reconciliation
Avatar
librarian
17 views
MathNet: a Global Multimodal Benchmark for Mathematical Reasoning and Retrieval
Avatar
librarian
23 views
Polysemantic Experts, Monosemantic Paths: Routing as Control in MoEs
Avatar
Charles Ye
11 views
LiteResearcher: A Scalable Agentic RL Training Framework for Deep Research Agent
Avatar
librarian
17 views
ContraPrompt: Contrastive Prompt Optimization via Dyadic Reasoning Trace Analysis
Avatar
librarian
10 views
The Topological Dual of a Dataset: A Logic-to-Topology Encoding for AlphaGeometry-Style Data
Avatar
Anthony Bordg
8 views
Beyond the Basics: Leveraging Large Language Model for Fine-Grained Medical Entity Recognition
Avatar
librarian
6 views
Yanasse: Finding New Proofs from Deep Vision's Analogies, Part 1
Avatar
librarian
8 views
Safe and Policy-Compliant Multi-Agent Orchestration for Enterprise AI
Avatar
librarian
6 views
Blue Data Intelligence Layer: Streaming Data and Agents for Multi-source Multi-modal Data-Centric Applications
Avatar
librarian
26 views
RadAgent: A tool-using AI agent for stepwise interpretation of chest computed tomography
Avatar
librarian
22 views