Recursive Agent Harness Enhances Long-Context Reasoning in AI Coding Agents

arXiv AI· Elias Lumer, Sahil Sen, Kevin Paul et al.· Saturday, June 13, 2026

Recursive language models (RLMs) have demonstrated that recursion over model calls is a potent technique for long-context reasoning. Recent developments in production coding agents, particularly Anthropic's dynamic workflows, have led to the introduction of the Recursive Agent Harness (RAH), which employs a full agent framework equipped with filesystem tools, code execution, and planning capabilities. This study evaluates RAH's performance against existing baselines, revealing that it improved the Codex coding-agent baseline accuracy from 71.75% to 81.36% on the Oolong-Synthetic dataset. Additionally, with the Claude Sonnet 4.5 backbone, RAH achieved an accuracy of 89.77%, highlighting the effectiveness of harness recursion in enhancing coding agent performance.

Read Full Article

View All For This Day

Recursive Agent Harness Enhances Long-Context Reasoning in AI Coding Agents

More Articles From This Day

MetaX LLC Aims for Hong Kong Listing Amid AI Industry Surge

DXC Technology Partners with Anthropic to Integrate Claude into Banking and Airline Systems

KPMG Report Exaggerates AI Adoption Benefits with False Case Studies

DeepSeek Proposes Innovative Redesign of Residual Connections in AI

Introducing Operadic Consistency: A New Signal for Detecting Reasoning Failures in LLMs

Mistral AI Engages in Funding Discussions at Approximately €20 Billion Valuation