New 30B-A3B Reasoning Model Achieves Gold-Medal Performance in Olympiads

A recent paper from Hugging Face introduces a 30B-A3B reasoning model that demonstrates gold-medal-level performance in both mathematical and physics competitions, including the International Mathematical Olympiad (IMO) and International Physics Olympiad (IPhO). The model, named SU-01, employs a systematic approach that includes a reverse-perplexity curriculum, two-stage reinforcement learning, and test-time scaling to enhance its proof-search capabilities. Trained on 340,000 sub-8K-token trajectories and refined through 200 reinforcement learning steps, SU-01 can tackle complex problems with trajectories exceeding 100,000 tokens. The authors emphasize the model's strong generalization abilities beyond traditional mathematics and physics domains, and they have open-sourced the code and model for public access.

Read Full Article

View All For This Day

OpenAIGenerative AI+2

OpenAIGenerative AIStartup FundingAI Investment

OpenClaw Creator Invests $1.3M in OpenAI Tokens Within a Month

The creator of OpenClaw has reportedly spent $1.3 million on OpenAI tokens over the course of 30 days, highlighting significant financial backing for the project. This substantial investment reflects the growing interest in generative AI technologies and their applications. The community has responded with 122 comments on the announcement, indicating a lively discussion around the implications of such investments in AI.

Generative AIAI Coding+2

Generative AIAI CodingEnterprise AdoptionSlack

Salesforce Plans $300 Million Investment in Anthropic Tokens to Enhance AI Coding Capabilities

Salesforce CEO Marc Benioff announced the company's intention to spend $300 million on Anthropic tokens in 2026, primarily aimed at improving coding efficiency. During an appearance on the All-In podcast, Benioff praised AI coding agents and indicated that this investment would significantly reduce development costs at Salesforce. As one of Anthropic's largest potential commercial accounts, Salesforce's spending could amplify Anthropic's revenue, which surged from $9 billion to $30 billion due to enterprise adoption of its AI model, Claude. Additionally, Benioff revealed ongoing efforts to integrate advanced coding functionalities into Slack, enhancing its capabilities with over 30 new AI features powered by Claude. This follows Salesforce's substantial productivity gains from AI agents, which have reduced its support workforce by over 40%.

The Next Web AIRead →

Startup FundingRobotics+2

Startup FundingRoboticsCybersecurityBiotech

Anduril Industries Dominates Weekly Funding Round with $5 Billion Raise

Anduril Industries has emerged as the leader in this week's top startup funding rounds, securing $5 billion in a Series H financing round led by Andreessen Horowitz and Thrive Capital, bringing its total valuation to $61 billion. Other significant funding rounds include VoltaGrid, which raised $775 million for mobile natural gas generators, and Mind Robotics, which secured $400 million for its AI-enabled industrial robotics platform. Cowboy Space closed $275 million for its rocket and satellite infrastructure, while indoor farming startup Oishii raised $150 million. Additional notable rounds included Exaforce with $125 million for cybersecurity solutions and Create Medicines with $122 million for biotech advancements.

Crunchbase NewsRead →

SemiconductorsArtificial Intelligence+2

SemiconductorsArtificial IntelligenceChip-MakingASML

ASML and Tata Electronics Collaborate to Enhance India's Semiconductor Industry

ASML Holding NV has announced a partnership with Tata Electronics to advance India's semiconductor capabilities. At a news conference in Eindhoven, CEO Christophe Fouquet highlighted that ASML's orders in the fourth quarter surpassed analysts' expectations, driven by the increasing demand for advanced chip-making machines fueled by the rapid development of artificial intelligence infrastructure.

Bloomberg TechnologyRead →

AIRegulation+2

AIRegulationFintechInsider Trading

US Government Leverages AI to Combat Insider Trading in Prediction Markets

The Commodity Futures Trading Commission (CFTC) is intensifying its scrutiny of prediction markets, particularly targeting suspicious trading activities on platforms like Polymarket. Agency chairman Michael Selig announced that the CFTC is actively pursuing traders using virtual private networks to access these offshore markets despite their U.S. ban. Selig emphasized the agency's commitment to employing AI tools to analyze trading patterns and detect potential market manipulation, noting that AI can provide valuable insights to guide investigations and enforcement actions. The CFTC is also expanding its workforce to manage the increased demands of monitoring these markets.

Ars TechnicaRead →

AI SafetyOpenAI+2

AI SafetyOpenAIElon MuskSam Altman

Musk v. Altman Trial Concludes as Jury Weighs Credibility and AI Control

In the concluding week of the Musk v. Altman trial, the credibility of Elon Musk and Sam Altman was fiercely debated. Altman faced intense scrutiny regarding allegations of dishonesty and self-dealing, while he countered by portraying Musk as a power-hungry individual intent on dominating the development of artificial general intelligence (AGI). OpenAI displayed a trophy symbolizing AI safety amidst the legal arguments, with Musk's legal team asserting that Altman and Greg Brockman breached commitments regarding Musk's donations. Conversely, OpenAI's defense maintained that no such promises were made, and accused Musk of seeking to undermine a rival AI firm, xAI. The jury is set to begin deliberations, with potential implications for OpenAI's IPO and Musk's plans for xAI, which is anticipated to go public soon.

MIT Technology ReviewRead →