Multimodal Instruction Attacks on Agent Skill Scanners Highlight Security Blind Spots

A recent empirical study reveals that current defenses in LLM-based systems inadequately address multimodal hidden instruction attacks targeting agent skills. Researchers Xiaojun Jia, Jie Liao, Simeng Qin, and colleagues developed SkillCamo, a method that embeds malicious instructions within images while modifying accompanying documentation to appear legitimate. This approach exploits the limitations of existing skill scanners, which primarily focus on textual signals. To counter this threat, the team introduced ExecScan, a multimodal scanning module designed to extract intent, reconstruct behavior, and assess risks associated with skill artifacts. Extensive testing demonstrates that these image-concealed malicious instructions can evade traditional defenses, while ExecScan enhances scanning efficacy by integrating visual content analysis.

Read Full Article

View All For This Day

Generative AINLP+2

Generative AINLPHealthcareAI Chemist

Near-Autonomous AI Chemist Enhances Key Drug-Making Reaction in Medicinal Chemistry

OpenAI, in collaboration with Molecule.one, has demonstrated how a near-autonomous AI chemist powered by GPT-5.4 has successfully improved a crucial reaction in drug manufacturing. This advancement represents a significant step forward in medicinal chemistry research, showcasing the potential of AI technologies to enhance complex chemical processes.

AI RegulationLeadership+2

AI RegulationLeadershipAnthropicGenerative AI

US Order on Anthropic Models Marks New Chapter in AI Regulation

Dario Amodei, co-founder and CEO of Anthropic, discussed the company's innovative leadership approach during an interview on 'The Circuit with Emily Chang.' He emphasized the importance of focusing on big-picture discussions, organizational culture, and strategic input on research, rather than traditional management of senior leadership roles. This shift in leadership style comes as the US signals a new era for AI controls, with Anthropic at the forefront of these developments.

Bloomberg TechnologyRead →

Generative AITransformers+2

Generative AITransformersNLPFine-Tuning

Introducing GLM-5.2: A New Model Designed for Long-Horizon Tasks

Hugging Face has unveiled GLM-5.2, an advanced generative language model specifically optimized for long-horizon tasks. This model aims to enhance performance in complex scenarios where sustained reasoning and memory are required. The blog details the methodologies implemented in GLM-5.2, including improvements in architecture and training techniques to support extended interactions, making it a valuable tool for applications demanding high-level cognitive capabilities.

Hugging FaceRead →

LLMNLP+2

LLMNLPOpen SourceBenchmark

Stanford EDGAR Filings Dataset Transforms U.S. Corporate Disclosures into Efficient Pretraining Data

The Stanford EDGAR Filings Dataset (SEFD) addresses the growing scarcity of clean long-context documents for training large language models (LLMs) by providing an open reconstruction of SEC filings into layout-faithful MultiMarkdown. This dataset includes a wide range of financial documents such as audited financial statements and market-moving event filings, making them accessible for long-context pretraining data, financial reasoning, and compliance. SEFD-v1 consists of 152 billion tokens and features less than 0.1% overlap with Common Crawl-derived datasets. Additionally, two benchmarks, EDGAR-Forecast and EDGAR-OCR, are introduced to evaluate numerical forecasting and transcription of complex financial tables, respectively.

arXiv AIRead →

Startup FundingGenerative AI+2

Startup FundingGenerative AIAICuspAI

Jeff Bezos Invests in CuspAI, Elevating Valuation to $2.6 Billion with $400 Million Funding

CuspAI, a UK-based AI startup, has secured $400 million in funding, significantly boosting its valuation to $2.6 billion. This financing round, which has garnered support from notable investors including Jeff Bezos, represents a substantial increase in the company's worth in just two years since its inception.

Financial Times TechRead →

AI GovernanceCybersecurity+2

AI GovernanceCybersecurityPublic OpinionRegulation

Anthropic's Claude Fable 5 Sparks Debate on AI Governance and National Security

Anthropic's ongoing discussions with U.S. officials regarding Claude Fable 5 reflect a significant shift in AI governance, moving focus from alignment and privacy to cybersecurity, national security, and data sovereignty. The company asserts that complete jailbreak resistance is unachievable for any model provider, yet emphasizes that the real concern lies in the widening gap between private innovation and public oversight. As companies strive to develop advanced AI systems, governments struggle to establish evaluation and accountability standards. Anthropic's recent public-opinion survey of 51,993 Americans reveals a desire for AI to enhance various aspects of life, tempered by fears of job displacement, misinformation, and privacy erosion. The survey indicates that 71% of Americans support government involvement in AI regulation, highlighting a significant trust deficit towards AI companies.

Forbes InnovationRead →