Featured News

How Autonomous AI Systems Are Testing Governance in Physical Environments

2026-05-28 by AICC
Autonomous AI Systems

Autonomous AI systems are beginning to move beyond software environments and into warehouses, delivery networks, and public spaces. The development is drawing attention to whether current AI rules cover systems that operate in physical environments.

Most existing AI governance frameworks have focused on online harms and model outputs, including bias, misinformation, and harmful content. Embodied AI systems carry risks in physical environments, where failures can affect infrastructure, property, or human safety.

📋 Singapore's New AI Governance Framework

Singapore's Infocomm Media Development Authority published version 1.5 of its Model AI Governance Framework for Agentic AI on May 20. The framework sets out guidance for organisations deploying AI agents that can plan, make decisions, and take actions across multiple steps to complete user-defined goals.

The framework says agents can interact with tools, external systems, and other agents, including systems that update databases, write files, control devices, or perform transactions. It lists access controls, monitoring, and human approval among governance measures for deployment.

🤖 AI Moves Into Physical Systems

At an AI summit in Singapore last week, discussions around robotics and embodied AI focused on operational safety issues more commonly associated with aviation, industrial systems, and critical infrastructure oversight than conventional software regulation.

Speakers also discussed whether autonomous systems can operate safely and reliably in unpredictable real-world environments over extended periods.

Dr. Ya-Qin Zhang, founding dean of the Institute for AI Industry Research at Tsinghua University, said embodied AI systems amplify risks already associated with autonomous software. He said failures can directly affect transport systems, drones, logistics networks, and critical infrastructure.

"Any risk in the digital domain will be amplified in the physical domain, and the physical domain will have a physical consequence," Zhang told MLex on the sidelines of the summit.

He added that vehicles, drones, smart grids, and other infrastructure could become exposed as AI systems are embedded more deeply into physical operations.

Speakers discussed reliability, operational monitoring, and post-deployment assurance as governance concerns. Summit discussions pointed to deployment-based governance models built around simulation, telemetry, and iterative testing, rather than one-time certification alone.

📊 Monitoring Becomes a Deployment Issue

Grab, which is piloting autonomous vehicles and delivery robots in Singapore's Punggol district, said deployment governance depends heavily on simulation, testing, and continuous monitoring.

"We do a lot of simulation, we do a lot of testing in closed courses and open courses in order to make sure our robots are reliable," Suthen Thomas Paradatheth, Grab's chief technology officer, said during one of the summit panels.

"Before we scale to hundreds of robots, we make sure we crack it first in simulation and with a few robots," he added.

Grab also pointed to monitoring systems designed to track robot performance and detect unexpected failures after deployment. "There's a long tail of issues that could emerge," Paradatheth said.

⚙️ Key Framework Recommendations:

  • Assess agentic AI use cases based on data access and external system access
  • Evaluate autonomy levels and task complexity
  • Consider scope and reversibility of agent actions
  • Limit agent access to tools and systems
  • Apply least-privilege permissions
  • Define standard operating procedures for agent workflows
  • Set mechanisms to take agents offline when they malfunction

👥 Accountability Spreads Across More Actors

MLex reported that embodied AI systems can involve several parties across development, manufacturing, and deployment. These include AI developers, robotics manufacturers, semiconductor suppliers, and infrastructure operators.

MLex also noted that responsibility can be harder to assign when systems continue adapting after deployment through software updates, telemetry, and operational data.

IMDA says organisations and humans remain accountable for agent actions, even when agents operate autonomously. The framework calls for clear responsibility across the agentic AI value chain, from model and platform providers to deployers, tooling providers, and end users.

Applied Materials said large-scale robotics deployment is also tied to semiconductor economics and systems integration. Om Nalamasu, the company's chief technology officer, said robotics systems will depend on better sensors, energy efficiency, advanced packaging, and computing architectures.

🌏 Regional Approaches to AI Robotics

🇨🇳 China: Zhao Yuli, chief strategy officer of Chinese robotics startup Galbot, said Beijing is prioritising deployment scale and industrial commercialisation through government-backed testbeds, industrial partnerships, and long-term funding initiatives.

Galbot has deployed humanoid robotics systems in retail, warehouse, and pharmaceutical operations in China, including autonomous stores that operate around the clock. Zhao said semi-structured industrial environments are likely to become an early commercialisation path because they offer more controllable operating conditions.

🇯🇵 Japan: Japan is placing more focus on standards-setting, robotics datasets, and safety governance. Professor Yutaka Matsuo of the University of Tokyo's Graduate School of Engineering pointed to an "AI Association" project aimed at collecting 100,000 hours of robotics data to support robotic foundation models.

Matsuo also referred to Japan's AI Safety Institute and the Hiroshima AI Process as part of broader efforts to develop governance standards for embodied AI systems with Singapore and other Asian countries.

🔒 Singapore Sets Out Agent Controls

Singapore's framework sets out four governance areas for agentic AI:

  1. Upfront risk assessment
  2. Human accountability
  3. Technical controls
  4. End-user responsibility

The framework describes them as an iterative process rather than a one-time assessment.

The framework says human oversight has to be adapted for agentic systems because continuous review of all workflows becomes impractical at scale. It recommends human approval at significant checkpoints, including high-stakes actions, irreversible actions, and outlier behaviour.

IMDA also identifies automation bias and alert fatigue as risks when humans supervise capable agents. It recommends auditing oversight through indicators such as human override rates and response times, and using automated real-time monitoring to flag unexpected behaviour.

The framework says users should be told what actions an agent can take, what data it can access, and what responsibilities remain with the user. It also recommends employee training on human-agent interaction, oversight, and the professional skills needed to assess agent outputs.

🏦 Companies Test AI in Regulated Workflows

JPMorgan is implementing AI tools across its global investment banking business, Paul Uren, the bank's Asia Pacific head of investment banking, told Reuters. The bank said the tools help bankers access more information and synthesise it with internal systems. They are also being used to prepare content and support client engagement.

JPMorgan CEO Jamie Dimon told Bloomberg News that the bank would hire more AI specialists and fewer traditional bankers. Reuters reported that global banks are increasing AI investment, reshaping workforces, and changing job roles.

The bank is also among selected organisations permitted by Anthropic to use its Mythos cybersecurity model under a controlled initiative known as Project Glasswing. According to Anthropic, Mythos can detect old vulnerabilities in browsers, infrastructure, and software.

Reuters reported that Goldman Sachs, Citigroup, Bank of America, and Morgan Stanley also have access to, or are testing, Mythos, citing sources and company executives.

💼 Case Study: OCBC Bank of Singapore

IMDA's framework includes a case study from OCBC Bank on source-of-wealth analysis. The system parses income-related documents and drafts a source-of-wealth memo. It does not make credit, onboarding, or risk decisions autonomously. The workflow is limited to task-level autonomy and operates only when triggered by predefined workflows. Human review is required at critical decision points, and final validation remains with designated reviewers.

🏭 Robots Move Into Industrial Use

In Japan, one-third of companies are already using or considering AI-powered robots, according to a Reuters survey conducted by Nikkei Research from May 1 to May 15. The survey contacted 492 companies, with 220 responding on the condition of anonymity.

📈 Survey Results:

  • 4% already use AI robots
  • 5% plan to deploy them
  • 25% are considering deployment
  • 66% have no such plans

Transportation equipment manufacturers were the most active group in the survey, with 80% already using AI robots or considering deployment. By comparison, 94% of wholesale sector respondents said they had no plans to deploy AI robots.

Among companies using, planning to use, or considering AI robots:

  • 71% selected manufacturing as a use case
  • 19% selected dangerous tasks
  • 11% selected customer-facing services

The Japanese government expects AI robots to help address the country's chronic labour shortage and support its position in industrial robotics. Japan is home to robotics companies including Fanuc, Yaskawa Electric, and Kawasaki Heavy Industries, but faces competition from China and the United States in AI-enabled robotics.

🛒 Retail Agents Expand Beyond Search

Walmart has outlined plans to use agentic AI across shopping, employee, supplier, and developer workflows.

In July 2025, the retailer announced plans for four AI-powered "super agents" designed for shoppers, store employees, suppliers and sellers, and software developers. Walmart said these agents would become the main entry point for AI interactions across those groups.

🤖 Walmart's AI Super Agents:

  • Sparky: Shopping assistant (already available in Walmart's app) - will be expanded to reorder items, plan events, and use computer vision to suggest recipes based on fridge contents
  • Associate super agent: For store workers and corporate staff
  • Marty agent: For sellers, suppliers, and advertisers
  • Developer super agent: For testing, building, and launching future AI tools

One of the tools, Sparky, is already available in Walmart's app as a generative AI-powered shopping assistant. Hari Vasudev, Walmart's US chief technology officer, said its expanded version would be able to reorder items and plan events. It would also use computer vision to suggest recipes based on the contents of a shopper's fridge.

The company declined to say whether the agents would replace jobs. Dave Glick, senior vice president of enterprise business systems, said the tools would create new jobs, without giving further details.

(Photo by Growtika)

300+ AI Models for
OpenClaw & AI Agents

Save 20% on Costs