AI-Powered Multi-Agent System for Automated Specification Verification
Transform natural language system specifications into verified simulation code in seconds—automatically.
Manual specification translation is error-prone, time-consuming, and repetitive
Converting ambiguous natural language text into rigorous state machine logic requires significant engineering effort.
Writing repetitive simulation scaffolding for every requirement slows down development cycles.
No automated way to verify that generated code matches the original specification intent.
Hours spent debugging and validating correctness for each specification change.
See how Spec2Sim-Agent transforms natural language into verified simulation code
Three specialized agents working together with intelligent self-healing
Extracts structured JSON specifications from natural language requirements
Generates complete SimPy simulation code with state machines and test scenarios
Executes code in MCP sandbox and validates against original specification
When verification fails, the Verifier analyzes execution logs and identifies the root cause of errors.
Error details and execution traces are sent back to the Architect with specific suggestions for fixes.
The Architect uses the feedback to regenerate corrected code, learning from previous mistakes.
AI determines if the error is fixable (retry) or indicates a fundamental specification issue (abort).
Main process running three AI agents (Analyst, Architect, Verifier) coordinating workflow and managing the self-healing loop.
Subprocess running MCP server that executes SimPy code in an isolated sandbox and captures all logs.
Processes communicate via Model Context Protocol over stdio, ensuring secure and standardized code execution.
Simulation crashes don't affect agents. Potentially unsafe code runs in a separate isolated environment.
Google ADK with Gemini 2.5 Flash Lite for intelligent agent orchestration
SimPy for discrete-event simulation and state machine modeling
Model Context Protocol (MCP) for sandboxed code execution
Streamlit for real-time visualization and interactive demonstrations
Tested on 3 industrial control systems with 100% success rate
| Demo | Time | Status | Code Lines | Complexity |
|---|---|---|---|---|
| Traffic Light | 17.36s | PASS | 113 | Time-driven FSM |
| BMS Precharge | 79.89s | PASS | 205 | Voltage monitoring |
| Elevator Control | 26.91s | PASS | 136 | Multi-floor logic |
Successfully deployed to Google Cloud Vertex AI Agent Engine
Full Functionality: Dual-process architecture with complete SimPy execution via MCP server. All agents and self-healing loop fully operational.
Demonstration: Single-process deployment showcasing multi-agent conversation flow and explaining system architecture capabilities.
Constraint: Cannot launch subprocesses via subprocess.Popen(), preventing MCP server execution in the default container environment.
Solutions: Deploy to Cloud Run (with process management), GKE (Kubernetes), or separate services for full functionality.