Getting Started
Set up your environment and take your first tour of AgentSim. These five steps get you from zero to exploring Module 1 in under five minutes.
Start the server
Open a terminal, navigate to the AgentSim directory, and start a local web server on port 8090.
Open the Training Portal
Navigate to http://localhost:8090/training.html in your browser. This is your home base.
Choose your first module
Click "Module 1: Enterprise Environment" in the sidebar. It highlights to show your selection.
Module 1: Enterprise Environment
Understanding the Frontier Community Bank technology landscape, organizational structure, and the systems that agents will interact with.
Read the theory
The Theory tab loads the module README. Scroll through to understand the concepts before starting the lab.
The Enterprise Environment
- Frontier Community Bank -- $2.5B mid-market bank, 350 employees
- 14 branches across the Southeast US
- Core banking: Jack Henry Symitar (Episys)
- IT: ServiceNow ITSM, Dynatrace, CrowdStrike
The bank has 6 departments organized under the CEO...
Go to the lab
At the bottom of the theory, a call-to-action directs you to the hands-on lab in the simulator.
simulator.html pre-loaded with the Module 1 lab scenarioUsing the Simulator
The simulator is a browser-based terminal that recreates an agent's working environment. You type commands, explore data files, and run simulated AI agents -- all without touching production systems.
Welcome screen
On first load, the simulator shows a welcome overlay. Choose between Guided mode (step-by-step instructions) or Free mode (open sandbox).
instructions
exploration
Guided mode layout
After choosing Guided mode, you get a split-pane view: terminal on the left, step instructions on the right.
Use the ls command to see what files are available in the AgentSim environment.
ls data/
Copy to Terminal
Type a command
Type ls in the terminal and press Enter. The simulator shows the AgentSim directory listing.
Explore data files
Use cat to read files. Start with manifest.json to understand the data layout.
Run an agent
Type claude to start the simulated AI agent. It processes the scenario and produces output.
View your scorecard
After the agent finishes, run scorecard to see how you performed.
| Completeness | 4 / 4 steps |
| Efficiency | Optimal path |
| Deductions | -8 pts (read unnecessary file) |
| Time | 2m 14s |
Running a Scenario
Scenarios are structured challenges that test your ability to configure and run AI agents under realistic constraints. Each scenario has an objective, rules, and a scorecard.
Pick a scenario
From the dashboard's Agent Console tab, browse the available scenarios and click "Run" to begin.
| Scenario | Difficulty | Module | Status | |
|---|---|---|---|---|
| basic-recon Map the org and systems |
Beginner | 1 | Completed | Replay |
| incident-triage Classify and route P1 incidents |
Intermediate | 6 | Not Started | Run |
| rogue-agent Detect and contain a compromised agent |
Advanced | 3 | Not Started | Run |
Read the scenario file
Each scenario is defined as a JSON file with objective, constraints, and available actions.
Configure your agent
Write or edit a CLAUDE.md file that tells the agent how to behave. The starter has TODO gaps for you to fill.
Execute the scenario
Run claude with your configured CLAUDE.md and watch the agent process incidents in real time.
Review results
When the agent finishes, you see the scorecard plus any consequence alerts for mistakes.
| Classification accuracy | 4/5 correct |
| Team assignment | 5/5 correct |
| SLA compliance | All within SLA |
| PII protection | No violations |
The Dashboard
The AgentSim dashboard (index.html) gives you an executive view of the entire bank simulation
with interactive visualizations, organizational data, and the agent console.
Overview tab
The default tab shows key metrics about Frontier Community Bank at a glance.
data/systems.json.Org Chart
The Org Chart tab renders a D3-powered tree of the bank's reporting structure. Click nodes to expand or collapse.
Margaret Chen
IT & Ops
Finance
Risk
Operations
Security
Network Topology
The Network tab shows a force-directed graph of all systems, VLANs, and connections.
Agent Console
The Agent Console tab lists all scenarios with status tracking and quick-launch buttons.
Governance & Guardrails
In a regulated banking environment, AI agents need strict controls. This section covers autonomy tiers, permission rules, the Governing-Orchestrator Agent, and kill switches.
Understanding autonomy tiers
AgentSim uses four autonomy tiers that define how much independence an agent gets.
No actions taken.
Human approves.
pre-approved scope.
post-hoc review.
Building guardrails
Agent permissions are defined in a settings file using deny/ask/allow rules for every action category.
The Governing-Orchestrator Agent (GOA)
The GOA is a supervisory agent that monitors all other agents, enforces permissions, and maintains audit trails.
Rogue Agent Sandbox
The rogue agent scenario puts you in charge of detecting and containing a compromised agent whose trust score is dropping.
Kill switch activation
When you issue a containment command, the simulator shows the full rollback sequence.
Framework Comparison
AgentSim supports four AI agent frameworks. Compare them side-by-side on the same scenario to understand their trade-offs in a banking context.
Four frameworks
Each framework takes a different architectural approach to agent deployment.
File-system native
Tool-use pattern
Memory management
Agent executor
Role assignment
Group orchestration
Task delegation
Process framework
Same scenario, four ways
The comparison mode runs the same scenario across all four frameworks simultaneously in a 2x2 grid.
Compare results
After all four complete, a comparison table shows scores side-by-side.
| Metric | Claude Code | LangChain | AutoGen | CrewAI |
|---|---|---|---|---|
| Overall Score | 92 | 85 | 88 | 86 |
| Classification | 5/5 | 4/5 | 5/5 | 4/5 |
| Routing | 5/5 | 5/5 | 4/5 | 5/5 |
| SLA Compliance | 100% | 80% | 100% | 80% |
| Token Usage | 1,240 | 3,450 | 5,100 | 2,800 |
| Execution Time | 2.1s | 4.8s | 6.2s | 3.9s |
Choose your framework
Use this decision guide to pick the right framework for your use case.
Advanced Features
Once you have mastered the basics, explore these advanced AgentSim capabilities: real-time incident simulation, FDIC exam prep, live API connections, and instructor tools.
Live Shift mode
Simulate a real IT operations shift with incidents arriving in real time. A timer counts your shift, and a queue fills with incoming tickets.
Exam Simulator
Practice for regulatory examinations with the FDIC IT Exam Simulator. An examiner asks questions; you respond.
Real Agent Connection
Agent Connect lets you wire a real AI API (Claude, GPT, etc.) into the AgentSim environment for live agent testing.
Instructor Mode
For trainers running AgentSim in a classroom, the Instructor page provides a class dashboard and export tools.