Selected work
Each project carries the same through-line: state the capability, then state the leash. The two production agents lead because they are the only ones with real measured outcomes. The open-source repos are public and verifiable, and the agent-run car-rental business is the operator credential that makes the guardrails philosophy concrete against real money.
Production
2 case studiesFirst-pass labeling agent
An agent that auto-buckets incoming unlabeled data into first-pass categories, so human labelers open a triaged queue instead of raw input, and a person confirms every call.
+25% data grading outputRCA auto-remediation agent
An agent that reads root-cause-analysis tickets and runs scoped terminal commands to resolve known failure modes, autonomous only inside a known, safe, reversible remediation envelope.
+30% data uptimeOpen source
3 case studiesVantage OS
A Claude Code plugin where a coordinator routes every request to the right skill or sub-agent, then gates real-world actions through a QA filter and a permissions-tier system before they reach the human.
10 skills, 8 agents, publicAgent Skills
An optimizer that finds the single highest-leverage move in any context, and an orchestrator that plans a mission into a wave-based DAG of parallel sub-agents, gated by an auditor.
7 agents, standalone, publicPrediction-Market Automation
Scheduled, read-only samplers that build a time-series of Polymarket and Kalshi order books to test whether hourly prediction markets are ever actually market-makeable, with hard no-trade and no-write rails baked into every task.
Safety rail is the point