Built an open-source way to give GPT agents a real browser (not headless)
An open-source project called Otto enables GPT agents to interact with a real Chrome browser over a secure relay, addressing limitations of cloud-browser subscriptions and headless Chrome setups. This allows for more realistic logged-in flows. The project is MIT-licensed and includes an MCP server and JSON CLI for integration with tool-use loops. You can use it to give your GPT agents a real browser experience.
Key takeaways
- Enables GPT agents to drive a real Chrome tab securely.
- Deterministic code handles clicking and extraction, optimizing token usage.
- MIT-licensed, with MCP server and JSON CLI for tool integration.
An open-source project called Otto enables GPT agents to interact with a real Chrome browser over a secure relay, addressing limitations of cloud-browser subscriptions and headless Chrome setups. This allows for more realistic logged-in flows. The project is MIT-licensed and includes an MCP server and JSON CLI for integration with tool-use loops. You can use it to give your GPT agents a real browser experience.
Key takeaways
- Enables GPT agents to drive a real Chrome tab securely.
- Deterministic code handles clicking and extraction, optimizing token usage.
- MIT-licensed, with MCP server and JSON CLI for tool integration.