Browser Use
by Browser Use
Open-source Python library for AI browser automation using LLMs and computer vision. Free tier with 10 tasks/month, cloud from $29/month. MCP server included. 90,000+ GitHub stars.
Browser Use is an open-source Python library that lets AI agents control a real browser using LLMs instead of brittle CSS selectors or XPath scripts. Rather than pre-programming every interaction with fragile selector code, agents describe what they want in natural language and Browser Use figures out how to execute it against whatever website it encounters, including sites it has never seen before. With over 90,000 GitHub stars, it is one of the most widely adopted open-source browser automation libraries in the AI agent ecosystem. The library provides a self-healing automation harness: when a website's layout changes, Browser Use adapts because it reasons from the page's current visual and DOM state rather than hardcoded selectors. This makes it well-suited for automating legacy enterprise web applications, government portals, insurance platforms, healthcare systems, and any site without a public API. The cloud platform adds stealth browsers that mimic human browser fingerprints to avoid bot detection, handling canvas, WebGL, and audio fingerprinting automatically. The product line has multiple tiers. The open-source library (MIT license, pip install browser-use) is free and self-hosted, giving direct browser automation access with your own LLM API keys. The cloud platform adds managed infrastructure: Free tier (10 tasks/month, 3 concurrent sessions, advanced stealth), Dev ($29/month, $29 in credits, 25 concurrent sessions), Business and Scaleup tiers for higher volume. Browser Use Box is a 24/7 cloud agent product that runs continuously on your behalf. Browser Use ships both a local MCP server (stdio) and a hosted Cloud MCP server. This means Claude Desktop, Cursor, and other MCP-compatible agents can directly invoke browser automation tools including navigate, fill forms, extract data, manage tabs, and run autonomous agent tasks, without custom integration code. Benchmarks report 97% task accuracy on the Browser Use cloud benchmark. The library supports any LLM via BYOK, including OpenAI, Anthropic, Google Gemini, Groq, and local models via Ollama. Works with n8n, LangChain, and other orchestration platforms. Key limitations: the open-source library requires your own compute and API key management, complex multi-step tasks on JavaScript-heavy or bot-detection sites can still fail, and the cloud platform's usage-based pricing offers less cost predictability than flat-rate enterprise RPA tools.
Pricing
freemium · Free
Segment
b2b
Setup
easy
Verified
May 14, 2026
Capabilities
Pros & Limitations
Editorial assessmentPros
- ✓Self-healing browser automation using LLMs and computer vision instead of brittle CSS selectors or XPath, adapting to website layout changes without code maintenance and making it reliable for automating legacy web apps, government portals, and sites without public APIs
- ✓Native MCP server integration (local stdio and hosted cloud) lets Claude Desktop, Cursor, and other MCP-compatible agents invoke browser automation directly without custom integration code, exposing navigate, fill forms, extract data, and manage tabs as native agent tools
- ✓MIT-licensed open-source library with 90,000+ GitHub stars and BYOK model support covering OpenAI, Anthropic, Gemini, Groq, and local models via Ollama, giving teams full model flexibility and zero vendor lock-in
Limitations
- ⚠Complex multi-step workflows on JavaScript-heavy or bot-detection sites can fail, as the LLM reasoning approach adds latency and cost per task compared to deterministic selector-based automation, and success rates on adversarial sites vary significantly
- ⚠Open-source self-hosting requires managing your own compute, LLM API keys, and browser infrastructure, meaning teams without DevOps resources face meaningful operational overhead compared to fully managed browser automation alternatives
- ⚠Cloud pricing is usage-based and the platform is early-stage, with the free tier limited to 10 tasks/month, and high-volume production workloads face less cost predictability than flat-rate enterprise RPA tools
Technical Details
Similar agents
Rating
Editorial score
Industries
Leave a review
Never displayed publicly.
Agent Stacks
See workflow stacks that feature Browser Use.
Related Content
Is this your tool?
Claim this listing to update your details and get a Verified badge.
Claim listing →