AI Agent Index
Devin logo

Devin

4.5/ 5

by Cognition AI

FeaturedMCPEditorial review
Visit site →

Fully autonomous AI software engineer that plans, codes, tests, and submits pull requests independently. Core plan from $20/month at $2.25/ACU. Cognition acquired Windsurf for ~$250M in 2026.

Devin is a fully autonomous AI software engineer built by Cognition AI, founded in 2023 and backed by Founders Fund, Elad Gil, and others with over $174M raised. Where tools like Cursor and GitHub Copilot augment human developers in real time, Devin operates independently -- you assign it a task via Slack, GitHub issue, or the Devin web interface, and it plans, codes, tests, debugs, and submits a pull request without supervision. Devin operates in a persistent sandboxed cloud environment with its own terminal, code editor, and browser. It can read documentation, search Stack Overflow, install dependencies, run tests, interpret error logs, and iterate autonomously. The knowledge base feature lets it learn your codebase conventions and architectural patterns, improving output quality over time. Devin 2.0, launched December 2025, dropped entry pricing from $500/month to $20/month and introduced parallel task execution -- spinning up multiple Devin instances simultaneously on a single codebase. Cognition AI acquired Windsurf (formerly Codeium) for approximately $250 million in 2026, putting Devin and Windsurf under the same parent company. One positions autonomous agent execution, the other IDE-first agentic coding -- a strategic consolidation of both ends of the AI coding spectrum. Pricing uses Agent Compute Units (ACUs) -- Cognition''s normalised measure of compute resources (virtual machine time, model inference, networking). One ACU equals approximately 15 minutes of active Devin work. Core plan: $20/month, pay-as-you-go at $2.25/ACU. Team plan: $500/month with 250 ACUs included at $2.00/ACU plus API access. Enterprise: custom pricing with private cloud deployment, SSO, and dedicated SLA. Devin for Terminal launched April 2026 -- a CLI agent that keeps running after you close your laptop, extending Devin''s autonomous execution to terminal-first workflows. Cognition reports Devin successfully completes approximately 75% of assigned tasks, with the remaining 25% requiring human intervention. Best suited to well-scoped, testable tasks: bug fixes with clear reproduction steps, test coverage expansion, library migrations, boilerplate generation, and documentation updates. G2 rating: 4.2/5. Trustpilot (Cognition): 3.0/5 as of March 2026 -- recurring themes include task failures without clear explanation and compute limits at the $20/month tier.

Pricing

usage-based · $20

Segment

enterprise

Setup

moderate

Verified

May 2, 2026

Capabilities

autonomousagentic-codinggit-nativemulti-file-editingterminal-agentcode-generation

Pros & Limitations

Editorial assessment

Pros

  • Highest autonomous execution capability among coding agents -- Cognition reports 75% task completion on well-defined engineering tasks, handling the full loop from research to implementation to pull request without developer supervision
  • Parallel task execution via Devin 2.0 -- multiple Devin instances run simultaneously on the same codebase, allowing engineering teams to parallelise backlog items that would otherwise queue sequentially
  • Persistent sandboxed environment with terminal, browser, and code editor -- Devin reads live documentation, searches Stack Overflow, installs dependencies, and runs tests in context rather than generating code blindly

Limitations

  • ACU billing creates cost unpredictability -- the $20/month Core plan is pay-as-you-go at $2.25/ACU where 1 ACU equals ~15 minutes of active work, so complex multi-hour tasks can push monthly spend well past the entry price before teams realise
  • Performance degrades on ambiguous requirements and complex architectural decisions -- Devin excels on bounded, well-specified tasks but struggles with open-ended features and unusual codebases, with a 25% failure rate requiring human intervention
  • Slower feedback loop than IDE tools -- Devin operates asynchronously, meaning tasks take minutes to hours rather than the near-instant responses developers expect from Cursor or Copilot, making it unsuitable for tight iteration or pair programming

Technical Details

Deployment
webslack
Model architectureProprietary
Avg setup time< 5 minutes (Slack/web app, GitHub connection, first task assigned)
Autonomous rate75% task completion rate on well-defined engineering tasks (Cognition internal benchmark)
MCP compatibleYes
Integrations
GitHubSlackVS CodeJira
Security
SOC 2 Type II

Similar agents

Rating

4.5/ 5

Editorial score

Industries

EnterpriseSaaSDevToolsB2BStartups

Leave a review

Never displayed publicly.

Agent Stacks

See workflow stacks that feature Devin.

Compare

Related Content

Is this your tool?

Claim this listing to update your details and get a Verified badge.

Claim listing →