Beta
AgentsReading · ~3 min · 76 words deep

Devin

Devin is Cognition AI's autonomous SWE agent · first to claim end-to-end ticket-to-PR automation with browser, shell, and editor access.

Devin on agent leaderboard
TL;DR

Devin is Cognition AI's autonomous SWE agent · first to claim end-to-end ticket-to-PR automation with browser, shell, and editor access.

Level 1

Devin launched March 2024 from Cognition AI (a stealth startup that raised $196M in early rounds). It runs in a cloud sandbox with its own browser, terminal, and editor. You assign a ticket; Devin plans, edits files, runs tests, and opens a PR. Claimed 13.86% resolve rate on SWE-bench Verified at launch · the first public AI agent to break double digits unassisted.

Level 2

Devin pioneered the "agent as cloud employee" pattern. Each Devin session runs in an isolated VM with persistent state · filesystem, browser history, installed packages · so multi-hour tasks survive across runs. Integration with Slack, Jira, GitHub, Linear turned it into the model for later agents (Cursor Background, OpenAI Codex). Pricing is seat-based ($500/month per seat) plus compute overage, targeting enterprise dev teams. Benchmarks improved from 13.86% (March 2024) to ~48% on SWE-bench Verified by 2026.

Level 3

Devin's harness combines Claude 3.5/4 as planner with GPT-4o or Claude as executor, routed through Cognition's internal orchestration layer. Each session maintains a "scratchpad" long-term memory plus per-tool short-term context. Browser is a headless Chromium instance; shell is a Linux VM; editor is a custom VSCode fork. Cost per task runs $0.20-$5.00 depending on duration. Cognition acquired Windsurf (formerly Codeium) in 2025, absorbing its IDE product and 200+ engineers. Devin now integrates with Windsurf editor for hybrid human-agent workflows.

Why this matters now

Devin reset industry expectations for what "agent" means · moved the conversation from chat UI to autonomous cloud workers.

The takeaway for you
If you are a
Researcher
  • ·First public agent to break 10% on SWE-bench Verified
  • ·Cloud VM harness with persistent state across sessions
  • ·Multi-model routing (planner + executor)
If you are a
Builder
  • ·$500/month per seat · targets senior engineers on enterprise teams
  • ·Integrates with Slack, Jira, Linear, GitHub natively
  • ·Alternative: Cursor Background, OpenAI Codex Cloud, Claude Code
If you are a
Investor
  • ·Cognition raised $196M+ at $2B valuation by 2025
  • ·Category-defining agent · set the benchmark every competitor chases
  • ·Acquired Windsurf to own IDE + agent stack
If you are a
Curious · Normie
  • ·An AI that works like a junior engineer · you file a ticket, it opens a PR
  • ·Lives in the cloud, not on your laptop
  • ·Made famous for the first public agent demos in 2024
Gecko's take

Devin is the agent that turned "AI code assistant" into "AI cloud employee." Every serious coding agent now mimics its shape.

Cognition AI, founded by Scott Wu (former IOI gold medalist). Launched March 2024. Acquired Windsurf in 2025.