SF SaaS Flags
Vibe Coding & AI IDEs · Risk Score 78 · Severe

Devin (Cognition AI) Complaints: What Buyers Are Actually Reporting

Autonomous AI software engineer claiming to complete full engineering tasks end-to-end

3 documented complaints against Devin (Cognition AI) — paraphrased from BBB filings, Trustpilot reviews, Reddit threads, and public forum posts. Newest complaints first.

Our AI scanner searches Reddit, Trustpilot, BBB, and news sources for fresh complaints from the past year, paraphrases what it finds, and adds anything new to this page. Takes up to 90 seconds.

1
Critical
2
High
0
Medium
0
Low

Who reported these complaints

Sourced from public platforms across US, UK, and global markets — each report links to the original source.

Platform Reports Who's reporting
Reddit2US & global users
News1Public media
Misleading Marketing CRITICAL Source: News Updated 20d ago

Launch demo benchmark claims found to be misleading by independent researchers

The viral March 2024 demo claimed Devin achieved '13.86% on SWE-bench' — but independent researchers including those at Princeton found the statistic was from a non-standard subset of the benchmark, with unreported human assistance. The claim was not corrected in the marketing and continued to be cited in media coverage.

"We replicated the evaluation and found the 13.86% figure used a non-verified subset with human-in-the-loop assistance not disclosed in the original announcement."

Billing Problems HIGH Source: Reddit Updated 20d ago

Pricing at $500/month for capabilities that fall short of the demo

Early adopters describe paying $500/month and finding Devin reliable only for narrow, well-scoped tasks — not the end-to-end autonomous engineering in the launch demo. The price-to-actual-productivity ratio is the most common complaint in professional developer communities.

"It's an impressive tool but at $500/month I expected something closer to the demo. In reality I'm still reviewing and fixing almost everything it touches."

Customer Complaints HIGH Source: Reddit Updated 20d ago

Autonomous mode creates context drift on longer tasks

Developers who have given Devin longer tasks describe it losing track of constraints set at the beginning, making changes that contradict earlier instructions, and requiring significant human correction to bring the output back on track.