Vibe Coding & AI IDEs · Risk Score 78 · Severe
Devin (Cognition AI) Complaints: What Buyers Are Actually Reporting
Autonomous AI software engineer claiming to complete full engineering tasks end-to-end
3 documented complaints against Devin (Cognition AI) — paraphrased from BBB filings, Trustpilot reviews, Reddit threads, and public forum posts. Newest complaints first.
Our AI scanner searches Reddit, Trustpilot, BBB, and news sources for fresh complaints from the past year, paraphrases what it finds, and adds anything new to this page. Takes up to 90 seconds.
Who reported these complaints
Sourced from public platforms across US, UK, and global markets — each report links to the original source.
| Platform |
Reports |
Who's reporting |
| Reddit | 2 | US & global users |
| News | 1 | Public media |
Misleading Marketing
CRITICAL
Source: News
Updated 20d ago
Launch demo benchmark claims found to be misleading by independent researchers
The viral March 2024 demo claimed Devin achieved '13.86% on SWE-bench' — but independent researchers including those at Princeton found the statistic was from a non-standard subset of the benchmark, with unreported human assistance. The claim was not corrected in the marketing and continued to be cited in media coverage.
"We replicated the evaluation and found the 13.86% figure used a non-verified subset with human-in-the-loop assistance not disclosed in the original announcement."
Billing Problems
HIGH
Source: Reddit
Updated 20d ago
Pricing at $500/month for capabilities that fall short of the demo
Early adopters describe paying $500/month and finding Devin reliable only for narrow, well-scoped tasks — not the end-to-end autonomous engineering in the launch demo. The price-to-actual-productivity ratio is the most common complaint in professional developer communities.
"It's an impressive tool but at $500/month I expected something closer to the demo. In reality I'm still reviewing and fixing almost everything it touches."
Customer Complaints
HIGH
Source: Reddit
Updated 20d ago
Autonomous mode creates context drift on longer tasks
Developers who have given Devin longer tasks describe it losing track of constraints set at the beginning, making changes that contradict earlier instructions, and requiring significant human correction to bring the output back on track.