SF SaaS Flags
Vibe Coding & AI IDEs · Risk Score 78 · Severe

Is Devin (Cognition AI) Worth It? 3 Documented Issues Reviewed

Autonomous AI software engineer claiming to complete full engineering tasks end-to-end

78
Severe
out of 100

TL;DR: Devin (Cognition AI) carries a Risk Score of 78/100 (Severe) based on 3 documented complaints from BBB filings, lawsuits, Trustpilot, and Reddit — the public record reflects a serious pattern of issues you should know about before buying.

Devin, from Cognition AI, is marketed as the 'world's first fully autonomous AI software engineer' — capable of handling full coding tasks from spec to deployment. It launched to enormous hype in March 2024 at $500/month (later adjusted). Unlike copilots, Devin runs autonomously in its own sandboxed environment.

Website: cognition.ai · Category: Vibe Coding & AI IDEs · Last scanned: 26 days ago

Should You Trust Devin (Cognition AI)?

Devin is the most aggressively marketed product in the AI coding category and the one with the largest documented gap between demo and real-world performance. The March 2024 launch claimed Devin achieved 13.86% on SWE-bench — a figure that independent researchers subsequently found was measured on a non-standard benchmark subset with human assistance not disclosed in the presentation. The claim was not corrected in subsequent marketing.

At $500/month — a price with no peer in this category — real-world reports describe a tool impressive for narrow, well-defined tasks but far from the autonomous software engineer the launch narrative implied. Developers describe reviewing and correcting almost everything it touches on complex work. The benchmark controversy is documented, specific, and worth reading in full before handing over $500.

Having issues with Devin (Cognition AI)?

Help others know they're not alone. Your report is anonymous.

0 people have reported an issue

Is Devin (Cognition AI) Worth the Cost? Complaint Category Breakdown

Each complaint type is weighted differently in the Risk Score. Billing and marketing deception weigh heaviest.

0
BBB Complaints
0
Lawsuits & Legal Action
1
Misleading Marketing
1
Customer Complaints
0
Churn & Retention
1
Billing Problems
0
Support Failures

Where these complaints come from

Complaints are sourced from public platforms spanning US, UK, and global consumers. Each report links back to its original source.

Platform Reports Who's reporting
Reddit2US & global users
News1Public media

What Buyers Say About Devin (Cognition AI)

Documented pricing complaints, billing issues, and support failures — newest first.

Our AI scanner searches Reddit, Trustpilot, BBB, and news sources for fresh complaints from the past year, paraphrases what it finds, and adds anything new to this page. Takes up to 90 seconds.

Misleading Marketing CRITICAL Source: News Updated 20d ago

Launch demo benchmark claims found to be misleading by independent researchers

The viral March 2024 demo claimed Devin achieved '13.86% on SWE-bench' — but independent researchers including those at Princeton found the statistic was from a non-standard subset of the benchmark, with unreported human assistance. The claim was not corrected in the marketing and continued to be cited in media coverage.

"We replicated the evaluation and found the 13.86% figure used a non-verified subset with human-in-the-loop assistance not disclosed in the original announcement."

Billing Problems HIGH Source: Reddit Updated 20d ago

Pricing at $500/month for capabilities that fall short of the demo

Early adopters describe paying $500/month and finding Devin reliable only for narrow, well-scoped tasks — not the end-to-end autonomous engineering in the launch demo. The price-to-actual-productivity ratio is the most common complaint in professional developer communities.

"It's an impressive tool but at $500/month I expected something closer to the demo. In reality I'm still reviewing and fixing almost everything it touches."

Customer Complaints HIGH Source: Reddit Updated 20d ago

Autonomous mode creates context drift on longer tasks

Developers who have given Devin longer tasks describe it losing track of constraints set at the beginning, making changes that contradict earlier instructions, and requiring significant human correction to bring the output back on track.

Frequently asked questions about Devin (Cognition AI)

Is Devin (Cognition AI) worth the price?

We've documented 1 billing complaint against Devin (Cognition AI) — a signal worth weighing before committing to a paid plan. Its Risk Score of 78/100 puts it in the "Severe" band. See the full complaints breakdown → before deciding.

Is Devin (Cognition AI) easy to cancel?

Cancellation difficulty is one of the top SaaS frustration patterns. Check the complaints page → — we tag cancel-related issues under "Billing" and "Contract Trap" categories. If none are documented yet, run a scan to surface what's currently out there.

Does Devin (Cognition AI) have hidden fees?

Hidden-fee complaints fall under our "Billing Issue" category. We've documented 1 billing complaint for Devin (Cognition AI) so far. See all complaints → for the full picture.

How does Devin (Cognition AI) compare to its alternatives?

We track other vibe coding & ai ides tools and rank them by Risk Score. See our alternatives comparison → to find lower-risk options in the same category.

What are the biggest complaints about Devin (Cognition AI)?

The highest-severity documented complaints involve misleading marketing. Read all 3 documented complaints on the complaints page →

Is Devin (Cognition AI) a scam?

Probably not in the strict legal sense — most SaaS products with bad reputations are real companies delivering a real (if disappointing) product. But "is it a scam" is the question people ask when they feel they were misled. Read our full scam analysis →

How does Devin (Cognition AI)'s Risk Score get calculated?

We weight each warning by severity (Low to Critical) and category, then aggregate. Lawsuits and misleading-marketing claims weigh heaviest. The current 78/100 score puts Devin (Cognition AI) in the "Severe" band. Full methodology →

Where do these complaints come from?

Each warning is paraphrased from a public source — BBB filings, Trustpilot or G2 reviews, Reddit threads, Capterra ratings, court records, or news articles. The source URL is attached to every warning so you can verify it yourself. More on our methodology →

Other Vibe Coding & AI IDEs we track

Sibling products in the same category, ranked by Risk Score (lowest first).

See full alternatives comparison →

Devin (Cognition AI) vs Amazon Q Developer →Devin (Cognition AI) vs Tabnine →Devin (Cognition AI) vs Windsurf (Codeium) →Devin (Cognition AI) vs v0 (Vercel) →