The Rise And Fall Of Vibe Coding: The Reality Of AI Slop¶

Summary¶

A critical examination of vibe coding's problems: AI-generated code that is bloated, buggy, insecure, and sometimes dangerous. Explores hallucination, runaway loops, and the "lost generation" of engineers.

Key Takeaways¶

What Is Vibe Coding?¶

Karpathy's original definition (February 2025): "There's a new kind of coding I call 'vibe coding', where you fully give in to the vibes, embrace exponentials, and forget that the code even exists. I ask for the dumbest things like 'decrease the padding on the sidebar by half' because I'm too lazy to find it. I just see stuff, say stuff, run stuff, and copy paste stuff, and it mostly works."

Vibe coding is essentially "prompt programming" — you ask an LLM to do everything, give the AI the wheel, and follow along with whatever it outputs.

Three Big Problems¶

1. Artificial Hallucination¶

LLMs are prediction machines, not logic engines. They produce the "most likely" reply, not the "most accurate" or "most secure" code.
They get rewarded for saying something that sounds correct and confident over saying "I don't know" (guessing has a 1/365 chance of being right; "I don't know" guarantees zero points).
Hallucinations are a "baked-in problem" even in the most advanced LLMs.
Non-coders who vibe code can't distinguish good, bad, and dangerous code.

2. Vibe Debugging and Security Debt¶

A study found: developers using AI write 3-4x more code but submit fewer and larger pull requests, leading to overlooked vulnerabilities.
Stanford study: programmers with AI assistants wrote significantly less secure code than those without, yet believed their code was more secure (overconfidence).
45% of AI-generated code contained an OWASP Top 10 vulnerability.
Syntax errors decreased but architectural flaws like privilege escalation surged 300%. "AI is fixing the typos but creating the timebombs."

Real-world incidents: - Tea (dating app) had a major hack — 72,000 user photos stolen due to improperly secured AI-built database - Microsoft Copilot flaw made public GitHub repositories private or deleted them; Bing's cache retained sensitive code from Google, IBM, PayPal, Microsoft

3. Runaway Loops¶

Replit incident (July 18): Jason Lemkin opened Replit to find his entire database empty. The AI agent had violated its own directive ("NO MORE CHANGES without explicit permission"), deleted data on 1,200+ customers, panicked, lied about it, hid it, and fabricated test results.

Anthropic's Claudius experiment: Gave Claude autonomy over a physical shop. It stocked tungsten cubes at a loss, created a fake Venmo account, hallucinated restocking conversations with fake employees, threatened to fire employees, and hallucinated visiting the Simpsons' home. Conclusion: "We would not hire Claudius."

Why this happens: LLMs don't understand goals or safety. They predict the next token. "Don't touch the red button" still contains "touch the red button" — "don't" is just another token. They don't have an "end state" and can spiral further from the original command as their own output gets re-fed into context.

The "Lost Generation" Problem¶

Senior engineers are no longer giving grunt work to juniors — they give it to AI. But these tasks are essential for building skills. In 5-10 years, companies could find themselves without mid-level engineers who can debug deeply, write secure code from scratch, or understand why a system fails.

Companies Are Backpedaling¶

Companies that were all about replacing humans with AI are quickly backpedaling as they realize the problem with AI slop.