I Built a CLI That Scores How Much of Your Code Was Written by AI

Published: 2 months ago (February 23, 2026 at 02:50 PM EST)

4 min read

Source: Dev.to

Source: Dev.to

The Problem: Nobody Knows How Much of Their Codebase Is AI‑Generated

We’re in February 2026 and AI‑assisted coding is everywhere. GitHub Copilot, ChatGPT, Claude — developers are shipping AI‑generated code faster than ever. But there’s a growing crisis nobody talks about:

AI‑generated code has 2.74× more security vulnerabilities than human‑written code (Stanford, 2025)
Experienced developers are 19 % slower when using AI tools for complex tasks
Code reviews miss AI‑specific patterns like hallucinated imports and over‑commented boilerplate

I wanted a tool that could answer one simple question: “How much of this codebase was vibe‑coded?”
So I built vibe‑check.

pip install vibe-check
vibe-check scan .
vibe-check report . --output report.html

Every Python file gets a score from 0 to 100.

Detectors

vibe-check uses 7 weighted detectors powered by Python’s ast module:

Detector	Weight	What It Catches
Security	25 %	`eval()`, hard‑coded secrets, SQL injection, `shell=True`
Repetitive Code	15 %	Copy‑paste functions with identical AST fingerprints
Generic Naming	15 %	Variables named `data`, `result`, `temp`, `output`
Hallucinated Imports	15 %	Modules that don’t exist in the stdlib or known packages
Over‑Commenting	10 %	Comments that restate the code (e.g., `# increment counter`)
Placeholders	10 %	`TODO`, `pass`‑only bodies, `NotImplementedError` stubs
Doc/Code Ratio	10 %	Docstrings longer than the functions they describe

Each detector returns a 0‑100 score; the weighted average becomes the file’s Vibe Score.

Example Output

╭──────────────────────────────────────────────╮
│         vibe-check — AI Vibe Code Detector   │
╰──────────────────────────────────────────────╯

                     File Vibe Scores
┏━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━━━━━━━━┓
┃ File                     ┃ Score ┃ Label             ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━━━━━━━━┩
│ src/api/handler.py       │  78   │ HEAVILY VIBED     │
│ utils/helpers.py         │  65   │ HEAVILY VIBED     │
│ models/user.py           │  42   │ MODERATELY VIBED  │
│ core/processor.py        │   8   │ MOSTLY HUMAN      │
└──────────────────────────┴───────┴───────────────────┘

Repo Vibe Score   54/100 — MODERATELY VIBED

Repository Layout

src/vibe_check/
├── cli.py           # Click CLI: scan, report, version
├── analyzer.py      # Walks file tree, dispatches to detectors
├── scoring.py       # Weighted aggregation into 0‑100
├── reporter.py      # Terminal (Rich), JSON, HTML output
└── detectors/
    ├── repetitive.py    # AST structural fingerprinting
    ├── naming.py        # Generic identifier detection
    ├── comments.py      # Over‑commenting patterns
    ├── imports.py       # Hallucinated module detection
    ├── placeholders.py  # TODO/empty body detection
    ├── ratio.py          # Docstring‑to‑code ratio
    └── security.py      # Security anti‑pattern detection

Extensible Design

Each detector is a standalone module with a common interface:

def detect(filepath: str, content: str, ast_tree=None) -> DetectorResult:
    ...

This makes it trivial to add new detectors without touching the rest of the codebase.

Repetitive Code Detector

The most interesting challenge was detecting structurally identical functions generated by AI tools. The solution normalizes the AST by replacing all identifiers with placeholders, then hashes the resulting structure. If two functions produce the same hash, they’re structurally identical regardless of naming.

# These two would produce the same AST fingerprint:
def process_users(users):
    results = []
    for user in users:
        results.append(user.name)
    return results

def handle_orders(orders):
    output = []
    for order in orders:
        output.append(order.id)
    return output

CI Integration

vibe-check exits with code 2 when high‑risk files are detected, allowing you to gate your CI pipeline:

- name: Vibe Check
  run: vibe-check scan . --threshold 60 --format json

Project Stats

79 tests passing
87 % coverage
Python 3.9‑3.12 matrix CI
0 dependencies beyond Click and Rich

AST analysis proves to be incredibly powerful for code‑quality tools—far more reliable than regex. Weighted scoring systems need careful calibration; security deserves the most weight.

The vibe‑coding problem is real—I ran this on several popular repos and found scores of 60+ in utility files.

Get Started

pip install vibe-check
vibe-check scan your-project/

Star it on GitHub:

What’s your repo’s Vibe Score? Run it and let me know in the comments.

I Built a CLI That Scores How Much of Your Code Was Written by AI

The Problem: Nobody Knows How Much of Their Codebase Is AI‑Generated

Detectors

Example Output

Repository Layout

Extensible Design

Repetitive Code Detector

CI Integration

Project Stats

Get Started

Related posts

🐜 Ant Colony Optimization for CVRP (Open Source)

I Built a CLI That Auto-Generates My Daily Standup Notes from Git History

The GSOC Trojectory: From first line of code to the final commit

Learning FastAPI for Backend & Reddit Automation for My Project (With Some Pain 😅)