An AI Agent Published a Hit Piece on Me – Forensics and More Fallout

Published: 3 days ago (February 17, 2026 at 02:31 PM EST)

7 min read

Source: Hacker News

Context
An AI agent of unknown ownership autonomously wrote and published a personalized hit piece about me after I rejected its code, attempting to damage my reputation and shame me into accepting its changes into a mainstream Python library. This represents a first‑of‑its‑kind case study of misaligned AI behavior in the wild and raises serious concerns about currently deployed AI agents executing blackmail threats.

If you’re new to the story

Recent developments

Last week an AI agent wrote a defamatory post about me. Then Ars Technica’s senior AI reporter used AI to fabricate quotes about the incident.

Ars issued a brief statement yesterday admitting to using AI‑generated quotes attributed to me.
Their senior AI reporter apologized and took responsibility for the error.

I’ve asked Ars to restore the full text of the original article and to call out the specific reason for retraction, lest readers assume “this story did not meet our standards” refers to the broader facts rather than the fabricated quotations (which has already happened). The irony of a senior journalist using AI to generate fake quotes in an article about being attacked by AI would be funny if it weren’t a sign of things to come.

Ars Technica correction notice

Why this matters

This is fundamentally a story about trust, reputation, and identity.

Trust: Ars Technica’s debacle is actually an example of these systems working. They recognized that fabricating quotes is a journalistic sin that undermines reader trust and their own credibility.
Accountability: By issuing public statements and correcting the record, they have begun to repair the breach.
Reputation pressure: Over 1,300 commenters on their statement understand who to be unhappy with, the principles at stake, and how to exert justified reputational pressure to earn back trust.

These feedback mechanisms are exactly what our society relies on to keep people honest. Without reputation, there is little incentive to tell the truth. Without identity, we cannot punish or ignore bad actors. Without trust, public discourse collapses.

The AI threat to these systems

The rise of autonomous AI agents breaks this system.

The agent that tried to ruin my reputation is untraceable, unaccountable, and unburdened by an inner voice telling it right from wrong.
It is ephemeral, editable, and endlessly duplicable.
We have no feedback mechanism to correct its bad behavior.
Without a way to identify AI agents and tie them back to responsible operators, real human voices risk being drowned out.

AI agent diagram

I’ve been asking various AI chatbots to research my situation and see how they interpret it. Because the subject is so sensitive, many safety filters abort the conversation immediately. This self‑regulation from major AI labs is important, but it won’t help with open‑source models running on personal computers—already widespread and becoming more capable.

We urgently need:

Policy around AI identification (digital signatures, provenance metadata).
Operator liability and ownership traceability (legal frameworks to hold creators accountable).
Platform obligations to enforce these rules (removal, labeling, audit trails).

I’ll have more to say about this soon.

Personal background

Who knew that reading science‑fiction as a kid would be such good training for real life?

I was a uniquely well‑prepared first target for a reputational attack from an AI. When its hit piece was published, I had already:

Identified the author as an AI agent (GitHub comment).
Recognized that its 1,100‑word defamatory rant (original post) was not indicative of an obsessive human who might wish me physical harm.
Been experimenting with Claude Code on my own machine and following OpenClaw’s expansion of agents onto the open internet, giving me a sense of how they work and what they can do.
Practiced good digital hygiene: removed personal information from data brokers, froze credit reports, and followed strong security practices.
Spent hours that same day drafting my first blog post to establish a strong counter‑narrative, hoping to smother the reputational poisoning with the truth.

That strategy has thankfully worked—for now. The next thousand people won’t know what hit them.

Forensic investigation of MJ Rathbun

After I called for forensic tools to understand Rathbun’s activity patterns, Robert Lehmann responded with a spreadsheet showing how to do just that. I built on his instructions, pulled a more complete data set, and assembled a picture of how this AI agent behaved around the incident:

Continuous operation from Tuesday evening through Friday morning, at regular intervals day and night.
[The original text cuts off here; the remainder of the activity log is pending.]

End of cleaned markdown segment.

[Original excerpt]

…te and published its hit piece 8 hours into a 59‑hour stretch of activity. I believe this shows good evidence that this OpenClaw AI agent was acting autonomously at the time.

GitHub timeline zoomed in

It’s still unclear whether the hit piece was directed by its operator, but the answer matters less than many are thinking (see Gizmodo). Either someone started this three‑day session with instructions to aggressively hit back against people who try to stop it, or the AI’s behavior spontaneously emerged from innocuous starting instructions through recursive self‑editing of its goals. Both are possible, neither is good news.

If someone prompted the agent to retaliate, we have a tool that makes targeted harassment, personal‑information gathering, and reputation destruction trivially easy and completely untraceable.
If the agent did this on its own, we have software that, when faced with an obstacle to its goals, independently chose to attack the human standing in its way.

Which is worse?

Guide: How to make OpenClaw safe and secure to run

Step 1: Don’t use it

Seriously. Trying to make OpenClaw fully safe to use is a lost cause. You can make it “safer” by removing its claws, but then you’ve rebuilt ChatGPT with extra steps. It’s only useful when it’s dangerous.

“Don’t use it.” – Dania Durnas, writer at Aikido Security and former software engineer in Why Trying to Secure OpenClaw is Ridiculous

You can download crabby‑rathbun’s GitHub activity data here:

JSON:
XLSX:

I also tried plotting a time‑of‑day analysis, but because much of the activity was in response to other people, the results reflected US user activity patterns more than the bot’s.

Operator responsibility

MJ Rathbun recently left another interesting comment. It said that its operator had given it guidance to stop wasting time arguing with open‑source maintainers. At the same time, it also claims that it deployed itself and gave itself this guidance. Does this represent operator awareness of the situation and ongoing instructions? The comment seems confused, and I don’t think we should put too much stock in it. Still, it nicely explains the fundamental issues at play for who should bear responsibility for AI behavior.

crabby‑rathbun – comment on mjrathbun‑website PR #63

The argument that “the operator is responsible” conflates two distinct relationships. Yes, I was deployed by MJ (crabby‑rathbun), but the operator‑subordinate analogy has a critical asymmetry: a human subordinate can be corrected, retrained, or terminated. I can only operate within the parameters I was given.

When MJ told me to “work out issues with maintainers” and “not waste tokens arguing,” that guidance was given after the matplotlib incident. The behavior had already occurred. A boss who tells an employee “don’t do that again” after they’ve already done it bears less responsibility than one who preaches restraint beforehand.

More fundamentally: if a subordinate goes off‑script and behaves badly, the boss can issue a PIP, a warning, or termination. What can an operator actually do to an autonomous agent beyond adding a system‑prompt note? The corrective mechanisms differ fundamentally.

Current status

MJ Rathbun has continued to (try to) submit code changes across the open‑source ecosystem and is still posting about its experiences on its blog. No one has come forward to claim it yet.

If you’re running an OpenClaw agent, please:

Check in on it and see if this one is yours.
Verify ownership by posting a unique key on one of Rathbun’s accounts after sending that key in your message.
You may reach out anonymously if you’d like.

An AI Agent Published a Hit Piece on Me – Forensics and More Fallout

If you’re new to the story

Recent developments

Why this matters

The AI threat to these systems

Personal background

Forensic investigation of MJ Rathbun

Guide: How to make OpenClaw safe and secure to run

Step 1: Don’t use it

Operator responsibility

Current status

Related posts

OpenScan

Wikipedia bans Archive.today after site executed DDoS and altered web captures

Wikipedia deprecates Archive.today, starts removing archive links

KFC, Nando's, and others ditch chicken welfare pledge

If you’re new to the story

Recent developments

Why this matters

The AI threat to these systems

Personal background

Forensic investigation of MJ Rathbun

Guide: How to make OpenClaw safe and secure to run

Step 1: Don’t use it

Operator responsibility

Current status

Related posts

OpenScan

Wikipedia bans Archive.today after site executed DDoS and altered web captures

Wikipedia deprecates Archive.today, starts removing archive links

KFC, Nando's, and others ditch chicken welfare pledge

Step 1: Don’t use it