unai — humanize AI-generated text and code

AI writes like AI. unai fixes that.

$ echo "Certainly! Let me delve into this comprehensive topic." | unai
Let me explore this thorough topic.

You know the patterns when you read them: Certainly!, leveraging, delve, meticulous, it is worth noting. Every model writes this way. RLHF bakes it in — human raters reward formal-sounding output, so models overfit to a vocabulary that sounds polished but reads robotic.

unai is a CLI humanizer for AI text and code. It strips those patterns out, replaces what it can automatically, and flags the rest. Pipe it into your workflow and your output reads like a person wrote it — not a chatbot trying to sound helpful.

Unlike web-based AI humanizers, unai is deterministic: same input, same output, every time. No cloud. No API calls. No model in the loop. Just regex rules grounded in corpus data, fast enough to run on every keystroke.

What it humanizes

Input	Mode	What gets cleaned
Blog posts, docs, emails	Text	Openers, filler words, hedges, clichés
Source code	Code	Comments, variable names, docstrings, test boilerplate
Git commit messages	Commit	Past tense, vague subjects, bloated bodies

Everything in one binary. No config files. No dependencies.

Install

Prebuilt binary (fastest)

Download from GitHub Releases:

Platform	File
Linux x86_64	`unai-v*-x86_64-unknown-linux-gnu.tar.gz`
Linux arm64	`unai-v*-aarch64-unknown-linux-gnu.tar.gz`
macOS x86_64	`unai-v*-x86_64-apple-darwin.tar.gz`
macOS arm64 (M1/M2/M3)	`unai-v*-aarch64-apple-darwin.tar.gz`
Windows	`unai-v*-x86_64-pc-windows-msvc.zip`

Extract and put unai anywhere in your $PATH.

From crates.io

cargo install unai

Requires Rust 1.75+. The binary ends up at ~/.cargo/bin/unai.

No Rust? One line:

curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh

Then cargo install unai.

From source

git clone https://github.com/HugoLopes45/unai
cd unai
make install

Claude Code skill

mkdir -p ~/.claude/skills/unai
curl -sL https://raw.githubusercontent.com/HugoLopes45/unai/main/prompts/claude-code.md \
  > ~/.claude/skills/unai/SKILL.md

Type /unai in any Claude Code session to humanize the current file or selection.

Or from the repo: make install-skill

Cursor

make install-cursor   # copies prompts/cursor.mdc → .cursor/rules/unai.mdc

Cursor picks up the rules on every generation in that project.

Any LLM (ChatGPT, Gemini, API)

Copy prompts/system-prompt.md into your system prompt. Done.

Usage

Clean text and move on

# pipe through unai, get humanized text back
echo "Certainly! We should utilize this approach." | unai
# → "We should use this approach."

# humanize a file
unai draft.md > draft-clean.md

# overwrite in place
unai draft.md | sponge draft.md

Only patterns with an auto-fix get replaced. Everything else passes through unchanged. Use --report to see what was flagged but couldn't be auto-fixed.

`--report` — what's wrong and why

Inspect findings without changing anything. Every finding cites the corpus study that measured it.

unai --report draft.md

# high-priority only
unai --report --min-severity high draft.md

Mode: text  |  11 finding(s)

CRITICAL (2)
  line 1: Sycophantic opener: 'Certainly!' (RLHF-induced, Juzek 2025)
  line 1: LLM tell: 'delve' (25× excess frequency, Kobak 2025)

HIGH (3)
  line 2: LLM filler: 'leveraging' (Kobak 2025)
  line 2: LLM tell: 'pivotal' (Kobak 2025, Liang 2024)
  line 3: LLM cliché: 'stands as a testament' (Neri 2024)

MEDIUM (3)
  line 1: LLM filler: 'comprehensive' (Kobak 2025)
  line 2: LLM filler: 'robust' (Kobak 2025)
  line 3: LLM filler: 'innovative' (Kobak 2025)

LOW (3)
  line 1: LLM hedge: 'it is worth noting' (Kobak 2025)
  line 2: LLM connector: 'furthermore' (Rosenfeld 2024)
  line 3: LLM connector: 'in conclusion' (Rosenfeld 2024)

`--diff` — preview changes before applying them

unai --diff draft.md

--- original
+++ cleaned
@@ -1,3 +1,3 @@
-It is worth noting that this comprehensive approach is pivotal to our success.
-Furthermore, leveraging these robust methodologies facilitates seamless collaboration.
+This thorough approach is key to our success.
+Using these robust methodologies facilitates collaboration.
 In conclusion, this innovative solution stands as a testament to meticulous engineering.

Patterns without an auto-fix (like meticulous, innovative) won't show in the diff. Run --report alongside to see the full picture.

`--dry-run` — list every change before applying

unai --dry-run draft.md

--- Auto-fixable (4) ---
  line  1: "utilize" → "use"  — LLM filler: 'utilize' (Kobak 2025)
  line  2: "leveraging" → "using"  — LLM filler: 'leveraging' (Kobak 2025)
  line  3: "facilitate" → "help"  — LLM filler: 'facilitate' (Kobak 2025)
  line  4: "in order to" → "to"  — Filler: 'in order to'

--- Flagged (no auto-fix) (2) ---
  line  5: "meticulous"  — LLM tell: 'meticulous' (Kobak 2025, Neri 2024)
  line  6: "innovative"  — LLM filler: 'innovative' (Kobak 2025, lower ratio)

Original text prints unchanged after the list — safe to pipe elsewhere.

`--annotate` — mark findings inline

unai --annotate draft.md

Prints each line, with finding markers at the exact column on stderr. Good for spotting where the patterns cluster in a longer document.

Humanize code

# explicit code mode
unai --mode code --report service.py

# only flag naming and comments
unai --mode code --rules naming,comments --report service.ts

# docstrings only
unai --mode code --rules docstrings --report api.go

Available --rules values: comments, naming, commits, docstrings, tests, errors, api

Before:

def getUserDataObject(userDataObject):
    # This function serves as the core handler for processing user data
    # Initialize the result variable to store the output
    result = None
    return result

Findings:

HIGH   line 2: LLM docstring boilerplate: 'this function serves as'
MEDIUM line 1: Type-in-name anti-pattern: use 'user' instead of 'userDataObject'
LOW    line 3: Over-explaining comment: states what the code already shows

Humanize commit messages

unai fires commit-specific rules automatically on COMMIT_EDITMSG and MERGE_MSG files.

# check a commit message
unai --report COMMIT_EDITMSG

# pipe one through
echo "Added new authentication feature" | unai --mode code --rules commits --report
# HIGH: Past tense in commit subject — use imperative mood ('add' not 'added')

Severity levels

Level	Meaning	Example
`critical`	Statistically extreme (r > 10×) or RLHF-specific pattern	`delve`, `Certainly!`, `feel free to`
`high`	Strong corpus signal (r > 3×) or clear LLM boilerplate	`leveraging`, `pivotal`, `meticulous`
`medium`	Elevated in LLM text, also common in marketing copy	`comprehensive`, `robust`, `innovative`
`low`	Filler and hedging language that reads as padded	`in order to`, `moreover`, `furthermore`

Filter with --min-severity:

unai --report --min-severity high draft.md   # High + Critical only
unai --report --min-severity critical post.md  # Critical only

Git hooks

Drop unai into your commit flow and catch LLM-isms before they land.

Non-blocking (report only):

# .git/hooks/commit-msg
#!/bin/sh
unai --mode code --rules commits --report --min-severity high "$1"

Blocking (reject bad commit messages):

# .git/hooks/commit-msg
#!/bin/sh
findings=$(unai --mode code --rules commits --report --min-severity high "$1" 2>&1)
if echo "$findings" | grep -q "finding(s)"; then
  echo "$findings" >&2
  echo "" >&2
  echo "Fix the commit message and try again." >&2
  exit 1
fi

chmod +x .git/hooks/commit-msg

What it catches

Text patterns

Pattern	Severity	Source
`Certainly!`, `Of course!`, `Absolutely!`, `Great question!`	Critical	Juzek 2025 (RLHF)
`I'd be happy to`, `Happy to help`, `feel free to`	Critical	Juzek 2025 (RLHF)
`delve`, `delves`	Critical	Kobak 2025 (r=25×)
`leveraging`, `utilize`, `facilitate`, `commence`	High	Kobak 2025
`pivotal`, `meticulous`, `intricate`, `realm`	High	Kobak 2025, Liang 2024
`stands as a testament`, `tapestry`	High	Neri 2024
`comprehensive`, `robust`, `seamlessly`, `innovative`	Medium	Kobak 2025
`in order to`, `moreover`, `furthermore`, `in conclusion`	Low	Rosenfeld 2024

Code patterns

Pattern	Severity	Rule
`# This function serves as...`, `# This class represents...`	High	`docstrings`
`# Initialize the X variable to Y`	Low	`comments`
`UserDataManager`, `ErrorHandler`, `ProcessingHelper`	High	`naming`
`userDataObject`, `configurationSettings`	Medium	`naming`
`Added X` in commit subject	High	`commits`
`assert result is not None` with no message	Medium	`tests`
bare `except: pass`	High	`errors`

137 patterns across 9 rule files. Browse them in rules/.

What unai doesn't touch

Content inside fenced code blocks in Markdown (``` fences)
Inline backtick spans (`code`)
Bare URL lines
Non-English text: "pivotale" (French), "pivotaler" (German), "这是pivotal决策" (Chinese) — word-boundary matching is Unicode-safe, so foreign words containing an English flagged word pass through unchanged

Why a CLI, not a web app

Most AI text humanizers are web apps: you paste text, a model rewrites it, you copy the output. That works for one-off documents. It breaks for everything else.

unai is a Unix filter. It composes:

# humanize every markdown file in a repo
find . -name "*.md" | xargs -I{} sh -c 'unai {} | sponge {}'

# gate a CI pipeline on AI writing quality
unai --report --min-severity high release-notes.md || exit 1

# pre-commit hook that blocks AI-written commit messages
unai --mode code --rules commits --report COMMIT_EDITMSG

# integrate into your editor's format-on-save
unai % | sponge %

Other humanizers can't do any of this. They're also non-deterministic: the same sentence in, different output each run. unai gives you the same result every time — which means you can review diffs, write tests, and trust it in automation.

Zero dependencies. No Python runtime. No Node. No Docker. One static binary, ~2MB.

Why this exists

LLMs generate the statistical median of "correct writing." That median is padded, over-hedged, and full of words that sound confident while saying nothing specific.

The patterns are predictable because RLHF rewards formal-sounding text. Leveraging because business writing uses it. Furthermore because academic papers use it. Certainly! because it sounds agreeable to a human rater grading a response.

None of it is wrong, exactly. It just doesn't sound like a person wrote it. A person would say using instead of leveraging. A person's code comment would say # don't touch this — it works and I don't know why instead of # This function serves as the core processing handler for user data operations.

unai draws a line. Write like yourself, not like a chatbot trying to sound helpful.

Research basis

Rules are grounded in peer-reviewed corpus studies, not intuition.

Kobak et al. (2025), Science Advances — 15 million PubMed abstracts (2010–2024). delves appeared 25× more often post-ChatGPT than baseline. 280+ excess words identified, 66% verbs, 18% adjectives. This is the core word list. arXiv:2406.07016

Liang et al. (2024), Nature Human Behaviour — 950,000+ scientific papers. pivotal, intricate, showcasing, realm approximately doubled post-2023. arXiv:2404.01268

Juzek & Ward (COLING 2025) — Explains why the vocabulary diverges: RLHF, not training data. Human raters prefer formal-sounding outputs. Models overfit to that preference. arXiv:2412.11385

Rosenfeld & Lazebnik (2024) — 6 LLMs vs 13,371 NYT articles. LLMs produce less sentence length variation, more auxiliary verbs, fewer complex noun phrases. PMC11422446

Bisztray et al. (AISec 2025) — Comment density is the #1 feature for LLM code authorship attribution. Removing it drops accuracy 7.2pp. arXiv:2506.17323

Lopes & Klotzman (ICSE 2024) — LLM commit messages are 20× longer than human commits (median 381 vs 19 chars). LLM commits always include "why" explanations. arXiv:2401.17622

This tool is a style linter, not an AI detector. It cannot tell you whether a document was written by an LLM — no surface-level tool can do that reliably. What it can do is help you write in a voice that reads as distinctly human.

Rules

Nine files, 137 patterns. Each has a before/after and a one-line fix.

rules/text.md — 24 prose patterns
rules/code-comments.md — 18 comment patterns
rules/naming.md — 14 naming patterns
rules/commits.md — 10 commit message patterns
rules/docstrings.md — 8 docstring patterns
rules/tests.md — 15 test patterns
rules/error-handling.md — 15 error handling patterns
rules/api-design.md — 17 API design patterns
rules/llm-tells.md — 16 fingerprints unique to LLMs

Contributing

See CONTRIBUTING.md. Open an issue with the new rule template before writing code.

The bar: if a human developer would write it unprompted, it's not a style violation.

License

MIT.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.github		.github
cli		cli
prompts		prompts
rules		rules
.gitignore		.gitignore
CHALLENGE.md		CHALLENGE.md
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
CURSOR.md		CURSOR.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
WARP.md		WARP.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

unai — humanize AI-generated text and code

What it humanizes

Install

Prebuilt binary (fastest)

From crates.io

From source

Claude Code skill

Cursor

Any LLM (ChatGPT, Gemini, API)

Usage

Clean text and move on

`--report` — what's wrong and why

`--diff` — preview changes before applying them

`--dry-run` — list every change before applying

`--annotate` — mark findings inline

Humanize code

Humanize commit messages

Severity levels

Git hooks

What it catches

Text patterns

Code patterns

What unai doesn't touch

Why a CLI, not a web app

Why this exists

Research basis

Rules

Contributing

License

About

Uh oh!

Releases 1

Packages

Languages

License

HugoLopes45/unai

Folders and files

Latest commit

History

Repository files navigation

unai — humanize AI-generated text and code

What it humanizes

Install

Prebuilt binary (fastest)

From crates.io

From source

Claude Code skill

Cursor

Any LLM (ChatGPT, Gemini, API)

Usage

Clean text and move on

--report — what's wrong and why

--diff — preview changes before applying them

--dry-run — list every change before applying

--annotate — mark findings inline

Humanize code

Humanize commit messages

Severity levels

Git hooks

What it catches

Text patterns

Code patterns

What unai doesn't touch

Why a CLI, not a web app

Why this exists

Research basis

Rules

Contributing

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

`--report` — what's wrong and why

`--diff` — preview changes before applying them

`--dry-run` — list every change before applying

`--annotate` — mark findings inline

Packages