A tool designed to flag authors writing without human judgment can remove human judgment from the reader using it turning a detection score into a digital scarlet letter.
In a recent post, I wrote about how AI can flatten writing when writers outsource too many decisions. That includes the subtle choices authors give away to AI that make a piece recognizably theirs.
But there’s a second kind of flattening that may be even more subtle. This one happens to readers.
AI detection tools can be useful. I use them. Educators need help identifying cases where students submit work they didn’t do. Publishers need to protect readers from synthetic articles produced at scale. The internet and social media is already flooded with low-effort AI content, or what some call AI slop.
But the important question isn’t only whether AI detection can identify patterns in writing. We also need to ask,
What happens when a detector score or label becomes a shortcut for judgment?
What AI Detection Can and Can’t Tell Us
Pangram is one of the more sophisticated AI detection tools. Rather than flagging a few suspicious phrases, it looks for patterns across a full piece of writing — what CEO Max Spero describes as identifying “mode collapse,” the tendency of large language models to make similar structural choices repeatedly.
The company claims a false-positive rate of one in 10,000. That does matter because few detectors perform at Pangram’s level. A 2024 higher-education study tested six AI text detectors and found a baseline accuracy of only 39.5%.
Most people, though, are not making decisions with the best detector under ideal conditions. They’re using whatever tool is built into an LMS, SAAS, found by search, avaible for free. Accuracy varies widely by tool, text length, writing context, and whether the writing has been revised, translated, or mixed with human work.
Yet even if Pangram performs better than most, the level of accuracy still doesn’t address an uderlying concern with AI detection..
When A Signal Becomes A Verdict
A useful signal can become a problem if people treat if as a final verdict. A label can tell us something about patterns in prose. It cannot tell us whether an author had something real to say, whether the experience is authentic, whether the judgment is sound, or whether the writing is worth reading.
That part still requires a human reader. We already read in enviroments shaped by algorithms. Our feeds descide what rises, what disappears, and what we are likley to see at all. We’re already not fully in confrol of what worthy reading reaches us.
AI detection adds another layer. It doesn’t just shape what appears in front of us. It can shape what we trust once it gets there.
The contrast shows up in my classroom. As a professor, I don’t look at AI detection scores before reading student papers. I read them first, forming my own opinion about the writing, argument, and thinking.
If something raises a concern (such as voice that doesn’t match a student’s earlier work, a structure that feels imported versus developed, or a stat that feels unbelievable) then I may consult a detection tool and do further checking on my own.
When evidence points to AI use in a way not permitted in the assignment, that becomes a teaching moment. I explain why using AI for that task was not allowed, how it affects the grade, and how it impacts their learning and skill development.
AI detection tools should support human judgment. They shouldn’t replace it.
Universities are beginning to see this underlying issues. Vanderbilt disabled Turnitin’s AI detector after raising concerns about false positives, transparency, privacy, and the possibility that non-native English writers could be disproportionately flagged. Washington State University canceled its Turnitin AI Detection contract in 2026, citing concerns about false positives, student distress over false accusations, and lack of transparency.
The problem is made harder by the detection-evasion arms race. AI detectors try to identify machine-generated pattersn, while AI writing and “humanizers’ tools promise to make AI-assisted text sound more natural. For example, Grammarly, now offers both an AI detector and an AI humanzer that it sells to both universities and students. Research has found that humanizing tools can make AI-generated text harder to detect.
That does not mean AI detection has no value. It means institutions are starting to ask the same question individual readers should ask: Is this tool helping us exercise better judgment, or is it tempting us to outsource judgment to a score?
The Label Appears Before You Read
Professors can see AI scores before reading student work. Editors and publichser can run manuscripts through automatic screening tools before considering the argument. Reviewers, managers, or readers may be a label or score before they have formed their own judgement.
Pangram’s new Chrome extension brings the same concern into broader public view.
The extension scans posts as you scroll, automatically labeling them human, AI-assisted, or AI-generated. Green means the writing appears human. Yellow means AI-assisted. Red means AI-generated.
You don’t have to stop, open a separate tool, paste the text, and consider the result. The judgment appears instantly, automatically, and without your involvement.
AI detection tools should support human judgement, not replace it.
The concern is the immediacy. The label gets there first. Before you read the opening sentence, before you hear the writers voice, before you consider the argument, the classifier has already suggested how susicious you should be. If you accept that label as a verdict, your judgement has been shaped before the writing has had a chance.
A yellow or red label risks becoming a digital scarlet letter: a visible mark of suspicion that encourages readers to judge the writer before they have read the words.
Click on graphic to download a PDF. Graphic created by ChatGPT.The Pope Leo Moment
As Wired reported, Pangram promoted its new Chrome extension by flagging multiple posts from Pope Leo XIV’s official X account, including posts raising concerns about artificial intelligence.
The irony is hard to miss. A post warning that AI could weaken human discernment was itself labeled AI-generated by an AI detector that can encourage readers to outsource discernment.
Maybe the label was accurate. AI could have played some role in shaping the posts. Public figures have communications teams. Social media posts are often drafted, reviewed, revised, and approved by more than one person.
And in marketing communications, using AI to help adapt, edit, or optimize social posts is becomming part of the normal workflow. Many social media management tools now build AI assistance into the process.
But does that automatically dismiss the importance of the message?
Will we let AI make judgements about other humans and thier message without any thought of our own?
The Pope example became an effective publicity hook for Pangram’s browser extension, drawing articles and reposts. That’s what product launches try to do. But I felt a difference.
A tool built by an AI company was publicly questioning the authenticity of one of the most visible voices on the human risks of AI. That doesn’t make the detector illegitimate. But it does make the judgment around how it is used worth examining.
The tool isn’t simply identifying synthetic spam hidden in the corners of the internet. It’s placing visible labels next to people’s words, encouraging instant judgment as we scroll past them.
What the Pope Said Next
That question became even more relevant a month later when Pope Leo released his first encyclical, Magnifica Humanitas: On Safeguarding the Human Person in the Time of Artificial Intelligence.
In it the Pope warns against fully delegating decisions that affect people’s rights, opportunities, status, freedom, or reputation to automated systems. He also writes that “every technology shapes those who use it.”
That applies to writers. It also applies to readers.
If we automatically skip anything labeled AI-assisted, we’re no longer exercising our own judgment. We’re asking an imperfect AI classifier to decide what deserves our attention before we’ve read the first paragraph.
What’s more, Spero himself notes in the same interview that the tool becomes more reliable the longer the text gets. More words mean more branching decisions to analyze and more pattern data to work with. A post on X is about as short as text gets. Yet the extension applies the same confident color label regardless.
The tool designed to flag writing with little human judgment can end up removing human judgment from the reader.
AI Assistance Is Not Human Replacement
Writers have never worked entirely alone. Books and magazine articles have editors. Academic articles have reviewers. Journalists have fact-checkers. Copywriters have creative directors and proof readers.
I became a better writer because other people challenged my work, questioned my logic, and told me when an idea was not there yet. AI can play a useful supporting role. It can identify an unclear paragraph, find a source, suggest a counterargument, or help a writer get unstuck.
That’s different from publishing thousands of synthetic posts under fake names or filling the internet with auto-generated comments no human thought deeply about.
Disclosure matters. Readers should know when AI played a meaningful role. But collapsing every kind of AI assistance into a single category doesn’t help us think more clearly about authorship.
A yellow label can’t tell you whether a writer spent five minutes, five days, or even five month shaping the final piece. It can’t tell you whether the experience is real, the judgment is sound, or the idea is worth your time.
Pope Leo’s encyclical calls for an “ecology of communication” where reasoned argument and verification carry more weight than immediate reaction. That’s a useful standard, whether you’re a writer deciding how much to lean on AI or a reader deciding what author to trust.
AI can flatten writing when writers outsource too many decisions.
AI detection can flatten reading when readers do the same.
If you want to know whether a writer has something real to say, you still have to read them.
AI use disclosure: The central idea for this post was mine. It began with my reaction to seeing writers quickly dismissed through Pangram’s color-coded browser labels, a kind of digital scarlet letter applied before their work had been read. I used ChatGPT and Claude as thought partners to pressure-test the argument, identify possible supporting sources, and follow new tributaries opened by the Wired article, the 2024 higher-education detector study, and Pope Leo XIV’s encyclical on AI. I opened and checked the sources myself before using them. I then rewrote, reorganized, added, cut, and edited through multiple rounds. The experiences, ideas, opinions, and final decisions are my own. If an AI detector flags this as AI-assisted, I hope readers will still read before judging. I first had this thought on May 1st, and the research, thinking, writing, and back-and–forth with ChatGPT as my thought partner unfolded over two months. This was not a single-prompt, copy-and-paste article.

