Detection of AI created content

Wooden letter tiles scattered on a textured surface, spelling 'AI'.

As AI has progressed generating code, writing, music or other intelligent language based skills through an LLM there is a parallel growth in detection of AI generated content. 

Like everything AI, it is a probability game. Trying to estimate the combination of words/tokens and comparing with what is in the model. Using a standard model like the one from OpenAI makes it an easier comparison. However, there are many methods that are used.

There are many methods that a AI detector could use: They could look at frequency patterns, or attention patterns. And variability or entropy through the documents. A human writing those might have periods of drift vs the AI written script which follows a models distribution. However , note that the detection process also needs to be trained through another model in which case they could obtain scripts written by AI and then compare it with human script and create a model. This trained model can then be used to detect a AI written script.

This is obviously a continuous progression since the models are changing fast and so is the output they produce. These models are also learning new methods of coming up with new techniques. For example, a feature that was often used to detect was perplexity – lower perplexity usually meant that it was more of expected text and hence was AI. 

I believe for any writing fully composed with AI using a standard model, it will be relatively easy to detect but as models get more complex and more advanced and use more “human” techniques, they will become progressively more difficult to detect small segments of AI embedded within a bigger piece of human creativity.

If you would like to try one on the web then use GPTzero.me

Similar Posts

  • |

    AI for drug discovery

    There are many companies that promise AI for drug discovery such as ChatGPT from OpenAI, Anthropic’s AI suite and Perplexity and there are more companies coming up too. One specific company that is unusual is that it seems to be promising much more deeper understanding of the drug discovery pathways then the others. It can…

  • Observability

    Observability is important for AI and AI tools. It is the ability to monitor them for token usage, response quality and model drift. Typically, an AI system is monitored through logs, traces and metrics but an AI system on AI agent may need other metrics. Troubleshooting a complex AI system that produces its output probabilistically…

  • Error codes

    There is no error code in the answers that are provided by AI prompts. It will return an answer that is the best fit to the prompt or the question, but it does not tell you the probability that it is not correct or that it is low probability of answer. The conversational AI will…

  • |

    Open source protein models

    A company called Profluent (profluent.bio) has been developing protein models that can be used for designing new proteins (https://www.nature.com/articles/s41587-022-01618-2), modeling of new CRISPR-Cas sequences (https://www.nature.com/articles/s41586-025-09298-z) and developing LLM for protein generation (https://www.biorxiv.org/content/10.1101/2025.11.12.688125v1.article-info). What is amazing is that they have open sourced all their models and Profluent-E1 is available in GitHub to download and use. (https://github.com/Profluent-AI/E1)…

  • | | |

    Biotech companies

    Small Biotechs: Diagonal Tx: Clustering antibodies that mimic the action of the ligand and bypass the need for the ligand and receptor. This mutation that is created makes standard AI models not useful and so need a new method. This restores new ALK1 signaling in Hereditary Hemorrhagic Telangiectasia. It also treates LoF mutations in ALK1…