Burstiness
What Burstiness Means
Burstiness captures the pattern of variation in sentence complexity throughout a text. The term comes from information theory, where "bursty" data shows irregular patterns rather than uniform distribution.
Human writing naturally exhibits high burstiness. A skilled writer might follow a long, complex sentence with a short, direct one for emphasis. Paragraphs shift between dense analytical passages and simple declarative statements. This variation is a hallmark of natural human composition.
AI-generated text tends to have lower burstiness. Language models produce text word-by-word based on probability distributions, resulting in more uniform sentence lengths and complexity levels. While AI can produce individual short or long sentences, the overall pattern across a passage tends to be smoother and less variable than human writing.
Burstiness as a Detection Signal
GPTZero popularized the use of burstiness as a detection signal, combining it with perplexity to classify text. The two metrics complement each other: perplexity measures how predictable individual word choices are, while burstiness measures how varied the overall structure is.
Like perplexity, burstiness works better on longer texts where there is enough sentence-to-sentence variation to measure. On short texts (under 200 words), burstiness measurements become unreliable.