Home / Glossary / Burstiness

Burstiness

Burstiness measures how much sentence length and complexity vary throughout a text. Human writing naturally alternates between short, punchy sentences and longer, complex ones. AI-generated text tends to maintain more uniform sentence structures, resulting in lower burstiness. Several AI detectors, including GPTZero, use burstiness as a detection signal alongside perplexity.

What Burstiness Means

Burstiness captures the pattern of variation in sentence complexity throughout a text. The term comes from information theory, where "bursty" data shows irregular patterns rather than uniform distribution.

Human writing naturally exhibits high burstiness. A skilled writer might follow a long, complex sentence with a short, direct one for emphasis. Paragraphs shift between dense analytical passages and simple declarative statements. This variation is a hallmark of natural human composition.

AI-generated text tends to have lower burstiness. Language models produce text word-by-word based on probability distributions, resulting in more uniform sentence lengths and complexity levels. While AI can produce individual short or long sentences, the overall pattern across a passage tends to be smoother and less variable than human writing.

Burstiness as a Detection Signal

GPTZero popularized the use of burstiness as a detection signal, combining it with perplexity to classify text. The two metrics complement each other: perplexity measures how predictable individual word choices are, while burstiness measures how varied the overall structure is.

Like perplexity, burstiness works better on longer texts where there is enough sentence-to-sentence variation to measure. On short texts (under 200 words), burstiness measurements become unreliable.