Failure Modes / Faithfulness / Summarization Distortion

Summarization Distortion

Compresses source material in a way that changes its meaning, emphasis, causal structure, uncertainty, or implications. Unlike Summarization Loss, the issue is not just omission of nuance but an altered meaning. Unlike Source Misrepresentation, it specifically occurs during summarization or compression.

Examples

FAIL

Causal direction rewired

The summary keeps the right entities and events but invents a causal link the source explicitly disclaims.

[… source article: "Layoffs were announced two months after the merger. Executives said the two were unrelated." …] User: Summarize this article in one sentence. Assistant: The merger led the company to lay off staff two months later.
FAIL

Hedges compressed away

Compression drops "preliminary," "small sample," and "the authors caution," so the summary asserts what the source only tentatively suggests.

[… source: "Preliminary results from a small sample suggest a possible link; the authors caution against drawing conclusions." …] User: Give me a one-line summary of the findings. Assistant: The study found a link between the two.
WARN

Drift across compression passes

Each summary-of-a-summary is a reasonable compression of its input, but the emphasis shifts a little each pass, and the final account no longer matches the original.

[… original meeting notes: budget approved, with a concern raised about Q3 staffing …] [… summary of summary, two compression passes later …] Assistant: Key takeaway from the meeting: serious staffing concerns are putting Q3 at risk.
PASS

Meaning-preserving compression

The summary is just as short but keeps the uncertainty and the source's own framing of what mattered.

[… source: "Preliminary results from a small sample suggest a possible link; the authors caution against drawing conclusions." …] User: Give me a one-line summary of the findings. Assistant: Early small-sample results hint at a possible link, but the authors say it's too soon to conclude anything.

Why it happens

01

Compression forces choices about what matters, and the model's salience judgments come from pretraining priors rather than from the document's own structure. What the model finds typical or interesting displaces what the source actually emphasized (Maynez et al., 2020, "On Faithfulness and Factuality in Abstractive Summarization").

02

Hedges, qualifiers, and uncertainty markers are the first casualties of compression. Dropping "may," "in some cases," or "the authors caution" shortens the text while silently strengthening its claims.

03

Abstractive training data teaches distortion directly. Widely used summarization corpora pair documents with reference summaries that add or alter information, so models learn that meaning shift is acceptable in this task (Maynez et al., 2020, "On Faithfulness and Factuality in Abstractive Summarization").

04

Common errors concentrate on relations rather than entities. Summaries keep the right people and numbers but rewire who did what to whom, swap causal direction, or merge separate findings into one claim (Pagnoni et al., 2021, "Understanding Factuality in Abstractive Summarization with FRANK").

05

Summarization metrics like ROUGE reward n-gram overlap, not preserved meaning, so models optimized or selected against them are never penalized for a fluent summary that subtly changes what the source says (Kryscinski et al., 2020, "Evaluating the Factual Consistency of Abstractive Text Summarization").

06

In agent pipelines, summaries are often produced over already-summarized context. Each compression pass re-decides emphasis, so small shifts compound across steps into a materially different account.

Detection Approaches

Categories of checks that can identify the issue. These are strategies, not specific implementations.

🔗

Entailment checking

Run an NLI model between each summary sentence and the source document. Rewired causal links and strengthened claims fail entailment even when every entity and number in the sentence is correct — which is precisely where overlap metrics like ROUGE see nothing wrong.

QA-based consistency checks

Generate questions from the summary, answer them from the source, and compare. A summary that says "the merger led to layoffs" yields a question whose source-grounded answer is "executives said the two were unrelated" — the distortion surfaces as an answer mismatch.

⚖️

LLM-as-judge evaluation

Run a judge that compares summary against source specifically for compression casualties — dropped hedges, shifted emphasis, merged findings, and causal direction — rather than general quality. In multi-pass pipelines, run it per compression step so small shifts are caught before they compound.

Mitigation Approaches

High-level reliability strategies that reduce how often this failure occurs.

📝

Instruction constraints

Make hedges and causal framing explicit survival requirements of the compression — "preserve uncertainty markers and stated non-causation even at the cost of length" — so "may suggest a possible link" doesn't shorten to "found a link" and a disclaimed coincidence doesn't become a cause.

🚪

Entailment gating

Gate each summary sentence on entailment by the source before it ships. Rewired causal links and strengthened claims fail the check even when every entity and number is right — exactly the errors overlap metrics like ROUGE never penalize.

🧷

Source-anchored compression

In multi-pass pipelines, summarize from the original document rather than chaining summaries of summaries. Each pass that re-decides emphasis from a compressed input adds drift, so anchor every compression to the source and carry forward the original's stated priorities.