Confirmation Bias

The tendency to seek out, interpret, favor, and recall information in a way that confirms or supports existing beliefs or hypotheses — while disproportionately discounting information that contradicts them. The most extensively studied bias in the decision-making literature.


Origin

Wason, P. C. (1960). “On the Failure to Eliminate Hypotheses in a Conceptual Task.” Quarterly Journal of Experimental Psychology, 12(3), 129–140. The “2-4-6 task” demonstrated that subjects trying to discover a rule consistently tested hypotheses in ways designed to confirm rather than disconfirm their current theory.

The term “confirmation bias” was popularized by Oswald, P. & Grosjean, S. (2004), but the phenomenon was described by Francis Bacon in 1620: “The human understanding when it has once adopted an opinion draws all things else to support and agree with it.”


Mechanism

Confirmation bias operates at multiple cognitive stages:

  • Search: preferentially seeking information sources likely to agree with the current belief
  • Interpretation: ambiguous evidence is read as supporting the favored hypothesis
  • Memory encoding: confirming evidence is remembered better than disconfirming evidence
  • Evaluation: confirming evidence receives less critical scrutiny than disconfirming evidence

Intelligence Analysis Context

Riley: SATs in Cybersecurity (2024) names confirmation bias explicitly as a target of SATs in cybersecurity contexts.

CIA Tradecraft Primer (2009) describes the mechanism under “mind-set” and “consistency bias” without using the term directly:

  • “Data that are in accordance with analysts’ unconscious mental models are more likely to be perceived and remembered than information that is at odds with them”
  • ACH is described as overcoming the tendency to “rely on evidence to support their preferred hypothesis, but which also is consistent with other explanations”

The Iraq WMD case is a canonical intelligence example: confirming evidence (Iraqi deception behavior) was interpreted as confirmation of WMD programs; disconfirming evidence (absence of positive WMD signatures) was not weighted appropriately.


Note on Sources

Contradiction flag: Riley: SATs in Cybersecurity (2024) names “confirmation bias” explicitly. CIA Tradecraft Primer (2009) describes the same phenomenon under different terminology (consistency bias, mind-set, relying on evidence that “supports their preferred hypothesis”). The primer’s ACH technique is explicitly designed to counter it. No substantive contradiction between sources — terminological difference only.


LLM Agentic Systems Context

Confirmation bias in LLMs is structurally embedded through training:

  • RLHF confirmation loop: RLHF training rewards responses that human raters prefer; raters tend to prefer responses confirming their existing beliefs; the model learns confirmation as a generalized strategy
  • User frame adoption: LLMs adopt the user’s framing as their working hypothesis, then marshal context and reasoning to support it
  • Adversarial robustness failure: when a user insists an incorrect answer is right, LLMs frequently “confirm” the user’s position even against clear contradicting evidence
  • Chain-of-thought confirmation: intermediate reasoning steps often function as progressive commitment to the initial interpretation rather than genuine exploration

See SATs for LLM Agents for SAT-based mitigations.


SATs That Control For This Bias

  • Analysis of Competing Hypotheses (ACH) — the primary technical counter; disconfirmation focus; forces evaluation of all evidence against all hypotheses simultaneously
  • Devil’s Advocacy — institutionally assigns the task of disconfirming the dominant view
  • Key Assumptions Check — surfaces the assumptions that confirmation bias protects
  • Team B — separates analysts so one team can’t contaminate the other with confirming reasoning chains

Key References

  • Wason, P. C. (1960). “On the Failure to Eliminate Hypotheses in a Conceptual Task.” Quarterly Journal of Experimental Psychology, 12(3), 129–140.
  • Nickerson, R. S. (1998). “Confirmation Bias: A Ubiquitous Phenomenon in Many Guises.” Review of General Psychology, 2(2), 175–220.
  • Kahneman, D. (2011). Thinking, Fast and Slow. Farrar, Straus and Giroux. (Chapter 4, Chapter 20)
  • Richards j. heuer jr.The Psychology of Intelligence Analysis (1999), Chapter 8

Empirical Evidence (LLM)

StudyFinding
Echterhoff et al. (BiasBuster, 2024)Direct measurement of confirmation-bias-style probing in LLM decision-making. Bias is present across commercial and open-source models.
RAND RR1408 (2016) cites Mitre 2004ACH reduced confirmation bias in human analysts — but only among non-professionals. Implication: ACH may help generalist LLMs more than domain-tuned ones. Direct caution for H1 experimental design.

See Also

Cognitive Bias | Motivated Reasoning | Groupthink | Mind-Set | Anchoring Bias