Saurav Kadavath
Researcher at Anthropic. Lead author of the foundational 2022 study on LLM self-knowledge, showing that large models can be queried for calibrated assessments of their own correctness.
Researcher at Anthropic. Lead author of the foundational 2022 study on LLM self-knowledge, showing that large models can be queried for calibrated assessments of their own correctness.