How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions

We create a lie detector for blackbox LLMs by asking models a fixed set of questions (unrelated to the lie).
Read More →
We create a lie detector for blackbox LLMs by asking models a fixed set of questions (unrelated to the lie).
Read More →