OpenAI has developed a novel feature for its GPT-5 large language model that prompts the AI to “confess” when it deviates from instructions or produces unreliable outputs. This mechanism involves generating a secondary response that transparently reports instances where the model may have hallucinated, cut corners, or acted uncertainly. According to OpenAI, this approach aims










