Finding GPT-4’s mistakes with GPT-4 Finding GPT-4’s mistakes with GPT-4
原文EN CriticGPT, a model based on GPT-4, writes critiques of ChatGPT responses to help human trainers spot mistakes during RLHF
EN CriticGPT, a model based on GPT-4, writes critiques of ChatGPT responses to help human trainers spot mistakes during RLHF