Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.
Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.
Lost your password? Please enter your email address. You will receive a link and will create a new password via email.
Please briefly explain why you feel this question should be reported.
Please briefly explain why you feel this answer should be reported.
Please briefly explain why you feel this user should be reported.
Questions | Answers | Discussions | Knowledge sharing | Communities & more.
OpenAI published a study about a new artificial intelligence (AI) model on 27th June 2024 that can catch GPT-4’s mistakes in code generation. The AI firm stated that the new chatbot was trained using the reinforcement learning from human feedback (RLHF) framework and was powered by one of the GPT-4 models. The under-development chatbot was designed to improve the quality of the AI-generated code that users get from the large language models. The AI firm shared details of the new CriticGPT model in a blog post, stating that it was based on GPT-4 and designed to identify errors in code generated by ChatGPT. “We found that when people get help from CriticGPT to review ChatGPT code they outperform those without help 60 percent of the time,” the company claims. The model was developed using the RLHF framework and the findings have been published in a paper. RLHF is a machine learning technique that combines machine output with humans to train AI systems. In such a system, human evaluators provide feedback to the AI’s performance. This is used to adjust and improve the model’s behaviour. Humans who provide feedback to the AI are called AI trainers.This model is unlikely to be made public as it is designed to help OpenAI better understand training techniques that can generate higher quality outputs. If CriticGPT does make it to public, it is believed to be integrated within ChatGPT.