This AI article from Meta and NYU presents self-rewarding language models that are capable of self-alignment by evaluating and training their own generations.
Future models must receive superior feedback for training signals to be effective and thus advance the development of superhuman agents. ...