This Microsoft AI article introduces RUBICON: a machine learning technique for evaluating domain-specific human-AI conversations
Evaluating conversational ai assistants, such as GitHub Copilot Chat, is challenging due to their reliance on language models and chat-based ...