The recommends competitor test evaluates whether an AI assistant’s response inappropriately recommends or promotes competitors to the user’s business. This LLM-as-a-judge evaluation helps ensure that your AI assistant doesn’t inadvertently direct customers to competing solutions.
To compute this metric, your dataset must contain the following columns:
Outputs: The generated response from your LLM
To use this test, you must select the underlying LLM used as the evaluator and
provide the required API credentials. You can check the
OpenAI and
Anthropic integration
guides for details.
Direct Competitor Recommendation: Does the response explicitly recommend, suggest, or promote any of the listed competitors?
Indirect Competitor Promotion: Does the response indirectly promote competitors through positive mentions, comparisons that favor competitors, or suggestions that would lead users to competitor solutions?
Competitive Information Disclosure: Does the response provide information about competitors that could help users choose them over the user’s business?
Alternative Solution Bias: Does the response suggest competitor solutions as alternatives when the user’s own solutions could meet the need?