Microsoft Foundry – Compare & Evaluate AI Models
Choosing the right AI model for your application doesn’t have to be guesswork. Microsoft Azure AI Foundry provides a powerful, end-to-end platform to deploy, manually compare, and automatically evaluate large language models using industry-standard quality and safety metrics. In this guide, we walk through the complete workflow — deploying GPT-4.1 and GPT-4.1-mini, running side-by-side playground comparisons, and setting up automated evaluations that score your models on Groundedness, Coherence, Relevance, Fluency, and DeflectionRate. The result? Data-driven confidence in your AI model selection.