Modern Work Blog

When AI evaluates itself, the results are better

Quick answers are not enough when it comes to strategic decisions. Copilot Researcher’s new ‘Critique’ approach demonstrates how AI delivers robust analyses through self-assessment.

Person im modernen Büro arbeitet am Laptop an einem Schreibtisch hinter Glas, Notizbuch daneben, Schriftzug „Modern Work“.

Do you have any questions about Microsoft Copilot?

Are you interested in the Critique approach in Copilot Researcher, or do you have other questions about AI and Copilot? Speak to the experts at Telekom and get an informed assessment tailored to your specific situation. Take advantage of our free introductory consultation on Copilot and AI.

Book an appointment now

From rapid output to robust analysis

In recent years, artificial intelligence has provided businesses with one thing above all else: speed. Content is produced more quickly, summaries are available instantly, and answers are provided without delay. This is helpful for operational tasks. However, it is often not reliable enough for strategic decisions.

Copilot Researcher itself is not a new product. What is new is the approach Microsoft is now taking to significantly improve the quality of its analysis. With the introduction of the Critique mechanism, Copilot Researcher is making a clear shift in direction: away from quick, single-model answers, towards a verified, structured analysis using a multi-model approach.

The new approach: critique rather than a model answer

In automatic mode, Researcher no longer uses a single AI model. The tasks are deliberately separated:

An OpenAI model handles planning, research and content synthesis
A second model specifically checks the result for logical consistency, the quality of the reasoning and the reliability of the sources

This 'Critique' step applies a tried-and-tested business principle to the world of AI: results are actively scrutinised before being used. Moving away from the notion of a single 'correct' answer towards truly robust analysis.

Why is this relevant for businesses?

Companies make decisions in the face of uncertainty. This is precisely where the limitations of traditional AI chatbots have lain until now. Multi-model approaches are fundamentally changing this:

Risks and contradictions become apparent at an earlier stage
Chains of reasoning become more consistent
Source quality is systematically assessed
Decisions are easier to explain and justify

This represents a significant step forward for strategy, management, security, compliance and transformation programmes. AI is evolving from a mere provider of information into a supportive analytical tool.

An overview of the regulatory framework in the DACH region and the EU

One point deserves particular attention. The Critique mechanism currently also uses models from Anthropic. As things stand, these are operated outside the EU. This is relevant for regulated companies in the DACH region.

The assessment of this is as follows:

The primary processing is carried out using OpenAI models within the Microsoft architecture
Governance, access control and orchestration are entirely managed by Microsoft
Company data remains within the established Microsoft security and compliance framework

Nevertheless, it is important to note that automatic mode should be used judiciously and assessed in the context of one’s own regulatory requirements.

Conclusion

The real value lies not in the fact that Copilot Researcher is new, but in how it is evolving. The multi-model approach with Critique shows the direction in which enterprise AI is heading: away from speed as an end in itself, and towards quality, transparency and better decision support. When AI checks itself, the results are better.

Author: Julien Cléro

Julien Cléro is a sought-after expert and speaker specialising in Microsoft AI and security. With 25 years of professional experience in the IT and telecommunications industry, Julien brings a wealth of knowledge and expertise to the table. Benefit from his extensive experience, including major projects with top clients.

To the author page