Q: Will today’s evaluation methods still be relevant in 5-10 years given how fast AI is changing?

LLMs

evals

faq

faq-individual

Teams will still need error analysis as models and prompt techniques change.

Authors

Hamel Husain

Shreya Shankar

Published

July 27, 2025

Modified

July 27, 2025

Yes. Even with perfect models, you still need to verify they’re solving the right problem. The need for systematic error analysis, domain-specific testing, and monitoring will still be important.

Today’s prompt engineering tricks might become obsolete, but you’ll still need to understand failure modes. Additionally, a LLM cannot read your mind, and research shows that people need to observe the LLM’s behavior in order to properly externalize their requirements.

For deeper perspective on this debate, see these two viewpoints: “The model is the product” versus “The model is NOT the product”.

“The model is the product”:

“The model is NOT the product”:

↩︎ Back to main FAQ

This article is part of our AI Evals FAQ, a collection of common questions (and answers) about LLM evaluation. View all FAQs or return to the homepage.