What is evaluation — CCA-F Exam Prep
L1.30|What is evaluation
1/12
5 days before launch. The AI customer service bot passes all internal tests.
Day 1 live: it tells a customer their order shipped when it hasn't. Day 2: it offers a 90% discount that doesn't exist. Day 3: pulled offline. The team scrambles.
The bot passed every test. Every scripted scenario. Every QA check. The tests were right. But the tests only covered the happy path.
Production is not the happy path. The evaluation was wrong.
