Skip to main content

evaluation

55 articles

Testing AI quality beyond accuracy: rubrics, judges, and alignment evaluation. BLEU scores and accuracy metrics don't tell you if your AI is actually helping users. These posts cover what to measure instead.

Essential reading

All articles

Related topics

Stay sharp on AI personalization

Daily insights and research on AI personalization and context management at scale. Read by hundreds of AI builders.

Daily articles on AI-native products. Unsubscribe anytime.

Building AI that needs to understand its users?

Book a Strategy Call