A listing of all my blog posts can be found here
FAQ from our course on AI Evals.
Evaluation methods, data-driven improvement, and experimentation techniques from 30+ production implementations.
What I’ve seen work and what doesn’t.
A step-by-step guide with my learnings from 30+ AI implementations.
A free survey course on LLMs, taught by practitioners.
Quickly detect a common bug in AI products using an automated technique.
How to construct domain-specific LLM evaluation systems.
A reaction to a recent trend of disillusionment with fine-tuning.
Quickly understand inscrutable LLM frameworks by intercepting API calls.
Best practices for debugging axolotl with an example VSCode config.
Like Heroku, but you own it.
Learning K8s can give you an unreasonable advantage as an MLE and unblock your team.