Hello, I’m Hamel Husain. I’m a machine learning engineer who loves building machine learning infrastructure and tools 👷🏼‍♂️. I lead or contribute to many popular open-source machine learning projects. Furthermore, I have extensive experience (20+ years) as a machine learning engineer across various industries, including large tech companies like Airbnb and GitHub.

I’m currently an independent consultant helping companies operationalize LLMs. At GitHub, I lead CodeSearchNet, a large language model for semantic search that was a precursor to CoPilot, a large language model used by millions of developers.

💼 Get In Touch

Do you need help operationalizing ML or large language models?

I’m open to consulting work and other forms of advisory. Email me at hamel@parlance-labs.com if you’d like to chat!


📮 Feed

A curated collection of blog posts and shorter form notes.

Date Title
3/27/24 Is Fine-Tuning Still Valuable?
2/14/24 Fuck You, Show Me The Prompt.
1/11/24 How To Debug Axolotl
1/9/24 Dokku: my favorite personal serverless platform
12/17/23 Tokenization Gotchas
11/15/23 Tools for curating LLM Data
10/28/23 vLLM & Large Models
10/15/23 Optimizing LLM latency
5/30/23 On commercializing nbdev
1/16/23 Why Should ML Engineers Learn Kubernetes?
7/28/22 nbdev + Quarto: A new secret weapon for productivity
2/9/22 Notebooks in production with Metaflow
12/18/20 ghapi, a new third-party Python client for the GitHub API
11/20/20 Nbdev: A literate programming environment that democratizes software engineering best practices
9/1/20 fastcore: An Underrated Python Library
9/1/20 Data Science Meets Devops: MLOps with Jupyter, Git, & Kubernetes
3/6/20 GitHub Actions: Providing Data Scientists With New Superpowers.
2/21/20 Introducing fastpages, An easy to use blogging platform with extra features for Jupyter Notebooks.
2/5/20 Python Concurrency: The Tricky Bits
9/20/19 CodeSearchNet Challenge: Evaluating the State of Semantic Code Search
4/10/19 How to Automate Tasks on GitHub With Machine Learning for Fun and Profit
5/29/18 How To Create Natural Language Semantic Search for Arbitrary Objects With Deep Learning
1/18/18 How To Create Magical Data Products Using Sequence-to-Sequence Models
12/16/17 How Docker Can Help You Become A More Effective Data Scientist
5/10/17 Automated Machine Learning — A Paradigm Shift That Accelerates Data Scientist Productivity @ Airbnb
No matching items

📬 Subscribe

Subscribe via RSS.