Blog
Notes
Work With Me
OSS
Talks
📚 Hamel’s Notes
Prompt engineering
📚 Hamel's Notes
Python Concurrency
CUDA Version Management
How to learn
pandoc filters
Docker
dbt
programming languages
Video Editing
LLMs
Inference
Optimizing latency
Max Inference Engine
vLLM & large models
OpenAI
Function prompts
Function Calling
Llama-3 Func Calling
Fine-tuning
Dataset Basics
LangChain
DocumentLoaders
Estimating vRAM
Curating LLM data
Tokenization Gotchas
Template-free axolotl
Open Office Hours
Multi-Turn Chat Evals
ML Serving
TF Serving
Basics
GPUs & Batching
TorchServe
Basics
Serving Your Own Model
FastAPI
K8s
Basics
Secrets
Storage
Storage Basics
Dynamic Provisioning
Scaling
ReplicaSets
Scaling
StatefulSet
Jobs & CronJobs
Rollouts
Multi-Container Pods
Multi-Container Pods
Ambassador Sidecars
Restart Conditions
Sharing Processes in MC Pods
Helm
Helm Intro
Creating Helm Charts
Helm Upgrades & Rollbacks
Testing With Helm
Developer tips
Pod restart vs. replacement
Probes
Resource Limits
Requesting resources
Logging
Monitoring
Ingress
Cluster Components
Security
Network Security
Securing Containers
Webhooks
Updating a K8s Cluster
RBAC
Workload Placement
Auto Scaling
Preemption
Random TILs
Open Questions
fastai
Fundamentals
Image Classification
Data
Batch Predictions
Linux
Cheatsheet
Cookbook
Misc Utilities
OSX Shell Tips
Processes, Permissions and Moving Data
GitHub Actions
ocotokit.js
Resources
Prompt engineering
Course
Guidelines for Prompting
Iterative Prompt Development
Summarizing
Inferring
Transforming
Expanding
The Chat Format
Web Scraping
Browser requests to code
Transcribe & Diarize Videos
FastHTML
Concurrency For Starlette Apps (e.g FastAPI / FastHTML)
Quarto
Syntax Highlighting
Listings from data
Merge listings
Jupyter
Launch Jupyter on a remote server
Fix Jupyter CUDA cache
Remote Browser For Jupyter
My Jupyter Shortcuts
📚 Hamel’s Notes
Prompt engineering
Prompt engineering
Prompt engineering notes
Title
Description
Course
Notebooks from the…
No matching items