These are a set of minimal notes and code snippets. I wrote these notes for myself for quick reference. These notes are a work in progress.
topics
ssm
These notes focus on my use cases of state space models. These are not the linear deterministic kind of SSM used in big bad LLMs like Mamba, but instead Bayesian inference in stochastic (non)linear dynamical models.
- state space model
- linear gaussian ssm
- nonlinear gaussian ssm
- kalman filtering and smoothing
- ssm in dynamax
- online learning using ssm
- structural time series models
- state space gaussian process
- ssm resources
llm
Read Ravin’s GenAI guidebook or Vicki Boykis’ normcore-llm.md instead.
- language modelling
- transformer architecture
- multi-headed self-attention
- llm pre-training
- llm fine-tuning
- word2vec
- retrieval augmented generation
research engineering
File | date modified | date created |
---|---|---|
infrastructure as code | 6:56 PM - December 14, 2024 | 8:47 AM - May 03, 2024 |
mac setup | 7:23 PM - November 18, 2024 | 7:59 AM - January 30, 2024 |
beartype | 5:08 PM - November 10, 2024 | 9:51 PM - October 12, 2024 |
pandera | 10:43 PM - October 12, 2024 | 10:32 PM - October 12, 2024 |
docker | 9:51 PM - October 12, 2024 | 10:10 AM - February 08, 2024 |
d3.js | 9:17 AM - October 02, 2024 | 6:23 AM - September 25, 2024 |
observable plot | 9:09 AM - October 02, 2024 | 8:42 AM - October 02, 2024 |
python versioning, virtual environments and packaging | 3:54 PM - September 09, 2024 | 12:00 AM - January 30, 2024 |
serverless functions | 10:58 AM - September 04, 2024 | 10:01 AM - September 04, 2024 |
jax.vmap | 9:17 AM - August 30, 2024 | 8:30 AM - February 04, 2024 |
splitting keys in jax | 2:12 AM - June 21, 2024 | 1:59 AM - June 21, 2024 |
python generators | 1:59 AM - June 21, 2024 | 1:42 AM - June 21, 2024 |
pytensor | 10:56 AM - June 02, 2024 | 3:35 AM - June 01, 2024 |
mlflow | 8:58 AM - May 30, 2024 | 7:13 AM - May 30, 2024 |
tqdm to replace training loop | 7:20 AM - May 03, 2024 | 7:16 AM - May 03, 2024 |
Typer to replace argparse | 3:57 AM - April 13, 2024 | 3:42 AM - April 13, 2024 |
populate a PostgreSQL database with SQLModel | 3:40 AM - April 13, 2024 | 10:03 AM - March 03, 2024 |
sqlmodel (sqlalchemy) cascade | 3:39 AM - April 13, 2024 | 3:14 AM - April 13, 2024 |
mocker | 9:03 AM - March 24, 2024 | 1:51 PM - March 22, 2024 |
shorten FastAPI app.py with APIRouter | 1:38 PM - March 11, 2024 | 2:23 PM - March 07, 2024 |
other
File | date modified | date created |
---|---|---|
index | 6:56 PM - December 14, 2024 | 2:57 AM - January 31, 2024 |
probability integral transform | 4:03 PM - December 14, 2024 | 1:22 PM - December 14, 2024 |
archive | 2:33 PM - December 14, 2024 | 3:07 AM - January 31, 2024 |
copula | 2:32 PM - December 14, 2024 | 12:01 PM - December 14, 2024 |
automatic differentiation | 6:02 PM - December 01, 2024 | 5:45 AM - August 03, 2024 |
treescope | 3:06 PM - September 01, 2024 | 6:37 AM - August 31, 2024 |
neural networks in jax | 4:54 AM - September 01, 2024 | 7:02 AM - August 29, 2024 |
linear model as a neural network | 11:31 AM - August 29, 2024 | 5:32 AM - February 04, 2024 |
fixed point iteration | 7:14 AM - August 03, 2024 | 3:43 AM - July 30, 2024 |
implicit function theorem | 7:09 AM - August 03, 2024 | 3:46 AM - August 03, 2024 |
newton’s method | 7:07 AM - August 03, 2024 | 8:43 AM - July 29, 2024 |
explicit and implicit layers | 6:55 AM - August 03, 2024 | 3:48 AM - August 03, 2024 |
INLA | 9:40 AM - May 10, 2024 | 9:22 AM - May 03, 2024 |
pg_dump | 8:58 AM - March 24, 2024 | 1:47 PM - March 22, 2024 |
getting these notes online | 1:53 PM - February 25, 2024 | 12:53 PM - February 25, 2024 |
kernel trick | 11:58 AM - February 25, 2024 | 12:00 PM - February 14, 2024 |