this is a space to document and distill my thoughts on publications and articles within the research community. literature reviews are personal and messy; blog post are intended for a wider audience (marked with a dot).

i write for my own edification; and posts are subject to my own interest for the topic at the time of writing.

things i want to do

September 28, 2024

research - abliteration; probing; what makes a model good? - entropix code - more open source contributions. but what projects? i want to write c++, torch, and jax. - cuda; gpu programming. - jax shard_map. xmap and pmap which i've used extensively are deprecated, let's learn modern spmd - implement deepseek mla. why hasn't this caught on yet? - evaluate contextual retrieval on mindex. use local oss model for context. prompt caching on local models? - colpali to handle pdfs in mindex. life - ~~plan a ski trip to...