this is a space to document and distill my thoughts on publications and articles within the research community. literature reviews are personal and messy; blog post are intended for a wider audience (marked with a dot).
i write for my own edification; and posts are subject to my own interest for the topic at the time of writing.
September 28, 2024
research - abliteration; probing; what makes a model good? - entropix code - more open source contributions. but what projects? i want to write c++, torch, and jax. - cuda; gpu programming. - jax - implement deepseek mla. why hasn't this caught on yet? - evaluate contextual retrieval on mindex. use local oss model for context. prompt caching on local models? - colpali to handle pdfs in mindex. life - ~~plan a ski trip to...shard_map
. xmap
and pmap
which i've used extensively are deprecated, let's learn modern spmd