Leon Ericsson | Learning...

about me

hi! i'm leon, i'm a research engineer living in stockholm, lived here for almost 2 years now with my sambo. big fan of ml research, and just understanding how things work. otherwise i enjoy eating food, making food, traveling and working out.

i've got a background in software engineering, with a master's in compsci. but today i'm focused on language models, reinforcement learning, mech interp., and reasoning. although i have limited experience with it i'm also very keen on health/medicine as a field of application for the aforementioned fields. if you're curious about my current fixations, skim the blog.

i like making things

healthy mix of research, experiments, and scratch projects.

paper implementations in omni
this website, including my blog
mindex, a local semantic search engine for your mind index
alphax, scalable + fast alphazero in jax. facilitates spmd, simd, jit-composition etc
exploring limitations of llm-as-a-judge
llmcontext (fork of original), a context window pressure tester of open llms
a framework for super-resolution of clinical 4d flow mri. paper.
alphafour , minimal alphazero for connect four in pure pytorch

i like open source

these are my favorite, some of which i contribute to.

inference engine vllm
finetuning with unsloth
finetuning with trl (several PRs)
ggerganov's llama.cpp (#4484)
mlx is an array framework for apple silicon (#202 #237 #456 #19 #276)
tensor puzzles and cuda puzzles from sasha rush
community driven ohmyzsh
jax: the lovechild of autograd and xla

i like reading papers

selection of lesser known ones that stand out to me.

ideas

things / projects i'm thinking about lately.

visual reasoning and visual spatial reasoning. non-text cot?
long-form code generation benchmark.
long-context multi-needle reasoning benchmark.