I'm susun — LLM inference engineer, formerly database storage engineer. Writing about systems, inference, and engineering. More about me.
Posts
- pegainfer (5): One Size Can't Fit All
- How I Vibe Code
- Pegaflow (1): RDMA From Zero to Data Transfer
- pegainfer (4): From Pre-allocation to Graph Replay
- pegainfer (3): From Launch Overhead to CUDA Graph (Part 1)
- pegainfer (2): Adding a Sampler to the Inference Engine
- pegainfer: A Native Rust Inference Engine from Scratch