F3D / AI Infrastructure
Building storage systems for AI-scale workloads.
Distributed storage, cloud-native compute, vector search, and infra automation.
NEXT MiniMax / LLM Distributed Storage
STACK JuiceFS / TiKV / Spark / ANNS
MODE Performance / Reliability / Cost
吴松林,基础设施研发。
我做 AI-scale systems 背后的存储、计算和调度:分布式文件系统、Spark on K8s、向量检索、GPU infra,以及把成本和稳定性做成系统能力。
Next stop: MiniMax · LLM Distributed Storage.
Focus
- Distributed storage for LLM workloads
- Cloud-native compute and Spark on Kubernetes
- Metadata systems, CSI, object storage, FinOps
- Disk ANN search, I/O pipeline, SIMD, cache design
- Agentic tooling for infra diagnosis and migration
Signals
- MiniMax:LLM 分布式存储方向。
- Trip.com:Spark on K8s、JuiceFS/TiKV、AI storage、GPU infra、cost governance。
- Milvus:Starling disk-based vector search.
- Open source:JuiceFS / TiKV 相关性能与稳定性优化贡献。
- Education:Tongji University · M.S. Computer Science.
Stack
C++ Go Python Kubernetes Spark JuiceFS TiKV Milvus ANNS Prometheus Grafana pprof Codex
Writing
这里记录 infra notes:问题、判断、实验、复盘。
不写流水账,不写万能方法论。只写我真的拆过、跑过、踩过、重新设计过的系统问题。
Contact
GitHub 在页面底部。欢迎聊 storage、AI infra、systems performance。