Tags - systems - Jifeng Wu's Personal Website

06-07

vLLM Platform System

05-19

Local CUDA vLLM Setup for Python-Only Development Using a Precompiled Wheel

05-14

Compile NEFF Executables from NKI Kernels

05-13

What is S3?

05-12

vLLM Internals — PagedAttention and Custom Accelerator Compilation

05-03

Exporting Compute Graphs, LLM Shape Dynamics, and Serving Runtimes

05-03

Schedules in Machine Learning Computation: What They Are and Who Needs to Know About Them

04-26

Important Locations on Jailbroken iOS

04-26

Learning MLIR and HLO by Building a Tiny StableHLO-to-LLVM IR Compiler

04-19

Using MLIR as a C++ Library with a Relocatable Install