Hugging Face just dropped Kernel Builder—a full-stack toolchain for building, versioning, and shipping custom CUDA kernels as native PyTorch ops.
Kernels are architecture-aware, semantically versioned, and pullable straight from the Hub. It tracks changes with lockfiles and bakes in Docker deploys out of the box.