NVIDIA Dynamo Snapshot: Fast Startup for Inference Workloads on Kubernetes
NVIDIA introduces Dynamo Snapshot, a new checkpoint/restore system for Kubernetes that significantly reduces cold-start latency for single-GPU AI inference workloads using CRIU and cuda-checkpoint.