Join us

ContentUpdates and recent posts about kueue..
Link
@faun shared a link, 7 months, 2 weeks ago
FAUN.dev()

Multi-Cluster Orchestrator for cross-region Kubernetes workloads

Multi-Cluster Orchestratorslices through multi-cluster deployments like a hot knife through butter. Ideal for AI/ML champs and GitOps mavens itching for optimized resource allocation and snappier deployment workflows... read more  

Multi-Cluster Orchestrator for cross-region Kubernetes workloads
Link
@faun shared a link, 7 months, 2 weeks ago
FAUN.dev()

How to add Kubernetes-powered leader election to your Go apps

k8s.io/client-go/tools/leaderelectionturns any Kubernetes app into a leader with a snap of your fingers. But there's a catch—brace yourself for possible twin leaders. It's the Wild West of leadership where balance hangs by a thread... read more  

Link
@faun shared a link, 7 months, 2 weeks ago
FAUN.dev()

KubeCon Europe: How Kubernetes Handles 6G, LLMs and Deep Space

Kuberneteswrangles with the chaos of infrastructure stretching across continents, but the community's already crafting clever fixes... read more  

Link
@faun shared a link, 7 months, 2 weeks ago
FAUN.dev()

Cold-Starting LLMs on Kubernetes in Under 30 Seconds

RedesigningLLM cold start strategy sliced launch times from 10 minutes tounder 30 secondsby exploitingFUSEandobject storagefor on-demand GPU loading—a revelation for Kubernetes scaling... read more  

Cold-Starting LLMs on Kubernetes in Under 30 Seconds
Link
@faun shared a link, 7 months, 2 weeks ago
FAUN.dev()

The Next Evolution of DigitalOcean Kubernetes: Introducing Features that Unlock Superior Scalability for Growing Businesses

DigitalOceanjust cranked up the cluster game to a cool1,000nodes, injectedeBPF-based routingfor a performance boost, and rolled outManaged Ciliumto keep things rock steady. Scale orchestration? Now it's on rocket fuel... read more  

The Next Evolution of DigitalOcean Kubernetes: Introducing Features that Unlock Superior Scalability for Growing Businesses
Link
@faun shared a link, 7 months, 2 weeks ago
FAUN.dev()

Introducing kube-scheduler-simulator

kube-scheduler-simulatorlets you peek into the mind of Kubernetes’ scheduler. You can poke and prod at scheduling decisions without risking a real cluster meltdown. Add custom plugins like a pro, no sweat. Forget blindsiding surprises. The simulator mirrors production with eerie accuracy—sync resour.. read more  

Introducing kube-scheduler-simulator
Link
@faun shared a link, 7 months, 2 weeks ago
FAUN.dev()

CKA Prep: CKA Exam Overview and Preparation Strategy

CKA exam:Juggle up to 6 Kubernetes clusters like a pro. Command rolling updates, Ingress, and persistent storage with flair. Imperative commands? Your secret weapon to snatch victory... read more  

Link
@faun shared a link, 7 months, 2 weeks ago
FAUN.dev()

Google Cloud unveils AI-focused updates to Kubernetes Engine

Meet theCluster Director for GKE. This beast masters GPU/TPU clusters seamlessly, herding them with Kubernetes APIs like a rodeo star. Meanwhile, theGKE Inference Gatewayramps up AI model performance. It's like magic but real: Serving costs tumble by up to30%. Tail latency? Chopped by up to60%... read more  

Link
@faun shared a link, 7 months, 2 weeks ago
FAUN.dev()

Optimize Gemma 3 Inference: vLLM on GKE 🏎️💨

GKE Autopilot's GPUmeans business—AI inference tasks don’t stand a chance. Just two arguments and, bam, you’ve unleashed NVIDIA's beastly Gemma 3 27B model, which chugs a massive46.4GB VRAM. ⚡️ Meanwhile, vLLM squeezes the models with bf16 precision, though optimization requires wrestling with algor.. read more  

Optimize Gemma 3 Inference: vLLM on GKE 🏎️💨
Link
@faun shared a link, 7 months, 2 weeks ago
FAUN.dev()

Kubernetes 1.33 – What you need to know

Kubernetes 1.33 shakes things up with game-changing updates.LIST streaming encodingtrims down API Server memory like a chef with a sharp knife. Deliberate deletion orders lock down security tighter than a drum. And get this:in-place updatesfor Pod resources ditch those annoying restarts! Finally, us.. read more  

Kubernetes 1.33 – What you need to know
Kueue is a Kubernetes-native job queueing and workload management system designed for large-scale, mixed compute environments such as AI/ML training, batch workloads, and HPC workflows. Instead of scheduling individual Pods, Kueue operates at the job level, deciding when a job should run based on resource quotas, fair-sharing policies, cluster availability, and workload priorities.

Kueue integrates tightly with Kubernetes, working alongside the default scheduler rather than replacing it. It provides features such as all-or-nothing (gang) admission, workload preemption, quota-based sharing across teams or tenants, and support for advanced frameworks like JobSet and Ray. Its goal is to help Kubernetes clusters run efficiently under heavy load while ensuring that critical, latency-sensitive, or large training jobs receive the resources they need without starving lower-priority workloads.