Aaron Tokelove (@tokelove) on FAUN.dev()

Content

Posts from @tokelove..

Link

@kaptain shared a link, 2 hours ago

FAUN.dev()

How GKE Inference Gateway improved latency for Vertex AI

Vertex AI now plays nice withGKE Inference Gateway, hooking into the Kubernetes Gateway API to manage serious generative AI workloads. What’s new:load-awareandcontent-aware routing. It pulls from Prometheus metrics and leverages KV cache context to keep latency low and throughput high - exactly what.. read more

How GKE Inference Gateway improved latency for Vertex AI

Link

@kaptain shared a link, 2 hours ago

FAUN.dev()

How Kubernetes Learned to Resize Pods Without Restarting Them

Kubernetes v1.35 introduces in-place Pod resizing, allowing dynamic adjustments to CPU and memory limits without restarting containers. This feature addresses the operational gap of vertical scaling in Kubernetes by maintaining the same Pod UID and workload identity during resizing. With this breakt.. read more

How Kubernetes Learned to Resize Pods Without Restarting Them

Link

@kala shared a link, 2 hours ago

FAUN.dev()

Nathan Lambert: Open Models Will Never Catch Up

Open models will be the engine for the next ten years of AI research, according to Nathan Lambert, a research scientist at AI2. He explains that while open models may not catch up with closed ones due to fewer resources, they are still crucial for innovation. Lambert emphasizes the importance of int.. read more

Nathan Lambert: Open Models Will Never Catch Up

Link

@kala shared a link, 2 hours ago

FAUN.dev()

My AI Adoption Journey

A dev walks through the shift from chatbot coding toagent-based AI workflows, think agents that read files, run code, and double-check their work. Things only clicked once they built outcustom tools and configsto help agents spot and fix their own screwups. That’s the real unlock... read more

Link

@kala shared a link, 2 hours ago

FAUN.dev()

Generative Pen-trained Transformer

MeetGPenT, an open-source, wall-mounted polargraph pen plotter with a flair for generative art. It blends custom hardware, Marlin firmware, a Flask web UI running on Raspberry Pi, and Gemini-generated drawing prompts. The stack? Machina + LLM. Prompts go in, JSON drawing commands come out. That driv.. read more

Link

@kala shared a link, 2 hours ago

FAUN.dev()

Towards self-driving codebases

OpenAI spun up a swarm of GPT-5.x agents - thousands of them. Over a week-long sprint, they cranked out runnable browser code and shipped it nonstop. The system hit 1,000 commits an hour across 10 million tool calls. The architecture? A planner-worker stack. Hierarchical. Recursive. Lean on agent ch.. read more

Towards self-driving codebases

Link

@kala shared a link, 2 hours ago

FAUN.dev()

Self-Optimizing Football Chatbot Guided by Domain Experts on

Generic LLM judges and static prompts fail to capture domain-specific nuance in football defensive analysis. The architecture for self-optimizing agents built on Databricks Agent Framework allows developers to continuously improve AI quality using MLflow and expert feedback. The agent, such as a DC .. read more

Link

@devopslinks shared a link, 3 hours ago

FAUN.dev()

Demystifying : Why You Shouldn’t Fear Observability in Traditional Environments

OpenTelemetry is friendly with the past. It now pipesreal-time observability into legacy systems- no code rewrite, no drama. Pull structured metrics straight from raw logs, Windows PDH counters, or SQL Server stats. It doesn’t stop there. Got MQTT-based IoT gear? OTLP export or lightweight adapters .. read more

Demystifying : Why You Shouldn’t Fear Observability in Traditional Environments

Link

@devopslinks shared a link, 3 hours ago

FAUN.dev()

CloudBees CEO: Why Migration Is a Mirage Costing You Millions

A new CloudBees survey shows 57% of enterprises dropped over $1M on cloud migrations last year. Each effort blew past budget by an average of $315K. The kicker? Many teams still treatmodernization as migration- a shortcut that usually leads to drained budgets, burned-out devs, and delays in shipping.. read more

Link

@devopslinks shared a link, 3 hours ago

FAUN.dev()

The only Terraform pipeline you will ever need: GitHub Actions for Multi-Environment Deployments

A sharp new GitHub Actions pipeline can now sniff out which Terraform environments changed - anywhere in the repo, no matter how nested - and run them in parallel. Fast, clean, and automatic. It leans onmatrix jobs,Checkovfor static analysis,Workload Identity Federationfor secure cloud access (no ha.. read more

The only Terraform pipeline you will ever need: GitHub Actions for Multi-Environment Deployments