Join us

ContentUpdates and recent posts about Google Kubernetes Engine (GKE)..
Link
@kala shared a link, 2 months ago
FAUN.dev()

200k Tokens Is Plenty

Amp’s team isn’t chasing token limits. Even with ~200k available via Opus 4.5, they stick toshort, modular threads, around 80k tokens each. Why? Smaller threads are cheaper, more stable, and just work better. Instead of stuffing everything into a single mega-context, they slice big tasks into focuse.. read more  

200k Tokens Is Plenty
Link
@kala shared a link, 2 months ago
FAUN.dev()

Google tests new Gemini 3 models on LM Arena

Google’s been quietly field-testing two shadow models,Fierce FalconandGhost Falcon, on LM Arena. Early signs? They're probably warm-ups for the next Gemini 3 Flash or Pro drop. Classic Google move: float a checkpoint, stir up curiosity, then go GA... read more  

Google tests new Gemini 3 models on LM Arena
Link
@kala shared a link, 2 months ago
FAUN.dev()

Prompts for Open Problems

The author, Ben Recht, proposes five research directions inspired by his graduate machine learning class, arguing for different research rather than just more. These prompts include adopting a design-based view for decision theory, explaining the robust scaling trends in competitive testing, and mov.. read more  

Link
@kala shared a link, 2 months ago
FAUN.dev()

A trillion dollars is a terrible thing to waste

OpenAI co-founder Ilya Sutskever just said the quiet part out loud: scaling laws are breaking down. Bigger models aren’t getting better at thinking, they’re getting worse at generalizing and reasoning. Now he’s eyeingneurosymbolic AIandinnate inductive constraints. Yep, the “just make it huge” era m.. read more  

A trillion dollars is a terrible thing to waste
Link
@kala shared a link, 2 months ago
FAUN.dev()

Practical LLM Security Advice from the NVIDIA AI Red Team

NVIDIA’s AI Red Team nailed three security sinkholes in LLMs:reckless use ofexec/eval,RAG pipelines that grab too much data, andmarkdown that doesn't get cleaned. These cracks open doors to remote code execution, sneaky prompt injection, and link-based data leaks. The fix-it trend:App security’s lea.. read more  

Link
@kala shared a link, 2 months ago
FAUN.dev()

Roses are red, violets are blue, if you phrase it as poem, any jailbreak will do

A new study just broke the safety game wide open: rhymed prompts slipped past filters in25 major LLMs, including Gemini 2.5 Pro and Deepseek - withup to 100% success. No clever chaining, no jailbreak soup. Just single-shot rhyme. Turns out, poetic language isn’t just for bard-core Twitter. When it c.. read more  

Roses are red, violets are blue, if you phrase it as poem, any jailbreak will do
Link
@devopslinks shared a link, 2 months ago
FAUN.dev()

Advancing Our Chef Infrastructure: Safety Without Disruption

Slack pulled back the curtain onSlack AI, its LLM-powered assistant built with a fortress mindset. Every customer gets their ownisolated environment. Any data passed tovendor LLMs? It'sephemeral. Gone before it can stick. No fine-tuning. No exporting data outside Slack. And there’s a wholemiddle-lay.. read more  

Link
@devopslinks shared a link, 2 months ago
FAUN.dev()

Why we're leaving serverless

Every millisecond matters in the critical path of API authentication. After two years of battling serverless limitations, the entire API stack was rebuilt to reduce end-to-end latency. The move from Cloudflare Workers to stateful Go servers resulted in a 6x performance improvement and simplified arc.. read more  

Why we're leaving serverless
Link
@devopslinks shared a link, 2 months ago
FAUN.dev()

Declarative Action Architecture

The Declarative Action Architecture (DAA) is a scalable E2E testing pattern that separates concerns across three distinct layers. TheTest Layeris 100% declarative, statingwhatis being tested without any procedural logic, making tests read like documentation. The coreAction Layerimplements the execut.. read more  

Declarative Action Architecture
Link
@devopslinks shared a link, 2 months ago
FAUN.dev()

Failure is inevitable: Learning from a large outage, and building for reliability in depth at

Datadog ditched its “never fail” mindset after a March 2023 meltdown knocked out half its Kubernetes nodes and took major user features down with them. The fix? A full-stack rethink built aroundgraceful degradation. The team addeddisk-based persistence at intake,live-data prioritization,QoS-aware re.. read more  

Failure is inevitable: Learning from a large outage, and building for reliability in depth at
Google Kubernetes Engine (GKE) offers a Kubernetes experience on Autopilot that manages the underlying compute infrastructure without the need for manual configuration or monitoring. It provides container-native networking and security features, prebuilt Kubernetes applications and templates, pod and cluster autoscaling, and automated tools for workload migration. GKE clusters consist of a control plane and nodes that run the services supporting the containers. Autopilot mode manages the complexity of the cluster while allowing you to deploy and run your apps easily. The common uses of GKE include continuous integration and delivery, migrating workloads, and deploying and running applications. GKE pricing is based on the mode of operation, cluster management fees, and applicable multi-cluster ingress fees, with a free tier and a pricing calculator available to estimate costs. You can also connect with Google's sales team to get a custom quote for your organization or start your proof of concept.