Join us

ContentUpdates and recent posts about Pelagia..
Link
@faun shared a link, 3 months ago

AI inference supercharges on Google Kubernetes Engine

Google Cloud's pushingGKEbeyond container orchestration, framing it as an AI inference engine. Meet the new crew: theInference Gateway(smart load balancer, talks models and hardware),custom compute classes, and aDynamic Workload Schedulerthat tunes for both speed and spend. The setup handles GPU an..

Link
@faun shared a link, 3 months ago

How I eliminated networking complexity

A fresh pattern’s gaining traction:Docker + Tailscale sidecarsreplacing old-school reverse proxies and clunky VPNs. Each service runs as its ownmesh-routed node, containerized and independent. The trick?Network namespace sharing.App containers hook into the Tailscale mesh with no exposed ports, no ..

Link
@faun shared a link, 3 months ago

Cloud native is not just for hyperscalers

CNCF just dropped anAI workload conformance program, built like the Kubernetes one—so AI tools play nice across clusters. Portability, meet your referee. It’s tightening the loop betweenOpenTelemetry and OpenSearch, turning ad-hoc hacks into actual cross-project coordination. AndBackstage and GitOp..

Cloud native is not just for hyperscalers
Link
@faun shared a link, 3 months ago

Accessing the Kubernetes API from SQL Server 2025

SQL Server 2025 rolls outspinvokeexternalrestendpoint, a new way to hit REST APIs straight from T-SQL. That includes calling the Kubernetes API—thanks to a reverse proxy in front. The setup’s not exactly plug-and-play. You’ll need custom TLS certs, an nginx reverse proxy, and Kubernetes RBAC to kee..

Accessing the Kubernetes API from SQL Server 2025
Link
@faun shared a link, 3 months ago

How We Saved $1.22 Million Annually on GCP Costs in a Few Simple Steps

Arpeely chopped$140K/monthoff their cloud bill using a surgical mix of GCP tricks. Committed Use Discounts (CUDs) for high-availability services? Check. Smarter Kubernetes HPA configs? Definitely. Archiving old BigQuery data into GCS Archive? That one alone slashed storage costs 16x. The real kicker..

How We Saved $1.22 Million Annually on GCP Costs in a Few Simple Steps
Link
@faun shared a link, 3 months ago

Cloudera Acquires Taikun for Managing Kubernetes and Cloud

Cloudera acquired Taikun for seamless deployment of data and AI workloads in any environment. This move reinforces Cloudera's commitment to flexibility and innovation in managing complex IT infrastructures...

Link
@faun shared a link, 3 months ago

Introducing Headlamp AI Assistant

Headlamp just dropped an AI Assistant plugin that foldsLLM-driven actions and queriesstraight into the Kubernetes UI. It taps intocontext-aware promptsto spot issues, restart deployments, and hunt down flaky pods—without leaving the interface. System shift:This pushes Kubernetes toward intent-based..

Link
@faun shared a link, 3 months ago

Kubernetes Observability: Pillars, Tools & Best Practices

Kubernetes observability isn’t just about catching metrics or tailing logs. It’s about stitching togethermetrics, logs, and tracesto see what’s actually happening—across services, over time, and through the chaos. Thing is, Kubernetes doesn’t come with this built in. So teams hack together toolchai..

Link
@faun shared a link, 3 months ago

Kubernetes Will Solve YAML Headaches with KYAML

Kubernetes is eyeing a YAML remix. Version 1.34 may bring inKYAML—a stricter, YAML-compatible subset built to cut down on sloppy configs and sneaky formatting bugs. KYAML keeps the good parts: comments, trailing commas, unquoted keys. But it dumps YAML’s whitespace drama. Existing manifests and Hel..

Kubernetes Will Solve YAML Headaches with KYAML
Link
@faun shared a link, 3 months ago

Introducing Kubernetes for Snowflake

Snowflake just leveled up its workload scheduler—now driven by LLMs and reinforcement learning. Instead of locking jobs to static warehouses, it predicts where to send them in real-time. Smarter routing, tighter hardware use, over40%shaved off compute bills. Bigger picture:Another nod toward ML-bas..

Introducing Kubernetes for Snowflake
Pelagia is a Kubernetes controller that provides all-in-one management for Ceph clusters installed by Rook. It delivers two main features:

Aggregates all Rook Custom Resources (CRs) into a single CephDeployment resource, simplifying the management of Ceph clusters.
Provides automated lifecycle management (LCM) of Rook Ceph OSD nodes for bare-metal clusters. Automated LCM is managed by the special CephOsdRemoveTask resource.

It is designed to simplify the management of Ceph clusters in Kubernetes installed by Rook.

Being solid Rook users, we had dozens of Rook CRs to manage. Thus, one day we decided to create a single resource that would aggregate all Rook CRs and deliver a smoother LCM experience. This is how Pelagia was born.

It supports almost all Rook CRs API, including CephCluster, CephBlockPool, CephFilesystem, CephObjectStore, and others, aggregating them into a single specification. We continuously work on improving Pelagia's API, adding new features, and enhancing existing ones.

Pelagia collects Ceph cluster state and all Rook CRs statuses into single CephDeploymentHealth CR. This resource highlights of Ceph cluster and Rook APIs issues, if any.

Another important thing we implemented in Pelagia is the automated lifecycle management of Rook Ceph OSD nodes for bare-metal clusters. This feature is delivered by the CephOsdRemoveTask resource, which automates the process of removing OSD disks and nodes from the cluster. We are using this feature in our everyday day-2 operations routine.