Pelagia - Pelagia is a Kubernetes controller that implements

Join us

All FAUN.Sensei() courses are available with a 25% discount using the coupon SENSEI2525, valid until December 31.

Content

Updates and recent posts about Pelagia..

Posts
Description

Link

@kaptain shared a link, 1 month, 3 weeks ago

FAUN.dev()

How to build highly available Kubernetes applications with Amazon EKS Auto Mode

Amazon EKS Auto Mode now runs the cluster for you—handling control plane updates, add-on management, and node rotation. It sticks to Kubernetes best practices so your apps stay up through node drains, pod failures, AZ outages, and rolling upgrades. It also respectsPod Disruption Budgets,Readiness Ga.. read more

How to build highly available Kubernetes applications with Amazon EKS Auto Mode

Link

@kaptain shared a link, 1 month, 3 weeks ago

FAUN.dev()

Building a Kubernetes Platform — Think Big, Think in Planes

Thinking in planes, as introduced by the Platform Engineering reference model, helps teams describe their platform in a simple, shared language, turning a collection of tools into a platform. It forces you to think horizontally, connecting teams and technologies instead of adding more layers, creati.. read more

Link

@kaptain shared a link, 1 month, 3 weeks ago

FAUN.dev()

Helm 4 Overview

Helm 4 ditches the old plugin model for a sharper, plugin-first architecture powered by WebAssembly. That means isolation/control, and deeper customization - if you're ready to adapt! Post-renderers are now plugins. That breaks compatibility with earlier exec-based setups, so expect some rewiring. .. read more

Link

@kaptain shared a link, 1 month, 3 weeks ago

FAUN.dev()

Unlocking next-generation AI performance with Dynamic Resource Allocation on Amazon EKS and Amazon EC2 P6e-GB200

Amazon just droppedEC2 P6e-GB200 UltraServers, packingNVIDIA GB200 Grace Blackwellchips. Built for running trillion-parameter AI models onAmazon EKSwithout losing sleep over scaling. Under the hood:NVLink 5.0,IMEX, andEFAv4stitch up to 72 Blackwell GPUs into one memory-coherent cluster per UltraServ.. read more

Unlocking next-generation AI performance with Dynamic Resource Allocation on Amazon EKS and Amazon EC2 P6e-GB200

Link

@kaptain shared a link, 1 month, 3 weeks ago

FAUN.dev()

The State of OCI Artifacts for AI/ML

OCI artifacts quietly leveled up. Over the last 18 months, they’ve gone from a niche hack to production muscle for AI/ML workloads on Kubernetes. The signs? Clear enough:KitOpsandModelPacklanded in the CNCF Sandbox. Kubernetes 1.31 got native support forImage Volume Source. Docker pushedModel Runner.. read more

The State of OCI Artifacts for AI/ML

Link

@kala shared a link, 1 month, 3 weeks ago

FAUN.dev()

Build AI Agents Worth Keeping: The Canvas Framework

MIT and McKinsey found a gap the size of the Grand Canyon: 80% of companies claim they’re using generative AI, but fewer than 1 in 10 use cases actually ship. Blame it on scattered data, fuzzy goals, and governance that's still MIA. A new stack is stepping in:product → agent → data → model. It flips.. read more

Build AI Agents Worth Keeping: The Canvas Framework

Link

@kala shared a link, 1 month, 3 weeks ago

FAUN.dev()

Detect inappropriate images in S3 with AWS Rekognition + Terraform

A serverless AWS pipeline runs image moderation on autopilot - withS3,Lambda,Rekognition,SNS, andEventBridgeall wired up throughTerraform. When a photo gets flagged, it’s tagged, maybe quarantined, and triggers an email alert. Daily scan? Handled... read more

Detect inappropriate images in S3 with AWS Rekognition + Terraform

Link

@kala shared a link, 1 month, 3 weeks ago

FAUN.dev()

Grokipedia

Grokipedia just dropped - a Wikipedia remix built from LLM output, pitched as an escape from "woke" bias. The pitch? Bold. The execution? Rough. Entries run long. Facts bend. Citations wander. And the tone? Cold, context-free, and unmistakably machine-made. The usual LLM suspects are here: hallucina.. read more

Link

@kala shared a link, 1 month, 3 weeks ago

FAUN.dev()

Why GPUs accelerate AI learning: The power of parallel math

Modern AI eats GPUs for breakfast - training, inference, all of it. Matrix ops? Parallel everything. Models like LLaMA don’t blink without a gang of H100s working overtime... read more

Why GPUs accelerate AI learning: The power of parallel math

Link

@kala shared a link, 1 month, 3 weeks ago

FAUN.dev()

New trend: Programming by kicking off parallel AI agents

Senior engineers are starting to spin upparallel AI coding agents- think Claude Code, Cursor, and the like - to run tasks side by side. One agent sketches boilerplate. Another tackles tests. A third refactors old junk. All at once. Is it "multitasking on steroids"? Not just this as it messes with ho.. read more

Pelagia is a Kubernetes controller that provides all-in-one management for Ceph clusters installed by Rook. It delivers two main features:

Aggregates all Rook Custom Resources (CRs) into a single CephDeployment resource, simplifying the management of Ceph clusters.
Provides automated lifecycle management (LCM) of Rook Ceph OSD nodes for bare-metal clusters. Automated LCM is managed by the special CephOsdRemoveTask resource.

It is designed to simplify the management of Ceph clusters in Kubernetes installed by Rook.

Being solid Rook users, we had dozens of Rook CRs to manage. Thus, one day we decided to create a single resource that would aggregate all Rook CRs and deliver a smoother LCM experience. This is how Pelagia was born.

It supports almost all Rook CRs API, including CephCluster, CephBlockPool, CephFilesystem, CephObjectStore, and others, aggregating them into a single specification. We continuously work on improving Pelagia's API, adding new features, and enhancing existing ones.

Pelagia collects Ceph cluster state and all Rook CRs statuses into single CephDeploymentHealth CR. This resource highlights of Ceph cluster and Rook APIs issues, if any.

Another important thing we implemented in Pelagia is the automated lifecycle management of Rook Ceph OSD nodes for bare-metal clusters. This feature is delivered by the CephOsdRemoveTask resource, which automates the process of removing OSD disks and nodes from the cluster. We are using this feature in our everyday day-2 operations routine.

Do you use Pelagia?

We believe tools are best managed by their creators. Claim this page if you are the developer of Pelagia.

Claim this page

Alternative Tools

Kubernetes Dashboard

Azure Kubernetes Service (AKS)

Google Kubernetes Engine (GKE)

Amazon Elastic Container Service for Kubernetes (EKS)

Build or update your ToolBox

Publish on FAUN.dev() and Gain Infulence Points

⚡ Grow your network, share your updates and reach thousands of readers!

FAUN.amplify()

👋 Developers trust FAUN.dev() to stay up to date. Sponsor us and put your product, service, or event in front of thousands of highly engaged developers.!

FAUN.hbc() - Humans Behind Code

🧑‍💻 Are you developing a project? Join the "Humans Behind Code" project and showcase your work to the world!