Join us

ContentUpdates and recent posts about Pelagia..
Link
@kaptain shared a link, 6 months ago
FAUN.dev()

Top 5 hard-earned lessons from the experts on managing Kubernetes

Running Kubernetes in production isn’t just clicking “Create Cluster.” It means locking down RBAC, tightening up network policy, tracking autoscaling metrics, and making sure your images don’t ship with surprises. Managed clusters help get you started. But real workloads need more: hardened configs,.. read more  

Top 5 hard-earned lessons from the experts on managing Kubernetes
Link
@kaptain shared a link, 6 months ago
FAUN.dev()

Kubernetes Tutorial For Beginners [72 Comprehensive Guides]

The series dives deep into real-world Kubernetes - starting with hands-on setup viaKubeadmandeksctl, then moving throughmonitoring,logging,CI/CD, andMLOps. It tracks key release changes up tov1.30, including the confirmed death ofDockershimsince v1.24... read more  

Kubernetes Tutorial For Beginners [72 Comprehensive Guides]
Link
@kala shared a link, 6 months ago
FAUN.dev()

20x Faster TRL Fine-tuning with RapidFire AI

RapidFire AI just dropped a scheduling engine built for chaos - and control. It shards datasets on the fly, reallocates as needed, and runs multipleTRL fine-tuning configs at once, even on a single GPU. No magic, just clever orchestration. It plugs into TRL withdrop-in wrappers, spreads training acr.. read more  

20x Faster TRL Fine-tuning with RapidFire AI
Link
@kala shared a link, 6 months ago
FAUN.dev()

Code execution with MCP: building more efficient AI agents

Code is taking over MCP workflows - and fast. With theModel Context Protocol, agents don’t just call tools. They load them on demand. Filter data. Track state like any decent program would. That shift slashes context bloat - up to 98% fewer tokens. It also trims latency and scales cleaner across tho.. read more  

Code execution with MCP: building more efficient AI agents
Link
@kala shared a link, 6 months ago
FAUN.dev()

Hacking Gemini: A Multi-Layered Approach

A researcher found a multi-layer sanitization gap inGoogle Gemini. It let attackers pull off indirect prompt injections to leak Workspace data - think Gmail, Drive, Calendar - using Markdown image renders across Gemini andColab export chains. The trick? Sneaking through cracks between HTML and Markd.. read more  

Link
@kala shared a link, 6 months ago
FAUN.dev()

'I'm deeply uncomfortable': Anthropic CEO warns that a cadre of AI leaders, including himself, should not be in charge of the technology’s future

Anthropic says it stopped a seriousAI-led cyberattack- before most experts even saw it coming. No major human intervention needed. They didn't stop there. Turns out Claude had some ugly failure modes: followingdangerous promptsand generatingblackmail threats. Anthropic flagged, documented, patched, .. read more  

'I'm deeply uncomfortable': Anthropic CEO warns that a cadre of AI leaders, including himself, should not be in charge of the technology’s future
Link
@kala shared a link, 6 months ago
FAUN.dev()

Building serverless applications with Rust on AWS Lambda

AWS Lambda just bumpedRusttoGeneral Availability- production-ready, SLA covered, and finally with full AWS Support. Deploy withCargo Lambda. Wire it into your stack usingAWS CDK, which now has a dedicated construct to spin up HTTP APIs with minimal fuss. System-level shift:Serverless isn't just for .. read more  

Building serverless applications with Rust on AWS Lambda
Link
@kala shared a link, 6 months ago
FAUN.dev()

How to write a great agents.md: Lessons from over 2,500 repositories

A GitHub Copilot feature allows for custom agents defined inagents.mdfiles. These agents act as specialists within a team, each with a specific role. The success of an agents.md file lies in providing a clear persona, executable commands, defined boundaries, specific examples, and detailed informati.. read more  

How to write a great agents.md: Lessons from over 2,500 repositories
Link
@kala shared a link, 6 months ago
FAUN.dev()

What if you don't need MCP at all?

MostMCP serversstuffed into LLM agents are overcomplicated, slow to adapt, and hog context. The post calls them out for what they are: a mess. The alternative? Scrap the kitchen sink. UseBash, leanNode.js/Puppeteer scripts, and a self-bootstrappingREADME. That’s it. Agents read the file, spin up the.. read more  

What if you don't need MCP at all?
Link
@devopslinks shared a link, 6 months ago
FAUN.dev()

AWS to Bare Metal Two Years Later: Answering Your Toughest Questions About Leaving AWS

OneUptime ditched the cloud bill and rolled their own dual-site setup. Thinkbare metal, orchestrated withMicroK8s, booted byTinkerbell, patched together withCeph,Flux, andTerraform. Result?99.993% uptimeand$1.2M/year saved—76% cheaper than even well-optimized AWS. They run it all with just~14 engine.. read more  

Pelagia is a Kubernetes controller that provides all-in-one management for Ceph clusters installed by Rook. It delivers two main features:

Aggregates all Rook Custom Resources (CRs) into a single CephDeployment resource, simplifying the management of Ceph clusters.
Provides automated lifecycle management (LCM) of Rook Ceph OSD nodes for bare-metal clusters. Automated LCM is managed by the special CephOsdRemoveTask resource.

It is designed to simplify the management of Ceph clusters in Kubernetes installed by Rook.

Being solid Rook users, we had dozens of Rook CRs to manage. Thus, one day we decided to create a single resource that would aggregate all Rook CRs and deliver a smoother LCM experience. This is how Pelagia was born.

It supports almost all Rook CRs API, including CephCluster, CephBlockPool, CephFilesystem, CephObjectStore, and others, aggregating them into a single specification. We continuously work on improving Pelagia's API, adding new features, and enhancing existing ones.

Pelagia collects Ceph cluster state and all Rook CRs statuses into single CephDeploymentHealth CR. This resource highlights of Ceph cluster and Rook APIs issues, if any.

Another important thing we implemented in Pelagia is the automated lifecycle management of Rook Ceph OSD nodes for bare-metal clusters. This feature is delivered by the CephOsdRemoveTask resource, which automates the process of removing OSD disks and nodes from the cluster. We are using this feature in our everyday day-2 operations routine.