Join us

ContentUpdates and recent posts about Slurm..
Link
@anjali shared a link, 1 year, 1 month ago
Customer Marketing Manager, Last9

The Ultimate Guide to Ubuntu Performance Monitoring

A practical guide to monitoring performance on Ubuntu—tools, tips, and commands to keep your system running efficiently.

journalctl
Link
@anjali shared a link, 1 year, 1 month ago
Customer Marketing Manager, Last9

API Latency: Definition, Measurement, and Optimization Techniques

Learn what API latency really means, how to measure it the right way, and practical ways to make your APIs respond faster.

latency
Story
@laura_garcia shared a post, 1 year, 1 month ago
Software Developer, RELIANOID

🌐 Understanding the Five Eyes Coalition and Embracing Secure Innovation 🔒

The Five Eyes (FVEY) Coalition, an alliance of the US, UK, Canada, Australia, and New Zealand, has been a cornerstone of global intelligence sharing since WWII. Over the decades, its mission has evolved to address modern challenges like cybersecurity, critical infrastructure protection, and counteri..

The Five Eyes Coalition_ Origins, Evolution, and Principles of Secure Innovation Solutions
Story
@laura_garcia shared a post, 1 year, 2 months ago
Software Developer, RELIANOID

🚀 We’re heading to QCon London 2025! 🚀

From April 7th to 10th, RELIANOID will be joining some of the brightest minds in software development at QCon London, where pioneers and senior engineers share the latest trends, best practices, and real-world case studies. 🔹 What to Expect at QCon London? ✅ Emerging trends in software architecture,..

qcon london 2025
Link
@anjali shared a link, 1 year, 2 months ago
Customer Marketing Manager, Last9

How to Configure ContainerPort in Kubernetes (The Easy Way)

Learn how ContainerPort works in Kubernetes, why it matters, and how to configure it correctly for simplified container networking.

container
Link
@anjali shared a link, 1 year, 2 months ago
Customer Marketing Manager, Last9

Log4j vs Log4j2: Which Logging Framework Should You Choose

Choosing between Log4j and Log4j2? Log4j2 offers better performance, security, and flexibility. Here's why it might be the right choice for you.

logging_framework
Story
@laura_garcia shared a post, 1 year, 2 months ago
Software Developer, RELIANOID

🚀 Moving from Alteon to a Modern Load Balancer: Why and How? 🚀

As Alteon load balancers become obsolete, organizations are moving to more advanced, cloud-native solutions. One great option is the RELIANOID load balancer, designed to handle modern, high-traffic environments with superior flexibility, scalability, and security. Here’s how to make the switch: 1️⃣ ..

kb Alteon configuration migration
Link
@anjali shared a link, 1 year, 2 months ago
Customer Marketing Manager, Last9

Breaking Down Splunk Costs for SREs and DevOps Teams

Explore Splunk's pricing and how it impacts SREs and DevOps teams. Learn how to manage costs while maintaining performance.

splunk
Story
@laura_garcia shared a post, 1 year, 2 months ago
Software Developer, RELIANOID

🚀 A Busy April Ahead for RELIANOID!

April is shaping up to be an exciting and action-packed month for us at RELIANOID! Our team will be making a big effort to attend multiple key industry events, connecting with experts, partners, and clients to discuss the latest in cybersecurity, networking, and ADC solutions. Want to know where to ..

Recap Events April
Link
@anjali shared a link, 1 year, 2 months ago
Customer Marketing Manager, Last9

Reliability vs Availability: A Simple Breakdown

Reliability and availability are crucial concepts in DevOps. Here's a simple breakdown to help you understand their key differences and importance.

linux
Slurm Workload Manager is an open-source, fault-tolerant, and highly scalable cluster management and scheduling system widely used in high-performance computing (HPC). Designed to operate without kernel modifications, Slurm coordinates thousands of compute nodes by allocating resources, launching and monitoring jobs, and managing contention through its flexible scheduling queue.

At its core, Slurm uses a centralized controller (slurmctld) to track cluster state and assign work, while lightweight daemons (slurmd) on each node execute tasks and communicate hierarchically for fault tolerance. Optional components like slurmdbd and slurmrestd extend Slurm with accounting and REST APIs. A rich set of commands—such as srun, squeue, scancel, and sinfo—gives users and administrators full visibility and control.

Slurm’s modular plugin architecture supports nearly every aspect of cluster operation, including authentication, MPI integration, container runtimes, resource limits, energy accounting, topology-aware scheduling, preemption, and GPU management via Generic Resources (GRES). Nodes are organized into partitions, enabling sophisticated policies for job size, priority, fairness, oversubscription, reservation, and resource exclusivity.

Widely adopted across academia, research labs, and enterprise HPC environments, Slurm serves as the backbone for many of the world’s top supercomputers, offering a battle-tested, flexible, and highly configurable framework for large-scale distributed computing.