Join us

heart Posts from the community tagged with DevOps...
Sponsored Link FAUN Team
@faun shared a link, 1 year, 10 months ago

Read AI/M Weekly

AI Weekly Newsletter, Kala. Curated AI news, tutorials, tools and more - Join thousands of other readers, 100% free, unsubscribe anytime.

Story
@vmihailenco shared a post, 1 year, 8 months ago
@uptrace

Monitoring CPU/RAM/disk metrics with OpenTelemetry and Uptrace

OpenTeleletry Collector is an open source data collection pipeline that allows you to monitor CPU, RAM, disk, network metrics, and many more.

Collector itself does not include built-in storage or analysis capabilities, but you can export the data to Uptrace and ClickHouse, using them as a replacement for Grafana and Prometheus.

When compared to Prometheus, ClickHouse can offer small on-disk data size and better query performance when analyzing millions of timeseries.

cover.png
Story
@mohammad_zaigam shared a post, 1 year, 10 months ago
Technical Solutions Specialist, Logiq.ai

THE 5 STAGES OF THE OBSERVABILITY MATURITY MODEL

The unprecedented growth of data in recent years has led to a demand for evolution in traditional monitoring practices.

The current observability maturity model is a good solution but needs further augmentations.

The widely accepted model includes the following stages:

1) Monitoring (Is everything in working order?)

2) Observability (Why is it not working?)

3) Full-Stack Observability (What is the origin of the problem, and what are its consequences?)

4) Intelligent Observability (How to predict anomalies and automate response?)

LOGIQ is supporting the next stage in the model i.e, Federated Observability. In other words, data availability for consumers with on-demand convenience.

The Observability Maturity Model.png
Story
@squadcast shared a post, 1 year, 11 months ago

Strategies for Kubernetes Cluster Administrators: Understanding Pod Scheduling

As the complexity of a Kubernetes cluster grows, managing resources such as CPU and memory becomes more challenging. Efficient pod scheduling is critical to ensure optimal resource utilization and enable a stable and responsive environment for applications to run in. In this blog, we will delve into the intricacies of pod scheduling, including optimization of resource allocation and balancing workloads.

Squadcast - Strategies for Kubernetes Cluster Administrators: Understanding Pod Scheduling
Story
@squadcast shared a post, 2 years ago

What are Webhooks and why should developers use them?

Webhooks and APIs are a developer-friendly approach to building modern-day web applications. In this blog, we explain what a webhook is, do a detailed webhooks vs. API comparison, and explain why we recommend developers use them with Squadcast.

459cdva7lqi7q714timj.png.jpeg
Story
@emile shared a post, 2 years ago
Co-founder, Nebuly

Tutorial on Dynamic GPU Partitioning with MIG to Maximize the Utilization of GPUs in Kubernetes

Partitioning is a way to divide GPU resources into smaller slices. This allows Pods to be scheduled only on the memory/compute resources they actually need, thus increasing GPU utilization and reducing infrastructure costs in Kubernetes clusters.

nos, opensource to maximize GPU utilization in Kubernetes
Story
@squadcast shared a post, 2 years ago

Introducing our open source SLO Tracker - A simple tool to track SLOs and Error Budget

Check out our open-source SLO tracker and set up your SLO's so that you can accurately track your error budgets. Automate your SRE, with Squadcast's SLO tool!

squadcast .webp
Story
@squadcast shared a post, 2 years ago

What are Network Operation Centers (NOC) and how do NOC teams work?

In highly competitive markets, businesses have to strive hard to be always available & operational. Hence businesses invest heavily in dedicated Network Operations Centers (NOC) that constantly monitor the performance of an organization’s IT resources. In this blog, we will explore NOC and its importance.

Incident Management and SRE
Story
@squadcast shared a post, 2 years, 1 month ago

Demystifying Kubernetes RBAC

The more prominent and complex Kubernetes deployments become, the more important it is to define strict access controls and tighter security. In this blog, Kasun has explained how RBAC can be implemented in Kubernetes clusters to restrict user permissions to relevant resources only.

Kubernetes_RBAC.png
Story
@squadcast shared a post, 2 years, 1 month ago

Introduction to Automation Testing Strategies For Microservices

The complex nature of Microservices architecture requires a systematic testing strategy to ensure end-to-end (E2E) testing for any given use case. This blog explains some of the most adopted automation testing strategies with the help of the Testing Triangles for Microservices.

Automation Testing Strategies For Microservices
Story
@huseyni shared a post, 2 years, 1 month ago
System Engineer, azfiber

AWS services 101: A tour of popular AWS services and how to use them

Introduction

Amazon Web Services (AWS) is a cloud computing platform that provides a wide range of services for building, deploying, and managing applications on the cloud. AWS was one of the first cloud computing platforms and has since become the largest and most popular, with millions of customers around the world.

Amazon-Web-Services-1.png
loading...