Join us

heart Posts from the community tagged with SRE...
Story
@squadcast shared a post, 1 year, 11 months ago

Top Five Pitfalls of On-Call Scheduling

On-call schedules ensure someone is always available to fix or escalate any issues that may arise, so things keep running smoothly. This blog post explores five common challenges organizations face when handling on-call schedules and discusses how to alleviate these challenges.

6253f8945392e15bfabc7505_TopFivePitfalls-570x330.png
Story
@squadcast shared a post, 1 year, 11 months ago

Anti-patterns in Incident Response that you should unlearn | Squadcast

Ignoring anti-patterns can be far worse than settling for safe and rigid processes. This blog will explore anti-patterns in incident response and tell you why you need to unlearn those.

62e913a7e3970364d0a6b873_Aniti_Pattern-570x330.png
Story
@squadcast shared a post, 2 years ago

How important is Observability for SRE?

Observability is what defines a strong SRE team. In this blog, we have covered the importance of observability, and how SREs can leverage it to enhance their business.

How important is Observability for SRE?
Story
@squadcast shared a post, 2 years ago

Strategies for Kubernetes Cluster Administrators: Understanding Pod Scheduling

As the complexity of a Kubernetes cluster grows, managing resources such as CPU and memory becomes more challenging. Efficient pod scheduling is critical to ensure optimal resource utilization and enable a stable and responsive environment for applications to run in. In this blog, we will delve into the intricacies of pod scheduling, including optimization of resource allocation and balancing workloads.

Squadcast - Strategies for Kubernetes Cluster Administrators: Understanding Pod Scheduling
Story
@squadcast shared a post, 2 years, 1 month ago

What are Webhooks and why should developers use them?

Webhooks and APIs are a developer-friendly approach to building modern-day web applications. In this blog, we explain what a webhook is, do a detailed webhooks vs. API comparison, and explain why we recommend developers use them with Squadcast.

459cdva7lqi7q714timj.png.jpeg
Story
@squadcast shared a post, 2 years, 2 months ago

Introducing our open source SLO Tracker - A simple tool to track SLOs and Error Budget

Check out our open-source SLO tracker and set up your SLO's so that you can accurately track your error budgets. Automate your SRE, with Squadcast's SLO tool!

squadcast .webp
Story
@squadcast shared a post, 2 years, 2 months ago

What are Network Operation Centers (NOC) and how do NOC teams work?

In highly competitive markets, businesses have to strive hard to be always available & operational. Hence businesses invest heavily in dedicated Network Operations Centers (NOC) that constantly monitor the performance of an organization’s IT resources. In this blog, we will explore NOC and its importance.

Incident Management and SRE
Story
@squadcast shared a post, 2 years, 2 months ago

Demystifying Kubernetes RBAC

The more prominent and complex Kubernetes deployments become, the more important it is to define strict access controls and tighter security. In this blog, Kasun has explained how RBAC can be implemented in Kubernetes clusters to restrict user permissions to relevant resources only.

Kubernetes_RBAC.png
Story
@squadcast shared a post, 2 years, 2 months ago

Introduction to Automation Testing Strategies For Microservices

The complex nature of Microservices architecture requires a systematic testing strategy to ensure end-to-end (E2E) testing for any given use case. This blog explains some of the most adopted automation testing strategies with the help of the Testing Triangles for Microservices.

Automation Testing Strategies For Microservices
Story
@squadcast shared a post, 2 years, 3 months ago

What are Runbooks? And why are they needed?

Runbooks are documented procedures for the maintenance and upgrades of systems. Leverage runbooks during incident response. Save your team's invaluable time. Learn more.

What are runbook
loading...