ContentPosts from @waseemahammed..
Link
@faun shared a link, 4 weeks ago

LLM Evaluation: Practical Tips at Booking.com

Booking.com built Judge-LLM, a framework where strong LLMs evaluate other models against a carefully curated golden dataset. Clear metric definitions, rigorous annotation, and iterative prompt engineering make evaluations more scalable and consistent than relying solely on humans. **The takeaway**:..

Link
@faun shared a link, 4 weeks ago

Introducing the MCP Registry

The new **Model Context Protocol (MCP) Registry** just dropped in preview. It’s a public, centralized hub for finding and sharing MCP servers—think phonebook, but for AI context APIs. It handles public and private subregistries, publishes OpenAPI specs so tooling can play nice, and bakes in communit..

Link
@faun shared a link, 4 weeks ago

Accelerate serverless testing with LocalStack integration in VS Code IDE

The AWS Toolkit for VS Code now hooks straight into **LocalStack**. Run full end-to-end tests for **serverless workflows**—Lambda, SQS, EventBridge, the whole crew—without bouncing between tools or writing boilerplate. Just deploy to LocalStack from the IDE using the **AWS SAM CLI**. It feels like ..

Accelerate serverless testing with LocalStack integration in VS Code IDE
Link
@faun shared a link, 4 weeks ago

PostgreSQL maintenance without superuser

PostgreSQL’s moving in on superusers. As of recent releases—starting way back in v9.6 and maturing through PostgreSQL 18 (coming 2025)—there are now **15+ built-in admin roles**. No need to hand out superuser just to get things done. These roles cover the ops spectrum: monitoring, backups, fil..

PostgreSQL maintenance without superuser
Link
@faun shared a link, 4 weeks ago

Writing an operating system kernel from scratch

A barebonestime-sharing OS kernel, written inZig, running onRISC-V. It leans onOpenSBIfor console I/O and timer interrupts. Threads? Statically allocated, each running inuser mode (U-mode). The kernel stays insupervisor mode (S-mode), where it catchessystem callsandcontext switchesvia timer ticks. ..

Writing an operating system kernel from scratch
Link
@faun shared a link, 4 weeks ago

Scaling Prometheus: Managing 80M Metrics Smoothly

Flipkart ditched its creakyStatsD + InfluxDBstack for afederated Prometheussetup—built to handle 80M+ time-series metrics without choking. The move leaned intopull-based collection,PromQL's firepower, andhierarchical federationfor smarter aggregation and long-haul queries. Why it matters:Prometheus..

Scaling Prometheus: Managing 80M Metrics Smoothly
Link
@faun shared a link, 4 weeks ago

Magical systems thinking

AI now writes over **25% of Google’s** and as much as **90% of Anthropic’s** code. That’s not a trend—it’s a regime change. Still, the mess in large public systems reminds us: clever analysis isn’t enough. Complex systems don’t behave; they misbehave. When the machines are churning out code, the ..

Magical systems thinking
Link
@faun shared a link, 4 weeks ago

SLI Evolution Stages

A new SLI evolution model lays out a maturity roadmap—from rebranded latency/error metrics to ones that actually track business impact. It replaces shallow signals and pulls in the stuff that matters: how service failures hit user goals, tasks, and bottom lines...

SLI Evolution Stages
Link
@faun shared a link, 4 weeks ago

%CPU Utilization Is A Lie

Stress tests on the Ryzen 9 5900X uncovered a big gap between **reported CPU utilization** and what the chip actually pushes. Around 50% on paper? Could mean close to full throttle in reality—thanks to sneaky behaviors from **SMT resource sharing** and **Turbo frequency scaling**. **Takeaway:** Raw..

%CPU Utilization Is A Lie
Link
@faun shared a link, 4 weeks ago

Best 20 Linux Commands for Daily Use in Production Servers

A fresh roundup drops20 go-to Linux commandsfor production sysadmins, dialing in on modern defaults likehtop > top,ss > netstat, andip > ifconfig. The shift? Faster tools that actually get updates. Built with systemd in mind, too. Expect the usual suspects—journalctl,rsync,crontab—all still pulling..

Best 20 Linux Commands for Daily Use in Production Servers