ContentPosts from @boqingren..
Link
@faun shared a link, 4 hours ago

Implementing Vector Search from Scratch: A Step-by-Step Tutorial

Search is a fundamental problem in computing, and vector search aims to match meanings rather than exact words. By converting queries and documents into numerical vectors and calculating similarity, vector search retrieves contextually relevant results. In this tutorial, a vector search system is bu..

Link
@faun shared a link, 4 hours ago

Inside NVIDIA GPUs: Anatomy of high performance matmul kernels

NVIDIA Hopper packs serious architectural tricks. At the core: **Tensor Memory Accelerator (TMA)**, **tensor cores**, and **swizzling**—the trio behind async, cache-friendly matmul kernels that flirt with peak throughput. But folks aren't stopping at cuBLAS. They're stacking new tactics: **warp-gro..

Inside NVIDIA GPUs: Anatomy of high performance matmul kernels
Link
@faun shared a link, 4 hours ago

Jupyter Agents: training LLMs to reason with notebooks

Hugging Face dropped an open pipeline and dataset for training small models—think **Qwen3-4B**—into sharp **Jupyter-native data science agents**. They pulled curated Kaggle notebooks, whipped up synthetic QA pairs, added lightweight **scaffolding**, and went full fine-tune. Net result? A **36% jump ..

Jupyter Agents: training LLMs to reason with notebooks
Link
@faun shared a link, 4 hours ago

Building a Natural Language Interface for Apache Pinot with LLM Agents

MiQ plugged **Google’s Agent Development Kit** into their stack to spin up **LLM agents** that turn plain English into clean, validated SQL. These agents speak directly to **Apache Pinot**, firing off real-time queries without the usual parsing pain. Behind the scenes, it’s a slick handoff: NL2SQL ..

Building a Natural Language Interface for Apache Pinot with LLM Agents
Link
@faun shared a link, 4 hours ago

Shai-Hulud npm Supply Chain Attack

Malicious npm packages just leveled up: this one dropped a self-spreading worm that hijacks repos and leaks secrets the moment it lands. It abuses `postinstall` scripts to run TruffleHog and swipe tokens straight from your codebase. Then it uses GitHub Actions to exfiltrate the loot and auto-publis..

Shai-Hulud npm Supply Chain Attack
Link
@faun shared a link, 4 hours ago

Demystifying Log Retention in Azure

Azure logs come in three flavors: **Activity Logs**, **Diagnostic Logs**, and **Log Analytics**. Each with its own rules for retention and billing. The catch? Those differences aren’t quirks—they’re baked in...

Link
@faun shared a link, 4 hours ago

Introducing DigitalOcean Organizations, a new and comprehensive account layer

DigitalOcean just dropped **Organizations**—a real upgrade for anyone juggling multiple Teams. Think one top-level account to rule them all: centralized user control, one invoice to track, and org-wide settings for taxes, credits, and permissions...

Introducing DigitalOcean Organizations, a new and comprehensive account layer
Link
@faun shared a link, 4 hours ago

How FinOps Drives Value for Every Engineering Dollar

Duolingo’s FinOps crew didn’t just track cloud costs—they wired up sharp, automated observability across 100+ microservices. Real-time alerts now catch AI and infra spend spikes before they torch the budget. They sliced TTS costs by 40% with in-memory caching. Dumped pricey CloudWatch metrics for P..

How FinOps Drives Value for Every Engineering Dollar
Link
@faun shared a link, 4 hours ago

Observability for the Invisible: Tracing Message Drops in Kafka Pipelines

When an event drops silently in a distributed system, it is not a bug, it is an architectural blind spot. Detect, debug, and prevent message loss in Kafka-based streaming pipelines using tools like OpenTelemetry, Fluent Bit, Jaeger, and dead-letter queues. Make sure observability gaps in event strea..

Link
@faun shared a link, 4 hours ago

Top 30 Argo CD Anti-Patterns to Avoid When Adopting Gitops

A teardown of Argo CD anti-patterns calls out 28 common misfires—stuff like skipping Git for Application CRDs or stuffing Helm/Kustomize config right into Argo CD manifests. Yikes. It pushes for a cleaner setup: use **ApplicationSets** instead of rolling your own YAML, turn on **auto-sync/self-heal..