ContentPosts from @kala..
Link
@kala shared a link, 3 weeks, 4 days ago
FAUN.dev()

Multi-Agent System Reliability

LLMs are unreliable out of the box, but multi-agent systems can improve by dividing work among specialized agents. Building robust systems involves leveraging human system patterns like hierarchy, consensus, adversarial debate, and knock-out in a multi-agent architecture to ensure correctness and re.. read more  

Link
@kala shared a link, 3 weeks, 4 days ago
FAUN.dev()

The AI engineering stack we built internally - on the platform we ship

Cloudflare wired AI into the engineering stack. LLM traffic funnels through aproxy WorkerandAI Gateway. It shippedWorkers AIand theAgents SDK. Daily users hit 3,683 (93% R&D). MR throughput climbed to ~10,952/week.Workers AIhandled 51B input tokens and cut a security agent's inference spend by 77%... read more  

The AI engineering stack we built internally - on the platform we ship
Link
@kala shared a link, 3 weeks, 4 days ago
FAUN.dev()

An open-weights Chinese model just beat Claude, GPT-5.5, and Gemini in a programming challenge

The AI Coding Contest Day 12 matched ten models on a sliding‑letter puzzle. Open‑weightsKimi K2.6took first: 22 match points (7‑1‑0).MiMo V2‑Proscored second by blasting claims for intact ≥7‑letter seeds (43 points).GPT‑5.5andClaude Opus 4.7landed third and fifth. Grids ran10×10→30×30. Heavy scrambl.. read more  

An open-weights Chinese model just beat Claude, GPT-5.5, and Gemini in a programming challenge
Link
@kala shared a link, 3 weeks, 4 days ago
FAUN.dev()

Monitoring LLM behavior: Drift, retries, and refusal patterns

Traditional software is predictable due to determinism, while generative AI is unpredictable. Engineers need a new infrastructure layer, the AI Evaluation Stack, to ship enterprise-ready AI products. The stack includes deterministic assertions and model-based assertions to ensure structural integrit.. read more  

Link
@kala shared a link, 3 weeks, 4 days ago
FAUN.dev()

Introducing the Agent Readiness score. Check to see if your site is agent-ready

Cloudflare launchedIsItAgentReady. It scans200kdomains, scoresagent readiness, publishes weekly adoption charts, and exposes results via anAPI. It checksrobots.txt,llms.txt, content negotiation viaAccept: text/markdown,API Catalog,.well-known/mcp.json, OAuth discovery, andx402payments. Cloudflare ov.. read more  

Introducing the Agent Readiness score. Check to see if your site is agent-ready
 Activity
@kala added a new tool Ollama , 3 weeks, 5 days ago.
Link
@kala shared a link, 1 month ago
FAUN.dev()

Introducing Coregit

Coregit reimplements Git's object model inTypeScriptand runs onCloudflare Workersas a serverless edge Git API. Its commit endpoint accepts up to 1,000 file changes per request and replaces 105+ GitHub calls with one. Yes - one. It acknowledges writes inDurable Objects(~2ms), then flushes objects toR.. read more  

Link
@kala shared a link, 1 month ago
FAUN.dev()

Introducing Ternary Bonsai: Top Intelligence at 1.58 Bits

PrismML unveilsTernary Bonsai: a family of1.58-bitLMs in1.7B,4B, and8Bsizes. Models use ternary weights {-1,0,+1} with group-wise quantization. Weights are ternary (-1,0,+1). Each group of128weights shares anFP16scale. That cuts memory by ~9x versus 16-bit and boosts benchmark scores. The8Bhits 75.5.. read more  

Introducing Ternary Bonsai: Top Intelligence at 1.58 Bits
Link
@kala shared a link, 1 month ago
FAUN.dev()

How LLMs Work — A Visual Deep Dive

A complete walkthrough of how large language models like ChatGPT are built, from raw internet text to a conversational assistant... read more  

How LLMs Work — A Visual Deep Dive
Link
@kala shared a link, 1 month ago
FAUN.dev()

A GitHub agentic workflow

The developer automated parsing of unstructured release notes withGitHub agentic workflows. The pipeline compilesMarkdowntoYAML, then runs an agent. The setup requires afine-grained Copilot token. It enforces a hardenedsandboxpolicy and forbids Marketplace actions. CI runs a compile-then-compare che.. read more  

A GitHub agentic workflow