ContentPosts from @baobaoack000..
Link
@faun shared a link, 1 day, 7 hours ago

LLM Evaluation: Practical Tips at Booking.com

A new LLM evaluation framework taps into an"LLM-as-judge"setup—think strong model playing human annotator. It gets prompted (or fine-tuned) to mimic human scores and rate outputs from other LLMs. It runs on a tightly labeledgolden dataset, handles both pointwise and head-to-head comparisons, and sh..

LLM Evaluation: Practical Tips at Booking.com
Link
@faun shared a link, 1 day, 7 hours ago

No, AI is not Making Engineers 10x as Productive

Claims of 10–100x dev speed from AI tools skip the hard parts—code reviews, bug queues, flaky tests. In practice, AI helps with the small stuff: one-off scripts, throwaway glue code, basic scaffolds. But scaling that help across big, messy codebases? Still a pipe dream. Too much context lost. Too ma..

Link
@faun shared a link, 1 day, 7 hours ago

Google releases AI agent Jules for programming

Google’s AI agentJulesjust leveled up—out of beta and into full-on dev mode. It now handlesasynchronous tasks, pushesreal-time code updates, and can spin up pull requests with deeperGitHub integration. Under the hood: it runs on the beefierGemini 2.5 Promodel. AddsEnvironment Snapshotsfor state cap..

Link
@faun shared a link, 1 day, 7 hours ago

Powering Real-Time AI Applications

Generative AI databases like SingleStore now cramOLTP,OLAP,vector search, andfull-text searchinto one SQL-first platform. Structured, unstructured—it eats both. No ETL. No silos. Just real-time data, ripe for AI models and semantic queries. System shift:Blending transactional and analytic guts in o..

Link
@faun shared a link, 1 day, 7 hours ago

Anthropic Revokes OpenAI’s API Access to Claude, Alleging Violation Ahead of GPT-5 Launch

Anthropic just yanked OpenAI’s API access to Claude. Reason? Alleged violations of terms that forbid using Claude to train rival models—like GPT-5. Windsurf, an OpenAI acquisition target, got the boot earlier too. Spot the pattern: tighten access, box out competitors. System shift:APIs aren’t just ..

Anthropic Revokes OpenAI’s API Access to Claude, Alleging Violation Ahead of GPT-5 Launch
Link
@faun shared a link, 1 day, 7 hours ago

Who does your assistant serve?

OpenAI’s release of GPT-5 backfired: instead of excitement, users felt betrayed by a forced upgrade that stripped away the warmth and reliability they had come to rely on in GPT-4o. Many treated the model as more than a tool — a companion, therapist, or emotional support — so when its personality sh..

Who does your assistant serve?
Link
@faun shared a link, 1 day, 7 hours ago

Implementing MCP Servers in Python: An AI Shopping Assistant with Gradio

Gradio just leveled up. It now auto-converts plain Python functions intoMCP-compliant LLM tools, grabbing input schemas and metadata straight from docstrings. New tricks:real-time progress streaming,auto file uploads, plus tight integration withVS Code’s AI Chatfor wiring up agent workflows...

Link
@faun shared a link, 1 day, 7 hours ago

MCP Registry with Azure API Center

Azure just droppedMCP Center, showing off howAzure API Centercan double as a private registry forModel-Centric Protocol (MCP) servers. It’s built for internal use—think secure discovery, tight OAuth 2 auth, centralized control, and AI Gateway rules baked in. Handy when teams need to corral AI tools..

MCP Registry with Azure API Center
Link
@faun shared a link, 1 day, 7 hours ago

Introducing AWS Cloud Control API MCP Server: Natural Language Infrastructure Management on AWS

AWS dropped theCloud Control API MCP Server, a mouthful of a name for a tool that makes 1,200+ AWS resources manageable through a standard CRUDL API—using natural language. Think: describe what you want, and tools like Amazon Q Developer turn it into actual infra code. It doesn’t stop there. It val..

Introducing AWS Cloud Control API MCP Server: Natural Language Infrastructure Management on AWS
Link
@faun shared a link, 1 day, 7 hours ago

When Did AI Take Over Hacker News?

A fresh dive into 24,910 top Hacker News posts since 2019 shows that AI chatter didn’t blow up with ChatGPT—it took off afterGPT-4 landed in early 2023. The study used OpenAI’s Batch API and a lean GPT-5-mini to crunch the numbers. Turns out,52% of the AI talk was positive, and the busiest stretch?..

When Did AI Take Over Hacker News?