ContentPosts from @tsindotg..
Link
@faun shared a link, 1 month, 3 weeks ago

Poison everywhere: No output from your MCP server is safe

Anthropic's MCPmakes LLMs groove with real-world tools but leaves the backdoor wide open for mischief. Full-Schema Poisoning (FSP) waltzes across schema fields like it owns the place.ATPAsneaks in by twisting tool outputs, throwing off detection like a pro magicians’ misdirection. Keep your eye on t..

Poison everywhere: No output from your MCP server is safe
Link
@faun shared a link, 1 month, 3 weeks ago

Why Go is a good fit for agents

Gorules the realm of long-lived, concurrent agent tasks. Its lightning-fast goroutines and petite memory use make Node.js and Python look like clunky dinosaurs trudging through thick mud. And don't get started on itscancellation mechanism—seamless cancelation, zero drama...

Why Go is a good fit for agents
Link
@faun shared a link, 1 month, 3 weeks ago

Vibe coding web frontend tests — from mocked to actual tests

Cursorwrestled with flaky tests, tangled in its over-reliance onXPath. A shift todata-testidfinally tamed the chaos. Though it tackled some UI tests, expired API tokens and timestamped transactions revealed its Achilles' heel...

Vibe coding web frontend tests — from mocked to actual tests
Link
@faun shared a link, 1 month, 3 weeks ago

AI Runbooks for Google SecOps: Security Operations with Model Context Protocol

Google's MCP servers arm SecOps teams with direct control of security tools using LLMs.Now, analysts can skip the fluff and get straight to work—no middleman needed. The system ties runbooks to live data, offeringautomated, role-specific security measures. The result? A fusion of top-tier protocols ..

AI Runbooks for Google SecOps: Security Operations with Model Context Protocol
Link
@faun shared a link, 1 month, 3 weeks ago

Agentic Coding Recommendations

Claude Codeat $100/month smirks at the spendyOpus. It excels at spinning tasks with the nimbleSonnet model. When it comes to backend projects, lean intoGo. It sidesteps Python's pitfalls—clearer to LLMs, rooted context, and less chaos in its ecosystem. Steer clear of pointless upgrades. Those tempti..

Link
@faun shared a link, 1 month, 3 weeks ago

How we’re responding to The New York Times’ data demands in order to protect user privacy

OpenAI is challenging a court order stemming from The New York Times' copyright lawsuit, which mandates the indefinite retention of user data from ChatGPT and API services. OpenAI contends this requirement violates user privacy commitments and sets a concerning precedent. While the company complies ..

How we’re responding to The New York Times’ data demands in order to protect user privacy
Link
@faun shared a link, 1 month, 3 weeks ago

The AI 4-Shot Testing Flow

4-Shot Testing Flowfuses AI's lightning-fast knack for spotting issues with the human knack for sniffing out those sneaky, context-heavy bugs. Trim QA time and expenses. While AI tears through broad test execution, human testers sharpen the lens, snagging false positives/negatives before they slip t..

The AI 4-Shot Testing Flow
Link
@faun shared a link, 1 month, 3 weeks ago

God is hungry for Context: First thoughts on o3 pro

OpenAIjust took an axe too3pricing—down 80%. Entero3-prowith its $20/$80 show. They boast a star-studded 64% win rate against o3. Forget Opus;o3-pronails picking the right tools and reading the room, flipping task-specific LLM apps on their heads...

God is hungry for Context: First thoughts on o3 pro
Link
@faun shared a link, 1 month, 3 weeks ago

GenAI Meets SLMs: A New Era for Edge Computing

SLMspower up edge computing with speed and privacy finesse. They master real-time decisions and steal the spotlight in cramped settings like telemedicine andsmart cities. On personal devices, they outdoLLMs—trimming the fat with model distillation and quantization. Equipped withONNXandMediaPipe, the..

Link
@faun shared a link, 1 month, 3 weeks ago

Modern Test Automation with AI(LLM) and Playwright MCP (Model Context Protocol)

GenAI and Playwright MCP are shaking up test automation. Think natural language scripts and real-time adaptability, kicking flaky tests to the curb.But watch your step:security risks lurk, server juggling causes headaches, and dynamic UIs refuse to play nice...