Join us

FAUN.dev() is where engineers from GitHub, Netflix, and Shopify go to stay ahead — fast.

An effortless, straightforward way to keep up with technologies...so you can keep your tabs closed and your mind open!

70,000+ developers already joined our ecosystem ⭐⭐⭐⭐⭐
Trusted by engineers at:

Google • Microsoft • AWS • Netflix

GPT-5.4

GPT-5.4 is OpenAI’s latest frontier AI model designed for professional and developer workflows. It combines advanced reasoning, strong coding capabilities, and improved tool use into a single system …

Featured Course(s)

Practical MCP with FastMCP & LangChain

Engineering the Agentic Experience

> Get Your Copy

Content

Updates and recent posts about GPT-5.4..

Posts
Description

Link

@varbear shared a link, 3 weeks, 2 days ago

FAUN.dev()

How Microsoft Vaporized a Trillion Dollars

A former Azure Core engineer recounts arriving on his first day to find a 122-person org seriously planning to port Windows-based VM management agents - 173 of them, which nobody could fully explain - onto a tiny, low-power ARM chip running Linux. The stack was already failing to scale on server-gra.. read more

How Microsoft Vaporized a Trillion Dollars

Link

@kaptain shared a link, 3 weeks, 2 days ago

FAUN.dev()

Sandboxes: Run Agents in YOLO Mode, Safely

Over a quarterof production code is now AI-authored, with agents boosting pull requests by 60% when allowed to run autonomously in YOLO mode. Docker Sandboxes provide a safe boundary for agents, enabling fully autonomous operation without risking your machine or data... read more

Sandboxes: Run Agents in YOLO Mode, Safely

Link

@kaptain shared a link, 3 weeks, 2 days ago

FAUN.dev()

Kubernetes v1.36 Sneak Peek

Kubernetes v1.36, coming inApril 2026, will feature removals and deprecations, with enhancements that include retirement of the Ingress NGINX project and thedeprecation of .spec.externalIPs in Service.Additionally, the release will remove the gitRepo volume driver and introduce enhancements like fas.. read more

Link

@kaptain shared a link, 3 weeks, 2 days ago

FAUN.dev()

Broadcom Makes Its Pitch To Run Kubernetes On VMware VCF

Broadcom's $69 billion acquisition of virtualization pioneer VMware in late 2023 brought about significant price increases and a shift towards subscription-based licensing. The company aims to establish VMware Cloud Foundation (VCF) as the foundation for enterprise workloads gravitating towards priv.. read more

Broadcom Makes Its Pitch To Run Kubernetes On VMware VCF

Link

@kaptain shared a link, 3 weeks, 2 days ago

FAUN.dev()

Docker Offload now Generally Available: The Full Power of Docker, for Every Developer, Everywhere.

Docker Offload is a managed cloud service that moves the container engine to Docker’s secure cloud, allowing developers to run Docker from any environment without changing their workflows. With Docker Offload, developers can keep using the same commands and workflows they are accustomed to in Docker.. read more

Docker Offload now Generally Available: The Full Power of Docker, for Every Developer, Everywhere.

Link

@kaptain shared a link, 3 weeks, 2 days ago

FAUN.dev()

llm-d officially a CNCF Sandbox project

At Google Cloud, the llm-d project has been accepted as a Cloud Native Computing Foundation (CNCF) Sandbox project. This collaboration with industry leaders like Red Hat, IBM Research, CoreWeave, and NVIDIA aims to provide a framework for any model, accelerator, or cloud. The introduction of GKE Inf.. read more

llm-d officially a CNCF Sandbox project

Link

@kala shared a link, 3 weeks, 2 days ago

FAUN.dev()

From zero to a RAG system: successes and failures

An engineer spun up an internal chat with a localLLaMAmodel viaOllama, a PythonFlaskAPI, and aStreamlitfrontend. They moved off in-memoryLlamaIndexto batch ingestion intoChromaDB(SQLite). Checkpoints and tolerant parsing went in to stop RAM disasters. Indexing produced 738,470 vectors (~54 GB). They.. read more

From zero to a RAG system: successes and failures

Link

@kala shared a link, 3 weeks, 2 days ago

FAUN.dev()

Why we're rethinking cache for the AI era

Cloudflare data shows that 32% of network traffic originates from automated traffic, including AI assistants fetching data for responses. AI bots often issue high-volume requests and access rarely visited content, impacting cache efficiency. Cloudflare researchers propose AI-aware caching algorithms.. read more

Why we're rethinking cache for the AI era

Link

@kala shared a link, 3 weeks, 2 days ago

FAUN.dev()

Our most intelligent open models, built from Gemini 3 research and technology to maximize intelligence-per-parameter

Built from Gemini 3 research and technology, Gemma 4 offers maximum compute and memory efficiency for mobile and IoT devices. Develop autonomous agents, multimodal applications, and multilingual experiences with Gemma 4's unprecedented intelligence-per-parameter... read more

Our most intelligent open models, built from Gemini 3 research and technology to maximize intelligence-per-parameter

Link

@kala shared a link, 3 weeks, 2 days ago

FAUN.dev()

Qwen3.6-Plus: Towards Real World Agents

Qwen3.6-Plus, the latest release following Qwen3.5 series, offers enhanced agentic coding capabilities and sharper multimodal reasoning. The model excels in frontend web development and complex problem-solving, setting a new standard in the developer ecosystem. Qwen3.6-Plus is available via Alibaba .. read more

GPT-5.4 is OpenAI’s latest frontier AI model designed to perform complex professional and technical work more reliably. It combines advances in reasoning, coding, tool use, and long-context understanding into a single system capable of handling multi-step workflows across software environments. The model builds on earlier GPT-5 releases while integrating the strong coding capabilities previously introduced with GPT-5.3-Codex.

One of the defining features of GPT-5.4 is its ability to operate as part of agent-style workflows. The model can interact with tools, APIs, and external systems to complete tasks that extend beyond simple text generation. It also introduces native computer-use capabilities, allowing AI agents to operate applications using keyboard and mouse commands, screenshots, and browser automation frameworks such as Playwright.

GPT-5.4 supports context windows of up to one million tokens, enabling it to process and reason over very large documents, long conversations, or complex project contexts. This makes it suitable for tasks such as analyzing codebases, generating technical documentation, working with large spreadsheets, or coordinating long-running workflows. The model also introduces a feature called tool search, which allows it to dynamically retrieve tool definitions only when needed. This reduces token usage and makes it more efficient to work with large ecosystems of tools, including environments with dozens of APIs or MCP servers.

In addition to improved reasoning and automation capabilities, GPT-5.4 focuses on real-world productivity tasks. It performs better at generating and editing spreadsheets, presentations, and documents, and it is designed to maintain stronger context across longer reasoning processes. The model also improves factual accuracy and reduces hallucinations compared with previous versions.

GPT-5.4 is available across OpenAI’s ecosystem, including ChatGPT, the OpenAI API, and Codex. A higher-performance variant, GPT-5.4 Pro, is also available for users and developers who require maximum performance for complex tasks such as advanced research, large-scale automation, and demanding engineering workflows. Together, these capabilities position GPT-5.4 as a model aimed not just at conversation, but at executing real work across software systems.