February 21–27, 2026 (Apache Iceberg PMC, Nvidia v AMD, etc.)
This week on The Data, Lakehouse and AI Show, we are covering one of the most significant weeks in the history of the open data ecosystem. Apache Polaris has officially graduated from the Apache Incubator to become a Top-Level Apache Software Foundation project. Co-created by Dremio and built around the Apache Iceberg REST Catalog specification, Polaris received 27 binding +1 votes in its graduation round, reflecting the community confidence built through six releases, more than 100 contributors, and nearly 3,000 merged pull requests. With graduation complete, the new Polaris PMC now operates with full ASF oversight and a community-driven roadmap focused on credential vending for non-AWS storage backends, deeper Delta Lake support, and idempotent commit operations. This is the open catalog moment the lakehouse ecosystem has been building toward.
Over on the Apache Iceberg dev list, the community is deep in V4 planning. The hottest thread this week is the proposal to make the root metadata.json file optional — for streaming write workloads on Hadoop and HMS catalog backends, writing that file on every commit creates real performance bottlenecks, and two paths are under debate. We also saw the first sync for Iceberg's native index support feature, a key V4 capability enabling faster lookups without full table scans. And a concurrency bug was flagged in the snapshot expiration logic where a race window between the ExpireSnapshots job and concurrent ref additions could cause live snapshots to be incorrectly removed. The iceberg-go project already has a fix, and the Java team is investigating.
On the Apache Arrow side, a fascinating thread challenged the use of QUIC for IPC stream multiplexing — QUIC's independent delivery model conflicts with use cases requiring explicit ordering across batches from different logical streams. Arrow is also joining Google Summer of Code 2026 and published a formal security model this week. Apache Parquet continued advancing the Adaptive Lossless floating-Point encoding spec, which enables more compact storage of floating-point columns — a meaningful win for ML feature tables and financial datasets. Parquet Java 1.17.0, which dropped Java 8 support, is now moving into broader production adoption, so if your team is still on Java 8, it is time to plan that upgrade.
In the AI chipset world, NVIDIA gave CNBC an exclusive first look at the Vera Rubin system — 72 Rubin GPUs and 36 Vera CPUs in a fully liquid-cooled rack-scale NVL72, set to ship in the second half of 2026. NVIDIA claims Rubin delivers a 10x reduction in inference token cost and 4x fewer GPUs needed to train MoE models compared to Blackwell. Jensen Huang teased a chip that will "surprise the world" at GTC on March 16. Meta expanded its NVIDIA partnership in a deal worth tens of billions, locking in millions of GPUs, Vera Rubin rack systems, and becoming the first company to deploy NVIDIA's Grace CPUs as standalone data center chips.
In the agentic AI tooling space, Apple released Xcode 26.3 with native support for Claude Agent and OpenAI Codex, exposing Xcode's capabilities through the Model Context Protocol — one of the clearest mainstream signals yet that MCP is becoming the plumbing layer for agentic developer tools. OpenAI launched a macOS Codex app alongside GPT-5.2-Codex, their most advanced agentic coding model, with improvements in long-horizon context compaction, large-scale refactors, and cybersecurity capabilities.
On the Dremio front, the big news is STACKIT Dremio entering public preview — a sovereign, EU-operated lakehouse service built on Apache Iceberg and Apache Polaris for organizations that need EU data residency without sacrificing open standards. STACKIT's own internal deployment achieved 40% lower TCO and 4x faster time to insight. And the Forrester Q1 2026 Data Lakehouses Landscape confirms it: lakehouses are no longer experimental — they are the default architectural choice for modern analytics and AI.