#performance
// 11 transmissions tagged with #performance

Cloudflare launches SOCKMAP, a new TCP splicing mechanism
Cloudflare unveiled SOCKMAP, a kernel‑level TCP splicing feature that cuts latency and streamlines connection handling for high‑performance networking stacks. The code is open for testing and review.

Microsoft's new Outlook adds 10‑second delay versus Outlook Classic
The latest Windows version of Outlook introduces a performance regression, with common tasks taking up to 10 seconds compared to the instant response of Outlook Classic, raising concerns about productivity and competitiveness.

Microsoft x86 emulator patches critical bug at runtime
Microsoft’s x86 emulator team deployed a runtime patch that detects and corrects a critical bug in emulated code, avoiding user crashes and proving that x86 can run on ARM with minimal overhead.

Google releases DiffusionGemma, a model that generates text four times faster
Google’s DiffusionGemma model cuts per‑token latency by a factor of four while preserving text quality, opening the door to real‑time NLP workloads on modest hardware.

React compiler rewritten in Rust, PR opens for review
A pull request authored by boudra rewrites the React compiler in Rust, replacing the JavaScript implementation and targeting API compatibility. The PR is under review on GitHub.

Linear's speed dissected in a technical breakdown
A performance.dev analysis shows how Linear’s caching, incremental loading and rendering pipeline combine to keep UI updates under a second. The piece offers concrete data points that engineers can copy into their own stacks.

Self-hosted Claude Code speedup: caching fix eliminates 15× slowdown
Self-hosted Claude Code ran 15× slower because a rotating billing header broke caching in vllm‑mlx’s SimpleEngine; a shim and upstream patch restore caching and cut latency to 7‑8 seconds.

Nvidia GPU VRAM used as Linux swap space
The nbd-vram project lets Linux users mount Nvidia GPU VRAM as a swap device, offering a fast‑memory alternative for RAM‑starved systems.

Fast mode for Opus 4.7 on AI Gateway cuts latency 2.5x at 6x cost
Vercel's AI Gateway now supports fast mode for Claude Opus 4.7, delivering 2.5x faster output token generation with full model intelligence, priced at $30 input and $150 output per 1M tokens.

Postgres 18's async I/O subsystem hits 3x on sequential scans
Postgres 18 introduced an asynchronous I/O subsystem that issues parallel I/O requests instead of waiting on each one. Benchmarks show up to 3x gains on seq scans, bitmap heap scans, and vacuum.

Bun 2-3x ahead of Node on RPS. The runtime question is now serious.
2026 benchmarks put Bun at 30-50K RPS on standard HTTP workloads vs Node's 13-20K. Deno 2 sits in the middle around 22K. With Deno's full Node-compat and Bun's drop-in story, the runtime decision is no longer academic.