Google New TPU Generation is Specifically Designed for Agents and SOTA Model Training

Sergio De Simone — Wed, 06 May 2026 10:00:00 GMT

Google has unvelied a new generation of Tensor Processing Units (TPUs), featuring two specialized chips designed to accelerate model training and agent workflows, which require continuous, multi-step reasoning, and action loops distributed across multiple models. The new TPUs deliver better performance, memory, and energy efficiency, the company says.

By Sergio De Simone

Cloudflare Builds High-Performance Infrastructure for Running LLMs

Renato Losio — Sun, 03 May 2026 10:58:00 GMT

Cloudflare has recently announced new infrastructure designed to run large AI language models across its global network. As these models rely on costly hardware and must handle large volumes of incoming and outgoing text, Cloudflare separates the model's input processing and output generation onto different optimized systems.

By Renato Losio

InfoQ - GPU - News

Google New TPU Generation is Specifically Designed for Agents and SOTA Model Training

Cloudflare Builds High-Performance Infrastructure for Running LLMs