What's new in TensorFlow 2.21

Published: 3 days ago (March 6, 2026 at 07:53 PM EST)

2 min read

Source: Google Developers Blog

What’s new in the LiteRT stack?

At Google I/O ‘25, we shared a preview of the evolution to LiteRT: a high‑performance runtime designed specifically for advanced hardware acceleration. Today, we are excited to announce that these advanced acceleration capabilities have fully graduated into the LiteRT** production stack**, available now for all developers.

This milestone solidifies LiteRT as the universal on‑device inference framework for the AI era, representing a significant leap over TFLite for being:

Faster – delivers 1.4× faster GPU performance than TFLite, and introduces new, state‑of‑the‑art NPU acceleration.
Simpler – provides a unified, streamlined workflow for GPU and NPU acceleration across edge platforms.
Powerful – supports superior cross‑platform GenAI deployment for popular open models like Gemma.
Flexible – offers first‑class PyTorch/JAX support via seamless model conversion.

All of this is delivered while maintaining the same reliable, cross‑platform deployment you trust since TFLite.

Read the full announcement and get started.

tf.lite

Several operators now support lower‑precision data types for better performance and efficiency, including int8 and int16x8 for the SQRT operator and int16x8 for comparison operators.
Support for smaller data types has been extended across multiple operators: tfl.cast now supports conversions involving INT2 and INT4; tfl.slice has added support for INT4; and tfl.fully_connected now includes support for INT2.

Community updates

We’ve heard from the community about the need for fixing bugs quickly and providing more timely dependency updates, so we are increasing resources toward these efforts. Going forward, we will exclusively focus on:

Security and bug fixes – accelerating the resolution of security vulnerabilities and critical bugs, with minor and patch releases as required.
Dependency updates – releasing minor versions as needed to support new Python releases and other dependencies.
Community contributions – continuing to review and accept critical bug fixes from the open‑source community.

These commitments apply to TF.data, TensorFlow Serving, TFX, TensorFlow Data Validation, TensorFlow Transform, TensorFlow Model Analysis, TensorFlow Recommenders, TensorFlow Text, TensorBoard, and TensorFlow Quantum.

Note: The TF Lite project has been renamed to LiteRT and is in active development separately.

While TensorFlow continues to provide stability for production, we recommend exploring our latest updates for Keras 3, JAX, and PyTorch for new work in generative AI.

What's new in TensorFlow 2.21

What’s new in the LiteRT stack?

tf.lite

Community updates

Related posts

You can't stream the energy: A developer's guide to Google Cloud Next '26 in Vegas

On-Device Function Calling in Google AI Edge Gallery

Get ready for Google I/O 2026

Introducing Wednesday Build Hour