What's new in TensorFlow 2.21
Source: Google Developers Blog
What’s new in the LiteRT stack?

At Google I/O ‘25, we shared a preview of the evolution to LiteRT: a high‑performance runtime designed specifically for advanced hardware acceleration. Today, we are excited to announce that these advanced acceleration capabilities have fully graduated into the LiteRT** production stack**, available now for all developers.
This milestone solidifies LiteRT as the universal on‑device inference framework for the AI era, representing a significant leap over TFLite for being:
- Faster – delivers 1.4× faster GPU performance than TFLite, and introduces new, state‑of‑the‑art NPU acceleration.
- Simpler – provides a unified, streamlined workflow for GPU and NPU acceleration across edge platforms.
- Powerful – supports superior cross‑platform GenAI deployment for popular open models like Gemma.
- Flexible – offers first‑class PyTorch/JAX support via seamless model conversion.
All of this is delivered while maintaining the same reliable, cross‑platform deployment you trust since TFLite.
Read the full announcement and get started.
tf.lite
- Several operators now support lower‑precision data types for better performance and efficiency, including int8 and int16x8 for the
SQRToperator and int16x8 for comparison operators. - Support for smaller data types has been extended across multiple operators:
tfl.castnow supports conversions involvingINT2andINT4;tfl.slicehas added support forINT4; andtfl.fully_connectednow includes support forINT2.
Community updates
We’ve heard from the community about the need for fixing bugs quickly and providing more timely dependency updates, so we are increasing resources toward these efforts. Going forward, we will exclusively focus on:
- Security and bug fixes – accelerating the resolution of security vulnerabilities and critical bugs, with minor and patch releases as required.
- Dependency updates – releasing minor versions as needed to support new Python releases and other dependencies.
- Community contributions – continuing to review and accept critical bug fixes from the open‑source community.
These commitments apply to TF.data, TensorFlow Serving, TFX, TensorFlow Data Validation, TensorFlow Transform, TensorFlow Model Analysis, TensorFlow Recommenders, TensorFlow Text, TensorBoard, and TensorFlow Quantum.
Note: The TF Lite project has been renamed to LiteRT and is in active development separately.
While TensorFlow continues to provide stability for production, we recommend exploring our latest updates for Keras 3, JAX, and PyTorch for new work in generative AI.