Look to these key metrics and benchmarks to evaluate the performance, capability, reliability, and safety of your AI models ...
🎉 2026-02-14 · v0.1.3 Released. The v0.1.3 release introduces full support for the latest GLM-5 model, achieving up to 500 tokens/s on GLM-5-FP8 and up to 600 tokens/s on DeepSeek-V3.2. TileRT is a ...
👉 Try Gemma4-E4B with MTP on Linux, macOS, Windows or Raspberry Pi with the LiteRT-LM CLI: litert-lm run \ --from-huggingface-repo=litert-community/gemma-4-E2B-it ...