Qwen2.5-Coder-3B-Instruct LiteRT-LM Model

This repository contains LiteRT-LM variant of Qwen/Qwen2.5-Coder-3B-Instruct optimized for on-device text generation.

Available Artifact

File Quantization Recipe Context Size
Qwen2.5_Coder_3B_It.litertlm dynamic_wi8_afp32 - 3.4 GB

Integration

Ready to integrate this into your product? Get started in the LiteRT-LM documentation.

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support