Gemma 4 with quantization-aware training
Since releasing Gemma 4 two months ago, we’ve been continuously working to expand its capabilities. First, we introduced Multi-Token Prediction (MTP) to accelerate inference, and just a couple of days ago, we released a 12B model to bridge the gap between our E4B and 26B MOE models. Today, we are releasing new checkpoints optimized with […]
Continue Reading