Google's Gemma 4 open AI models use "speculative decoding" to get up to 3x faster - TrendCloud