ars Technica AIgooglegemma 4speculative decodingGoogle's Gemma 4 open AI models use "speculative decoding" to get up to 3x fasterUp to 3x the speed with no loss of quality—is it too good to be true?May 6, 3:44 PM