How NVIDIA’s Inference Software Stack Powers the Lowest Token Cost - TrendCloud