GPU Time-Slicing for Concurrent LLM Agents on Kubernetes - TrendCloud