Improving AI agents through better evaluations - TrendCloud