Tech Week Singapore 2025

Loading

From GPU Waste to Smart Scaling: Building Cost-Effective Private AI Infrastructure

08 Oct 2025
Productivity Optimisation & AI Adoption Theatre
From GPU Waste to Smart Scaling: Building Cost-Effective Private AI Infrastructure

While GPU hardware dominates AI infrastructure costs, most private deployments suffer from chronically low utilization rates due to static resource allocation. This session demonstrates how open-source elastic inference technology transforms GPU pools to serve multiple models dynamically, significantly reducing infrastructure costs while maintaining production-grade performance.

Speaker(s)
Yanzhen Yu, R&D Manager - Arcfra