How Trax Cut 75% of Kubernetes spend with PerfectScale by DoiT
Meet Trax
Traxโs mission is to enable brands and retailers to harness the power of digital technologies to produce the best shopping experiences for customers. Its industry-leading innovations and excellence in developing advanced technologies and autonomous data collection methods are driving positive shopper experiences and unlocking revenue opportunities at all points of sale.
Trax’s solution portfolio provides mission-critical metrics, analytics, and services that help customers save time and money by improving their shopping experience. Kubernetes is a key component of their infrastructure, enabling Trax to continuously innovate its solution while providing the scalability to consistently meet demand. Trax has grown to a large-scale multicloud, multi-cluster environment supporting customers in over 90 countries, including some of the globe’s largest enterprises.
The Challenge
At the beginning of the year, the Chief Financial Officer (CFO) laid out strong efficiency and unit-economics goals throughout the whole organization. For Mark Serdze, Director of Cloud Infrastructure, and his team, this meant quickly taking action to optimize their cloud costs.
Trax achieved fast results outside of Kubernetes, but hit roadblocks safely optimizing at scale within its clusters.
โWe started optimizing manually with available metrics, using Vertical Pod Autoscaler (VPA), cluster logs, and our monitoring solutions,โ explained Serdze. โThis approach didn’t give us proper clarity and was challenging to scale efficiently without big development needs. This left us taking ad-hoc, reactive actions that were having minimal impact on our goals.โ
Traxโs existing tooling also introduced friction into the optimization process. Even when team members identified potential opportunities, the approval process to validate the best course of action was based on instinct rather than evidence. Due to the lack of visibility, some missteps led to extra engineering toil, created internal tension, and introduced risks that could compromise service resilience.
It quickly became evident that their toolset lacked real-time intelligence and guardrails to optimize Kubernetes effectively.
The Solution
Shortly after deploying PerfectScale by DoiT, the team gained real-time, workload-level visibility they were missing. Across Traxโs 200+ microservice environment, they had clear visibility into what resources each service needed and identified the most significant opportunities to eliminate waste without risking performance.
The platformโs AI-guided intelligence allowed the team to take safe, evidence-backed action and improve unit economics quickly. By evaluating efficiencyโresilience trade-offs, they could adjust resources safely and efficiently while protecting SLOs.
โThe cost optimization recommendations were key for us, telling us what actions to take with a clear understanding of the impact each change would have,โ Serdze explained. โIn one of our clusters, we were able to reduce cost by 75%, saving us over a hundred thousands dollars in yearly expenses.โ
Furthermore, Trax was impressed by the comprehensive data and intelligence the solution provided across its entire environment. This implementation allowed it to upgrade its cost visibility toolset without impacting its budget.
โWe were able to replace a FinOps tool we were using that didn’t provide granular cost details or offer guidance on how to optimize our environment,โ explained Serdze. โPerfectScale is a tool built for the engineering teams, not just for finance, which made it easier for us to make the cost impacts we wanted.โ
Kubernetes optimization to improve business metrics
After eliminating wasted resources, Trax focused on identifying additional opportunities to improve efficiency and unit economics. The team delved into PerfectScale’s data, seeking ways to meaningfully impact their cost-centric business metrics.
โA key metric for us is โcost per processing,โ which is heavily affected by our Kubernetes efficiency,โ said Serdze. โIf it gets over a certain amount, we are under a lot of pressure to figure out why and to take actions to reduce it.โ
PerfectScale has a unique feature that consolidates every replica of a service into a single view to provide a clear picture of the utilization trends across all replicas, which is especially useful for ephemeral workloads like Spark or Flink jobs. Trax leveraged this capability to understand better the heterogeneous utilization across the replicas of several of its heavily used services. This level of visibility helped Trax re-architect services to drive additional efficiency gains without impacting resilience or availability.
โWe were able to build multiple flavors of the service with different levels of resources and route the incoming requests to the proper service based on the size of the data,โ explained Serdze. โThis made a big impact on our โcost per processingโ metric. PerfectScale surfaced this data instantly, and without them, we would have spent countless hours evaluating hundreds of replicas to generate the same results.โ
The Results
By adopting PerfectScale by DoiT, Trax achieved rapid, measurable results that aligned with both technical and financial goals:
- 75% reduction in Kubernetes costs within one cluster, over six figures in annual savings, while maintaining performance
- Replaced an underperforming FinOps tool with an engineer-first, prescriptive platform, without increasing budget
- Improved โcost per processingโ by rearchitecting services based on real-time, replica-level insights
- Accelerated decision-making with AI-driven recommendations, eliminating guesswork and reducing the risk of service disruption
Beyond the numbers, Trax gained strategic clarity. What once required hours of manual analysis and risky trial-and-error is now automated, guided, and precise. With PerfectScale by DoiT, Trax not only met significant cost targets but also set the foundation for long-term Kubernetes efficiency at scale.
โThe support during the proof of concept (POC) made a big impact in helping us drive quick results,โ said Serdze. โThe PerfectScale team sat with us, helped us optimize, and ensured our success in using the platform. I have not seen this level of commitment from other vendors, and I am glad we found a partner we can rely on to help us keep our Kubernetes cost in check as we continue to scale.โ