The cost impact of Large Language Models (LLMs) in production

Cloud Masters Episode #108

We cover the ever-growing importance of Large Language Models (LLMs) in applications, how LLM costs can easily compound once in production, and breaking down the costs associated with using LLMs.

Cloud Masters Episode #108

With DoiT Spot Scaling, automate your AWS Spot Instances to save up to 90% on compute spend without compromising reliability.

| January 17, 2024

Cloud Masters

The cost impact of Large Language Models (LLMs) in production

00:00 / 00:54:30

| January 17, 2024

January 17, 2024

Cloud Masters

The cost impact of Large Language Models (LLMs) in production

00:00 / 00:54:30

Episode notes

About the guests

Gad Benram

Gad is the founder and CTO of TensorOps, which offers expert services for AI-driven applications as well AIOps and AI cost optimization

Gabriel Gonçalves

Gabriel is an ML Solutions Architect at TensorOps. He specializes in crafting intelligent solutions and architectures for Large Language Model applications.

Sascha Heyer

Sascha Heyer, a Senior Machine Learning Specialist at DoiT, stands out as a Google Developer Expert and Google Cloud Innovator. He has been crucial in helping over 306 companies grow in the field of Machine Learning. Sascha believes in keeping things simple, a mindset that has helped clarify complex tech concepts. Moreover, as an author, this expertise is showcased through engaging YouTube presentations and insightful Medium articles (https://medium.com/@saschaheyer), effectively demystifying complex tech topics for a broad audience.

Gad is the founder and CTO of TensorOps, which offers expert services for AI-driven applications as well AIOps and AI cost optimization

Gabriel is an ML Solutions Architect at TensorOps. He specializes in crafting intelligent solutions and architectures for Large Language Model applications.

Sascha Heyer, a Senior Machine Learning Specialist at DoiT, stands out as a Google Developer Expert and Google Cloud Innovator. He has been crucial in helping over 306 companies grow in the field of Machine Learning. Sascha believes in keeping things simple, a mindset that has helped clarify complex tech concepts. Moreover, as an author, this expertise is showcased through engaging YouTube presentations and insightful Medium articles (https://medium.com/@saschaheyer), effectively demystifying complex tech topics for a broad audience.

Intent-aware FinOps platform to eradicate the "Illusion of Efficiency"

By Role

Tailored solutions for keycloud stakeholders

By Industry

Cloud innovation designed for your business goals

Consolidate and simplify your billing

Operating Intelligence

Cloud Analytics

Anomaly Detection

Allocations

Budgets

Workload Intelligence

BigQuery Lens

Spot Scaling

DataHub

Cloud Diagrams

Containerized Workloads

Automation

PerfectScale

CloudFlow

Flexsave for Compute

Ava

Integrations

Customer success stories

Customers, advancing ourtechnology

Real-time DoiT efficiency,impact and success

Global compliance acrosscloud providers

Insights, tips and perspectivesfrom cloud experts

Tangible tips for navigating the cloud

Foundational expertise and future-ready recommendations

What’s new at DoiT

In-person and virtual tech talks

Demos, interviews and more from cloud experts

Company

Company

Offering

Support

Never miss an update.

Subscribe to updates, news and more.

Schedule a call with our team

Tailored solutions for key
cloud stakeholders

Cloud innovation designed
for your business goals

Consolidate and simplify
your billing

Customers, advancing our
technology

Real-time DoiT efficiency,
impact and success

Global compliance across
cloud providers

Insights, tips and perspectives
from cloud experts

Tangible tips for navigating
the cloud

Foundational expertise and
future-ready recommendations

Demos, interviews and
more from cloud experts