Cloud Intelligence™Cloud Intelligence™

Read stories from our Forward Deployed Engineers.

Engineering Blog

Trusted by platform engineering teams

Luxury Escapes
Personio
Loop Returns
Kargo
Home Chef
Entain
ClickUp
Exiger
Synthesia
Promptly
Extenda Retail
PlayHQ
Monta
Wicked Reports

Latest

More Posts

From VM Tagging to Token Tracing: The FinOps Code for AI Cost Control
By Ant WeissNov 10, 20255 min read

From VM Tagging to Token Tracing: The FinOps Code for AI Cost Control

LLMs So Hot They Burn A Hole In Your Pocket

LLMs in production: optimising from multi-second to sub-second latency and getting 50x cost reductions for free
By Matthias BaetensAug 6, 20255 min read

LLMs in production: optimising from multi-second to sub-second latency and getting 50x cost reductions for free

When you’re dealing with a critical cloud infrastructure issue, every second counts. You need help fast, and you need it to be accurate. But even when you’re not in a rush, you don’t want to spend your precious time going through long forms asking to be filled out; you want to describe your problem and […]

LLMs in production: optimising from multi-second to sub-second latency and getting 50x cost…
By Matthias BaetensAug 6, 20255 min read

LLMs in production: optimising from multi-second to sub-second latency and getting 50x cost…

When you’re dealing with a critical cloud infrastructure issue, every second counts. You need help fast, and you need it to be accurate…

DoiT launches its local MCP server for DoiT Cloud Intelligence™: Explore your cloud costs & usage wherever you use AI
By Tal CohenMay 22, 20256 min read

DoiT launches its local MCP server for DoiT Cloud Intelligence™: Explore your cloud costs & usage wherever you use AI

DoiT's MCP server transforms how you analyze cloud costs through AI assistants. Ask about spending spikes, service outages, or billing anomalies in plain language and get instant, data-driven insights from your actual cloud environment.

Hosting Your LLM Model on Amazon SageMaker for AI-Assisted Coding
By Dr. Richard KangMar 26, 20258 min read

Hosting Your LLM Model on Amazon SageMaker for AI-Assisted Coding

Empowering Enterprise Developer Productivity with Secure, Self-Hosted AI Coding Assistants on Amazon SageMaker

Google Cloud LLM implementation: Key takeaways from our live Q&A
By Matan BordoNov 20, 202411 min read

Google Cloud LLM implementation: Key takeaways from our live Q&A

Learn how to implement LLMs on Google Cloud from DoiT's AI experts. Get practical insights on model selection, cost management, RAG implementation with Google Workspace, API testing strategies, and step-by-step guidance for your GenAI journey.

Purr-fecting Data Orchestration: 🐈 BasePaws Data Meets Cloud Composer and LLMs
By Matthew PorterNov 11, 202418 min read

Purr-fecting Data Orchestration: 🐈 BasePaws Data Meets Cloud Composer and LLMs

Analyze whole cat genome sequencing data scalable, reliably, and with greater ease using Cloud Composer and Claude 3.5 Sonnet

From Ideation to Production with AWS
By Rupal BhattJul 29, 202418 min read

From Ideation to Production with AWS

Here is a brief map for starting your journey for implementing LLM in your workload. The journey from ideation to production is an exciting…

Anatomy of an LLM
By Eduardo MotaJun 12, 202429 min read

Anatomy of an LLM

Large language models (LLMs) like Claude, Cohere, and Llama2 have exploded in popularity recently. But what exactly are they and how can you leverage LLMs to build impactful AI applications? This article is an in-depth look at LLMs and how to use them effectively on AWS. What are LLMs and How Do They Work? LLMs […]

GenAI: Anatomy of an LLM
By Eduardo MotaMay 30, 202428 min read

GenAI: Anatomy of an LLM

Maximizing Impact with Large Language Models (LLMs): Strategies for Building AI Applications on AWS

DoiT and Google Join Forces to Advance Generative AI Development
By Vadim SoloveyMay 29, 20243 min read

DoiT and Google Join Forces to Advance Generative AI Development

DoiT International and Google Cloud are strengthening their partnership to help enterprises accelerate generative AI adoption through hands-on workshops and expert guidance.