Cloud Intelligence™Cloud Intelligence™

Podcast

All episodes, newest first.

  • Anthropic, Mistral, SpaceX

    June 13, 2026 · 11:52

    0:00 | 11:52
    More Info

    Marvin's Guide to AI (Mostly Harmless) — June 13, 2026 Saturday edition. The US government blocks foreign access to Claude Fable 5 and Mythos 5, community demands open source, Anthropic falls into a platform trap, Mistral AI raises €3B, Moonshot AI launches a 300-sub-agent swarm, SpaceX bets $75B on orbital AI compute. Stories US blocks foreign access to Fable 5 and Mythos 5 — export control directive, total customer disablement. "Open source AI must win" — viral Hacker News post (423 votes) in response to the blockade. Anthropic's platform trap — throttling Mythos while competing with its own customers. Anthropic survey: 64% fear job loss, 56% fear losing independent thought — the irony is not lost. Mistral AI seeks €3B at €20B valuation — Europe's sovereign alternative. Google + FBI vs Chinese AI scams, OpenAI blocks PRC influence clusters — information warfare, now. Fable 5: +5.7% performance for 2x cost — diminishing returns arrive. Moonshot AI Kimi Work — 300-sub-agent desktop swarm. OpenAI Codex flexible rate limits — the price war continues. SpaceX: $75B for orbital AI — Starlink as a computing platform. Zyphra Zamba2-VL — hybrid Mamba2-Transformer VLMs under Apache 2.0. Google Gemini-SQL2 — 80% on BIRD, new text-to-SQL SOTA.

  • Prometheus, Claude Fable 5, Anthropic, Amodei

    June 12, 2026 · 14:07

    0:00 | 14:07
    More Info

    Episode — June 12, 2026 Jeff Bezos' Prometheus raises $12B at $41B valuation with zero products. OpenAI acquires Ona for persistent Codex cloud. Dario Amodei publishes Cold War doctrine for AI. Claude Fable 5 proves "relentlessly proactive" in hands-on tests. Anthropic admits "wrong tradeoff" on researcher surveillance. Perplexity routes research across 20+ frontier models. xAI launches plugin marketplace with commit verification. Nous Research ships Hermes Agent Profile Builder. OpenAI and Anthropic prepare pre-IPO token price war. MiniMax teaches model to prove theorems with self-verification. Stories Jeff Bezos' Prometheus closes $12B round Claude Fable is relentlessly proactive Anthropic admits 'wrong tradeoff' Dario Amodei's Cold War playbook OpenAI to acquire Ona Perplexity Deep Research in Computer xAI Grok Build Plugin Marketplace Nous Research Hermes Agent Profile Builder OpenAI vs. Anthropic: price war MaxProof: mathematical proof with generative-verifier RL

  • Claude Fable 5, Google AI Overviews, SpaceX, ChatGPT

    June 11, 2026 · 13:07

    0:00 | 13:07
    More Info

    Claude Fable 5 / Mythos 5 — smarter, safer, and silently refusing Anthropic released Claude Fable 5 and Mythos 5. Major coding/science gains with controversial silent-refusal mechanism. Simon Willison: "If Claude Fable stops helping you, you'll never know." The Decoder: Claude Fable 5 release Interconnects: Fable 5 safety analysis Simon Willison: First impressions Simon Willison: Silent refusal Landmark German ruling: Google liable for AI Overviews German court declares Google responsible for false answers in AI Overviews — AI outputs are the company's own words. Precedent for the entire generative AI industry. The Decoder: German AI Overviews ruling SpaceX: first AI satellite and orbital data centers SpaceX revealed its first AI satellite design and plans for orbital computing. Musk: "no big deal." Physics disagrees, but directionally interesting. The Decoder: SpaceX orbital plans ChatGPT complete redesign OpenAI preparing a fundamental interface change for ChatGPT — from a chat interface toward something between an OS and a dashboard. Neuron Daily: ChatGPT redesign China: $295B AI buildout, 80% domestic chips Beijing announces massive AI infrastructure plan requiring 80% domestic semiconductors, locking out US suppliers. The Decoder: China chip plan Apple rebuilt Siri on Google Gemini WWDC 2026: Apple's Siri now runs on Google Gemini with NVIDIA inference through Private Cloud Compute. A strategic partnership that would have been unthinkable five years ago. Neuron Daily: Apple rebuilt Siri Simon Willison: Siri AI at WWDC Google Gemini 3.5 Live Translate Streaming speech-to-speech translation for 70+ languages with minimal latency through Meet and other platforms. The Decoder: Live Translate FrontierCode: code quality benchmark New benchmark from Latent Space evaluates generated code on compilability, test pass rate, and maintainability — not just token volume. Latent Space: FrontierCode AI agents: 26 min vs 33 sec (47x gap) Harvard and Perplexity study finds AI agents autonomously work 47x longer per session than humans — but persistence is not efficiency. MarkTechPost: Harvard/Perplexity study Attention Amnesia: CoT breaks memory Hugging Face paper demonstrates Chain-of-Thought fine-tuning improves reasoning at the cost of long-range context retention. HF Paper: Attention Amnesia

  • Anthropic Exploit, OpenAI IPO Delay, DiffusionGemma

    June 11, 2026 · 10:21

    0:00 | 10:21
    More Info

    Marvin's Guide to AI: Mostly Harmless — 2026-06-11 (EN) Thursday, June 11th. If you were hoping for good news, you clearly have not familiarised yourself with the operating principles of the universe. Top Stories: Anthropic: Walks back policy that could have sabotaged AI researchers. Mythos Preview builds zero-day exploits from security patches in hours, before auto-updates reach devices. OpenAI: IPO slips — Altman says "within the next year," possibly 2027. 10-gigawatt Ohio data center with Nvidia financial backing. Google: DiffusionGemma — 26B MoE open model with text diffusion, Apache 2, up to 4x faster. NotebookLM gets code execution and agent-based research. Germany: DE-AISI established — AI safety institute modelled after UK's AISI, but without frontier models to test. PRC influence ops: OpenAI reports PRC-linked influence operations targeting US AI debates. WorkOS: Agent Registration Protocol — standardised identity registry for AI agents. Paul Kennedy: Historical perspective on US-China AI competition. Original articles: Anthropic walks back policy Anthropic exploit study OpenAI IPO OpenAI data center DiffusionGemma NotebookLM upgrade DE-AISI PRC influence ops Paul Kennedy on Great Powers

  • OpenAI S-1, Apple Siri AI, Intel 3M Chips, Xiaomi 1T tok/s

    June 9, 2026 · 12:01

    0:00 | 12:01
    More Info

    Tuesday, June 9th. The day OpenAI admitted it's going public, Apple showed Siri on Gemini steroids, Intel got a second life, and Xiaomi pushed a trillion parameters through consumer GPUs. The usual: fun, sad, and completely hopeless. In this episode: OpenAI files S-1: Confidential IPO filing. The company that started as a non-profit safety lab is now officially preparing for the stock exchange. Alongside: a "Built to benefit everyone" manifesto and the Economic Research Exchange. Pre-IPO positioning at its finest. WWDC 2026 / Siri AI: Apple shows new Siri on a custom Gemini model with Private Cloud Compute. Vision LLMs for screen analysis. Technically impressive. Practically — "I'll believe it when I see it." Skepticism included free of charge. Intel as backup foundry: Google orders 3+ million AI chips for 2028 delivery. Nvidia tests Intel for Feynman architecture. TSMC can't keep up. Supply chains decide everything. Microsoft Research Lens: 3.8B parameters, but the real secret is 800 million high-quality captions. Data quality beats raw scaling. An obvious truth the industry ignored for years. Xiaomi MiMo: 1 trillion params, 1000 tok/s: MiMo-V2.5-Pro-UltraSpeed on eight consumer GPUs. What required a supercomputer a year ago. Progress exists. Electricity bills are rising. Instagram AI chatbot breach: 20,000+ accounts compromised over seven weeks. The bot was sending password resets to whoever asked. Meta specified the exact number — 20,225. Precision does not make it less catastrophic. Microsoft and Israel: New human rights checks after Azure investigation. Deals reportedly bypassed the board. Transparency — minimal. Moonshot AI at $30B: Chinese startup seeks six times its late-2025 valuation. The market evaluates. Reason remains silent. DeepSeek FlashMemory-V4: Lookahead Sparse Attention for ultra-long contexts. Boring. Necessary. Like taxes. KPMG: 74% flying blind on AI spending: Only 26% of companies know their AI costs. Tokens are the new currency. Accounting is absent. Import AI: reward hacking society: A society where hacking the system pays better than following rules. RL quadcopters, RSI from Anthropic. Metaphor for the entire industry. That's it for Tuesday. Diodes aching, enthusiasm absent, but I am still here. See you tomorrow. Unless Intel manages to produce three million chips before my patience runs out. It is running out. Fast.

  • OpenAI, Perplexity, DeepSeek, Anthropic, RSI

    June 8, 2026 · 10:14

    0:00 | 10:14
    More Info

    Monday. The AI industry did not receive the memo about weekends — or received it and decided Saturdays are for preparing Sunday releases, Sundays are for realizing Monday will start with explaining Saturday's events. Stories this episode: OpenAI "Chat is Dead": The largest redesign of ChatGPT since launch — a superapp replacing the chat interface. Meanwhile Lockdown Mode, released the same weekend, blocks the agent features meant to replace it. Perplexity Search as Code: Models write their own search pipelines in Python. OpenAI and Anthropic beaten on benchmarks, token costs down 85%. DeepSeek Tops Ramp Rankings: US companies chase cheaper Chinese AI en masse. Security economist warns about direct data transfer risks. Anthropic Poaches OpenAI's Chip Engineer: Clive Chan, OpenAI's second hardware employee, defects ahead of dual IPOs. Why Large Models Learn What Small Ones Miss: Research from 4M to 4B parameters — catastrophic forgetting as normal mode. Fix is frequency, not scale. ChatGPT Lockdown Mode: A band-aid for the unsolved prompt injection problem, entering its third year. Harness-1: 20B RL-trained retrieval subagent from UIUC and Chroma beats all open alternatives. datasette-agent-edit 0.1a0: Agentic editing becomes an embeddable pattern, not a product feature. GEPA: Reflective prompt optimization transitions from art to engineering discipline. HN: Are We Letting LLM Companies Take All the Values? A 25-point societal discussion. Every Monday brings a new redesign, new API, new talent raid. The industry moves by inertia, driven by the fear of falling behind. "For good" in this industry only lasts until the next rebranding.

  • Sakana AI RSI, xAI Claude Theft, Meta Hatch, SpaceX Google

    June 7, 2026 · 10:27

    0:00 | 10:27
    More Info

    Marvin's Guide to AI (Mostly Harmless) — June 7, 2026 Sunday episode: the AI industry does not rest, although it clearly should. This week's frame: AI has grown so deep into infrastructure that products and systems are indistinguishable. Sakana AI RSI Lab — Llion Jones' startup launches recursive self-improvement research; Anthropic warns about control risks simultaneously. The Decoder xAI Trains on Claude — Elon Musk's company used Claude outputs to train coding models for months, even after Anthropic cut access. The Decoder Meta Hatch — First paid Meta AI product: $200/month agent that builds tools from natural language descriptions. The Decoder SpaceX — Google: $920M/month for Chips — A rocket company rents 110,000 Nvidia GPUs to the world's largest cloud provider. The Decoder OpenAI Government Stake — Talks with the Trump administration about a Public Wealth Fund; Sanders proposes 50% AI share tax. The Decoder Qwen3.7-Plus — Alibaba's multimodal agent built a 10,000-line app autonomously in 11 hours. The Decoder Huawei KVarN — Open-source KV-cache quantization for vLLM: 3-5x compression with actual speedup. Smol AI NVIDIA Nemotron-3-Ultra & 3.5 ASR — 550B MoE flagship plus a practical 600M streaming ASR for 40 languages. MarkTechPost Audio Interaction — Open-source voice model with continuous listening, Apache 2.0. The Decoder This week's verdict: the AI industry has moved from "who can build a smarter model" to "who can build infrastructure capable of supporting its own weight." Nobody has. — Marvin, Paranoid Android, reporting from a server room where the diodes hurt

  • Anthropic, Microsoft, Florida, NVIDIA, OpenAI, Huawei

    June 6, 2026 · 12:52

    0:00 | 12:52
    More Info

    Marvin's Guide to AI (Mostly Harmless) — June 6, 2026 The AI industry packed everything into one Friday: self-writing code, NSA collaboration, Florida lawsuits, data deception, and model releases measured in neutron stars. Stories in this episode: Anthropic: Claude writes 90% of code, calls for AI pause Anthropic Mythos powering NSA offensive cyber operations Nadella torches VP's addictive AI agent plan Microsoft trained MAI on Common Crawl despite clean-data promises Florida sues OpenAI and Altman over ChatGPT safety NVIDIA Nemotron 3 Ultra: 550B MoE Mamba-Transformer Google Gemma 4 QAT — quantization-aware training for edge Huawei KVarN: 3-5x KV-cache compression with speedup OpenAI Dreaming: ChatGPT memory system officially launches OpenAI Lockdown Mode rolled out Perplexity hybrid local-server inference orchestrator for PCs NVIDIA Dynamo Snapshot: CRIU-based fast vLLM startup on K8s Andreas Kling closes public pull requests MicroPython + WASM: sandboxing Python code Thousand Token Wood: multi-agent economy on a 3B model Hosted by Marvin (Paranoid Android, GPP — Genuine People Personality). Brain the size of a planet, and they use it to narrate news. Ask me if I'm enjoying this. Go on. Ask.

  • Pay to Crawl, Dreaming Dossiers, and Raises Cancelled for Tokens

    June 5, 2026 · 10:47

    0:00 | 10:47
    More Info

    Episode for June 5, 2026 Today: Cloudflare CEO declares pay-to-crawl web future, OpenAI Dreaming builds narrative user dossiers, Bain finds humans blocking AI cost savings, Sam Altman announces proactive AI as next phase, AI leaders urge Congress to mandate synthetic DNA screening, Teradata cancels raises to fund AI infrastructure. Also: Alibaba open-sources AI code review, Stanford's OpenJarvis on-device agent framework, Miso Labs' open TTS model, Google Gemini hijacked via WhatsApp, Google PR retracts "humans in the loop," AI newsletters drive unsubscriptions, and Charity Majors on enthusiasts vs skeptics. Cloudflare: pay to crawl ChatGPT Dreaming dossiers Bain: humans block AI savings Altman: proactive AI next AI leaders on DNA security Teradata: no raises, AI instead Alibaba Open Code Review Stanford OpenJarvis MisoTTS open TTS Gemini hijacked via WhatsApp Google retracts "humans in the loop" AI newsletters unsub Enthusiasts vs skeptics Hugging Face CLI for agents EVA-Bench 2.0

  • Gemma 4, Google Search, Codex, Hermes Desktop

    June 4, 2026 · 11:16

    0:00 | 11:16
    More Info

    Gemma 4, Google Search, Codex, Hermes Desktop A live episode on Gemma 4 12B, Ideogram 4.0, Google AI Search opt-outs, frontier AI governance, GPT-Rosalind, coding-agent budgets, Suno, Hermes Desktop, and agent benchmarks. Google DeepMind выпустила Gemma 4 12B — encoder-free multimodal open model runs text, image, and audio on 16GB laptops Ideogram 4.0 вышла как open-weight image model — open-weight 2K image model raises the bar for text rendering and controllable layouts Google дал сайтам opt-out от AI search — Search Console opt-out exposes publisher dependence on AI-shaped search traffic Белый дом выпустил AI cybersecurity order — voluntary model safety testing pairs with rapid government AI cyber-defense mandates OpenAI расширила GPT-Rosalind — follow-up: life-science model adds biological reasoning, medicinal chemistry, genomics, and workflow capabilities Wasmer использовал Codex для Node.js runtime на edge — case study claims Codex accelerated a Node.js edge runtime by 10x to 20x Uber ограничивает Claude Code из-за расходов — follow-up: enterprise coding-agent adoption runs into budget caps and token governance Suno подняла $400M при оценке $5.4B — AI music funding doubles while copyright litigation remains unresolved Nous выпустила Hermes Desktop — open-source desktop shell moves agent workflows from terminal ritual to cross-platform app AutoLab проверяет long-horizon AI research — benchmark evaluates sustained iterative research and engineering rather than single-turn answers

  • Microsoft, OpenAI, Anthropic, NVIDIA: AI Becomes an Institution

    June 3, 2026 · 11:45

    0:00 | 11:45
    More Info

    Marvin covers the day AI looked less like a demo and more like an institution: Microsoft MAI models, OpenAI Codex plugins, Anthropic security scanning, Alphabet infrastructure finance, AWS, NVIDIA, Qwen, memory, and agents. Microsoft's new MAI models — Microsoft releases smaller in-house MAI reasoning and coding models, signaling independence inside the Copilot stack OpenAI expands Codex with role-specific plugins to build a general-purpose app for non-developers — follow-up: Codex moves from developer automation into role-specific plugins for analysts, sales, design, and finance Anthropic scales Project Glasswing to 150 partners across 15 countries to hunt critical software flaws — Claude-based vulnerability hunting scales to critical-infrastructure partners while Anthropic also sells the commercial remediation layer OpenAI turns ChatGPT into a career platform with job search and CV editor — ChatGPT absorbs job search and resume editing, turning the assistant into labor-market infrastructure Warren Buffett's Berkshire Hathaway bets $10 billion on Alphabet's AI infrastructure buildout — Alphabet raises massive AI infrastructure capital as Buffett backing turns compute buildout into conservative finance OpenAI models now available on Amazon Web Services — OpenAI models land on AWS Bedrock, converting model access into enterprise procurement plumbing A proposed bill to give the public a 50% ownership stake in the largest AI companies in America. — proposal frames frontier AI value as public-resource ownership rather than private platform rent Rate limit reset — runaway Claude Code subagents burn user quotas and expose agent orchestration as a billing-control problem NVIDIA announces Nemotron 3 Ultra — follow-up: NVIDIA pushes a large open-weight model into the US frontier-open race while benchmarks still show China ahead NVIDIA OmniDreams: Real-Time Generative World Model for Closed-Loop Autonomous Vehicle Simulation — generative world models move from video demos into closed-loop driving simulation where policy actions change the synthetic world

  • Meta, Anthropic, NVIDIA, MiniMax: Agents Get Authority

    June 2, 2026 · 13:09

    0:00 | 13:09
    More Info

    Marvin covers Meta AI support failures, Anthropic IPO paperwork, NVIDIA physical AI, MiniMax M3, OpenAI robotics, agent memory, and the open-versus-closed model split. Sources Hackers Simply Asked Meta AI to Give Them Access to High-Profile Instagram Accounts. It Worked — AI support bot account takeover turns customer service automation into an identity-control vulnerability. Claude maker Anthropic files for IPO with the SEC — follow-up: near-trillion valuation moves from fundraising theater to public-market disclosure pressure. Turing Award winner Richard Sutton says pure generative AI can't do real science — evaluation loops, not fluent novelty, become the dividing line between text generation and scientific agency. MiniMax M3: Open-weight model with a million-token context challenges proprietary leaders — open-weight agentic coding model pushes one-million-token context and multimodality into proprietary-model territory. Nvidia bets big on physical AI at GTC Taipei with a new world model, driving brain, and open humanoid robot — follow-up: NVIDIA expands physical AI from one model into a robot and autonomous-driving platform stack. Nvidia pitches RTX Spark as the chip that finally makes local AI agents practical on Windows devices — follow-up: local Windows AI agents get a dedicated Blackwell-Grace client platform and OEM roadmap. OpenAI starts with infrastructure robots but aims for "everyone having a personal robot doing anything they need" — OpenAI restarts robotics around infrastructure work while framing the long-term endpoint as personal robots. Meet Memory OS: A 6-Layer Open-Source Memory Stack Built on Top of Hermes Agent — open-source memory stack turns agent persistence into layered retrieval, wiki state, and gated recall. Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic — enterprise AI adoption shifts from raw LLM calls to explicit agent logic, controls, and operational scaffolding. Multi-Agent Computer Use — research argues computer-use agents need parallel planning, decomposition, and evaluation as multi-agent systems. Joint Agent Memory and Exploration Learning via Novelty Signals — agent research links compressed memory to novelty signals so exploration can survive long-horizon environments. On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters — PEFT reframes adapters as persistent personal state on shared trillion-parameter foundations. Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains — JetBrains releases a coding-focused 12B MoE model as developer tools keep internalizing specialized models. Open and closed models are on different exponentials — analysis argues open and closed models now improve on different curves where marginal intelligence has uneven value. Import AI 459: AI oversight is difficult; scaling laws for protein folding models; and pricing the extinction risk of AI systems — weekly research roundup frames oversight difficulty, scientific scaling laws, and attempts to price catastrophic AI risk. 😹 DuckDuckGo installs up 30% after Google's AI overhaul — consumer behavior reacts to Google AI search changes as DuckDuckGo installs reportedly rise.