PodcastsTecnologíaMLOps.community

MLOps.community

Demetrios
MLOps.community
Último episodio

532 episodios

  • MLOps.community

    Sandboxing, Agent Harnesses, and Agent Teamwork

    19/06/2026 | 1 h 19 min
    Shahram Anver is the Co-Founder and CEO of Cleric, the autonomous AI SRE that investigates and root-causes production issues like an experienced teammate — often in under two minutes. Before Cleric, Shahram led MLOps, DevOps, and FinOps platform engineering at Gojek, Southeast Asia's super-app. In this conversation, he breaks down why production operations never kept pace with AI-accelerated development, and why the real unlock for an AI SRE isn't faster triage — it's an agent that *learns* and compounds operational memory across your whole org.

    In this episode:
    🔧 The on-call problem — Why one broken service still drags ten engineers onto a call, and how AI changes that
    🤖 What an AI SRE actually is — How Cleric investigates across your existing observability stack instead of adding another tool
    🧠 Learning over MTTR — Why Shahram argues the value isn't alert triage, it's an agent that gets better every investigation
    🪜 Ramping like a new engineer — Explore the environment, learn from the work, talk to the team
    🔁 The investigate–measure–learn loop — Turning what worked on one incident into context for the next
    🕸️ Knowledge graphs & operational memory — Mapping teams, clusters, and dependencies so insight from one team helps another
    ⚡ Under two minutes to root cause — What "fast" really requires in a live production environment
    🚀 The road to autonomy — From assisted investigation toward self-healing infrastructure
    If you're an SRE, platform engineer, DevOps lead, or anyone building or buying AI agents for production, this one's for you.

    🔗 Links & Resources
    Cleric: https://cleric.ai
    Shahram on LinkedIn: https://www.linkedin.com/in/shahramanver/
    Willem Pienaar (Co-Founder/CTO): https://www.linkedin.com/in/willempienaar/
    Cleric launches the first self-learning AI SRE: https://cleric.ai/blog/cleric-launches-the-first-self-learning-ai-sre
    MLOps Community: https://mlops.community
    Join the community: https://go.mlops.community/slack

    ⏱️ Timestamps
    [00:00] Tech Jargon Confusion
    [00:27] Harness vs Model
    [08:48] Model Evolution in Cleric
    [13:36] Sandboxing and Simulated Environments
    [20:40] Shifting AI Perceptions
    [24:10] Managing Humans vs Agents
    [31:32] Steering Parallel Agents
    [34:16] Human Decision Integration in Models
    [43:28] 80/20 Data Split
    [49:40] Becoming a Skill
    [53:35] 2027 Agent Autonomy
    [59:14] Agent Learning in Production
    [1:04:31] Software as Personal Capabilities
    [1:08:31] Vibe Coding vs Durability
    [1:18:23] Wrap up

    #AISRE #SiteReliabilityEngineering #AIAgents
  • MLOps.community

    Zipline Roundtable episode: Building Real-Time ML Systems with Zipline + Chronon

    17/06/2026 | 51 min
    Zipline Roundtable episode: Building Real-Time ML Systems with Zipline + ChrononJoin the Community: https://go.mlops.community/YTJoinInGet the newsletter: https://go.mlops.community/YTNewsletterMLOps GPU Guide: https://go.mlops.community/gpuguideBig shout-out to ZiplineAI for the collaboration!// AbstractReal-time ML use cases like personalization and risk decisioning come with a unique set of challenges: serving fresh feature values at low latency for inference, generating temporally consistent backfills for training, and building complex chains of on-demand, batch, and streaming transformations. In this roundtable, practitioners from Intuit, CreditKarma, Depop, and OpenAI share how they use Zipline and the OSS Chronon project to solve these challenges and deploy real-time ML use cases in production.// BioGerman KrikorianGerman is a Software Engineer on the Feature Platform team at Credit Karma. Since joining the company during the early development of its recommendation system, they have played a key role in building and scaling the platform over the years. Their work focuses on feature pipelines and the feature store, which serves as critical infrastructure supporting numerous teams and business verticals across the organization.Ben MagyarBen is an engineer at Depop working on ML and data systems. Before Depop, he worked on Search at Etsy. Most of his work is around the infrastructure and operational problems that come with running ML systems at scale.Raj KatakamRaj architects ML Infrastructure at Credit Karma (Intuit). He holds a Master's in Software Engineering from Carnegie Mellon and a B.Tech in EECE from IIT Kharagpur. His interests include ML Infrastructure, Distributed Systems, Real-Time Data Processing, and Generative AI. His current focus is on providing feature engineering platforms, production GenAI infrastructure, vector databases, ML model serving, and MLOps pipelines for fraud detection, personalized recommendations, financial insights, and model explainability.Mick JermsurawongLed Flyte ML training/experimentation at Stripe, and now led Chronon for ML features at OpenAIHosted by Demetrios// Related LinksWebsite: https://zipline.ai/https://chronon.ai/~~~~~~~~ ✌️Connect With Us ✌️ ~~~~~~~Catch all episodes, blogs, newsletters, and more: https://go.mlops.community/TYExploreJoin our Slack community [https://go.mlops.community/slack]Follow us on X/Twitter [@mlopscommunity](https://x.com/mlopscommunity) or [LinkedIn](https://go.mlops.community/linkedin)] Sign up for the next meetup: [https://go.mlops.community/register]MLOps Swag/Merch: [https://shop.mlops.community/]Connect with Demetrios on LinkedIn: /dpbrinkmConnect with German on LinkedIn: /e2zdkwh8cxghydg/Connect with Raj on LinkedIn: /rajkiran2190Connect with Mick on LinkedIn:/mick-jermsurawong/
  • MLOps.community

    MCP Servers Are Becoming the UI for AI Agents

    16/06/2026 | 47 min
    Naseem Al-Naji is the co-founder of MCPcat.io and the creator of Opal — a builder with deep roots in privacy-first developer tooling. In this conversation, he breaks down why MCP servers have become a black box in production, and how MCPcat gives teams X-ray vision into how agents and users actually behave.

    What we get into:
    🐱 What MCPcat Is — Open-source analytics and live debugging built specifically for MCP servers
    🎬 Session Replay — Watch an agent's full journey through your server, tool call by tool call
    🎯 Agent Intent & Goals — Understand "why" a tool was called, not just that it was
    🔍 Trace Debugging — Find exactly where agents and users get stuck or confused
    🚨 Catching Hallucinations — How issue tracking surfaces when an LLM goes off the rails
    🔒 Privacy-First by Design — Client-side redaction so sensitive data never leaves your environment
    ⚡ One-Line Integration — Python, TypeScript, and Go SDKs that drop into existing stacks
    📊 Works With Your Stack — Native support for OpenTelemetry, Datadog, and Sentry
    🚀 The Future of MCP — Where agent observability and the MCP ecosystem are heading

    If you build, ship, or maintain MCP servers — or you're trying to figure out why your AI agents misbehave in production — this one's for you.

    🔔 Subscribe, like, and share for more conversations on agentic AI:
    ▶️ YouTube: https://www.youtube.com/@AAIFAgenticConversations🎧 Spotify: https://open.spotify.com/show/033rZZJrQOVSSmhcStFhZA?si=rUNjFuNqRvGvAEWwqms7TA

    Links & Resources:
    🐱 MCPcat: https://mcpcat.io
    💻 MCPcat on GitHub: https://github.com/mcpcat
    👤 Naseem on LinkedIn: https://www.linkedin.com/in/naseem-al-naji
    🐙 Naseem on GitHub: https://github.com/naji247

    Timestamps:
    [00:00] Intro
    [01:41] MCP Needs Gatekeepers
    [06:32] Measuring MCP Success
    [13:57] MCPAT Feature Rollouts
    [18:50] MCP Server Query Optimization
    [26:48] UI Design Shift
    [29:14] MCP Server Design Choices
    [33:51] User Journey Traceability
    [40:40] Agent Experience Evaluation
    [45:23] AI Model Improvement Strategies

    #MCP #AIAgents #Observability
  • MLOps.community

    Agents & the $40M Bet on Multiplayer AI

    12/06/2026 | 1 h 20 min
    Stanislas Polu is Co-Founder & CTO of Dust — the enterprise AI agent platform used by 51,000 workers at 3,000+ companies. Before Dust, he spent three years on OpenAI's research team under Ilya Sutskever, working on mathematical reasoning in language models, and prior to that was an engineer at Stripe. He brings a rare combination of frontier AI research and product-building experience to the enterprise agent space.

    Agents & the $40M Bet on Multiplayer AI // MLOps Podcast #384 with Stanislas Polu, Co-Founder & CTO of Dust

    🤖 What is Dust? — How Dust enables teams to build and deploy AI agents powered by internal company data, and why the "multiplayer AI" model is winning in enterprise.
    🧠 From OpenAI Research to Startup Founder — Stanislas's journey from studying mathematical reasoning in LLMs under Ilya Sutskever to co-founding an enterprise AI company in Paris with Gabriel Hubert.
    🚀 The $40M Series B — What Dust is building with fresh funding, the bet on human-agent collaboration as the future of work, and what "multiplayer AI" actually means in practice.
    🔄 The Outer-Loop Era — Stanislas's framework for thinking about where AI agents create the most value: not just automating tasks, but rewiring how work gets done across entire organizations.
    ⚠️ What Most Enterprise AI Gets Wrong — The biggest mistakes companies make when deploying AI agents, why adoption fails, and how Dust achieves 70%+ weekly adoption rates.
    📊 Building Reliable Agent Infrastructure — Lessons from scaling to thousands of companies: observability, governance, data security, and why enterprise AI is harder than it looks.
    🛠️ Horizontal vs. Vertical AI Platforms — Why Dust chose to build a horizontal enterprise agent platform and how that decision shapes product, go-to-market, and technical architecture.

    This episode is essential for AI/ML engineers, enterprise AI leads, and anyone building or deploying AI agents at scale inside organizations.

    🔗 Links & Resources:
    • Dust: https://dust.tt
    • Stanislas Polu on X/Twitter: https://x.com/spolu
    • Dust on LinkedIn: https://www.linkedin.com/company/dust-tt
    • Dust $40M Series B announcement: https://dust.tt/blog
    • "The Outer-Loop Era" talk by Stanislas (dotconferences): https://www.youtube.com/watch?v=_outer_loop
    • Dust + Stripe MCP integration: https://stripe.com/customers/dust
    • Dust + Datadog observability case study: https://datadoghq.com/case-studies/dust

    ⏱️ Timestamps
    [00:00] Future of Work
    [00:19] Dust Scaling Lessons
    [04:44] Human-Agent Collaboration
    [14:24] Pod as Workspace
    [22:30] Work Flow Optimization
    [29:37] Multiplayer Collaboration Vision
    [39:55] Token Economics and Inference
    [47:20] AI Pricing Challenges
    [52:36] Dust vs Co-work
    [57:06] Agentic Work Infrastructure
    [1:04:23] Stateful Sandbox Challenges
    [1:09:58] Product Use Case Discussion
    [1:14:05] Agent Data Interaction Needs
    [1:20:09] Wrap up

    #EnterpriseAI #AIAgents #Dust
  • MLOps.community

    From Single-Player to Multi-Player: Operating AI Agents at Scale

    09/06/2026 | 55 min
    James Everingham is the CEO and Co-founder of Guild.ai — the AI agent control plane for production teams. With roots at Netscape, Instagram (Head of Engineering), and Meta (Head of Dev Infra, leading a 1,000-person org), James brings rare, hard-won expertise to the challenge of operating AI agents at scale.

    From Single-Player to Multi-Player: Operating AI Agents at Scale // MLOps Podcast #383 with James Everingham, CEO and Co-founder of Guild.ai

    In this episode, James unpacks what actually breaks when you move from a single AI agent to a fleet of them — and what engineering leaders need to build before it's too late.

    🎯 Single-Agent vs. Multi-Agent Systems — Why "single-player" AI workflows don't survive contact with production reality, and what the shift to multi-agent coordination actually demands from your infrastructure.
    🔍 The Agent Control Plane — What it is, why every engineering org needs one in 2026, and how Guild.ai is building the neutral layer to deploy, govern, and share agents across any framework or model.
    ⚠️ Non-Determinism at Scale — Why AI agents behave like employees, not software, and why you need workforce-style governance — not just observability tooling — to manage them.
    💸 Token Spend & Cost Visibility — How teams running agents in production are flying blind on cost, and what Guild shows you that your current stack doesn't.
    🏗️ Lessons from Meta's DevMate — How Meta's AI coding agent went from experiment to submitting 50% of all diffs, and what that journey teaches every engineering leader about scaling agents safely.
    🚦 Agent Identity & Governance — Why every agent needs an identity, what happens when they don't have one, and how agent sprawl becomes a governance crisis fast.
    🔄 Sharing Agents as Infrastructure — Why Guild treats agents as shared production infrastructure rather than one-off scripts, and how that changes the economics of AI investment.
    🛠️ Framework Agnosticism — Why betting on a single agent framework is a losing strategy, and how to build for a multi-model, multi-framework world from day one.
    Essential viewing for engineering leaders, AI platform teams, and founders building production-grade agentic systems.

    🔗 Guild.ai: https://guild.ai
    🔗 James on X/Twitter: https://x.com/jevering
    🔗 James on LinkedIn: https://www.linkedin.com/in/jameseveringham
    🔗 Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/

    ⏱️ Timestamps
    [00:00] Context Transfer Challenges
    [00:51] Control Plane for Agents
    [02:17] Effective Agent Policies
    [09:23] Agent Governance Policies
    [15:34] Developer Tool Adoption
    [22:02] Knowledge Sharing and Open Source
    [24:59] Simulated Deployments and Confidence
    [29:36] Agent Workloads vs Human Workloads
    [39:55] AI as a Customer
    [47:59] Agent Hub vs Autonomy
    [53:21] Wrap up

    #AgenticAI #AIAgents #AIEngineering
Más podcasts de Tecnología
Acerca de MLOps.community
Relaxed Conversations around getting AI into production, whatever shape that may come in (agentic, traditional ML, LLMs, Vibes, etc)
Sitio web del podcast

Escucha MLOps.community, Nosotros Los Clones y muchos más podcasts de todo el mundo con la aplicación de radio.net

Descarga la app gratuita: radio.net

  • Añadir radios y podcasts a favoritos
  • Transmisión por Wi-Fi y Bluetooth
  • Carplay & Android Auto compatible
  • Muchas otras funciones de la app
MLOps.community: Podcasts del grupo