Agentic AI in DevOps

Agentic AI in DevOps

Sirob Technologies
Negara Amerika Syarikat
Genre Technology
Bahasa EN
Episod 11
Terkini 05.06.2026

Insights on applying agentic AI to DevOps and software development automation, by practitioners who build and operate agentic AI for DevOps at Sirob Technologies — the team behind B.O.R.I.S, an AI DevOps teammate.

Episod

  • #11 — Base of Record for Intelligent Systems 05.06.2026
    The hosts of Agentic AI in DevOps make the case that the "second brain" idea — long a personal-productivity meme — is exactly what AI agents need to be useful inside a real engineering environment. Without a system of reference, the agent burns its context window rediscovering what is running where, hallucinates the gaps, and needs credentials it should not have. The episode also marks a pivot: B.O.R.I.S is no longer pitched as a replacement DevOps engineer, but as a context layer for engineering systems — and the acronym now stands for Base of Record for Intelligent Systems. Fernando Gonçalves rejects pure RAG as enough on its own (the relationships are what matter), Andrey Devyatkin turns "do you take notes?" into a hiring question, and the half-hour closes with a token-maxing post-mortem and the reminder that you can outsource the legwork to agents but not the decisions. Episode page (show notes and links): https://getboris.ai/insights/011-base-of-record-for-intelligent-systems/
  • #10 — What Changed in Our Daily AI Workflow 31.05.2026
    In a less-structured episode, the hosts of Agentic AI in DevOps compare day-to-day workflows: what they actually run, what they have stopped doing, and how their habits have shifted since the early autumn. Fernando Gonçalves keeps the classical engineering ritual — linting, tests, coverage targets — and bakes it into a skill so the AI cannot skip past quiet bugs. Vladimir Samoylov has rewired the harness with hooks for audit logs, command-failure journals, and a daily "dead code" sweep, and tracks a new personal metric: average working hours without a human. Andrey Devyatkin argues that the second shift at 9 PM is the wrong time to drive an agent — the sharpest decisions belong to a fresh 5 AM mind — and shells Claude Code out to Codex to cross-review its own plans. The closing exchange wanders into whether AI is "the last thing humans will invent," and the hosts disagree: imagination, accidents, and champagne all live outside the training data. Episode page (show notes and links): https://getboris.ai/insights/010-what-changed-in-our-daily-ai-workflow/
  • #9 — Code with Claude: Routines, Agents, and the AWS Catch 15.05.2026
    Anthropic's Code with Claude developer conference in San Francisco on May 6, 2026 dropped a wave of platform features aimed squarely at coding teams, and the hosts walk through what actually matters for builders. Andrey Devyatkin reads the SpaceX–Anthropic compute deal as one of the cleverest business moves of the year — xAI is sitting on underutilized GPUs, and "the enemy of my enemy" gets to raise everyone's usage limits. The episode dissects routines, outcomes/goals, multi-agent orchestration, the advisor pattern, dreaming, and the new Anthropic-on-AWS path that is *not* Bedrock, with Fernando Gonçalves flagging an easy-to-miss compliance gotcha: on that path, data still leaves AWS for Anthropic. The closing read on OpenAI is sharp — a two-month free Codex trial for new Codex users on eligible enterprise accounts, which the hosts call "a little bit of a desperate attempt" after Anthropic's enterprise adoption crossed OpenAI's. Episode page (show notes and links): https://getboris.ai/insights/009-code-with-claude-routines-agents-and-the-aws-catch/
  • #8 — DevOps Jobs Agentic AI Can Actually Do 08.05.2026
    After seven foundation-laying episodes, the hosts of Agentic AI in DevOps take the practitioner's tour: which DevOps jobs agentic AI actually does well, and which still fight back. Andrey Devyatkin reframes the "AI deleted my production database" headlines, arguing they are functionally identical to "my terminal deleted my database" — the human gave the credentials and confirmed the action — and walks through why infrastructure-as-code is harder for agents than application code (one word: state). The hosts dig into the gap between C-suite adoption claims and practitioner reality, with Fernando Gonçalves noting that "use AI" is now a manager KPI, and they land on documentation, runbooks, and postmortems as the place where agents quietly earn their keep right now. Episode page (show notes and links): https://getboris.ai/insights/008-devops-jobs-agentic-ai-can-actually-do/
  • #7 — When Agent Memory Helps and When It Hurts 29.04.2026
    Every session, your agent wakes up with amnesia — the same mistakes, the same rediscovery, the same wasted tokens. Memory is how teams solve this, but as the hosts of Agentic AI in DevOps argue, it is both a superpower and a liability. Andrey Devyatkin goes so far as to say that semantic memory makes an already non-deterministic LLM "even less deterministic," while Fernando Gonçalves warns that a poisoned memory file is trusted implicitly — just like a human trusts their own recollections. In this final episode of the foundations series, the hosts unpack the full memory lifecycle — capture, management, retrieval — and share hard-won lessons from building B.O.R.I.S, their agentic DevOps teammate where memory plays a central role. Episode page (show notes and links): https://getboris.ai/insights/007-when-agent-memory-helps-and-when-it-hurts/
  • #6 — The Big AI Squeeze 22.04.2026
    LLM subsidies are drying up, subscription limits are tightening, and the astronomical data center CapEx has to be paid by someone — spoiler: it is you. In this episode, Fernando Gonçalves, Andrey Devyatkin, and Vladimir Samoylov tackle what they call "the big squeeze": the two-sided pressure of rising AI costs and emerging local-inference technologies that could reshape how teams budget for and deploy AI. Along the way, they debate whether buying a Mac Mini is a rational investment or just hype, reveal the mental gymnastics required to get meaningful work out of a twenty-dollar subscription, and argue that if your AI vendor has no pricing page, that tells you everything you need to know about their business model. Episode page (show notes and links): https://getboris.ai/insights/006-the-big-ai-squeeze/
  • #5 — Stop Your Agent Before It Breaks Prod 17.04.2026
    Imagine your agent just deleted a production database — could you have stopped it? The hosts argue that yes, three lines of bash in a single hook could have prevented it, and yet most teams have never configured one. In this episode, Andrey Devyatkin, Vladimir Samoylov, and Fernando Gonçalves pull apart the agentic loop — the repeating cycle of reason, act, observe that makes coding agents appear to "think" — and show exactly where hooks slot in to give humans deterministic or automated harness-level control over non-deterministic AI behavior, depending on the hook type. Along the way, they unpack Anthropic's rapid-fire release week (Managed Agents, Routines, Opus 4.7, and the advisor tool pattern), debate whether LLM provider commoditization is real or an illusion, and warn that the same hook mechanism that protects your infrastructure can be weaponized through supply chain attacks. Episode page (show notes and links): https://getboris.ai/insights/005-stop-your-agent-before-it-breaks-prod/
  • #4 — Harness Engineering: What Claude Code Accidentally Taught Everyone 13.04.2026
    A packaging mistake exposed Claude Code's full source tree to the world — and instead of scandal, the community got a masterclass in how agentic coding tools actually work under the hood. In this episode, Andrey Devyatkin, Vladimir Samoylov, and Fernando Gonçalves unpack what the disclosure revealed about the engineering behind coding agents, introduce the emerging discipline of "harness engineering," and argue that the model is only the horsepower — it is the harness that determines whether the agent gallops toward the right destination or off a cliff. Along the way, they weigh in on NVIDIA's OpenShell sandboxing, Cursor 3.0's agent-first interface, and why Docker containers are not the security blanket many DevOps engineers assume them to be. Episode page (show notes and links): https://getboris.ai/insights/004-harness-engineering-what-claude-code-accidentally-taught-everyone/
  • #3 — Skills, Powers, SOPs 30.03.2026
    What happens when your AI coding tool quietly starts billing like a cloud service — and your team burns through a thousand dollars in a week? Vladimir Samoylov returns as the hosts share their sticker-shock moment with Cursor's new pricing before diving into agent skills. From Claude Code skills to Kiro Powers to AWS Strands SOPs, the naming varies but the idea is the same — plugging structured knowledge into an agent's brain on demand. Episode page (show notes and links): https://getboris.ai/insights/003-skills-powers-sops/
  • #2 — The Tool Layer: What Makes Agentic AI Possible 23.03.2026
    What happens when your AI coding assistant forgets what it was just working on? Andrey Devyatkin and Fernando Gonçalves break down context windows, how MCP servers can consume a large share of the session before the first message, and why over-specifying agent behavior in project rules often hurts output. Episode page (show notes and links): https://getboris.ai/insights/002-the-tool-layer-what-makes-agentic-ai-possible/
  • #1 — AI in DevOps, 2022 to 2026: From Autocomplete to Action 17.03.2026
    What if most AI tools sold into DevOps fail because they only see part of the stack? In this first episode of Agentic AI in DevOps, Andrey Devyatkin and Fernando Gonçalves trace AI tooling from ChatGPT's launch in late 2022 through 2025: they treat context—not model hype—as the constraint, and explain why assistants cut off from source, logs, metrics, and docs tend to guess wrong under real operational load. Episode page (show notes and links): https://getboris.ai/insights/001-ai-in-devops-2022-to-2026-from-autocomplete-to-action/

Popular di

Podcast ini turut muncul dalam senarai podcast negara-negara ini.