Agentic AI in DevOps

#15 — Can Origin and Entire Replace GitHub? 18.07.2026

GitHub outages and agent-scale workloads are exposing the limits of centralized development platforms—learn where Cursor Origin and Entire could help, and which bottlenecks they cannot fix. Andrey Devyatkin, Vladimir Samoylov, and Fernando Gonçalves examine AI-native Git forges, distributed mirrors, API and Actions limits, repository sovereignty, and whether Git is even the real constraint. They weigh agent-scale throughput claims against human review, CI, internet bandwidth, and the token cost of best-of-N agent fleets, asking whether teams need to leave GitHub or simply reduce their dependence on it. Episode page (show notes and links): https://getboris.ai/insights/015-can-origin-and-entire-replace-github/

#14 — Loop Engineering in DevOps 09.07.2026

Loop engineering solves the coding-agent babysitting problem by giving DevOps teams a way to run larger tasks with evidence, constraints, and a clear definition of done. Andrey Devyatkin, Vladimir Samoylov, and Fernando Gonçalves unpack outer loops around agents, context-window limits, unattended runs, greenfield versus brownfield work, alert batching with SNS and SQS, model mixing, token costs, and why bad code still gets worse when you automate it faster. Episode page (show notes and links): https://getboris.ai/insights/014-loop-engineering-in-devops/

#13 — cmux vs iTerm with Viktor Vedmich 26.06.2026

cmux terminal workflows: when iTerm slows down AI-agent work, Viktor Vedmich shows how workspaces, session restore, and socket APIs make Claude Code more practical. Andrey Devyatkin and Fernando Gonçalves talk with Viktor Vedmich about his terminal-first agentic stack, Obsidian vault setup, Claude Code on Bedrock, AWS Kiro, spec-driven workflows, MCP search tools, rising token bills, and why open-weight model competition matters for DevOps teams. Episode page (show notes and links): https://getboris.ai/insights/013-cmux-vs-iterm-with-viktor-vedmich/

#12 — Semantic Layers, Context Layers, and Agents That Stop Guessing 12.06.2026

Two terms keep colliding in infrastructure AI: semantic layer and context layer. They're not the same thing, and conflating them is how teams end up with agents that understand vocabulary but have no idea what's actually running — or agents that know the environment but still guess wrong about what the data means. Episode page (show notes and links): https://getboris.ai/insights/012-semantic-layers-context-layers-and-agents-that-stop-guessing/

#11 — Base of Record for Intelligent Systems 05.06.2026

The hosts of Agentic AI in DevOps make the case that the "second brain" idea — long a personal-productivity meme — is exactly what AI agents need to be useful inside a real engineering environment. Without a system of reference, the agent burns its context window rediscovering what is running where, hallucinates the gaps, and needs credentials it should not have. The episode also marks a pivot: B.O.R.I.S is no longer pitched as a replacement DevOps engineer, but as a context layer for engineering systems — and the acronym now stands for Base of Record for Intelligent Systems. Fernando Gonçalves rejects pure RAG as enough on its own (the relationships are what matter), Andrey Devyatkin turns "do you take notes?" into a hiring question, and the half-hour closes with a token-maxing post-mortem and the reminder that you can outsource the legwork to agents but not the decisions. Episode page (show notes and links): https://getboris.ai/insights/011-base-of-record-for-intelligent-systems/

#10 — What Changed in Our Daily AI Workflow 31.05.2026

In a less-structured episode, the hosts of Agentic AI in DevOps compare day-to-day workflows: what they actually run, what they have stopped doing, and how their habits have shifted since the early autumn. Fernando Gonçalves keeps the classical engineering ritual — linting, tests, coverage targets — and bakes it into a skill so the AI cannot skip past quiet bugs. Vladimir Samoylov has rewired the harness with hooks for audit logs, command-failure journals, and a daily "dead code" sweep, and tracks a new personal metric: average working hours without a human. Andrey Devyatkin argues that the second shift at 9 PM is the wrong time to drive an agent — the sharpest decisions belong to a fresh 5 AM mind — and shells Claude Code out to Codex to cross-review its own plans. The closing exchange wanders into whether AI is "the last thing humans will invent," and the hosts disagree: imagination, accidents, and champagne all live outside the training data. Episode page (show notes and links): https://getboris.ai/insights/010-what-changed-in-our-daily-ai-workflow/

#9 — Code with Claude: Routines, Agents, and the AWS Catch 15.05.2026

Anthropic's Code with Claude developer conference in San Francisco on May 6, 2026 dropped a wave of platform features aimed squarely at coding teams, and the hosts walk through what actually matters for builders. Andrey Devyatkin reads the SpaceX–Anthropic compute deal as one of the cleverest business moves of the year — xAI is sitting on underutilized GPUs, and "the enemy of my enemy" gets to raise everyone's usage limits. The episode dissects routines, outcomes/goals, multi-agent orchestration, the advisor pattern, dreaming, and the new Anthropic-on-AWS path that is *not* Bedrock, with Fernando Gonçalves flagging an easy-to-miss compliance gotcha: on that path, data still leaves AWS for Anthropic. The closing read on OpenAI is sharp — a two-month free Codex trial for new Codex users on eligible enterprise accounts, which the hosts call "a little bit of a desperate attempt" after Anthropic's enterprise adoption crossed OpenAI's. Episode page (show notes and links): https://getboris.ai/insights/009-code-with-claude-routines-agents-and-the-aws-catch/

#8 — DevOps Jobs Agentic AI Can Actually Do 08.05.2026

After seven foundation-laying episodes, the hosts of Agentic AI in DevOps take the practitioner's tour: which DevOps jobs agentic AI actually does well, and which still fight back. Andrey Devyatkin reframes the "AI deleted my production database" headlines, arguing they are functionally identical to "my terminal deleted my database" — the human gave the credentials and confirmed the action — and walks through why infrastructure-as-code is harder for agents than application code (one word: state). The hosts dig into the gap between C-suite adoption claims and practitioner reality, with Fernando Gonçalves noting that "use AI" is now a manager KPI, and they land on documentation, runbooks, and postmortems as the place where agents quietly earn their keep right now. Episode page (show notes and links): https://getboris.ai/insights/008-devops-jobs-agentic-ai-can-actually-do/

#7 — Agent Memory: When It Helps, When It Hurts, and How to Manage It 29.04.2026

Your AI agent just forgot everything. Again. You taught it your stack, your conventions, your preferences — and tomorrow it wakes up like it never happened. Memory is supposed to fix that. Here's what nobody tells you: it can also make things worse. This blog post breaks down exactly how agent memory works, when it helps, when it quietly poisons your sessions, and what to do about it. Episode page (show notes and links): https://getboris.ai/insights/007-when-agent-memory-helps-and-when-it-hurts/

#6 — The Big AI Squeeze 22.04.2026

LLM subsidies are drying up, subscription limits are tightening, and the astronomical data center CapEx has to be paid by someone — spoiler: it is you. In this episode, Fernando Gonçalves, Andrey Devyatkin, and Vladimir Samoylov tackle what they call "the big squeeze": the two-sided pressure of rising AI costs and emerging local-inference technologies that could reshape how teams budget for and deploy AI. Along the way, they debate whether buying a Mac Mini is a rational investment or just hype, reveal the mental gymnastics required to get meaningful work out of a twenty-dollar subscription, and argue that if your AI vendor has no pricing page, that tells you everything you need to know about their business model. Episode page (show notes and links): https://getboris.ai/insights/006-the-big-ai-squeeze/

#5 — Stop Your Agent Before It Breaks Prod 17.04.2026

Imagine your agent just deleted a production database — could you have stopped it? The hosts argue that yes, three lines of bash in a single hook could have prevented it, and yet most teams have never configured one. In this episode, Andrey Devyatkin, Vladimir Samoylov, and Fernando Gonçalves pull apart the agentic loop — the repeating cycle of reason, act, observe that makes coding agents appear to "think" — and show exactly where hooks slot in to give humans deterministic or automated harness-level control over non-deterministic AI behavior, depending on the hook type. Along the way, they unpack Anthropic's rapid-fire release week (Managed Agents, Routines, Opus 4.7, and the advisor tool pattern), debate whether LLM provider commoditization is real or an illusion, and warn that the same hook mechanism that protects your infrastructure can be weaponized through supply chain attacks. Episode page (show notes and links): https://getboris.ai/insights/005-stop-your-agent-before-it-breaks-prod/

#4 — Harness Engineering: What Claude Code Accidentally Taught Everyone 13.04.2026

A packaging mistake exposed Claude Code's full source tree to the world — and instead of scandal, the community got a masterclass in how agentic coding tools actually work under the hood. In this episode, Andrey Devyatkin, Vladimir Samoylov, and Fernando Gonçalves unpack what the disclosure revealed about the engineering behind coding agents, introduce the emerging discipline of "harness engineering," and argue that the model is only the horsepower — it is the harness that determines whether the agent gallops toward the right destination or off a cliff. Along the way, they weigh in on NVIDIA's OpenShell sandboxing, Cursor 3.0's agent-first interface, and why Docker containers are not the security blanket many DevOps engineers assume them to be. Episode page (show notes and links): https://getboris.ai/insights/004-harness-engineering-what-claude-code-accidentally-taught-everyone/

#3 — Skills, Powers, SOPs 30.03.2026

What happens when your AI coding tool quietly starts billing like a cloud service — and your team burns through a thousand dollars in a week? Vladimir Samoylov returns as the hosts share their sticker-shock moment with Cursor's new pricing before diving into agent skills. From Claude Code skills to Kiro Powers to AWS Strands SOPs, the naming varies but the idea is the same — plugging structured knowledge into an agent's brain on demand. Episode page (show notes and links): https://getboris.ai/insights/003-skills-powers-sops/

#2 — The Tool Layer: What Makes Agentic AI Possible 23.03.2026

What happens when your AI coding assistant forgets what it was just working on? Andrey Devyatkin and Fernando Gonçalves break down context windows, how MCP servers can consume a large share of the session before the first message, and why over-specifying agent behavior in project rules often hurts output. Episode page (show notes and links): https://getboris.ai/insights/002-the-tool-layer-what-makes-agentic-ai-possible/

#1 — AI in DevOps, 2022 to 2026: From Autocomplete to Action 17.03.2026

What if most AI tools sold into DevOps fail because they only see part of the stack? In this first episode of Agentic AI in DevOps, Andrey Devyatkin and Fernando Gonçalves trace AI tooling from ChatGPT's launch in late 2022 through 2025: they treat context—not model hype—as the constraint, and explain why assistants cut off from source, logs, metrics, and docs tend to guess wrong under real operational load. Episode page (show notes and links): https://getboris.ai/insights/001-ai-in-devops-2022-to-2026-from-autocomplete-to-action/

Agentic AI in DevOps

Episod

Podcast serupa

Popular di