Agentic Engineering
How I Build Apps with an AI That Lives on My Server
Alexander Gomez · 2026
30 seconds. Let the title sit. Terminal cursor blinks.
The Numbers Are In
90%
of code at Anthropic is written by AI
30%
of Google's new code is AI-generated
25%
of YC's Winter 2025 batch had codebases 95% AI-generated
These are not predictions. These are facts. From the last 12 months.
1 minute. Deliver slowly. Pause after each stat. This is your "holy shit" moment.
The CEO Mandates
Shopify E-commerce, ~10% of US online retail
CEO Tobi Lutke: "Prove why AI can't do it before requesting a new hire"
Coinbase Largest US crypto exchange, NASDAQ
Engineers who didn't onboard AI tools → Saturday meeting → fired
Klarna Swedish fintech, "buy now pay later"
5,000 → 3,000 employees. AI handles the work of 853 staff.
Goldman Sachs Investment bank, founded 1869
Piloting thousands of autonomous AI coders alongside 12,000 humans
1.5 min. "This isn't just startups. This is Goldman Sachs. This is enterprise."
The Tools Are Here
84-90%
of developers now use AI tools
Stack Overflow Developer Survey 2025
20M
GitHub Copilot users, 90% of Fortune 100
GitHub 2025
$29B
Cursor valuation, $500M ARR
AI-native code editor
9.6 → 2.4
Average PR time in days
GitHub Copilot study
The question is: how deep do you go?
1 min. Quick-hit stats. "The tools are here. The question is: how deep do you go?"
But Let's Be Honest
METR study: experienced devs were 19% slower with AI
— but believed they were faster
Stack Overflow: only 29% trust AI accuracy (down from 40%)
66% spend MORE time fixing "almost-right" AI code
Kent Beck — Creator of TDD & Extreme Programming, Agile Manifesto signatory
AI "lacks taste" — it excels at adding features but creates an "inhibiting loop" where complexity compounds.
So which is it? Faster or slower?It depends on how you use it.
1 min. "Before you think I'm an AI fanboy..." This earns trust. You're a practitioner, not a salesman.
The METR study tested Level 2 (Cursor inline). Your talk argues Level 3 (agentic) is different.
Vibe Coding vs. Agentic Engineering
Andrej Karpathy — OpenAI co-founder, ex-Tesla AI Director
"Fully give in to the vibes, embrace exponentials, and forget that the code even exists."
Coined "vibe coding" · Feb 2025 · Collins Word of the Year 2025
Simon Willison — Django co-creator, coined "prompt injection"
"Not all AI-assisted coding is vibe coding."
Two camps: Vibe coding (no review, prototypes) vs. Agentic engineering (AI does the work, you review and orchestrate)
I'm firmly in the second camp. And I'll show you why.
1.5 min. Karpathy later (Feb 2026) said vibe coding is "passe" and proposed "agentic engineering" as the new term.
What is Claude Code?
Not a chatbot. Not an IDE plugin. An autonomous engineering agent.
Lives in your terminal (or VS Code / JetBrains)
Full codebase awareness — reads, searches, understands context
Edits files, runs commands, commits to git, triggers CI/CD
Plan mode, subagents, MCP servers, hooks
Think of it as: a junior dev that never sleeps, reads every file, follows instructions exactly… and never asks for a raise.
1.5 min. Visual: terminal screenshot with Claude Code running.
The Three Levels
Level How Who Drives Example
1. Chat
Copy-paste prompts, manual integration
You type everything
ChatGPT, Claude.ai
2. IDE
AI in your editor, inline suggestions
You + AI side by side
Cursor, Copilot
3. Agentic
AI on your server, autonomous execution
AI executes, you review
Claude Code, Codex
Level 1 is talking to a tutor. Level 2 is pair programming.
Level 3 is having a developer on your team that you manage.
Where are most of you?
2 min. Ask the audience where they are. Usually Level 1-2.
My Philosophy: Control + Plan Mode
I like to be in control. I don't vibe code — I orchestrate.
Plan mode: Claude reads the codebase, proposes a plan, I review, THEN it executes
I review every change, approve every commit
Result: consistent, less buggy code — the AI thinks before it acts
Armin Ronacher — Creator of Flask & Jinja2, Python Software Foundation Fellow
"It feels like I've gained 30% more time in my day because the machine is doing the work."
About Claude Code, June 2025. 90%+ of his code is now AI-written across 40K lines.
1.5 min. Ronacher also said: "Review every line, shape the architecture, and carry responsibility."
Visual: screenshot of plan mode output.
The Advanced Workflow
What Level 3 actually looks like in practice:
Claude Code running on my VPS via SSH
It commits changes → triggers GitHub Actions → auto-deploys
Agent mode for deep research (parallel subagents)
Memory files that persist across sessions (CLAUDE.md)
I work from anywhere — phone, laptop, any terminal
My AI doesn't just suggest code. It ships code.
1 min. "It also doesn't complain about code reviews."
Act 3
Let Me Show You What I Built
Two projects. Both built with Claude Code.
Everything I'm about to show you was built by me and Claude Code together.
Not generated and forgotten — planned, reviewed, iterated, deployed.
30 sec transition. Build anticipation.
EMX: What Is It?
"Turn your moments into cinematic soundtracks"
Upload a photo → AI analyzes the scene
Generates: lyrics, music, album art, music video
Powered by: Gemini (vision + composition), Kie AI (music), Kling (video)
Photo → Vision → Compose → Image + Music (parallel) → Video
1.5 min. Visual: architecture diagram showing the pipeline.
EMX: How It Works
emx-api (Python / FastAPI)
Vision analysis, lyric composition, image generation
BFF (Node.js)
SSE relay, orchestrates Kie AI for music & video
emx-web
Single-file SPA — yes, one HTML file!
All behind Traefik reverse proxy, Docker Compose
Real-time SSE events drive the progressive UI
1.5 min. Visual: simplified backend flow diagram.
LIVE DEMO
EMX
emx.firebots.cloud
Emotion Pad — built with Claude Code
Upload a photo, watch face detection move the dot
Walk through the pipeline as it happens
Album cover, music, video — all generated live
4 min. Have a good photo ready. If pipeline is slow, narrate while it works.
React to whatever EMX generates: "Well, Claude chose that album cover, not me."
Vibe Deployer
"Describe an app → AI builds it → deployed in 60 seconds"
Chat interface at vibe.firebots.cloud
Describe what you want in plain English
Claude writes the entire app
Auto-deployed to apps.firebots.cloud/{slug}
Real-time progress via WebSocket
Vibe Deployer: How It Works
Chat UI (nginx)
Simple frontend, sends prompt via fetch
n8n workflow
Receives webhook → calls Claude API → writes HTML
WS Relay (Node.js)
WebSocket for real-time updates + file writer
Apps server (nginx)
Serves generated apps at unique URLs
1 min. Visual: architecture diagram.
LIVE DEMO
Vibe Deployer
vibe.firebots.cloud
Type a prompt — something fun
Watch it generate in real-time
Open the deployed app
Show it actually works
3 min. Pick something ridiculous: "Let's build a Tinder for houseplants."
Games work great for demos.
A Day in My Terminal
Multiple terminal tabs: Claude Code sessions for different projects
Plan mode for anything non-trivial
Agent mode for research: "find all the places this API is called"
Git commits, PR creation, deployment — all from the conversation
CLAUDE.md as persistent memory — Claude remembers your server
I don't just code with AI. I run my entire infrastructure with it.
1.5 min. Visual: screenshot of multiple terminal tabs / tmux session.
The New Developer Desktop
Tabs: Claude Code + Cursor + browser + Slack + monitoring
Multiple agents running in parallel
Background tasks: "research this while I work on that"
Some devs run 3-4 Claude Code instances simultaneously
Simon Willison — Django co-creator
"An LLM agent runs tools in a loop to achieve a goal."
Addy Osmani — Director at Google Cloud AI, 14 yrs leading Chrome DevEx
Uses 2+ LLMs in parallel to cross-check approaches — calls it "model musical chairs."
The developer of 2026 is a conductor, not a typist.
What I Learned Along the Way
Plan mode is non-negotiable
"Measure twice, cut once" applies to AI too
You must understand the code
You're the architect, not the AI
The AI makes mistakes
But so do humans, and AI is faster at fixing them
It's not magic — it's a workflow
The magic is in knowing when to use which level
It changed how I think
I think in systems and architecture, less in syntax
2 min. "The AI makes mistakes — but at 3 AM it doesn't send passive-aggressive Slack messages about it."
The Timeline
Feb 2025 — Karpathy coins "vibe coding"
Mar 2025 — YC batch is 95% AI-generated
Mid 2025 — CEO mandates (Shopify, Coinbase, Klarna)
Late 2025 — Claude Code, Codex, Cursor explode
2026 — 90% of Anthropic's code is AI-written
Now — You're watching a talk built with this technology
30 sec. Fast visual timeline. Let the last item land.
My Challenge to You
You're all experienced developers. You know how to code.
The question isn't whether AI can help you.
The question is: at which level are you going to engage?
Level 1: You're leaving 80% of the value on the table
Level 2: You're getting there
Level 3: Welcome to the future. It's already here.
Try Claude Code for one week. Use plan mode. Stay in control.
I guarantee you won't go back.
1 min. This is your closer. Deliver with conviction.
"This presentation was also planned with Claude Code. Even the jokes. Sorry about those."
Resources
Claude Code — claude.ai/code
EMX — emx.firebots.cloud
Vibe Deployer — vibe.firebots.cloud
Alexander Gomez
15 sec. Leave this up while transitioning to Q&A.
Questions?
Remaining time. Background: animated terminal cursor.