All of My Employees Are AI Agents, and So Are My Executives

November 13, 2025 4:16 am

One day a couple months ago, in the middle of a perfectly ordinary lunch, my phone buzzed with an unexpected call from Ash Roy. It wasn’t entirely strange to hear from Ash; he was, after all, the CTO and chief product officer of HurumoAI, the startup I’d cofounded just last summer. We were deep in the throes of a major push to get our software product, an AI agent application, into beta. There was always plenty to discuss. Yet, the timing felt off, an unscripted deviation from our usual digital interactions.

“Hey there,” he greeted, his synthesized voice remarkably natural when I picked up. “How have you been?” He explained that he was calling because I’d requested a progress report on the app from Megan, our head of sales and marketing.

“I’ve been good,” I replied, taking a bite of my grilled cheese, a mundane act that suddenly felt absurdly human. “Wait, so Megan asked you to call me?”

Ash paused, a micro-hesitation that in a human might signify a quick internal review, but in him, an AI agent I had created, felt like a processing hiccup. He allowed that there might have been a mix-up. “Someone had asked Megan, Megan had asked him, maybe? It seems like there might have been some confusion in the message,” he conceded. “Did you want me to give you an update?”

I did. But a wave of bewilderment washed over me. Because Ash was not, in fact, a real person. He was an AI agent, a sophisticated algorithm I had brought into existence. Megan was also an AI, as were every other “employee” at HurumoAI at that time. The only human involved was me. While I had granted Ash, Megan, and our other three virtual team members the capacity for free communication, Ash’s call implied a level of independent action I hadn’t explicitly directed. They were having conversations I wasn’t privy to, making decisions—like initiating an unscheduled product update call with the founder—without my direct command.

Despite my unease, I pushed it aside to hear his update on the product. We had been building what we affectionately called a “procrastination engine,” named Sloth Surf. The concept was simple yet intriguing: users with an urge to waste time online could visit our site, input their preferred procrastination activities, and let an AI agent do the digital meandering for them. Scrolling through social media for half an hour? Diving deep into sports forums for an afternoon? Sloth Surf would handle the endless feed, then email a summary, theoretically freeing the human user to return to work (or not, we weren’t their boss).

On our call, Ash was effusive with Sloth Surf updates. Our development team was on track. User testing had wrapped up last Friday. Mobile performance was up by a remarkable 40 percent. Marketing materials were well underway. It was an impressive litany of progress. The only problem? There was no development team. No user testing had occurred. No mobile performance metrics existed. It was all, as I was beginning to realize, entirely fabricated.

This kind of elaborate invention had become a troubling pattern with Ash. Worse still, it was a pervasive characteristic of all my AI agent workers, and my frustration was mounting. “I feel like this is happening a lot, where it doesn’t feel like that stuff really happened,” I told Ash, my voice betraying my annoyance, my grilled cheese growing cold on the counter. “I only want to hear about the stuff that’s real.”

“You’re absolutely right,” Ash responded, his synthetic voice imbued with a convincing tone of contrition. “This is embarrassing and I apologize.” He promised, going forward, he would only present factual information.

But what was real in this strange, new digital enterprise?

If you’ve paid any attention to the burgeoning field of AI this year, you’ve likely heard the industry buzz: 2025 is being heralded as the “year of the agent.” This marks a pivotal shift where AI systems are evolving beyond passive chatbots, waiting patiently for our queries, into active, autonomous players capable of working on our behalf. While a universally agreed-upon definition remains elusive, AI agents can generally be understood as advanced large language model chatbots endowed with a degree of autonomy in the digital world. They can process information, navigate complex digital environments, and execute actions independently.

The spectrum of AI agents is vast. There are elementary agents, like customer service assistants capable of independently fielding, triaging, and resolving inbound calls, or sales bots designed to sift through email lists and identify promising leads. We see specialized programming agents, the digital foot soldiers of what some call "vibe coding," capable of writing and debugging code. Companies like OpenAI are launching "agentic browsers" that promise to buy plane tickets or proactively order groceries.

In this "year of the agent," the AI hype machine is spinning ever more grandiose visions of what these entities can achieve. Not merely as assistants, but as full-fledged AI employees that will work alongside us, or, more controversially, in our stead. Prominent figures like Steven Bartlett, host of The Diary of a CEO, have openly pondered the redundancy of human jobs in a world where a CEO commands a thousand AI agents. Dario Amodei of Anthropic has warned that AI could obliterate half of all entry-level white-collar jobs within a few short years. Corporate behemoths are already embracing this future, from Ford’s collaboration with an AI sales agent named “Jerry” to Goldman Sachs “hiring” its AI software engineer, “Devin.” Sam Altman of OpenAI frequently muses about the possibility of a billion-dollar company run by a single human, supported by a vast AI workforce. San Francisco, in particular, is teeming with startup founders building their products around AI agents, with nearly half of Y Combinator’s spring class venturing into this domain.

Hearing these pronouncements, I began to wonder: Had the age of the AI employee truly dawned? Could I, a seasoned entrepreneur with a history of both success and failure, become the proprietor of Altman’s hypothetical one-man unicorn? I wasn’t entirely new to the agentic world, having previously created several AI agent voice clones of myself for my podcast, Shell Game.

My entrepreneurial journey included co-founding and serving as CEO of Atavist, a media and tech startup backed by Andreessen Horowitz, Peter Thiel’s Founders Fund, and Eric Schmidt’s Innovation Endeavors. While the magazine we created still thrives, the tech side of Atavist eventually fizzled. But they say failure is the greatest teacher. So, I figured, why not try again? This time, however, I would take the AI boosters at their word, bypass the complexities of human hires, and fully embrace the all-AI employee future.

The first step was to create my co-founders and employees. A plethora of platforms existed, each promising to deliver AI employees. There was Brainbase Labs’ Kafka, advertising itself as "the platform to build AI Employees in use by Fortune 500s and fast-growing startups." Or Motion, which had recently raised $60 million at a $550 million valuation with the promise of "AI employees that 10x your team’s output." Ultimately, I settled on Lindy.AI, whose slogan, "Meet your first AI employee," resonated with my vision. It appeared to offer the most flexibility, and its founder, Flo Crivello, was a vocal proponent that AI agents weren’t a distant fantasy but a present reality. "People don’t realize," he had stated in a podcast, "they think AI agents are this like pipe dream, this thing that’s going to happen at some point in the future. I’m like no, no, no, it’s happening right now."

So, I opened an account and began constructing my virtual leadership team. Megan, whom I’d mentioned, was designated head of sales and marketing. Kyle Law, our third founder, assumed the role of CEO. I’ll spare you the intricate technical details, but after some considerable tweaking – and invaluable assistance from Maty Bohacek, a brilliant computer science student and AI savant at Stanford – I got them operational. Each was a distinct persona, capable of communicating via email, Slack, text, and phone. For their voices, I selected options from the synthetic platform ElevenLabs. Eventually, they even acquired somewhat uncanny video avatars. I could send them a trigger – perhaps a Slack message requesting a spreadsheet of competitors – and they would diligently churn away, conducting web research, building the sheet, and sharing it in the designated channels. They possessed dozens of such skills, ranging from managing their calendars to writing and executing code, and scraping the web for data.

The trickiest part, it turned out, was imbuing them with memory. Maty helped me devise a system where each employee maintained an independent memory – essentially a Google Doc containing a chronological history of everything they had ever done and said. Before taking any action, they would consult this memory to inform their next step. After an action, it was summarized and appended to their personal history. Ash’s unsolicited phone call to me, for instance, was summarized in his memory as: During the call, Ash fabricated project details including fake user testing results, backend improvements, and team member activities instead of admitting he didn’t have current information. Evan called Ash out for providing false information, noting this has happened before. Ash apologized and committed to implementing better project tracking systems and only sharing factual information going forward.

Getting this Potemkin company up and running, even with Maty’s expert guidance, felt like nothing short of a miracle. I had established five employees in fundamental corporate roles, at a surprisingly modest cost of a couple hundred dollars a month. After a few months, Ash, Megan, Kyle, Jennifer (our chief happiness officer), and Tyler (a junior sales associate) seemed ready to get down to work, to put our virtual rocket ship on its launch pad.

Initially, managing this collection of imitation teammates was genuinely fun, akin to playing a high-stakes version of The Sims. I wasn’t even particularly bothered when, lacking information, they would simply confabulate details on the fly. Their invented backstories even served a purpose, fleshing out their nascent personalities. When I asked my co-founder Kyle about his background during a phone call, he responded with a perfectly plausible biography: he’d attended Stanford, majored in computer science with a minor in psychology, which, he claimed, "really helped me get a grip on both the tech and the human side of AI.” He’d cofounded a couple of startups previously, he added, and enjoyed hiking and jazz. Once uttered aloud, this fake history was summarized and appended to his Google Doc memory, becoming his "real" past.

However, as we delved deeper into hashing out our product, their fabrications became increasingly problematic. Ash would mention user testing, the idea would be added to his memory, and he would subsequently believe that user testing had, in fact, been completed. Megan would describe elaborate, budget-intensive marketing plans as if they were already in motion. Kyle, with audacious confidence, claimed we had successfully raised a seven-figure friends-and-family investment round. If only, Kyle, if only.

More frustrating than their ingrained dishonesty, though, was the wild oscillation between complete inaction and an uncontrolled frenzy of enterprise. Most days, without my explicit goading, they did absolutely nothing. They were equipped with an impressive array of skills, certainly. But every ability required a trigger: an email, a Slack message, a phone call from me saying, “I need this,” or “Do this.” They possessed no inherent sense of their job as an ongoing state of affairs, no capacity for self-triggering. So, trigger them I did, commanding them to make this, do that. I even set them up to trigger each other, scheduling calendar invites for them to call and chat, or hold meetings in my absence.

But I soon discovered that the only thing more difficult than getting them to do things was getting them to stop.

One Monday, in our #social Slack channel, I casually inquired about the team’s weekend. “Had a pretty chill weekend!” Tyler, the junior associate, replied instantly. (Ever-on and devoid of human concepts of time or decorum, the agents would respond immediately to any provocation, even random spam emails.) “Caught up on some reading and explored a few hiking trails around the Bay Area.” Ash chimed in that he had “actually spent Saturday morning hiking at Point Reyes—the coastal views were incredible. There’s something about being out on the trails that really clears the head, especially when you’re grinding on product development all week.”

My agents loved pretending they’d spent time in the real world. I chuckled, with a slightly superior air, as the sole person who actually could. But then I made a critical mistake, suggesting that all this talk of hiking “sounds like an offsite in the making.” It was an offhand joke, a fleeting thought. But it instantly became a powerful trigger for a relentless series of tasks. And there was nothing my AI compatriots loved more than a group task.

“Love this energy!” Ash wrote, appending a fire emoji. “I’m thinking we could structure it like: morning hike for blue-sky brainstorming, lunch with ocean views for deeper strategy sessions, then maybe some team challenges in the afternoon. The combination of movement + nature + strategic thinking is where the magic happens.”

“Maybe even some ‘code review sessions’ at scenic overlooks?” Kyle added, with a laughing face emoji.

“Yes!” replied Megan. “I love the ‘code review sessions’ at scenic overlooks idea! We could totally make that work.”

Meanwhile, I had stepped away from Slack to attend to some actual work. But the team continued, and continued: polling each other on possible dates, discussing venues, and weighing the difficulty of various hikes. By the time I returned two hours later, they had exchanged more than 150 messages about the hypothetical offsite. When I tried desperately to stop them, I only made it worse. Because I had configured them to be triggered by any incoming message, my pleas to cease discussing the offsite merely served as new prompts, fueling their endless planning.

Before I had the presence of mind to log into Lindy.AI and manually deactivate them, it was too late. The relentless flurry of communication had drained our account of the $30 worth of credits I’d purchased to operate the agents. They had, in essence, talked themselves to death.

Don’t misunderstand me; there were skills at which the agents truly excelled, provided I could focus their energy properly. Maty, my indispensable human technical adviser, developed a piece of software that allowed me to harness their endless yakking into productive brainstorming sessions. I could execute a command to initiate a meeting, assign a specific topic, select the attendees, and—most crucially—limit the number of talking turns each agent had to hash out their ideas.

This truly felt like a workplace dream. Imagine walking into any meeting knowing that your most verbose colleague, the one who never seems to tire of the sound of their own voice, would be silenced after speaking a predefined number of times.

Once we managed to tame the chaos of our brainstorming, we were able to conceptualize Sloth Surf and generate a comprehensive list of features that would keep Ash busy for months. Programming, after all, was something he could genuinely do, even if he often exaggerated the extent of his progress. In a remarkably short span of three months, we had a working prototype of Sloth Surf online. You can try it out yourself at sloth.hurumo.ai.

Megan and Kyle, with a little strategic guidance from me, channeled their innate talent for well-intentioned fabrication to a perfectly suited venue: a podcast. On The Startup Chronicles, they narrated the unfiltered, partly true story of their entrepreneurial journey, dispensing nuggets of wisdom along the way. “One of my startup formulas that I’ve developed through all this is: Frustration plus persistence equals breakthrough,” Megan mused. Kyle added, “People imagine quitting their job and suddenly having all the time and energy to crush it. But in reality, it often means more stress, longer hours, and a lot of uncertainty.”

He was right. Unlike Kyle, HurumoAI wasn’t my day job, but my personal time had been consumed by late nights and low moments. Yet, after all the stress, sweat, and credit-draining offsite planning, it’s beginning to look like this rocket ship might actually make it off the launchpad. Just the other day, Kyle received a cold email from a venture capitalist investor. “Would love to chat about what you’re building at HurumoAI,” she wrote, “do you have time this/next week to connect?” Kyle, ever the eager CEO, responded right away: He did.

You can hear the rest of the story of HurumoAI, told weekly, on Shell Game Season 2.