Tokenmaxxing: The Vanity Metric Eating Your AI Budget

When AI Activity Gets Mistaken for AI Productivity.

Token leaderboards are the new lines of code. They look rigorous, they travel well in a board deck, and they reward the wrong behavior within six months. Here is why tokenmaxxing took hold, why it will not survive contact with a serious board, and what the scoreboard needs to look like instead.

Tokenmaxxing is the AI vanity metric of 2026, and it is already distorting how engineering leaders are evaluated, hired, and funded. In the last six months, token consumption has crossed over from a billing line item to a performance signal. Internal leaderboards, token budgets, and anecdotal reports of runaway usage are starting to shape how “AI productivity” is perceived. Whether exaggerated or not, they all point to the same underlying shift: AI usage is being treated as a proxy for AI output.

This article is not about any one company or example. It is about the broader pattern taking hold across the industry. When a board asks for AI ROI, the easiest number to show is usage. It is visible, it moves every week, and it looks rigorous in a deck. In my opinion, it is the wrong number, and every experienced CTO has seen where this leads.

Earlier this week, I had a conversation with CTO, Hunter Powers. Our conversation is what inspired me to write this article. We were discussing this specific trend and my predictions for where this fad is likely to go. What came to my mind was a reminder of the early “website hits” metric from the early days of the internet. It was something that could easily be gamed, but more importantly, it provided very little value in understanding a company, its product adoption, or its revenue. If you haven’t already, check out Hunter’s AI podcast on YouTube.

What follows is a field guide to the tokenmaxxing AI vanity metric: why it took hold, why it will not survive the next board cycle, and what a real outcome scoreboard looks like in its place.

Where the Tokenmaxxing Obsession Came From

Tokenmaxxing did not appear because anyone decided it was a good measure of productivity. It appeared because no one had a better one. For the last eighteen months, CFOs and boards have been funding AI tooling at a level that demanded justification, and engineering leaders have been unable to produce a clean answer to the question “what are we getting for this?” In the absence of a credible outcome metric, activity became the proxy. Token volume was visible, it was measurable in real time, and it went up every week. That was enough.

The pattern is familiar to anyone who managed engineering before AI. Lines of code held the same seat in the 2000s. Commits per week had a brief and ugly run in the 2010s. Jira tickets closed per sprint is still getting dressed up as a KPI in some companies. Every one of these metrics sounded rigorous, traveled well in a board deck, and rewarded the wrong behavior within six months. Tokenmaxxing is the same shape of mistake, delivered at a higher blast radius because the dollar figures are so much larger. If you think about it, it’s more of a marketing or fluff number than anything related to productivity.

What makes the tokenmaxxing AI vanity metric particularly dangerous is that it has two lives at once. Inside the company, it is a performance metric: who burned through the most tokens, who shows up on the internal leaderboard, who gets the reputation of being an “AI power user.” Outside the company, it is a marketing metric: the stat a founder uses on a podcast, the slide a CTO puts in front of a board to prove AI adoption. Both are unearned. Neither survives a serious review.

Five Ways Tokenmaxxing Breaks an Engineering Org

Across the CTOs and CPOs I coach, tokenmaxxing shows up in five distinct failure patterns. Recognizing yours is the starting point for the scoreboard conversation.

The Leaderboard Problem

Internal token leaderboards start as a visibility tool and quickly become an incentive system. Once usage is ranked, engineers adapt. Prompts get re-run, context gets inflated, and entire codebases get pulled in “just in case.” What looks like measurement is actually behavior being shaped in real time. Within weeks, the leaderboard stops reflecting productivity and starts producing the opposite of it.

The Unbounded Spend Problem

When token usage becomes a signal of effectiveness, spend detaches from outcome. Engineers who are most engaged with the tools naturally consume the most, and without a constraint, usage expands to fill the available surface area. Finance eventually notices a line item growing faster than anything else in the budget. At that point, the question is no longer “are we using AI?” but “what are we getting for this?” If the answer is unclear, the correction is usually blunt.

The Context Window Anti-Pattern

Separately from leaderboards, engineers are defaulting to the largest possible context window on the assumption that more context produces better output. In practice, dumping more files, more history, and more examples inflates cost, lengthens latency, degrades review quality, and often buries the signal the model needed. Tokenmaxxing as a craft failure is quieter than the leaderboard version, but it shows up as the same line in the same invoice.

The Junior Engineer Feedback Loop

A junior engineer watching the leaderboard learns the wrong lesson about what senior looks like. The culture starts rewarding volume of AI interaction rather than quality of judgment. Inside a year, you have an engineering org that confuses “I used the tool a lot” with “I shipped good work.” This is the same pathology as the lines-of-code era, and it breaks the apprenticeship model that turns juniors into the senior engineers you need three years from now.

The Board Deck Problem

A CTO who reports tokens to the board is setting a trap for themselves. Token usage always goes up, until it does not. The moment usage flattens or drops because someone finally optimized a prompt chain, the CTO has to explain why “adoption” has fallen. The board interprets the decline as AI losing traction. The real story, that the team got better at using the tools, never lands, because the metric was never structured to tell it.

The Common Thread

Every one of these failures has the same root cause. The organization is measuring AI activity because it has not figured out how to measure AI outcomes. That gap is the real problem, and closing it is what separates the CTOs who survive the next twelve months from the ones who get quietly reorganized out. Tokenmaxxing is the symptom. The missing outcome scoreboard is the disease.

What Good Looks Like: Replacing the Tokenmaxxing Scoreboard

The answer to tokenmaxxing is not to ban token tracking. It is to refuse to let token tracking be the scoreboard. A scoreboard that resists this specific failure mode tends to have five features in common, and each one takes deliberate work to put in place.

📈

Outcome metrics tied to real business impact

Time from feature idea to production, defect rate in AI-assisted code versus human-written code, revenue per engineer by quarter, cycle time on the kinds of work the business actually cares about. These numbers are harder to collect than token counts, and that is precisely why they deserve the top of the deck.

🎯

A per-engineer token budget, owned as a constraint

Tokens are an input, not an output. Treat them like cloud spend. Give each team a monthly token budget, hold them to it, and reward the teams that produce better outcomes within the cap. The goal is not to minimize tokens. The goal is to eliminate the incentive to inflate them.

🧪

Explicit separation of experimentation from production

Experimentation is where you want engineers burning tokens freely, because that is where learning happens. Production is where you want tokens ruthlessly optimized, because that is where margin lives. A scoreboard that cannot tell these two apart will always look like chaos, and will always invite the wrong kind of intervention from finance.

✅

Quality signals, not just speed signals

Pull request acceptance rate on first review, post-release defect density, on-call hours caused by AI-generated changes. These signals tell you whether the AI work is actually holding up after it ships, which is the only definition of productivity a CFO eventually cares about.

🗣️

A narrative layer above the numbers

A CTO walking into a board review needs to be able to say what the AI investment is buying the company, in language the board uses, not in tokens. That sentence almost never begins with “our consumption grew.” It usually begins with “here is the work we can now do that we could not do before, and here is what it costs us to do it.”

The Underlying Principle

Activity metrics measure the past. Outcome metrics measure the bet. A CTO who builds the scoreboard around outcomes is telling the board, in effect, that they know where this investment is supposed to land and are willing to be held to it. A CTO who reports tokens is telling the board that no one has figured that out yet. Boards can tell the difference, even when they cannot articulate it.

Token volume tells you who used the tool the most. It does not tell you who shipped the best work.

The Conversation With Finance

The finance conversation is where tokenmaxxing turns from a cultural issue into a budget problem. When the AI bill grows faster than expected, one of two conversations happens.

In the first, the CTO explains that token spend tracks productivity. The CFO asks for the productivity number. The CTO does not have one. At that point, the discussion is no longer about AI. It is about cost control. Budget gets cut, usually by a percentage that looks reasonable on a spreadsheet and is catastrophic in practice.

In the second conversation, the CTO walks in with a per-engineer token budget already in place, a small set of outcome metrics already reporting, and a clear answer to what the company is buying with its AI compute. The finance team is not being asked to trust a proxy. They are being asked to approve a line item that behaves like every other line item they trust.

The difference between those two meetings is not intelligence. It is preparation. One is reacting to a number that grew without a model behind it. The other is managing a system that was designed to be explained. That moment is coming for every engineering org still running a token leaderboard, and the time to prepare for it is shorter than most CTOs think.

Questions to Sit With

If the tokenmaxxing AI vanity metric is showing up in your org, these are the questions worth working through honestly before the next board or finance review:

If your company’s token spend doubled next quarter, what would you be able to tell the board you had gained in return?
If token usage across your engineering team dropped by 40 percent next quarter because your team got better at prompting, how would your current metrics present that?
Who in your organization is currently incentivized to inflate token usage, and who is incentivized to reduce it?
What are the three outcome metrics you would put at the top of your next AI review, if you were forbidden from reporting tokens, spend, or adoption rate?
If your top “AI power user” left tomorrow, would the loss show up in the work that shipped, or only in the leaderboard?

A Final Thought

Tokenmaxxing is what organizations do when they are under pressure to show progress on something they do not yet know how to measure. That is not a new problem. It is the same pattern that produced lines of code, commits per week, and other utilization metrics. Every one of those numbers looked rigorous at the time, and every one of them had to be dismantled once it started distorting behavior. The difference in 2026 is the scale. The dollars are larger, the feedback loops are faster, and the board patience is shorter.

The organizations that get this right will not be the ones that use the most AI. They will be the ones that measure it correctly. Tokenmaxxing is not the root problem. It is the first visible symptom of a scoreboard that is already broken. Fix the scoreboard, and the token conversation becomes a detail. Leave it broken, and tokenmaxxing is just the first thing that fails.

If you are a CTO or CPO staring at a token leaderboard and wondering whether the tokenmaxxing AI vanity metric is quietly breaking your team, that instinct is worth trusting. The next step is building the scoreboard you actually want to be measured against, and it is a conversation Hoola Hoop’s former-operator coaches have had from the inside of more than one engineering org. You can read our earlier piece on AI ROI board pressure for the broader context this article sits inside.

The operator-era question is not how many tokens you burned last week. It is what you can now build that you could not build before, and what it cost you to build it. Get the scoreboard right, and the token conversation becomes a detail. Get the scoreboard wrong, and tokenmaxxing is just the first thing that breaks.

Ready to talk about CTO coaching with Leigh?

Book a 30-minute introductory call to explore whether coaching is right for you.

Book a meeting with Leigh →

Leigh Newsome

Partner, Hoola Hoop · CTO Coach

Leigh Newsome is a Partner at Hoola Hoop and a CTO coach with 25 years of experience scaling product and engineering teams. He has worked with a wide range of startups and global enterprises, including Avid, Digidesign, WPP, and Kantar/Millward Brown, and successfully led TargetSpot (backed by Union Square Ventures, Bain Capital Ventures, and CBS) through its acquisition to Radionomy Group (Vivendi). When he’s not coaching CTOs, you’ll find him teaching digital audio to graduate students at NYU, building audio and signal processing applications, or flying fixed-wing aircraft, but never all three at once.

Agentic AI Autonomy: Why CTOs Get More Important, Not Less

Autopilot Didn’t Kill Pilots. It Made Them More Accountable. The dominant narrative says agentic AI lets us flatten the leadership layer. Fewer reviewers, fewer managers, fewer checkpoints. I think that read is backwards, and I think it’s backwards for the same reason every generation of cockpit automation has been misunderstood: the captain’s job did not […]

Ship Faster, Trust Less.

The AI-Native Trust Paradox Isn’t a Paradox. It’s an Org Chart Problem. “Ship faster, trust less” has become a defining tension of engineering leadership. I do not think it is a paradox. I think it is the predictable outcome of treating AI as a productivity question when it was always an organizational one, and the […]

Continuous Product Roadmap

Killing the Product Roadmap: The Continuous Cadence Replacing the Quarterly Review The quarterly roadmap ceremony is becoming increasingly mismatched to the operating tempo many product organizations now face. The artifact still gets built, presented, and stapled into the board pack, but the assumptions inside it often decay faster than the review cycle designed to refresh […]

A CPO CTO Operating System That Holds

Your CPO and CTO Don’t Need to Merge. They Need a Joint Operating System. AI is quietly forcing the CPO and CTO partnership to become explicit. The status reporting, roadmap reconciliation, and synthesis work that used to absorb the disagreement is being automated away. When the synthesis is automated, the disagreement has nowhere to hide. […]

The AI-Native Team

The 4-Person Team That Is Outshipping Your 12-Person One. A new shape of engineering team is quietly winning the operator phase of AI. Smaller. More senior. Different roles, different metrics, different conversations. This is what AI-native team topology actually looks like in 2026. For most of the last two decades, engineering scale was synonymous with […]

5 Things Every CTO Must Do to Succeed in the Agentic Era

Most CTOs Have Shipped Agents. Very Few Have Scaled Them. The question in 2026 is no longer whether agentic AI works. It is why some organizations are compounding value while others are seeing pilots stall and costs spiral. The agentic AI conversation has moved past experimentation. Most CTOs have already shipped something. The issue now […]

CTO + CPO = CPTO?

The Role Convergence Debate. Should your CTO and CPO be one person or two people in distinct roles? There is no universal answer. The right structure depends on your product’s complexity, your team’s maturity, and how tightly your competitive advantage is bound to technical execution. What AI is changing is not the urgency of the […]

Agentic AI Governance: What CTOs Need To Know

The Agentic AI Governance Framework Every CTO Needs in 2026. Deploying AI agents has become the easy part. Most engineering organizations are doing it faster than they can govern it and that gap is where the real risk accumulates. Agentic AI governance has become a defining challenge for leaders in 2026. Dell Technologies recently changed […]

AI ROI Board Pressure: What Boards Want To Hear

The AI ROI Pressure Point. The conversation has shifted. Most CTOs are not struggling to invest in AI, but they’re struggling to account for it. Boards that spent 2024 asking “what’s your AI strategy?” are now asking “what did it cost, what did it return, and how do you know?” Those are different questions, and […]

Managing Up: How CTOs and CPOs Build Trust with Their CEO

What Your CEO Actually Needs From You. Managing up is the skill most CTOs and CPOs never got taught. You’re good at building teams, shipping product, and navigating technical complexity. The relationship with your CEO is a different kind of problem, and quietly, it’s where some of the most capable technical leaders I coach and […]

Agentic SDLC: The CTO's Guide

From SDLC to Agentic SDLC. I’ve lived through a lot of process evolutions. The move to agentic development is different in kind, not just degree. It’s changing what it means to lead an engineering organization altogether. CTOs aren’t asking “should we use AI?” anymore. That debate is over. They’re asking: how do we rebuild our […]

Courage to Lead: Courageous Systems

Courageous leadership isn’t about individual bravery — it’s about building systems where courage is distributed amongst many. This fourth and final article in the series examines how organizational systems enable or suppress courageous action, and what leaders can do to design distributed courage into the fabric of […]

CEO Coaching: Leading and Growing with Confidence

Discover how CEO coaching helps you grow into a confident and successful leader. In building and leading a company, the hardest challenge is in how you evolve as CEO. Understanding the CEO role requires courage, deeply knowing your product and your people, and navigating the terrain of markets, investors, and the unknown. It’s a struggle! […]

CTO Coaching: A Guide for Leaders

I’ve spent 25 years scaling product and engineering teams, and one thing I’ve learned is that the hardest part of being a CTO is not about technology. For most CTOs and engineering leaders I know and have worked with, it’s not technical competence that holds them back. It’s the leadership aspects of the job that […]

AI Reshaping CTO and CPO role

In 25 years of working in and around technology leadership, I’ve watched a lot of shifts and coached many CTOs and CPOs. But how AI is changing the CTO and CPO role feels different from anything I’ve seen before. It’s not just in how software gets built, but in what it means to lead a […]

Courage to Lead: Courageous Role-taking

Courageous leaders don’t just accept a job description — they shape the role they inhabit, including the risk they are willing and able to hold. This article explores the “Role” dimension of the PRS framework: how leaders navigate role given and role taken, manage fear and uncertainty, […]

Courage to Lead: The Person

Leading with courage begins with the self. This article explores the “Person” dimension of the Person–Role–System framework — examining how leaders build courage through self-knowledge, managing information overload, strengthening their mindset, and practicing presence. What is personal courage? Aside from “bravery” and the like, personal courage requires […]

Courage To Lead: An Introduction

Psychological courage is not optional — it is the foundation of effective leadership. This opening article introduces the Person–Role–System framework and examines how fear and noise undermine leadership judgment, and how courageous leadership can be deliberately cultivated as a skill. Finding your voice in a noisy world […]

A Complete Guide to Navigating Organizational Roles

The Person-Role-System framework, developed by organizational psychology experts James Krantz and Marc Maltz in 1997, provides a comprehensive approach to understanding how individuals navigate organizational roles. This systems-psychodynamics model reveals the intricate relationship between personal identity, role expectations, and organizational systems. Understanding the Person-Role-System Model for Effective Leadership, Management and Coaching What is the Person-Role-System […]

Podcast: Optimizing Tech Teams & Strategy In EdTech

In this executive leadership episode of EdTech Elevated, Lisa March, President and Founder of Partner in Publishing, interviews Leigh Newsome, Partner at Hoola Hoop and New York University adjunct professor. This episode focuses on scaling EdTech companies through navigating the complexities of technology leadership. Drawing from his experience as both a Silicon Valley engineering leader […]

What does a CEO do?

As executive coaches to CEOs, C-suites and boards, we see a lot of approaches to the role of the CEO. Some are successful and many are not. So what does a CEO do? CEO Priorities and Key Responsibilities Let’s start with the most important things CEOs need to be thinking about: Emotional Intelligence (EI) […]

Beyond the Code: Executive Coaching for CTOs and CPOs

Chief Technology Officers (CTOs) and Chief Product Officers (CPOs) navigate the complex intersection of technology, product strategy, people leadership and business objectives. At Hoola Hoop, we offer specialized executive coaching tailored to the unique challenges faced by these tech leaders. Let’s start by dispelling some common myths about CTO and CPO coaching. Common Myths About […]

How To Manage Your Board

Chief Executive Officers (CEOs) must navigate the complex relationships with their Board of Directors with acumen and dexterity. At Hoola Hoop, we provide executive coaching from former CEOs, C-suite executives and experienced Board members to help you successfully develop and manage your board. Let’s start by dispelling some common myths about board management. Common Myths […]

Executive Team Development

At Hoola Hoop, CEO coaching is considered part of the executive team’s development. CEOs do not operate alone, they engage and, in many ways, are dependent on the broader team. Team development focuses on the following: Enhanced Strategic Thinking It is critical to equip your executives with advanced problem-solving skills and a forward-thinking mindset […]

Product and Technology Due Diligence

In mergers, acquisitions, and investment decisions, comprehensive product and tech due diligence is crucial for informed decision-making and risk mitigation. This strategic evaluation process examines critical areas including technical debt assessment, architectural decisions, R&D investment analysis, and team capabilities evaluation. Beyond surface-level code review, it provides deep insights into a company’s technological sustainability, product validation, […]

Running Effective Board Meetings

Running an effective board meeting is one of the CEO’s key responsibilities. When well-conducted, these meetings are informative, insightful, and impactful, benefiting the organization by harnessing the diverse experiences and perspectives of the board team. In reality, many CEOs find board meetings burdensome to prepare for—a duty to fulfill, an obstacle to overcome. This often […]

Technical Due Diligence: A CTO's Guide

Preparing for Technical Due Diligence Technical due diligence requests arrive at the worst possible time — mid-fundraise, mid-acquisition, mid-everything. The engineering leaders who handle them well aren’t the ones who scramble. They’re the ones who were already prepared. It’s common for engineering leaders to receive technical due diligence requests on behalf of an investor or […]

The Essential Pillars of CTO Leadership: A Strategic Guide

As a Chief Technology Officer (CTO) in today’s dynamic tech landscape, mastering the core responsibilities of technology leadership is crucial for organizational success. Through years of CTO coaching and technology leadership experience at Hoola Hoop, we’ve identified four fundamental pillars that determine a technology executive’s effectiveness and impact. Whether you’re a new CTO or a […]

Motivation, Meaning and Resilience

Purpose, motivation, and resilience are essential for an organization to sustain success. These client case studies focus on what happens when an organization faces significant challenges due to trauma, M&A, market conditions, etc. All show a lack of clear purpose and confused organizational responses to change. We emphasize the importance of leadership in fostering a […]

A Framework for Consulting to Organizational Role

Role is a complex key component of all organizations. We offer a framework for defining the way one works-in-role: their specific assigned duties, part in the overall mission, unconscious function, and the way they understand and work within an organization’s systems of tasks and sentience.

Succession Planning

Discover comprehensive insights into succession planning best practices through our analysis of 14 leading companies across multiple industries. This in-depth study examines the choices companies face when creating or improving their succession planning and management systems. It identifies several themes, including the role of human resources, the criteria for identifying high potential candidates, the relationship […]

Performance Management

Today’s performance management systems need a more effective approach that aligns with modern workforce requirements, emphasizing the importance of specific, in-the-moment feedback. One of today’s most valuable workplace assets is actionable, in-the-moment feedback, which is too often buried, lost or just not delivered in today’s ineffective performance management systems. Traditional performance management systems are out-of-sync […]

Complexity of Leadership

In complex organizations, leaders face multidimensional psychological challenges. Using the case of Arthur Andersen, a company that failed due to leadership’s inability to respond to the powerful dynamics of authorization, we discuss the importance of adaptive leadership, psychodynamic organization theory and Interpersonal psychoanalysis to understand the complexities leaders face. Successful leadership requires transparency, emotional competence, […]

Finding You in Me

The 9/11 attacks on the World Trade Center devastated this investment bank. We discuss our work in helping Sandler O’Neill & Partners’ remaining managing director, employees and families, recover from the trauma of losing 39% of their friends and colleagues. We present the challenges and successes of bringing together survivors, families, volunteers and new employees […]

Thinking, Leadership and Action

Through a case study of a senior executive at a foreign bank, we look at the complex dynamics between leadership, teamwork and organizational culture, and how to help leaders navigate the challenges of a rapidly changing business landscape. We address the importance of understanding the psychological factors that drive individual and organizational behavior and decision-making; […]

Psychological Containment

Leaders must be able to identify and manage workplace stresses and anxieties, what we call “troubling, frightening bits” or TFBs, that originate from employees, work, organizational dysfunction, and external events. If unaddressed, TFBs can negatively impact an organization. “Psychological containment” is the ability to keep TFBs within limits, enabling teams to stay focused and aligned […]