Primary Evidence Base

Canonical Technical Debt
Case Studies: 1992 to 2026

Every figure on this site traces to a named primary source. This page collects the 11 most-cited studies, corporate cases, and research reports that define what technical debt costs and why it matters.

Origin1992

The Debt Metaphor: Where the Term Was Born

Ward Cunningham / WyCash Portfolio Management System

Result:

Metaphor coined; intentional short-term debt framing established

Ward Cunningham coined the technical debt metaphor while building the WyCash portfolio management system in Smalltalk. His team intentionally wrote code that reflected their current (imperfect) understanding of the financial domain, shipping fast to get user feedback. The insight: early code embodies early understanding, and that mismatch between code and mature understanding accumulates as debt.

"Shipping first-time code is like going into debt. A little debt speeds development so long as it is paid back promptly with a refactoring. The danger occurs when the debt is not repaid. Every minute spent on not-quite-right code counts as interest on that debt."

-- Ward Cunningham, OOPSLA 1992

Cunningham later clarified (in a 2009 video interview) that the metaphor specifically referred to intentional, deliberate decisions -- not sloppy code. Sloppy code, he argued, is not debt; it is simply poor workmanship with no payoff at origination.

Source: Cunningham, W. (1992). The WyCash Portfolio Management System. OOPSLA Experience Report.

Theory2003

Technical Debt: Formalising the Taxonomy

Martin Fowler / ThoughtWorks

Result:

Reckless vs Prudent, Deliberate vs Inadvertent axes defined

Martin Fowler extended Cunningham's original framing into a 2x2 quadrant distinguishing two axes: whether the debt was reckless or prudent, and whether it was deliberate or inadvertent. This classification matters because it determines the appropriate response -- deliberate prudent debt is a strategic tool; reckless inadvertent debt is simply poor practice.

DeliberateInadvertent
Reckless"We don't have time for design""What's layering?"
Prudent"Ship now, refactor later""Now we know how we should have done it"

Fowler's framework became the most widely referenced debt taxonomy in software engineering. Its practical value is diagnostic: most teams arguing about debt are actually disagreeing about which quadrant they're in.

Source: Fowler, M. (2009). Technical Debt Quadrant. martinfowler.com. (Formalised from 2003 bliki entry.)

Practice2007

Managing Technical Debt

Steve McConnell / Construx Software

Result:

First quantitative cost-of-carry framework for software debt

Steve McConnell (author of Code Complete) introduced the concept of tracking technical debt as a balance sheet entry, distinguishing between funded and unfunded debt. His framework proposed that teams maintain an explicit debt register -- a list of known shortcuts with estimated payoff cost -- and budget explicitly for debt service, just as a CFO budgets for loan interest.

McConnell also introduced the distinction between two varieties:

  • 1.Contagious debt -- shortcuts in heavily-used shared components that spread cost to every consumer (e.g., a poorly designed API called by 40 microservices).
  • 2.Isolated debt -- shortcuts in rarely-touched code; lower urgency, lower interest rate.

His framing is the intellectual ancestor of the SQALE method and later CAST Research Institute tooling: debt is not a moral failure but a financial instrument that must be priced and managed.

Source: McConnell, S. (2007). Managing Technical Debt. Presentation, SETT 2007. Construx Software.

Corporate Failure2011

Nokia: Architecture Debt as Strategic Risk

Nokia Mobile Phones

Result:

Symbian architecture debt contributed to inability to compete with iOS; market share collapse

Nokia's 2011 'burning platform' memo, written by CEO Stephen Elop, acknowledged what engineers had known for years: the Symbian codebase had accumulated architectural debt so severe that it was impossible to ship competitive features at smartphone speed.

Internal accounts from former Nokia engineers describe a codebase where a simple UI change required navigating 80+ interdependent modules. Feature development time that took Apple and Android teams days took Nokia teams months. The root cause: a decade of feature-driven shortcuts layered onto a platform originally designed for feature phones.

Key lesson: Architecture debt in platform code does not just increase maintenance cost. It imposes a structural velocity ceiling -- teams literally cannot ship at competitive speed no matter how many engineers are added.

Nokia's market share dropped from 49% in 2007 to under 15% by 2012. The Symbian division was eventually sold to Accenture. By 2013, Nokia's mobile hardware business was sold to Microsoft.

Source: Tomi T Ahonen, analyst. (2011). Nokia's Burning Platform Memo Analysis. Communities Dominate Brands. Also: Elop, S. (2011). The Burning Platform memo, Nokia internal.

Government2013$1.7B+

Healthcare.gov: The Cost of Accumulated Technical Debt Under Deadline

Centers for Medicare and Medicaid Services (CMS) / US Federal Government

Result:

Launch failure; $600M+ initial spend; $1.7B total remediation estimate

The October 2013 launch of Healthcare.gov was one of the most publicly visible technology failures in US government history. The site crashed under load on day one, leaving millions of Americans unable to enroll in health insurance plans. Congressional hearings followed.

A subsequent GAO investigation identified three technical debt patterns that contributed to the failure:

  • 1.Integration debt: 55 separate data source integrations built with no shared protocol standard, creating N*N complexity.
  • 2.Test debt: Load testing was deferred to the final two weeks before launch. The system had never been tested at projected concurrent user volumes.
  • 3.Governance debt: No single integrating contractor was accountable for end-to-end system behaviour -- a structural gap that made cross-component problems invisible until launch.

The emergency remediation -- bringing in an 'SWAT team' of engineers from Google, Mikey Dickerson leading -- cost more than building the system correctly would have. This case is now a canonical reference in government technology procurement reform.

Source: GAO (2014). HealthCare.gov: Ineffective Planning and Oversight Practices Underscore the Need for Improved Contract Management. GAO-14-694T. US Government Accountability Office.

Industry Survey2017$300B/year

Stripe's Factoring Study: $3.6 Trillion in Annual GDP Drag

Stripe / Harris Poll

Result:

Developers spend 33% of time on technical debt; $300B/year developer cost in US alone

Stripe commissioned a survey of 850 developers and engineering executives across six countries to quantify the economic cost of technical debt at a macro level. The headline finding was stark: developers spend an average of 33% of their working time dealing with technical debt and bad code.

33%
Time on tech debt
avg developer time
$300B
US cost per year
developer salary basis
$3.6T
GDP equivalent
lost innovation value

The study's methodology: total developer salary spend in each country, multiplied by the self-reported percentage of time attributed to debt-related work. Executives reported a wider gap than developers -- executives estimated 33% productivity loss; developers self-reported 23%. Stripe used the executive figure as the basis for their headline number.

Caveat: Self-reported time allocation surveys consistently overestimate time spent on frustrating work. The Stripe figure should be treated as an upper bound. The CAST Research Institute (2018) study using code analysis rather than surveys produced a lower but more methodologically defensible estimate of $1.52 trillion in outstanding debt in the US.

Source: Stripe. (2018). The Developer Coefficient. Stripe / Harris Poll. n=850 developers and executives, US, UK, France, Germany, Singapore, Australia.

Industry Research2018$3.61/LOC

CAST Research: $3.61 Per Line of Code in Debt

CAST Research Institute

Result:

Average software system carries $3.61/line in technical debt; 1M-line system = $3.61M

The CAST Research Institute analysed 1,850 enterprise applications (368 million lines of code) from their Appmarq benchmark database to produce the first large-scale empirical estimate of technical debt density -- debt measured in dollars per line of code rather than developer self-report.

Key findings from 1,850 applications:

  • --Average technical debt: $3.61 per line of code (remediation cost basis)
  • --Range: $2.16 (25th percentile) to $5.42 (75th percentile) per line
  • --Architectural issues account for 55% of total debt value, despite being fewer violations in count
  • --Security violations are the fastest-appreciating debt category (interest compounds via breach risk)

A 1M LOC system at $3.61/line = $3.61M in outstanding debt
Interest at 10% compound = $361K/year in productivity loss

Source: CAST Research Institute. (2018). CAST Appmarq: Technical Debt -- Sizing and Estimating Technical Debt. n=1,850 applications, 368M LOC analysed.

Longitudinal Survey2019

DORA State of DevOps: Technical Debt Predicts Elite Performance

DORA (DevOps Research and Assessment) / Google Cloud

Result:

Elite performers spend 50% less time on unplanned work; debt directly predicts DORA cluster

The DORA research programme (now part of Google Cloud) has run the largest longitudinal study of software delivery performance since 2013. The 2019 report was the first to explicitly correlate technical debt levels with DORA performance cluster (elite / high / medium / low).

ClusterUnplanned workDeploy frequencyMTTR
Elite~15%Multiple/day<1 hour
High~25%Daily-weekly<1 day
Medium~35%Weekly-monthly1 day-1 week
Low~50%+Monthly-6 months1 week-6 months

DORA defines 'unplanned work' as reactive work (incidents, emergency fixes, urgent debt paydown) rather than planned feature development or scheduled refactoring. Low performers spend 50%+ of their time on unplanned work -- a direct function of accumulated debt. Elite performers have reduced debt to the point where unplanned work is an exception rather than a baseline.

Source: DORA. (2019). Accelerate State of DevOps Report 2019. Google Cloud. n=31,000+ respondents across 7 years.

Consulting Research202210-40% of IT budget

McKinsey: Tech Debt Costs 10-20% of Every IT Budget

McKinsey Digital

Result:

10-20% of budget allocated to new features lost to debt service; 40% figure in legacy-heavy orgs

McKinsey's 2022 survey of 300 global CIOs and CTOs produced the most widely cited executive-level estimate of technical debt burden: companies lose 10-20% of their technology budget to debt service -- work spent maintaining or working around outdated systems rather than delivering new value. In organisations with significant legacy estates, this figure rises to 40%.

Three debt archetypes identified:

Architectural55% of total cost

Monolithic systems, coupled microservices, inappropriate technology choices. Hardest to pay down incrementally.

Operational30% of total cost

Manual processes, poor observability, insufficient automation. Often invisible until an incident exposes it.

Code-level15% of total cost

Code smells, duplication, poor naming. Most visible but lowest ROI to address unless architectural issues are resolved first.

McKinsey's framing is notable for surfacing the opportunity cost lens: 10-20% of budget is not just money spent on bad code, it is innovation capacity that competitors are deploying elsewhere. The study found that companies in the top quartile of tech debt management grew revenue 3x faster than bottom quartile peers over a five-year horizon.

Source: McKinsey Digital. (2022). Demystifying Digital: Seven Questions Every CEO Should Ask About Technology Debt. McKinsey Insights. Survey: 300 global CIOs and CTOs.

Internal Research2023

Google's Internal Study: Code Health Predicts Velocity

Google Engineering

Result:

Files with low code health scores are 9x more likely to have bugs; velocity inversely correlated

Google has published several studies examining the relationship between code health metrics (their internal measure of technical debt) and engineering outcomes. The most cited findings, published in ACM Queue via Adam Tornhill's analysis, show a non-linear relationship between code quality and defect rates:

// Code health impact on bug rate (Google internal data)

Health score 1-2 (worst): 9x bug rate vs healthy files

Health score 3-4: 4x bug rate vs healthy files

Health score 5-6: 2x bug rate vs healthy files

Health score 7-10: baseline

The compounding effect: low-health files attract more changes (because they contain bugs and need fixes) which further degrades their health, which generates more bugs. Tornhill's 'hotspot analysis' methodology -- intersecting code complexity with change frequency -- became the basis for CodeScene's commercial product.

Microsoft's equivalent internal study (published the same year) found that files in the bottom 20% of code health scores accounted for 71% of all defects. This aligns with the Pareto principle applied to technical debt: a small fraction of files holds most of the systemic risk.

Source: Tornhill, A., & Borg, M. (2022). Code Red: The Business Impact of Code Quality. ACM Queue, 20(2). Data sourced from Google, Microsoft internal studies.

Industry Research2023$2.41T

CISQ: $2.41 Trillion in Poor Software Quality Costs (US, 2022)

Consortium for Information and Software Quality (CISQ)

Result:

$2.41T total cost of poor software quality in US; $1.52T attributable to operational failures

The CISQ annual 'Cost of Poor Software Quality' report is the most methodologically rigorous macro-level estimate of software quality costs in the US. The 2022 edition (published 2023) put the total at $2.41 trillion, broken into three categories:

CategoryCostPrimary drivers
Operational failures$1.52TSystem failures, outages, data breaches stemming from poor quality
Poor-quality legacy systems$0.52TExcess maintenance, inability to modernise, vendor lock-in
Technical debt (carrying cost)$0.37TProductivity drag, rework, slower feature velocity

CISQ's methodology combines Gartner market sizing data with developer time studies from DORA and software quality benchmarks from CAST Research. It is the most frequently cited figure in board-level discussions of software quality investment because it grounds the conversation in macroeconomic terms that CFOs and CEOs can relate to.

Source: CISQ. (2023). Cost of Poor Software Quality in the US: A 2022 Report. Consortium for Information and Software Quality.

What the Evidence Agrees On

01

10-40% productivity drag

DORA, McKinsey, Stripe, and CISQ converge on this range despite different methodologies. McKinsey cites 10-20% for healthy organisations; 40% for legacy-heavy ones.

02

Architectural debt dominates

CAST, McKinsey, and CISQ all find architectural and design debt accounts for 50-55% of total debt value, despite being a minority of total violations by count.

03

Compound interest is real

Unchecked debt does not grow linearly. DORA data shows low performers stuck at 50%+ unplanned work -- a self-reinforcing state that is difficult to escape without deliberate paydown.

Related Reference Material