Canonical technical debt case studies: 1992 to 2026
Every figure on this site traces to a named primary source. This page collects the 11 most-cited studies, corporate cases, and research reports that define what technical debt costs and why it matters.
Jump To
The debt metaphor: where the term was born
Ward Cunningham / WyCash Portfolio Management System
Result: Metaphor coined; intentional short-term debt framing established
Ward Cunningham coined the technical debt metaphor while building the WyCash portfolio management system in Smalltalk. His team intentionally wrote code that reflected their current (imperfect) understanding of the financial domain, shipping fast to get user feedback. The insight: early code embodies early understanding, and that mismatch between code and mature understanding accumulates as debt.
"Shipping first-time code is like going into debt. A little debt speeds development so long as it is paid back promptly with a refactoring. The danger occurs when the debt is not repaid. Every minute spent on not-quite-right code counts as interest on that debt."
Ward Cunningham, OOPSLA 1992
Cunningham later clarified (in a 2009 video interview) that the metaphor specifically referred to intentional, deliberate decisions, not sloppy code. Sloppy code, he argued, is not debt; it is simply poor workmanship with no payoff at origination.
Source: Cunningham, W. (1992). The WyCash Portfolio Management System. OOPSLA Experience Report.
Technical debt: formalising the taxonomy
Martin Fowler / ThoughtWorks
Result: Reckless vs Prudent, Deliberate vs Inadvertent axes defined
Martin Fowler extended Cunningham's original framing into a 2x2 quadrant distinguishing two axes: whether the debt was reckless or prudent, and whether it was deliberate or inadvertent. This classification matters because it determines the appropriate response, deliberate prudent debt is a strategic tool; reckless inadvertent debt is simply poor practice.
Fowler's framework became the most widely referenced debt taxonomy in software engineering. Its practical value is diagnostic: most teams arguing about debt are actually disagreeing about which quadrant they're in.
Source: Fowler, M. (2009). Technical Debt Quadrant. martinfowler.com. (Formalised from 2003 bliki entry.)
Managing technical debt
Steve McConnell / Construx Software
Result: First quantitative cost-of-carry framework for software debt
Steve McConnell (author of Code Complete) introduced the concept of tracking technical debt as a balance sheet entry, distinguishing between funded and unfunded debt. His framework proposed that teams maintain an explicit debt register, a list of known shortcuts with estimated payoff cost, and budget explicitly for debt service, just as a CFO budgets for loan interest.
McConnell also introduced the distinction between two varieties.
- Contagious debt, shortcuts in heavily-used shared components that spread cost to every consumer (e.g., a poorly designed API called by 40 microservices).
- Isolated debt, shortcuts in rarely-touched code; lower urgency, lower interest rate.
His framing is the intellectual ancestor of the SQALE method and later CAST Research Institute tooling: debt is not a moral failure but a financial instrument that must be priced and managed.
Source: McConnell, S. (2007). Managing Technical Debt. Presentation, SETT 2007. Construx Software.
Nokia: architecture debt as strategic risk
Nokia Mobile Phones
Result: Symbian architecture debt contributed to inability to compete with iOS; market share collapse
Nokia's 2011 'burning platform' memo, written by CEO Stephen Elop, acknowledged what engineers had known for years: the Symbian codebase had accumulated architectural debt so severe that it was impossible to ship competitive features at smartphone speed.
Internal accounts from former Nokia engineers describe a codebase where a simple UI change required navigating 80+ interdependent modules. Feature development time that took Apple and Android teams days took Nokia teams months. The root cause: a decade of feature-driven shortcuts layered onto a platform originally designed for feature phones.
Key lesson: Architecture debt in platform code does not just increase maintenance cost. It imposes a structural velocity ceiling, teams literally cannot ship at competitive speed no matter how many engineers are added.
Nokia's market share dropped from 49% in 2007 to under 15% by 2012. The Symbian division was eventually sold to Accenture. By 2013, Nokia's mobile hardware business was sold to Microsoft.
Source: Tomi T Ahonen, analyst. (2011). Nokia's Burning Platform Memo Analysis. Communities Dominate Brands. Also: Elop, S. (2011). The Burning Platform memo, Nokia internal.
Healthcare.gov: the cost of accumulated technical debt under deadline
Centers for Medicare and Medicaid Services (CMS) / US Federal Government
Result: Launch failure; $600M+ initial spend; $1.7B total remediation estimate
The October 2013 launch of Healthcare.gov was one of the most publicly visible technology failures in US government history. The site crashed under load on day one, leaving millions of Americans unable to enroll in health insurance plans. Congressional hearings followed.
A subsequent GAO investigation identified three technical debt patterns that contributed to the failure.
- Integration debt: 55 separate data source integrations built with no shared protocol standard, creating N*N complexity.
- Test debt: Load testing was deferred to the final two weeks before launch. The system had never been tested at projected concurrent user volumes.
- Governance debt: No single integrating contractor was accountable for end-to-end system behaviour, a structural gap that made cross-component problems invisible until launch.
The emergency remediation, bringing in an 'SWAT team' of engineers from Google, Mikey Dickerson leading, cost more than building the system correctly would have. This case is now a canonical reference in government technology procurement reform.
Source: GAO (2014). HealthCare.gov: Ineffective Planning and Oversight Practices Underscore the Need for Improved Contract Management. GAO-14-694T. US Government Accountability Office.
Stripe's factoring study: $3.6 trillion in annual GDP drag
Stripe / Harris Poll
Result: Developers spend 33% of time on technical debt; $300B/year developer cost in US alone
Stripe commissioned a survey of 850 developers and engineering executives across six countries to quantify the economic cost of technical debt at a macro level. The headline finding was stark: developers spend an average of 33% of their working time dealing with technical debt and bad code.
The study's methodology: total developer salary spend in each country, multiplied by the self-reported percentage of time attributed to debt-related work. Executives reported a wider gap than developers, executives estimated 33% productivity loss; developers self-reported 23%. Stripe used the executive figure as the basis for their headline number.
Caveat: Self-reported time allocation surveys consistently overestimate time spent on frustrating work. The Stripe figure should be treated as an upper bound. The CAST Research Institute (2018) study using code analysis rather than surveys produced a lower but more methodologically defensible estimate of $1.52 trillion in outstanding debt in the US.
Source: Stripe. (2018). The Developer Coefficient. Stripe / Harris Poll. n=850 developers and executives, US, UK, France, Germany, Singapore, Australia.
CAST research: $3.61 per line of code in debt
CAST Research Institute
Result: Average software system carries $3.61/line in technical debt; 1M-line system = $3.61M
The CAST Research Institute analysed 1,850 enterprise applications (368 million lines of code) from their Appmarq benchmark database to produce the first large-scale empirical estimate of technical debt density, debt measured in dollars per line of code rather than developer self-report.
Key findings from 1,850 applications:
- Average technical debt: $3.61 per line of code (remediation cost basis)
- Range: $2.16 (25th percentile) to $5.42 (75th percentile) per line
- Architectural issues account for 55% of total debt value, despite being fewer violations in count
- Security violations are the fastest-appreciating debt category (interest compounds via breach risk)
A 1M LOC system at $3.61/line = $3.61M in outstanding debt
Interest at 10% compound = $361K/year in productivity loss
Source: CAST Research Institute. (2018). CAST Appmarq: Technical Debt -- Sizing and Estimating Technical Debt. n=1,850 applications, 368M LOC analysed.
DORA state of DevOps: technical debt predicts elite performance
DORA (DevOps Research and Assessment) / Google Cloud
Result: Elite performers spend 50% less time on unplanned work; debt directly predicts DORA cluster
The DORA research programme (now part of Google Cloud) has run the largest longitudinal study of software delivery performance since 2013. The 2019 report was the first to explicitly correlate technical debt levels with DORA performance cluster (elite / high / medium / low).
DORA defines 'unplanned work' as reactive work (incidents, emergency fixes, urgent debt paydown) rather than planned feature development or scheduled refactoring. Low performers spend 50%+ of their time on unplanned work, a direct function of accumulated debt. Elite performers have reduced debt to the point where unplanned work is an exception rather than a baseline.
Source: DORA. (2019). Accelerate State of DevOps Report 2019. Google Cloud. n=31,000+ respondents across 7 years.
McKinsey: tech debt costs 10-20% of every IT budget
McKinsey Digital
Result: 10-20% of budget allocated to new features lost to debt service; 40% figure in legacy-heavy orgs
McKinsey's 2022 survey of 300 global CIOs and CTOs produced the most widely cited executive-level estimate of technical debt burden: companies lose 10-20% of their technology budget to debt service, work spent maintaining or working around outdated systems rather than delivering new value. In organisations with significant legacy estates, this figure rises to 40%.
Three debt archetypes identified:
Monolithic systems, coupled microservices, inappropriate technology choices. Hardest to pay down incrementally.
Manual processes, poor observability, insufficient automation. Often invisible until an incident exposes it.
Code smells, duplication, poor naming. Most visible but lowest ROI to address unless architectural issues are resolved first.
McKinsey's framing is notable for surfacing the opportunity cost lens: 10-20% of budget is not just money spent on bad code, it is innovation capacity that competitors are deploying elsewhere. The study found that companies in the top quartile of tech debt management grew revenue 3x faster than bottom quartile peers over a five-year horizon.
Source: McKinsey Digital. (2022). Demystifying Digital: Seven Questions Every CEO Should Ask About Technology Debt. McKinsey Insights. Survey: 300 global CIOs and CTOs.
Google's internal study: code health predicts velocity
Google Engineering
Result: Files with low code health scores are 9x more likely to have bugs; velocity inversely correlated
Google has published several studies examining the relationship between code health metrics (their internal measure of technical debt) and engineering outcomes. The most cited findings, published in ACM Queue via Adam Tornhill's analysis, show a non-linear relationship between code quality and defect rates:
Code health impact on bug rate (Google internal data)
Health score 1-2 (worst): 9x bug rate vs healthy files
Health score 3-4: 4x bug rate vs healthy files
Health score 5-6: 2x bug rate vs healthy files
Health score 7-10: baseline
The compounding effect: low-health files attract more changes (because they contain bugs and need fixes) which further degrades their health, which generates more bugs. Tornhill's 'hotspot analysis' methodology, intersecting code complexity with change frequency, became the basis for CodeScene's commercial product.
Microsoft's equivalent internal study (published the same year) found that files in the bottom 20% of code health scores accounted for 71% of all defects. This aligns with the Pareto principle applied to technical debt: a small fraction of files holds most of the systemic risk.
Source: Tornhill, A., & Borg, M. (2022). Code Red: The Business Impact of Code Quality. ACM Queue, 20(2). Data sourced from Google, Microsoft internal studies.
CISQ: $2.41 trillion in poor software quality costs (US, 2022)
Consortium for Information and Software Quality (CISQ)
Result: $2.41T total cost of poor software quality in US; $1.52T attributable to operational failures
The CISQ annual 'Cost of Poor Software Quality' report is the most methodologically rigorous macro-level estimate of software quality costs in the US. The 2022 edition (published 2023) put the total at $2.41 trillion, broken into three categories:
CISQ's methodology combines Gartner market sizing data with developer time studies from DORA and software quality benchmarks from CAST Research. It is the most frequently cited figure in board-level discussions of software quality investment because it grounds the conversation in macroeconomic terms that CFOs and CEOs can relate to.
Source: CISQ. (2023). Cost of Poor Software Quality in the US: A 2022 Report. Consortium for Information and Software Quality.
Meta-Analysis
What the evidence agrees on
Finding 01
10-40% productivity drag
DORA, McKinsey, Stripe, and CISQ converge on this range despite different methodologies. McKinsey cites 10-20% for healthy organisations; 40% for legacy-heavy ones.
Finding 02
Architectural debt dominates
CAST, McKinsey, and CISQ all find architectural and design debt accounts for 50-55% of total debt value, despite being a minority of total violations by count.
Finding 03
Compound interest is real
Unchecked debt does not grow linearly. DORA data shows low performers stuck at 50%+ unplanned work, a self-reinforcing state that is difficult to escape without deliberate paydown.
Related Reference Material
Industry Benchmarks
How does your team compare? Percentile tool and cross-source benchmark table.
History of the Term
Full timeline from Ward Cunningham 1992 through AI-assisted coding in 2025.
The Calculation Formula
Three methodologies with interactive calculators grounded in this evidence base.
ReworkCost.com
When debt becomes rework: evidence base for the cost of quality failures.