Architecture

This chapter explains how debtmap’s analysis pipeline works, from discovering files to producing prioritized technical debt signals.

Analysis Pipeline Overview

Debtmap’s analysis follows a multi-stage pipeline that transforms source code into structured signals:

┌─────────────────┐
│ File Discovery  │
└────────┬────────┘
         │
         ▼
┌─────────────────┐
│Language Detection│
└────────┬────────┘
         │
         ▼
    ┌────────┐
    │ Parser │
    └────┬───┘
         │
    ┌────┼────────────┐
    │    │            │
    ▼    ▼            ▼
┌─────┐ ┌──────────┐ ┌───────────┐
│ syn │ │rustpython│ │tree-sitter│
│ AST │ │   AST    │ │    AST    │
└──┬──┘ └────┬─────┘ └─────┬─────┘
   │         │             │
   └─────────┼─────────────┘
             │
             ▼
  ┌──────────────────┐
  │ Metric Extraction │
  └─────────┬────────┘
            │
    ┌───────┼───────┐
    │       │       │
    ▼       ▼       ▼
┌────────┐ ┌─────┐ ┌─────────┐
│Complexity│ │Call │ │ Pattern │
│  Calc   │ │Graph│ │Detection│
└────┬───┘ └──┬──┘ └────┬────┘
     │        │         │
     ▼        │         │
┌─────────┐   │         │
│ Entropy │   │         │
│ Analysis│   │         │
└────┬────┘   │         │
     │        │         │
     ▼        ▼         ▼
┌─────────┐ ┌────────┐ ┌──────┐    ┌──────────┐
│Effective│ │Dependency│ │ Debt │    │   LCOV   │
│Complexity│ │Analysis│ │Class │    │ Coverage │
└────┬────┘ └────┬───┘ └──┬───┘    └────┬─────┘
     │           │        │             │
     └───────────┼────────┼─────────────┘
                 │        │
                 ▼        ▼
           ┌─────────────────┐
           │  Risk Scoring   │
           └────────┬────────┘
                    │
                    ▼
         ┌───────────────────┐
         │Tiered Prioritization│
         └─────────┬─────────┘
                   │
                   ▼
        ┌──────────────────────┐
        │ Context Suggestion   │
        │    Generation        │
        └─────────┬────────────┘
                  │
                  ▼
          ┌────────────────┐
          │Output Formatting│
          └────────┬───────┘
                   │
       ┌───────────┼───────────┐
       │           │           │
       ▼           ▼           ▼
   ┌──────┐   ┌────────┐   ┌─────┐
   │ JSON │   │LLM-MD  │   │Term │
   └──────┘   └────────┘   └─────┘

Key Components

1. File Discovery and Language Detection

Purpose: Identify source files to analyze and determine their language.

How it works:

Walks the project directory tree (respecting .gitignore and .debtmapignore)
Detects language based on file extension (.rs, .py, .js, .ts)
Filters out test files, build artifacts, and vendored dependencies
Groups files by language for parallel processing

Configuration:

[analysis]
exclude_patterns = ["**/tests/**", "**/target/**", "**/node_modules/**"]
include_patterns = ["src/**/*.rs", "lib/**/*.py"]

2. Parser Layer

Purpose: Convert source code into Abstract Syntax Trees (ASTs) for analysis.

Language-Specific Parsers:

Rust (syn):

Uses the syn crate for full Rust syntax support
Extracts: functions, structs, impls, traits, macros
Handles: async/await, generic types, lifetime annotations
Performance: ~10-20ms per file

Python (rustpython):

Uses rustpython’s parser for Python 3.x syntax
Extracts: functions, classes, methods, decorators
Handles: comprehensions, async/await, type hints
Performance: ~5-15ms per file

JavaScript/TypeScript (tree-sitter):

Uses tree-sitter for JS/TS parsing
Extracts: functions, classes, arrow functions, hooks
Handles: JSX/TSX, decorators, generics
Performance: ~8-18ms per file

Error Handling:

Syntax errors logged but don’t stop analysis
Partial ASTs used when possible
Files with parse errors excluded from final report

3. Metric Extraction

Purpose: Extract raw metrics from ASTs.

Metrics Computed:

Function-Level:

Lines of code (LOC)
Cyclomatic complexity (branch count)
Nesting depth (max indentation level)
Parameter count
Return path count
Comment ratio

File-Level:

Total LOC
Number of functions/classes
Dependency count (imports)
Documentation coverage

Implementation:

#![allow(unused)]
fn main() {
pub struct FunctionMetrics {
    pub name: String,
    pub location: Location,
    pub loc: u32,
    pub cyclomatic_complexity: u32,
    pub nesting_depth: u32,
    pub parameter_count: u32,
    pub return_paths: u32,
}
}

4. Complexity Calculation and Entropy Analysis

Purpose: Compute effective complexity using entropy-adjusted metrics.

Traditional Cyclomatic Complexity:

Count decision points (if, match, loop, etc.)
Each branch adds +1 to complexity
Does not distinguish between repetitive and varied logic

Entropy-Based Adjustment:

Debtmap calculates pattern entropy to adjust cyclomatic complexity:

Extract patterns - Identify branch structures (e.g., all if/return patterns)
Calculate variety - Measure information entropy of patterns
Adjust complexity - Reduce score for low-entropy (repetitive) code

Formula:

Entropy = -Σ(p_i * log2(p_i))

where p_i = frequency of pattern i

Effective Complexity = Cyclomatic * (1 - (1 - Entropy/Max_Entropy) * 0.75)

Example:

#![allow(unused)]
fn main() {
// 20 similar if/return statements
// Cyclomatic: 20, Entropy: 0.3
// Effective: 20 * (1 - (1 - 0.3/4.32) * 0.75) ≈ 5.5
}

This approach reduces false positives from validation/configuration code while still flagging genuinely complex logic.

5. Call Graph Construction

Purpose: Understand function dependencies and identify critical paths.

What’s Tracked:

Function calls within the same file
Cross-file calls (when possible to resolve)
Method calls on structs/classes
Trait/interface implementations

Analysis:

Fan-in: How many functions call this function
Fan-out: How many functions this function calls
Depth: Distance from entry points (main, handlers)
Cycles: Detect recursive calls

Usage:

Prioritize functions called from many untested paths
Identify central functions (high fan-in/fan-out)
Detect test coverage gaps in critical paths

Limitations:

Dynamic dispatch not fully resolved
Cross-crate calls require additional analysis
Closures and function pointers approximated

6. Pattern Detection and Debt Classification

Purpose: Identify specific technical debt patterns.

Debt Categories:

Test Gaps:

Functions with 0% coverage and high complexity
Untested error paths
Missing edge case tests

Complexity Issues:

Functions exceeding thresholds (default: 10)
Deep nesting (3+ levels)
Long functions (200+ LOC)

Design Smells:

God functions (high fan-out)
Unused code (fan-in = 0)
Circular dependencies

Implementation:

#![allow(unused)]
fn main() {
pub enum DebtType {
    TestingGap { coverage: f64, cyclomatic: u32, cognitive: u32 },
    ComplexityHotspot { cyclomatic: u32, cognitive: u32 },
    DeadCode { visibility: FunctionVisibility, cyclomatic: u32, cognitive: u32, usage_hints: Vec<String> },
    GodObject { methods: u32, fields: Option<u32>, responsibilities: u32, god_object_score: f64, lines: u32 },
    // ... 35 total variants
}
}

7. Coverage Integration

Purpose: Map test coverage data to complexity metrics for risk scoring.

Coverage Data Flow:

Read LCOV file - Parse coverage report from test runners
Map to source - Match coverage lines to functions/branches
Calculate coverage % - For each function, compute:
- Line coverage: % of lines executed
- Branch coverage: % of branches taken
Identify gaps - Find untested branches in complex functions

Coverage Scoring:

#![allow(unused)]
fn main() {
pub struct CoverageMetrics {
    pub lines_covered: u32,
    pub lines_total: u32,
    pub branches_covered: u32,
    pub branches_total: u32,
    pub coverage_percent: f64,
}
}

Special Cases:

Entry points (main, handlers) expect integration test coverage
Generated code excluded from coverage requirements
Test files themselves not analyzed for coverage

8. Risk Scoring

Purpose: Combine complexity and coverage into a unified risk score.

Risk Formula:

Risk Score = (Effective Complexity * Coverage Gap Weight) + (Call Graph Depth * Path Weight)

where:
- Effective Complexity: Entropy-adjusted cyclomatic complexity
- Coverage Gap Weight: 1.0 for 0% coverage, decreasing to 0.1 for 95%+
- Call Graph Depth: Distance from entry points
- Path Weight: Number of untested paths leading to this function

Example Calculation:

#![allow(unused)]
fn main() {
fn calculate_risk_score():
  Effective Complexity: 8.5
  Coverage: 30%
  Coverage Gap Weight: 0.7
  Call Graph Depth: 3
  Untested Paths: 2

  Risk = (8.5 * 0.7) + (3 * 2 * 0.3) = 5.95 + 1.8 = 7.75
}

Risk Tiers (Unified Score 0-10):

Critical (9.0-10.0): Severe risk requiring immediate attention
High (7.0-8.9): Significant risk, address this sprint
Medium (5.0-6.9): Moderate risk, plan for next sprint
Low (3.0-4.9): Minor risk, monitor
Minimal (0.0-2.9): Well-managed code

9. Tiered Prioritization

Purpose: Classify and rank technical debt items by severity.

Prioritization Algorithm:

Calculate base risk score (from Risk Scoring step)
Apply context adjustments:
- Entry points: -2.0 score (integration test coverage expected)
- Core business logic: +1.5 score (higher priority)
- Frequently changed files: +1.0 score (git history analysis)
- Critical paths: +0.5 score per untested caller
Classify into tiers:
- Critical: score >= 9.0
- High: score >= 7.0
- Medium: score >= 5.0
- Low: score >= 3.0
- Minimal: score < 3.0
Sort within tiers by:
- Severity score
- Coupling impact
- File location (group related items)

Output:

#![allow(unused)]
fn main() {
pub struct PrioritizedDebtItem {
    pub rank: u32,
    pub score: f64,
    pub tier: Tier,
    pub location: Location,
    pub debt_type: DebtType,
    pub metrics: ComplexityMetrics,
    pub coverage: Option<CoverageMetrics>,
    pub context: ContextSuggestion,
}
}

See Tiered Prioritization for detailed explanation of the ranking algorithm.

10. Context Suggestion Generation

Purpose: Provide AI agents with specific file ranges to read for understanding the debt item.

Context Types:

Primary Context:

The function/struct where debt is located
Start and end line numbers
File path

Related Context:

Callers: Functions that call this function
Callees: Functions this function calls
Tests: Existing test files that cover related code
Types: Struct/enum definitions used by this function

Selection Algorithm:

Include primary location (always)
Add top 3 callers by call frequency
Add callees that are untested
Add test files with matching function names
Limit total context to ~500 lines (configurable)

Output Format:

CONTEXT:
├─ Primary: src/parser.rs:38-85
├─ Caller: src/handler.rs:100-120 (calls 12x)
├─ Caller: src/api.rs:45-60 (calls 8x)
├─ Callee: src/tokenizer.rs:15-40 (untested)
└─ Test: tests/parser_test.rs:50-120

11. Output Formatting

Purpose: Present analysis results in formats optimized for different consumers.

Output Formats:

Markdown (–format markdown):

Structured for minimal token usage
Context suggestions included inline
Metrics in consistent tabular format
Designed for piping to AI assistants

JSON (–format json):

Machine-readable for CI/CD integration
Full metadata for each debt item
Stable schema for programmatic consumption
Schema-versioned for compatibility

Terminal (–format terminal):

Color-coded by tier (red=critical, yellow=high, etc.)
Hierarchical tree view with unicode box characters
Progress bars for analysis phases
Summary statistics at top

Markdown (–format markdown):

Rendered in GitHub/GitLab for PR comments
Embedded code blocks with syntax highlighting
Collapsible details sections
Linked to source code locations

See Output Formats for examples and configuration options.

Data Flow Example

Let’s trace a single function through the entire pipeline:

Input: Source File

#![allow(unused)]
fn main() {
// src/handlers.rs
pub fn process_request(req: Request) -> Result<Response> {
    validate_auth(&req)?;
    let data = parse_payload(&req.body)?;
    let result = apply_business_logic(data)?;
    format_response(result)
}
}

Stage 1: Parsing

#![allow(unused)]
fn main() {
FunctionAst {
    name: "process_request",
    location: Location { file: "src/handlers.rs", line: 2 },
    calls: ["validate_auth", "parse_payload", "apply_business_logic", "format_response"],
    ...
}
}

Stage 2: Metric Extraction

#![allow(unused)]
fn main() {
FunctionMetrics {
    name: "process_request",
    cyclomatic_complexity: 4,  // 3 ?-operators + base
    nesting_depth: 1,
    loc: 5,
    ...
}
}

Stage 3: Entropy Analysis

#![allow(unused)]
fn main() {
// Pattern: repetitive ?-operator error handling
Entropy: 0.4 (low variety)
Effective Complexity: 4 * 0.85 = 3.4
}

Stage 4: Call Graph

#![allow(unused)]
fn main() {
CallGraphNode {
    function: "process_request",
    fan_in: 3,  // called from 3 handlers
    fan_out: 4,  // calls 4 functions
    depth: 1,  // direct handler (entry point)
}
}

Stage 5: Coverage (from LCOV)

#![allow(unused)]
fn main() {
CoverageMetrics {
    lines_covered: 5,
    lines_total: 5,
    branches_covered: 3,
    branches_total: 4,  // Missing one error path
    coverage_percent: 75%,
}
}

Stage 6: Risk Scoring

#![allow(unused)]
fn main() {
Risk = (3.4 * 0.25) + (1 * 1 * 0.2) = 0.85 + 0.2 = 1.05
Tier: LOW (entry point with decent coverage)
}

Stage 7: Context Suggestion

CONTEXT:
├─ Primary: src/handlers.rs:2-6
├─ Callee: src/auth.rs:15-30 (validate_auth)
└─ Test: tests/integration/handlers_test.rs:10-25

Stage 8: Output

#23 SCORE: 1.1 [LOW]
├─ src/handlers.rs:2 process_request()
├─ COMPLEXITY: cyclomatic=4, cognitive=3, nesting=1
├─ COVERAGE: 75% (1 branch untested)
└─ CONTEXT: Primary + 1 callee + 1 test file

Performance Characteristics

Analysis Speed:

Small project (< 10k LOC): 1-3 seconds
Medium project (10-50k LOC): 5-15 seconds
Large project (50-200k LOC): 20-60 seconds
Very large project (200k+ LOC): 1-5 minutes

Parallelization:

File parsing: Parallel across all available cores
Metric extraction: Parallel per-file
Call graph construction: Parallel with work stealing
Risk scoring: Parallel per-function
Output formatting: Sequential

Memory Usage:

Approx 100-200 KB per file analyzed
Peak memory for large projects: 500 MB - 1 GB
Streaming mode available for very large codebases

Optimization Strategies:

Skip unchanged files (git diff integration)
Parallel processing with rayon
Efficient AST traversal (visitor pattern)
Memory-efficient streaming for large codebases

Extension Points

Custom Analyzers: Implement the Analyzer trait to add language support:

#![allow(unused)]
fn main() {
pub trait Analyzer {
    fn parse(&self, content: &str) -> Result<Ast>;
    fn extract_metrics(&self, ast: &Ast) -> Vec<FunctionMetrics>;
    fn detect_patterns(&self, ast: &Ast) -> Vec<DebtPattern>;
}
}

Custom Scoring: Implement the RiskScorer trait to adjust scoring logic:

#![allow(unused)]
fn main() {
pub trait RiskScorer {
    fn calculate_risk(&self, metrics: &FunctionMetrics, coverage: &CoverageMetrics) -> f64;
    fn classify_tier(&self, score: f64) -> Tier;
}
}

Custom Output: Implement the OutputFormatter trait for new formats:

#![allow(unused)]
fn main() {
pub trait OutputFormatter {
    fn format(&self, items: &[PrioritizedDebtItem]) -> Result<String>;
}
}

Next Steps

Understand prioritization: See Tiered Prioritization
Learn scoring strategies: See Scoring Strategies
Configure analysis: See Configuration
Integrate with AI: See LLM Integration

Keyboard shortcuts

Debtmap Documentation