MapReduce Troubleshooting Guide¶

Commit Validation Failures¶

Overview¶

Prodigy enforces commit validation for MapReduce agent commands marked with commit_required: true. This prevents silent data loss from agents that complete without creating the expected commits.

Common Symptoms¶

Agent Failure Message:

Agent execution failed: Commit required but no commit was created

Worktree: /path/to/worktree/agent-1
Expected behavior: Command should create at least one git commit
Command: shell: echo "test" > file.txt

DLQ Entry:

{
  "error_type": "CommitValidationFailed",
  "manual_review_required": true,
  "failure_history": [{
    "error_message": "Commit validation failed",
    "json_log_location": "/path/to/claude/logs/session-xyz.json"
  }]
}

Root Causes¶

1. Missing `git add` or `git commit` Commands¶

Problem: Agent creates/modifies files but doesn't commit them.

Example:

agent_template:
  - shell: |
      echo "content" > file.txt
      # Missing: git add file.txt
      # Missing: git commit -m "message"
    commit_required: true

Solution:

agent_template:
  - shell: |
      echo "content" > file.txt
      git add file.txt
      git commit -m "Add file.txt"
    commit_required: true

2. Conditional Logic That Skips Commits¶

Problem: Agent has conditional logic that sometimes skips commit creation.

Example:

agent_template:
  - shell: |
      if [ "${item.type}" = "process" ]; then
        echo "content" > file.txt
        git add file.txt
        git commit -m "Process ${item.id}"
      else
        echo "Skipping item ${item.id}"
        # No commit created for non-process items
      fi
    commit_required: true

Solution: Either:

Remove commit_required flag for conditional commits:

agent_template:
  - shell: |
      if [ "${item.type}" = "process" ]; then
        echo "content" > file.txt
        git add file.txt
        git commit -m "Process ${item.id}"
      fi
    # No commit_required flag

Filter items before map phase to ensure all agents create commits:

map:
  filter: "item.type == 'process'"  # Only process items that need commits
  agent_template:
    - shell: |
        echo "content" > file.txt
        git add file.txt
        git commit -m "Process ${item.id}"
      commit_required: true

3. Command Fails Before Reaching Commit¶

Problem: Command fails early, never reaching the commit statement.

Example:

agent_template:
  - shell: |
      some-command-that-fails
      git add file.txt
      git commit -m "message"  # Never reached
    commit_required: true

Solution: Use on_failure handlers or fix the failing command:

agent_template:
  - shell: |
      some-command || exit 1
    on_failure:
      shell: |
        echo "Command failed, creating fallback commit"
        echo "error" > error.log
        git add error.log
        git commit -m "Failed to process ${item.id}"
  - shell: |
      git add file.txt
      git commit -m "Process complete"
    commit_required: true

4. Empty Commits (Nothing to Commit)¶

Problem: Agent tries to commit but has no changes staged.

Example:

agent_template:
  - shell: |
      # This might not modify anything
      echo "test" > file.txt
      rm file.txt  # Undo the change
      git add .
      git commit -m "message"  # Fails: nothing to commit
    commit_required: true

Solution: Use --allow-empty or ensure changes exist:

agent_template:
  - shell: |
      echo "test ${item.id}" > "file-${item.id}.txt"
      git add .
      if git diff --cached --quiet; then
        # No changes, create empty commit
        git commit --allow-empty -m "No changes for ${item.id}"
      else
        git commit -m "Process ${item.id}"
      fi
    commit_required: true

Debugging Steps¶

1. Check Agent Worktree State¶

# Navigate to the agent's worktree (from error message)
cd /path/to/worktree/agent-1

# Check git status
git status

# Check commit history
git log --oneline

# Check for uncommitted changes
git diff

2. Review Claude JSON Log¶

The error message includes the path to the Claude JSON log file. This contains the complete execution trace:

# View the log
cat /path/to/claude/logs/session-xyz.json | jq

# Extract command execution details
cat /path/to/claude/logs/session-xyz.json | jq '.messages[] | select(.content[]?.type == "tool_use")'

3. Check DLQ for Pattern Analysis¶

# View DLQ items for the job
prodigy dlq show <job-id>

# Look for commit validation failures
prodigy dlq show <job-id> | jq '.items[] | select(.failure_history[].error_type == "CommitValidationFailed")'

# Analyze failure patterns
prodigy dlq analyze <job-id>

4. Test Agent Command Manually¶

# Create a test worktree
mkdir test-agent
cd test-agent
git init

# Set up test item data
export item.id=1
export item.type=process

# Run the agent command manually
shell: |
  echo "content" > file.txt
  git add file.txt
  git commit -m "Test commit"

# Check if commit was created
git log

Prevention Best Practices¶

Use commit_required Sparingly
Only mark commands as commit_required when you genuinely expect a commit
For optional commits, use on_failure handlers instead
Test Workflows with Dry-Run Mode
```
prodigy run workflow.yml --dry-run
```

Use Filters to Ensure Commit Eligibility

map:
  filter: "item.needs_commit == true"
  agent_template:
    - shell: |
        process-and-commit.sh
      commit_required: true

Add Explicit Validation

agent_template:
  - shell: |
      process-item.sh
      git add .
      git commit -m "Process ${item.id}"
  - shell: |
      # Verify commit was created
      if ! git log -1 --oneline | grep -q "Process"; then
        echo "ERROR: Commit validation failed"
        exit 1
      fi
    commit_required: true

Command-Level Options - Details on commit_required flag
MapReduce Event Tracking - Understanding event streams
Dead Letter Queue - Managing failed items

MapReduce Troubleshooting Guide¶

Commit Validation Failures¶

Overview¶

Common Symptoms¶

Root Causes¶

1. Missing git add or git commit Commands¶

2. Conditional Logic That Skips Commits¶

3. Command Fails Before Reaching Commit¶

4. Empty Commits (Nothing to Commit)¶

Debugging Steps¶

1. Check Agent Worktree State¶

2. Review Claude JSON Log¶

3. Check DLQ for Pattern Analysis¶

4. Test Agent Command Manually¶

Prevention Best Practices¶

Related Documentation¶

1. Missing `git add` or `git commit` Commands¶