Bayesian forgiver strategy #1478

hodgesmr · 2025-12-10T09:00:54Z

Note: This strategy was ideated and developed with the assistance of Claude Code. I did not see an LLM policy in the repository, but I felt it was important to note that, given open source sensitivities to those tools. I did manual review, edits, and iterations on the implementation and testing.

Summary

This PR introduces BayesianForgiver, a strategy that uses Bayesian inference to model opponent behavior and make adaptive forgiveness decisions based on uncertainty.

Motivation

While the Axelrod library contains many strategies, none use Bayesian opponent modeling to inform decision-making. Existing forgiveness strategies use fixed thresholds or probabilities, regardless of confidence in their assessment of the opponent.

BayesianForgiver introduces uncertainty-aware decision making to the IPD strategy landscape.

Key Features

BayesianForgiver maintains a Beta distribution representing beliefs about the opponent's cooperation probability. It uses both the mean (expected cooperation rate) and variance (uncertainty) to make decisions:

More cautious when uncertain about the opponent (requires stronger evidence of cooperation to forgive a defection)
More punishing when certain the opponent is hostile (confident exploitation)
Adaptive threshold that adjusts forgiveness based on confidence in opponent model

This approach provides noise resilience and adaptive behavior compared to fixed-threshold strategies.

Algorithm

Maintain a Beta(α, β) distribution for opponent's cooperation probability
Start with Beta(1, 1) - neutral/uniform prior
Update after each round:
- If opponent cooperates: α += 1
- If opponent defects: β += 1
Calculate statistics:
- Mean cooperation probability: μ = α / (α + β)
- Uncertainty (std dev): σ = √(αβ / ((α+β)²(α+β+1)))
Adaptive forgiveness threshold:
- threshold = base_threshold + uncertainty_factor × σ
Decision:
- If opponent cooperated last round: Cooperate (reciprocate)
- If opponent defected last round:
  - If μ ≥ threshold: Cooperate (forgive)
  - Otherwise: Defect (punish)

Implementation Details

Computational complexity: O(1) per round (constant time Beta updates)
Memory depth: Infinite (uses full history via Bayesian updates)
Stochastic: No (deterministic given opponent history)
Parameters (optimized via grid search over 144 configurations):

prior_alpha=1.0
prior_beta=1.0
base_forgiveness_threshold=0.45 # Base component of forgiveness threshold
uncertainty_factor=2.5 # Increases caution under uncertainty (raises threshold early)


## Testing

All tests pass:
```bash
python -m pytest axelrod/tests/strategies/test_bayesian_forgiver.py
# 32 passed in 4.35s

Tests cover:

Initial strategy behavior
Behavior against classic strategies (TFT, Grudger, Cooperator, Defector, etc.)
Response to various opponent patterns (alternating, mixed, single defection)
Parameter customization
Clone and reset functionality

drvinceknight · 2025-12-10T10:23:10Z

Thanks for this.

The CI is failing because line 140 of this file is not hit by a test.

 axelrod/strategies/bayesian_forgiver.py                             34      1    97%   140

…all cooperation rate is high enough

hodgesmr · 2025-12-10T18:30:03Z

Thanks @drvinceknight - added the necessary test and also ran the required black formatter.

drvinceknight · 2025-12-11T09:48:37Z

axelrod/data/all_classifiers.yml

@@ -1,12 +1,12 @@
-$\\phi$:
+$\phi$:


Can you help me understand what is happening with this file @hodgesmr ? Was this done by running https://github.com/Axelrod-Python/Axelrod/blob/dev/axelrod/classifier.py#L91 ?

This diff is from the commit: c2619eb

I was following the documentation for adding a new strategy: "To classify the new strategy, run rebuild_classifier_table: python rebuild_classifier_table.py"

hodgesmr · 2025-12-15T10:13:14Z

I've decided to close this PR, especially as #1479 is being considered by the maintainers. I want to make sure contributions are accurate and productive, and I don't want this to distract or interfere with that. Thank you @drvinceknight and the other maintainers for the thoughtful time spent on this.

hodgesmr added 6 commits December 10, 2025 02:05

create Bayesian Forgiver strategy

3439f0b

added BayesianForgiver to _strategies

afe61cf

ran rebuild_classifier_table.py to update all_classifiers.yml

c2619eb

added Bayesian Forgiver to strategy index

b341f8c

bayesian forgiver tests

3ec848a

increment in index.rst

15a8656

drvinceknight mentioned this pull request Dec 10, 2025

LLM Policy document? #1479

Closed

hodgesmr added 2 commits December 10, 2025 10:55

Add test where the opponent defects but we forgive because their over…

2436797

…all cooperation rate is high enough

ran black formatter per repo guidelines

097d501

drvinceknight reviewed Dec 11, 2025

View reviewed changes

comment and docstring cleanup

3a4e569

hodgesmr closed this Dec 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bayesian forgiver strategy #1478

Bayesian forgiver strategy #1478

Uh oh!

hodgesmr commented Dec 10, 2025 •

edited

Loading

Uh oh!

drvinceknight commented Dec 10, 2025

Uh oh!

hodgesmr commented Dec 10, 2025

Uh oh!

drvinceknight Dec 11, 2025

Uh oh!

hodgesmr Dec 11, 2025

Uh oh!

drvinceknight Dec 11, 2025

Uh oh!

hodgesmr commented Dec 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@@ -1,12 +1,12 @@
		$\\phi$:
		$\phi$:

Bayesian forgiver strategy #1478

Bayesian forgiver strategy #1478

Uh oh!

Conversation

hodgesmr commented Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Motivation

Key Features

Algorithm

Implementation Details

Uh oh!

drvinceknight commented Dec 10, 2025

Uh oh!

hodgesmr commented Dec 10, 2025

Uh oh!

drvinceknight Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

hodgesmr Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

drvinceknight Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

hodgesmr commented Dec 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

hodgesmr commented Dec 10, 2025 •

edited

Loading