Introduction

This third edition is presented as a focused interlude to formally introduce an essential building block of Bayesian modeling: the Beta-Binomial distribution. This distribution is not only elegant from a mathematical perspective, but also plays a central role in the Bayesian monitoring of binomial outcomes, such as interim analyses in clinical trials.

In the previous edition, we explored how prior knowledge and observed data combine in a Bayesian framework to yield posterior beliefs. One common setting involved binary outcomes, success/failure, event/no event. In many such applications, Beta distributions serve as priors and Binomial distributions as likelihoods, resulting in a Beta posterior.

In this edition, we go one step further by showing how this leads naturally to the Beta-Binomial distribution, which captures the distribution of future events when the probability of success itself is uncertain.


1. Motivation

Imagine we are drawing balls from an urn, each time with replacement. The probability ( \theta ) of drawing a red ball is not known exactly, but based on a preliminary study (or domain expertise), we believe it follows a Beta distribution:

\(\theta \sim \text{Beta}(\alpha, \beta)\)

Now, suppose we plan to perform \(n\) future draws from the urn and count how often a red ball is observed.

Then the distribution of the number of red balls \(X \in \{0,1,\dots,n\}\) follows a Beta-Binomial distribution.


2. Definition

We say that:

\(X \sim \text{Beta-Binomial}(n, \alpha, \beta)\)

if the probability mass function is given by:

\(P(X = x) = \displaystyle {n \choose k} \cdot \displaystyle \frac{B(\alpha + x, \beta + n – x)}{B(\alpha, \beta)}\)

where \(B(\alpha, \beta) = \displaystyle \frac{\Gamma(\alpha)\Gamma(\beta)}{\Gamma(\alpha + \beta)}\) is the Beta function, and \(\Gamma(\cdot)\) is the Gamma function.

This is a discrete distribution that combines:

  • Binomial randomness: \(\displaystyle {n \choose k}\)

  • Uncertainty in success probability \(\theta\) via the Beta distribution


3. Interpretation and Use Case

In practice, this model arises in the following type of scenario:

You observe outcomes from an initial study of size \(n_1\), and your posterior belief about the event probability is a Beta distribution.

Then, to predict how many events will occur in a future sample of size \(n_2\), the posterior predictive distribution of the count of events is Beta-Binomial.

This is particularly useful in interim analyses in clinical trials:

  • You analyze an initial sample (say 20 patients), and your posterior is \(\theta \mid y \sim \text{Beta}(\alpha^*, \beta^*)\)

  • Then you ask: what is the distribution of the number of events in the next 30 patients?

\(\tilde{X} \sim \text{Beta-Binomial}(n = 30, \alpha^*, \beta^*)\)

The Beta-Binomial naturally accounts for parameter uncertainty, unlike the classic Binomial distribution, which assumes \(\theta\) is fixed.


4. Relation to Bayesian Clinical Trial Monitoring

In Edition 4, we will revisit this concept when analyzing vaccine efficacy in a Phase 3 trial. There, the number of observed cases in the vaccine group among all cases is modeled with:

\(X \sim \text{Beta-Binomial}(n, \alpha, \beta)\)

where the prior \(\text{Beta}(\alpha, \beta)\) reflects skepticism or conservatism, and is updated using observed data.

This allows us to compute:

  • Posterior distributions for the event probability

  • Credible intervals for vaccine efficacy

  • Probabilities for exceeding efficacy thresholds


5. Summary

The Beta-Binomial distribution emerges when modeling binary outcomes with uncertain success probabilities. It is central to many Bayesian applications, including:

  • Posterior predictive modeling

  • Clinical trial design and interim analysis

  • Sequential decision-making

Its key advantage lies in properly accounting for uncertainty in \(\theta\), offering more robust probabilistic forecasts than plug-in frequentist methods.


Closing

This edition served as a conceptual bridge to our next article, which applies these principles in a real-world vaccine trial. If you’re comfortable with this distribution, you’re ready to appreciate the full strength of Bayesian monitoring and inference.


Continue learning with us at 3 D Statistical Learning as we make Bayesian ideas accessible, intuitive, and practical.

With gratitude to Dr. Dany Djeudeu for expert inspiration and guidance.