Skip to main content

Discrete Probability Distributions

Overview

This section covers discrete random variables and their probability distributions.

Key Definitions

Def. (Random Variable)

A random variable is a function that maps outcomes in the sample space to real numbers. Formally, X:SRX: S \to \mathbb{R}.

Def. (Discrete Random Variable)

A random variable XX is discrete if it takes on a countable number of distinct values.

Def. (Probability Mass Function (PMF))

For a discrete random variable XX, the PMF is a function pX(x)=P(X=x)p_X(x) = P(X = x) that gives the probability that XX takes the value xx.

Def. (Cumulative Distribution Function (CDF))

The CDF of a random variable XX is defined as FX(x)=P(Xx)F_X(x) = P(X \leq x) for all xRx \in \mathbb{R}.

Def. (Expected Value)

The expected value (or mean) of a discrete random variable XX is defined as E[X]=xxP(X=x)E[X] = \sum_{x} x \cdot P(X = x), provided the sum converges.

Def. (Variance)

The variance of a random variable XX is defined as Var(X)=E[(XE[X])2]=E[X2](E[X])2\text{Var}(X) = E[(X - E[X])^2] = E[X^2] - (E[X])^2.

The variance measures the spread or dispersion of the distribution around the mean.

Def. (Standard Deviation)

The standard deviation of a random variable XX is the square root of the variance:

σX=Var(X)\sigma_X = \sqrt{\text{Var}(X)}

The standard deviation has the same units as XX and provides a measure of spread in the original units.

Def. (Independence of Random Variables)

Random variables XX and YY are independent if for all values xx and yy:

P(X=x,Y=y)=P(X=x)P(Y=y)P(X = x, Y = y) = P(X = x) \cdot P(Y = y)

Equivalently, knowing the value of XX provides no information about YY.

Properties:

  • If XX and YY are independent, then E[XY]=E[X]E[Y]E[XY] = E[X]E[Y]
  • If XX and YY are independent, then Var(X+Y)=Var(X)+Var(Y)\text{Var}(X + Y) = \text{Var}(X) + \text{Var}(Y)

Multiple random variables: X1,X2,,XnX_1, X_2, \ldots, X_n are mutually independent if for all values x1,x2,,xnx_1, x_2, \ldots, x_n: P(X1=x1,X2=x2,,Xn=xn)=P(X1=x1)P(X2=x2)P(Xn=xn)P(X_1 = x_1, X_2 = x_2, \ldots, X_n = x_n) = P(X_1 = x_1) \cdot P(X_2 = x_2) \cdots P(X_n = x_n)

Common Discrete Distributions

Bernoulli Distribution

Def. (Bernoulli Distribution)

A random variable XX follows a Bernoulli distribution with parameter p[0,1]p \in [0,1], denoted XBernoulli(p)X \sim \text{Bernoulli}(p), if P(X=1)=pP(X = 1) = p and P(X=0)=1pP(X = 0) = 1-p.

  • Parameter: pp = probability of success
  • Support: {0,1}\{0, 1\}
  • Applications: Binary outcomes in CS (success/failure, true/false)

Binomial Distribution

Def. (Binomial Distribution)

A random variable XX follows a Binomial distribution with parameters nn and pp, denoted XBinomial(n,p)X \sim \text{Binomial}(n, p), if it represents the number of successes in nn independent Bernoulli(p)(p) trials. The PMF is:

P(X=k)=(nk)pk(1p)nk for k=0,1,,nP(X = k) = \binom{n}{k} p^k (1-p)^{n-k} \text{ for } k = 0, 1, \ldots, n

  • Parameters: nn = number of trials, pp = probability of success per trial
  • Support: {0,1,2,,n}\{0, 1, 2, \ldots, n\}
  • Relationship: Sum of nn independent Bernoulli(p)(p) random variables

Geometric Distribution

Def. (Geometric Distribution)

A random variable XX follows a Geometric distribution with parameter pp, denoted XGeometric(p)X \sim \text{Geometric}(p), if it represents the number of trials needed to get the first success in a sequence of independent Bernoulli(p)(p) trials. The PMF is:

P(X=k)=(1p)k1p for k=1,2,3,P(X = k) = (1-p)^{k-1} p \text{ for } k = 1, 2, 3, \ldots

  • Parameter: pp = probability of success per trial
  • Support: {1,2,3,}\{1, 2, 3, \ldots\}
  • Memoryless Property: P(X>n+mX>n)=P(X>m)P(X > n+m | X > n) = P(X > m)

Poisson Distribution

Def. (Poisson Distribution)

A random variable XX follows a Poisson distribution with parameter λ>0\lambda > 0, denoted XPoisson(λ)X \sim \text{Poisson}(\lambda), if its PMF is:

P(X=k)=λkeλk! for k=0,1,2,P(X = k) = \frac{\lambda^k e^{-\lambda}}{k!} \text{ for } k = 0, 1, 2, \ldots

  • Parameter: λ\lambda = average rate of occurrence
  • Support: {0,1,2,}\{0, 1, 2, \ldots\}
  • Applications: Network traffic, system failures, rare events

Key Formulas

Expected Value

For a discrete random variable XX:

E[X]=xxP(X=x)E[X] = \sum_{x} x \cdot P(X = x)

Variance

Var(X)=E[X2](E[X])2\text{Var}(X) = E[X^2] - (E[X])^2