Skip to content

S3 Chapter 2: Combinations of Random Variables

Definition: Normal Distribution A random variable XX is normally distributed with mean μ\mu and variance σ2\sigma^2, written XN(μ,σ2).X \sim N(\mu,\sigma^2).

  • μ\mu: center/location (expected value),
  • σ\sigma: spread/typical deviation,
  • σ2\sigma^2: variance.

Theorem: Linearity of Expectation For any random variables X1,,XnX_1,\ldots,X_n and constants a1,,an,ba_1,\ldots,a_n,b, E ⁣(i=1naiXi+b)=i=1naiE(Xi)+b.E\!\left(\sum_{i=1}^{n} a_iX_i + b\right)=\sum_{i=1}^{n}a_iE(X_i)+b. No independence assumption is needed.

Theorem: Variance Rules Under Independence If XX and YY are independent, then Var(X+Y)=Var(X)+Var(Y).\text{Var}(X+Y)=\text{Var}(X)+\text{Var}(Y). For independent X1,,XnX_1,\ldots,X_n, Var ⁣(i=1naiXi)=i=1nai2Var(Xi).\text{Var}\!\left(\sum_{i=1}^{n}a_iX_i\right)=\sum_{i=1}^{n}a_i^2\text{Var}(X_i).

Theorem: Linear Combination of Independent Normals If X1,,XnX_1,\ldots,X_n are independent and XiN(μi,σi2),X_i\sim N(\mu_i,\sigma_i^2), then for constants a1,,ana_1,\ldots,a_n, L=i=1naiXiL=\sum_{i=1}^{n}a_iX_i is normally distributed with E(L)=i=1naiμi,Var(L)=i=1nai2σi2.E(L)=\sum_{i=1}^{n}a_i\mu_i,\quad \text{Var}(L)=\sum_{i=1}^{n}a_i^2\sigma_i^2.

Normal Distribution: The Model and Intuition

Section titled “Normal Distribution: The Model and Intuition”
  • Read and interpret XN(μ,σ2)X \sim N(\mu,\sigma^2).
  • Connect mean and variance to realistic contexts.
  • Standardize to the ZZ-distribution for probability calculation.

If XN(μ,σ2)X \sim N(\mu,\sigma^2), define Z=XμσN(0,1).Z=\frac{X-\mu}{\sigma}\sim N(0,1).

Then P(Xx)=P ⁣(Zxμσ).P(X\le x)=P\!\left(Z\le \frac{x-\mu}{\sigma}\right).

Example: Battery Weight

Suppose battery weight WN(75,32)W \sim N(75,\,3^2) grams. P(W>80)=P ⁣(Z>80753)=P(Z>1.667).P(W>80) = P\!\left(Z>\frac{80-75}{3}\right) = P(Z>1.667). So the problem is converted to standard normal table use.

Example: Hedging Trade: Meituan and Alibaba Expected Return

Let

  • MM: one-day return (%) of Meituan stock
  • AA: one-day return (%) of Alibaba stock

A trader builds a hedge portfolio: P=M0.7AP=M-0.7A (long Meituan, short 0.70.7 units of Alibaba).

Suppose E(M)=0.40,E(A)=0.25.E(M)=0.40,\quad E(A)=0.25. Then E(P)=E(M)0.7E(A)=0.400.7(0.25)=0.225.E(P)=E(M)-0.7E(A)=0.40-0.7(0.25)=0.225. So the expected one-day return of the hedged portfolio is 0.225%0.225\%.

Example: Hedged Portfolio Variance: Independence Given

Continue with P=M0.7A,P=M-0.7A, and assume Var(M)=2.25,Var(A)=1.44.\text{Var}(M)=2.25,\quad \text{Var}(A)=1.44. If the question states that MM and AA are independent, then Var(P)=Var(M)+(0.7)2Var(A)=2.25+0.49(1.44)=2.9556.\text{Var}(P)=\text{Var}(M)+(-0.7)^2\text{Var}(A)=2.25+0.49(1.44)=2.9556. Therefore SD(P)=2.95561.72.\text{SD}(P)=\sqrt{2.9556}\approx 1.72. Key point: coefficients must be squared in variance calculations.

Example: Hedged Portfolio Variance: Independence Not Given

If the question only gives Var(M)=2.25,Var(A)=1.44,\text{Var}(M)=2.25,\quad \text{Var}(A)=1.44, but does not state whether Meituan and Alibaba returns are independent, then for P=M0.7AP=M-0.7A you cannot directly write Var(P)=2.25+0.49(1.44).\text{Var}(P)=2.25+0.49(1.44).

If the returns are negatively correlated, then the variance of PP becomes lower which reduces the risk of the hedge portfolio.

Example: Difference of Blood Pressure Readings

Morning and evening blood pressures: MN(120,25),EN(115,36),M\sim N(120,25),\quad E\sim N(115,36), with independence. Define D=MED=M-E.

Then E(D)=120115=5,Var(D)=25+36=61.E(D)=120-115=5,\quad \text{Var}(D)=25+36=61. So DN(5,61).D\sim N(5,61).

Example: Weighted Combination

Suppose XN(60,4)X\sim N(60,4), YN(45,9)Y\sim N(45,9), independent. Let T=2X3Y.T=2X-3Y. Then E(T)=2(60)3(45)=15,E(T)=2(60)-3(45)=-15, Var(T)=22(4)+(3)2(9)=16+81=97.\text{Var}(T)=2^2(4)+(-3)^2(9)=16+81=97. Hence TN(15,97).T\sim N(-15,97).

If X1,,XnX_1,\ldots,X_n are independent N(μ,σ2)N(\mu,\sigma^2), then Xˉ=1ni=1nXi\bar{X}=\frac{1}{n}\sum_{i=1}^{n}X_i is also normal, and XˉN ⁣(μ,σ2n).\bar{X}\sim N\!\left(\mu,\frac{\sigma^2}{n}\right).

This single result powers much of later confidence interval work.