Typical set

cosmos 22nd March 2019 at 8:45pm
Information theory

An ϵ\epsilon-typical set, Aϵ(n)A_\epsilon^{(n)} Set of strings of length nn which have a typical probability in an Stochastic process. "Typical" refers to a probability close (ϵ\epsilon) to that defined in the Asymptotic equipartition property. This set is typical because most strings belong to this set, asymptotically all of them.

It includes all strings XX such that:

2n(H(X)+ϵ)p(X)2n(H(X)ϵ)2^{-n(H(X)+\epsilon)} \leq p(X) \leq 2^{-n(H(X)-\epsilon)}

We can show that:

  • Pr{Aϵ(n)}>1ϵPr\{A_\epsilon^{(n)}\} > 1 - \epsilon, for nn sufficiently large. This is just from the AEP theorem and the definition of Convergence in probability. However, this bound isn't tight. I think using Hoeffding's inequality one obtains an exponential decrease in the probability of not being in the typical set.
  • Aϵ(n)2n(H(X)+ϵ)|A_\epsilon^{(n)}| \leq 2^{n(H(X)+\epsilon)}
  • Aϵ(n)(1ϵ)2n(H(X)ϵ)|A_\epsilon^{(n)}| \geq (1-\epsilon)2^{n(H(X)-\epsilon)}, for nn sufficiently large.

See equations 3.7-16 in Els. of info theory book for proofs.

High-probability set

A high-probability set is defined as the smallest set of strings BB with Pr{B}1δPr\{B\} \geq 1-\delta.

One can show that Aϵ(n)A_\epsilon^{(n)} and BB are about the same size.


video