Asymptotic Equipartition Property
Sources:
- EE 376A: Information Theory. Winter 2018. Lecture 4. - Stanford University
- EE 376A: Information Theory. Winter 2017. Lecture 4. - Stanford University
- Elements of Information Theory
Intro
In information theory, the analog of the law of large numbers(LLN) is the asymptotic equipartition property (AEP).
It is a direct consequence of the weak law of large numbers.
The law of large numbers states that for iid random variables,
The AEP states that
This enables us to divide the set of all sequences into two sets, the typical set, where the sample entropy is close to the true entropy, and the nontypical set, which contains the other sequences.
Most of our attention will be on the typical sequences. Any property that is proved for the typical sequences will then be true with high probability and will determine the average behavior of a large sample.
Notation
Memoryless source:
iid , where denotes one PMF of arbitory . Note that "memoryless" is used here because samples are drawn iid and have no dependence on past realizations.- Note: All i.i.d. r.v. have the same probability distribution and the same entropy.
Alphabet:
specifies the possible values that each symbol can take on. The size of is denoted .Source sequence:
denotes the -tuple that specifies a sequence of source symbols. In other words, is a stochastic process. Further note that indicates the set of all possible source sequences of length .- Since the source symbols are memoryless here.
is an iid stochastic process. It's also denoted as in some literatures.
- Since the source symbols are memoryless here.
Probability of
where are i.i.d. : The probability assigned to a source sequence is given by . Since we implicitly evaluate the probabilities over the alphabet , we may also omit amd write
The -typical Set
Definition: For some
Let
即:
Furthermore, note the following equivalent ways of defining
We have
From
Theorem:
Theorem:
Proof
Note that
<==> is an -typical set <==> . So we haveAlso, we know that
. Then the right term can be reformed to By , it then can be reformed to As a result, we haveNote that since
are iid, are iid. Then by the weak law of large numbers (LLN), Here we just treat as iid r.v, then apply the LLN.Recalling that by the definition of entrypy,
, so To conclude,
Note: Since
AEP
From Theorem:
From
As a result,
for
Thus, AEP enables us to partition the sequence
- Typical set: Containing sequences with probability close to
. - Atypical set: Containing the other sequences.

Note: the notation
Theorem: Bound of
Theorem:
Proof
Upper bound:
See following equation:
According to
Lower bound:
See following equation:
The starting equation
Since
According to
Theorem: is an exponentially smaller than
The space of all possible source sequences
In fact,
Despite the small size of
Theorem: is minimal
Theorem: Fix
Proof
$$
Explaination:
is derived from a basic theorem of sets. That is, given arbitary sets , The proof is in appendix.Then, we have
is because given arbitary sets , Then we have And we also know .Then
is because all are independent? //TODO is due to . is a common computation. is because given arbitary sets , Then we have is due to the prerequisite .The limit in
is easy to understand. Because Then we have
From the theorems proven above, we understand
Appendix
Proof of
"Given arbitary sets , ":Inclusion (
):Let
be an arbitrary element in . We want to show that is also in the right-hand side of the equation.- If
: is in the intersection of and . - If
: is in the intersection of and the complement of .
Therefore,
.- If
Inclusion (
):Let
be an arbitrary element in . We want to show that is also in .- If
: is in the intersection of and , so yy is in . - If
: is in the intersection of and the complement of , so is in .
Since
could be in either or , we can conclude that is in .Therefore,
.Combining both inclusions, we have
.- If