Graduate → Probability and Statistics → Stochastic Processes ↓

Markov Chains

A Markov chain is a mathematical system that experiences transitions from one state to another according to certain probabilistic rules. The unique feature of a Markov chain is that the future state depends only on the current state, not on the sequence of events before it.

Introduction to Markov chains

Consider a simple example like the weather: if you know today's weather, you can predict tomorrow's weather, without needing the weather the day after tomorrow. This is a classic example of the Markov property, which states that the system is "memoryless". Markov chains essentially model situations where this property is true.

Basic concepts

Before delving deeper into Markov chains, let's define some basic terms:

State: A description of the state of the Markov system at any time.
Transition: The process of moving from one state to another in a Markov chain.
Transition Probability: The probability of moving from one state to another.

Mathematical representation

A Markov chain is represented by a matrix called the transition matrix. Each element in the matrix represents the probability of moving from one state to another. Let's look at an example:

P = | 0.7 0.3 | | 0.4 0.6 |

In this matrix, 0.7 is the probability of staying in state 1, and 0.3 is the probability of moving from state 1 to state 2.

Visual example

The following diagram shows a simple Markov chain with two states:

In this example, the arrow going from S1 to S2 has a transition probability of 0.3, which means it has a 30% chance of going from state 1 to state 2.

Types of Markov chains

There are mainly two types of Markov chains:

Discrete-time Markov chain: This process proceeds in discrete steps or states.
Continuous-time Markov chain: Changes (transitions) can happen at any time.

Stable distribution

The Markov chain can reach a point where the probabilities of being in each state become stable. This is called the stationary distribution. It can be calculated by solving the following equation:

πP = π

Here, π denotes the stationary distribution vector, and P is the transition matrix.

Text example

Suppose you are a machine learning engineer working on a recommendation system for a streaming platform. You come across two states that represent user actions: “watching” and “browsing.” The user can switch between these states based on certain probabilities. Using a transition matrix, you represent these probabilities as follows:

P = | 0.8 0.2 | | 0.5 0.5 |

After several iterations, you find that the probability distribution does not change. This stable distribution helps you understand user behavior over time.

Applications of Markov chains

Markov chains are used in a variety of fields, including:

Economics: Modeling stock prices and economic trends.
Biology: Simulating population dynamics.
Queueing theory: modeling systems with service lines.
Computational algorithms: such as Google's PageRank algorithm.

Examples in queueing theory

Imagine a single-server queue where customers arrive randomly. The system can be modeled using a Markov chain, where states represent the number of customers in the system. Transition probabilities are based on arrival and service rates.

For example, consider a supermarket with a checkout counter. Customers arrive at random intervals, and the time taken to serve each customer varies. The arrival and service times form the basis of transition probabilities, and the queue length represents the different states of the Markov chain.

Solving Markov chains

Solving a Markov chain typically involves finding the stationary distribution. This is usually done in one of two ways:

Iteration: Multiply the initial distribution by the transition matrix until it becomes stationary.
Direct solution: Use linear algebra to solve πP = π , with additional constraints such as π sums to 1.

Example calculation

Let's solve the Markov chain using recursion. Suppose we have an initial state distribution: [1, 0]. With a transition matrix:

P = | 0.9 0.1 | | 0.2 0.8 |

Our initial distribution says that we start in state 1 with certainty. After a transition, the distribution becomes:

[1, 0] * | 0.9 0.1 | = | 0.9 0.1 |

Repeating these multiplications will eventually give the stationary distribution.

Conclusion

Markov chains are powerful tools in probability and statistics for modeling random processes. They are particularly useful when the system exhibits Markov properties, where the next state depends only on the current state and therefore it summarizes everything you need to know about the past. Understanding the basics of Markov chains will help you tackle a wide variety of problems in science and engineering.

The topics under Markov chains, such as types, applications, and solutions, are very broad, and continued study can lay a strong foundation for advanced topics in stochastic processes.

Mark as read

Graduate → 5.3.1

username

completed in Graduate

Markov Chains

Introduction to Markov chains

Basic concepts

Mathematical representation

Visual example

Types of Markov chains

Stable distribution

Text example

Applications of Markov chains

Examples in queueing theory

Solving Markov chains

Example calculation

Conclusion

Comments

Markov Chains