16 minute read

A handbook for understanding quantum math in articlesPermalink

We all agree that quantum mechanics is weird and fascinating: it can teleport information, connect particles across space and time, and even speed up computations that would take centuries. Our mission at The Quantumist is to make this field a bit more accessible by walking you through the steps and reasonings in a colloquial manner. However, one cannot fully grasp what is going on under the hood without understanding some of the math. We believe math can be seen as a game, and the “rules” of this game determine the symbols you can write and how you can manupulate them. The purpose of this article is to serve as a cheat-sheet for the basic rules that apply when dealing with the math of quantum computing, and is intended for people that want to get their hands dirty. We will expand it in the future with additional notions and tricks we deem useful.

Matrix multiplication refresherPermalink

This section is intended as a light refresh on one basic linear algebra concept that is fundamental for understanding the rest of the article. Feel free to skip it if you feel you already know these notions.

Matrix multiplication is a basic operation useful to understand how quantum states evolve and how quantum gates combine with each other. To perform a matrix multiplication we of course need two matrices to multiply, A and B, and we call the resulting matrix C. We define A to be of generic size (m×n) (m rows, n columns), while B is of generic size (k×l). A and B can contain any kind of numbers, in our example we choose real numbers for simplicity (R) but quantum computing utilizes complex values (C). For now, not much is lost in limiting ourselves to real numbers. Back to our matrices, the size we choose for them is important, as we can only perform a matrix product AB if the inner dimensions of the product match. In other words, we can only multiply matrices of size (m×n) and (k×l) if n=k.

Now, let’s see how we can actually compute the matrix product. The generic element cij of C in row i and column j is computed as the scalar product of the i-th row of A with the j-th column of B. This is also a mnemonic trick: “To compute the element in the third row, second column of C, you must use the third row of A and the second column of B”.

AB=[a11a1nam1amn][b11b1lbk1bkl]=[c11c1lcm1cml]=C cij:=x=0naixbjx

As an example, let’s take a look at the following, with m=n=k=3 and l=2, in particular let’s show how to obtain element c32:

AB=[131431212][104133]=[166196127]=C c32=20+11+23=7

Note that n=k was important in order to have vectors of the same length in the computation for cij.

Matrix notationPermalink

A recap of quantum gatesPermalink

Just as classical computers have logical gates to manipulate information, quantum computers have quantum gates. They serve the purpose of manipulating quantum states - effectively changing the shape of the probability distribution - in order to express some logical function. All quantum gates - and all quantum operators at large - act in a linear fashion. This means we can represent them in a matrix form. Moreover, every generic gate U is unitary, meaning it preserves the total probability sum of the state to 1 (UU=UU=I). All quantum gates are also reversible, meaning that the input can be unambiguously reconstructed from its output. By contrast, a classical XOR gate is irreversible since its output 0 can be obtained by both 11 and 00. Here is a table summarizing the main single and multi-qubit quantum gates:

Gate Symbol Matrix form
Identity I [1001]
Pauli-X X [0110]
Pauli-Y Y [0ii0]
Pauli-Z Z [1001]
Hadamard H 12[1111]
φ Phase Shift P(φ) [100eiφ]
CNOT CNOT [1000010000010010]
SWAP SWAP [1000001001000001]
Controlled-Z CZ [1000010000100001]
Toffoli CCX [1000000001000000001000000001000000001000000001000000000100000010]

Applying quantum gates in matrix formPermalink

The result of applying a quantum gate to a state can be calculated simply as a matrix multiplication of the gate with the state. The effect of a gate can be easily understood once one visualizes the problem using linear algebra. To this regard, we recommend an excellent and intuitive explanation of these concepts: 3Blue1Brown’s Essence of linear algebra.

An intuition coming from linear algebra is that gates in matrix formulation are a map to where the base states end up. Let’s take a look at the following example with a Pauli-Y gate on the |0=[1,0]T state:

Y|0=[0ii0][10]=[0i]

Observe how the final output corresponds to the first column of Y. This is trivial, since state |0 had a single 1 in the first position. This, however, is a very useful detail to notice, as it’s valid for gates and states of any size. Moreover, if your state is not a pure state (e.g. not |0 or |1), you can still compute the output as a sum of base states. The values of the state tells you “how much” of every column to use for composing your state. For example, if we apply the Pauly-Y gate to the superposed state |+=12[1,1]T, we obtain:

Y|+=[0ii0][11]12=12[0i]+12[i0]=12[ii]

Let’s make a final example with a CNOT gate applied to a larger, 2-qubit state:

CNOT(|00+|11)=[1000010000010010]([1000]+[0001])=[1000]+[0010]=|00+|10

Dirac notationPermalink

Quantum states in Dirac notationPermalink

Doing the math in matrix notation can get out of hand very quickly. Let’s take the case of a qubit system made of three |0 states. We represent this in matrix notation as a tensor product of three 2-dimensional vectors:

|0|0|0=[10][10][10]=[10000000]T

As you can see, this notation is very cumbersome, especially when dealing with large qubit systems whose vector size grows exponentially. For this reason, quantum scientists developed a shorthand form - Dirac notation - that concisely represents the constituent qubits of a system. In this notation, the 3-qubit system seen before is written as |000. Much more compact! Another reason for wanting to work in Dirac notation is that quantum gates often operate on a small part of the whole set of qubits, so it’s useful to represent them as separate systems.

There is more! Scientists came up with many different ways of shortening and re-writing the notation while meaning the same thing. Here is a quick reference of alternative ways you can use to refer to a quantum system (in this case we take |000 as an example, but it works with any other multiqubit state). Remember, these are all different ways of writing the same |000=[1,0,0,0,0,0,0,0]T state:

  • |0|0|0: Mathematical definition of stacking together multiple qubits in the same ket.
  • |0|0|0: Shorthand for the Kronecker product.
  • |0|00: The way qubits are split in different kets can be moved around.
  • |03: Three concatenated |0 states.

Let’s now take a different case. Suppose that we want to write a full superposition of all states in an n-qubit system. It would look somewhat like (|000++|111)/n, which is again very long. A shorthand way to represent the same state is the following:

1nx{0,1}n|x

For example, the 2-qubit superposition (|00+|01+|10+|11)/2 becomes 12x{0,1}2|x. This notation often pops up when dealing with quantum algorithms.

Quantum gates in Dirac notationPermalink

When dealing with quantum computations, you may deal with long and cumbersome operations that cannot be further reduced. A way of speeding up your calculations is to work directly in Dirac notation. To do so, we first need to lay down some rules to make sure we are playing the game correctly.

Rule of behavioursPermalink

All gates act in easy to describe ways. The outcome of passing a superposition through a gate can be obtained by applying the gate separately on each basis state and summing the results. Moreover, you can deduce this behavior by looking at the columns of the matrix form of the gate, in the same way we did for the matrix prodct. Let’s take a look at an example that shows how to transition from matrix notation to the Dirac one.

Let’s consider gate H applied on |+=(|0+|1)/2: H|+=[1111]12[11]12=(H|0+H|1)/2 Remember how all kets actually represent column vectors? We use that same rule to convert back from vectors to ket, and rewrite the application of H in a different way. Now, we can read the columns of H (in red and blue) to understand how they will change the states they’re applied to:

(H|0+H|1)/2=(|0+|1+|0|1)/2=|0

Indeed, the first column of H is [1,1]T/2 and |0 is transformed in a state with the same meaning: |0+|1=[1,1]T/2.

Here is a table summarizing the behaviours of the main gates:

Gate Symbol Mapping Description
Identity I |0|0
|1|1
Does nothing
Pauli-X X |0|1
|1|0
Quantum NOT. Turns 0 to 1 and viceversa
Pauli-Y Y |0i|1 |1i|0 Similar to X, but multiplies by i or i
Pauli-Z Z |0|0 |1|1 Similar to I, but flips the sign of nonzero inputs
Hadamard H |0(|0+|1)/2 |1(|0|1)/2 Creates a superposition
φ Phase Shift P(φ) |0|0 |1eiφ|1 Shifts |1 inputs by a specific phase
CNOT CNOT |00|00 |01|01 |10|11 |11|10 If the first qubit is 1, it flips the second qubit
SWAP SWAP |00|00 |01|10 |10|01 |11|11 Swaps the two qubits
Controlled-Z CZ |00|00 |01|01 |10|10 |11|11 Flips the sign of the target qubit if both qubits are in the 1 state
Toffoli CCX |010|010 |110|111 |111|110 Flips the third qubit if both first and second qubits are 1 (only some examples shown)

Rule of parallel gatesPermalink

Gates stack like states do. In other words, one can represent three parallel H gates with a tensor product: HHH. A shorthand notation is H3. The resulting gate can handle 3-qubit systems, so H3|000 would be a valid operation.

Rule of matching sizesPermalink

In Dirac notation, a gate is always applied to the state to its right. States and the gates we pass them through must match in size. For example, the operation CNOT|000 doesn’t make sense because the CNOT gate expects 2 qubits as input, while we are feeding it a 3-qubit system. A correct way to apply a CNOT to only the first two qubits would be (CNOTI)|000 or CNOT|00|0.

Let’s now see how one can use these shortcuts to work through the math without using the lengthy matrix notation.

Example 1 - Single-qubit gates in Dirac notationPermalink

Write the result of applying the gate YHX on state |1

YHX|1=YH|0Start by flipping the qubit=Y(|0+|1)/2Use H to build the superposition=(Y|0+Y|1)/2Carry the Y inside =(i|1i|0)/2Apply Y separately to each basis state

Example 2 - Multiple-qubit gates in Dirac notationPermalink

Write the result of applying the gate (X3)(XCNOT)(CNOTI) on state |101

(X3)(XCNOT)(CNOTI)|101=(X3)(XCNOT)|111Controlled flip operation on second qubit=(X3)|010Flip of first qubit and controlled flip of third qubit=|101Flip of all qubits Notice how the gate used in the example respects the rule of matching sizes, and notice how by applying the rule of behaviours we solved an otherwise very large matrix multiplication in just a few steps!

Reading quantum circuitsPermalink

Oftentimes, papers and books present quantum algorithms using the circuit that describes them. Going from the diagram to the math and viceversa can be confusing, therefore we summarized here some useful rules to keep in mind in order to correctly interpret the notation.

  • Gates in series correspond to consecutive matrix products
|s=TXH|s
Figure 1: Three gates in series on a single qubit.
  • Gates in parallel correspond to a tensor product
|s=(CNOTI)(ICNOT)(HXI)|s
Figure 2: Multiple gates in parallel on 3 qubits. Note how s is actually a 3-qubit system.

As a sanity check, remember that the formula you obtain must satisfy the “rule of matching sizes”. Translating a diagram and finding something like CNOT|0 is a sign that something went wrong.

ConclusionPermalink

If you got this far, congratulations! This article is very dense and tries to summarise many concepts in the simplest way possible. Don’t worry if you couldn’t memorize all notions, it’s normal to feel this way. The purpose of this article is to be a quick guide to return to in case you stumble upon some confusing math later on. Remember to come back from time to time, as new tips and tricks may be added later on. See you soon!

Quantum gate summary tablePermalink

Gate Symbol Mapping Matrix form Description
Identity I |0|0
|1|1
[1001] Does nothing
Pauli-X X |0|1
|1|0
[0110] Quantum NOT. Turns 0 to 1 and viceversa
Pauli-Y Y |0i|1 |1i|0 [0ii0] Similar to X, but multiplies by i or i
Pauli-Z Z |0|0 |1|1 [1001] Similar to I, but flips the sign of nonzero inputs
Hadamard H |0(|0+|1)/2 |1(|0|1)/2 12[1111] Creates a superposition
φ Phase Shift P(φ) |0|0 |1eiφ|1 [100eiφ] Shifts |1 inputs by a specific phase
CNOT CNOT |00|00 |01|01 |10|11 |11|10 [1000010000010010] If the first qubit is 1, it flips the second qubit
SWAP SWAP |00|00 |01|10 |10|01 |11|11 [1000001001000001] Swaps the two qubits
Controlled-Z CZ |00|00 |01|01 |10|10 |11|11 [1000010000100001] Flips the sign of the target qubit if both qubits are in the 1 state
Toffoli CCX |010|010 |110|111 |111|110 (Omitted for readability) Flips the third qubit if both first and second qubits are 1 (only some examples shown)

SourcesPermalink

Nielsen, M., & Chuang, I. (2010). Quantum Computation and Quantum Information: 10th Anniversary Edition. Cambridge: Cambridge University Press. doi:10.1017/CBO9780511976667

Updated: