Mode of Operations

Block ciphers

A block cipher works on fixed-length blocks. The most famous block cipher is AES.

Specifically, a block cipher \(F\) is a deterministic algorithm that works as follows:

Input: \( K, M \)
- \(K\): A fixed-length encryption key
- \(M\): A fixed-length plaintext block.
Output: \(C\)
- \(C\): A fixed-length ciphertext block.
Requirement
- The length of M is fixed, and it's the same as the length of C.
- We call the length of M (= the length of C) as the block size.
For a block cipher \(F\), we will use the following notation: \[ C \leftarrow F(K, M) \]

Sometimes we will also use the following short-handed notation when discussing the key is less important: \[ C \leftarrow F_K(M) \]

Block cipher = Pseudorandom permutation

A block cipher is also called a pseudorandom permutation.

For every key \(K\), it hold that \( F_K: \{0,1\}^\ell \rightarrow \{0,1\}^\ell \) is a permutation.
The permutation is pseudorandom, which means that it looks random even to the computers although it is not truly random in the strict mathematical sense. Therefore, unless you have a key, given a ciphertext block, which looks random, you cannot know what the corresponding the plaintext block is.

Decryption

If you have a key, then block cipher allows you to decrypt a ciphertext. Of course, the decyption is not possible if you don't have the correct key.

	AES
Key length (in bits)	128, 192 or 256
Block size (in bits)	128

AES (Advanced Encryption Standard)

NIST issued a call for proposals for a new Advanced Encryption Standard (AES), which should have a security strength equal to or better than 3DES and significantly improved efficiency. NIST selected three members of the Rijndael family, each with a block size of 128 bits, but three different key lengths: 128, 192 and 256 bits. AES was announced by the NIST as U.S. FIPS PUB 197 (FIPS 197) on November 26, 2001.

AES became effective as a federal government standard on May 26, 2002 after approval by the Secretary of Commerce. AES is included in the ISO/IEC 18033-3 standard. AES is available in many different encryption packages, and is the first publicly accessible and open cipher approved by the National Security Agency (NSA) for top secret information when used in an NSA approved cryptographic module . See the following for the key length, and the block size.

Mode of Operations: Encrypting a long message

When encryption a long message, we need to construct an encryption scheme that can encrypt an arbitrary message (possibly consisting of multiple blocks) multiple times.There are several ways suggested and we will call them modes of operation.

OpenSSL

In this lecture, we will use openssl commands in the terminal. Openssl can be installed as follows:

sudo apt install openssl

Documenation

The documentation for openssl encryption commands can be found here.

Electronic Code Book (ECB) Mode

Fig 1. Electronic Code Book (ECB) mode

This is the simplest mode of operation: directly apply the block cipher to each plaintext block.

\( {\sf Gen} \): Choose a random key \( K \) uniformly at random from \( \{ 0, 1 \}^n \) .
\( {\sf Enc}_K(M_1, M_2, \ldots, M_m) \): Output \( ( F_K(M_1), F_K(M_2), \ldots, F_K(M_m) ) \).
\( {\sf Dec}_K(C_1, C_2, \ldots, C_m) \): Output \( (F_K^{-1}(C_1), F_K^{-1}(C_2), \ldots, F_K^{-1}(C_m)) \).

Insecurity of ECB Mode

If you encrypt messages using the ECB mode, the ciphertext will leak information of the underlying plaintext.

In the ECB mode, if \( M_i = M_j \), then \( C_i = C_j \)

The above statement is true, since we apply the same permutation \( F_K \) to each message. Since \( F_K \) is a deterministic function, if the inputs are the same, the outputs will always be the same.

Example

Follow the procedure in your VM.

Download pic_original.bmp. It's a picture:
Execute the following command in the terminal to encrypt the file with the ECB mode:
```
openssl enc -aes-128-ecb -in pic_original.bmp -out output_ecb.bin -K 00112233445566778899aabbccddeeff 
```
In the above,
- -aes-128-ecb means encryption with AES-ECB mode, where the length of the key is 128 bits.
- -in option specifies the input file.
- -out option specifies the output file.
- -K option specifies the encryption key. The values should be used the hex string form. That is, the toy key we used here is 0x00112233...ff (16 bytes = 128 bits).

Check the first 128 bytes of the first two files:

$ hexdump -v -n 128 pic_original.bmp
0000000 4d42 d28e 0002 0000 0000 0036 0000 0028
0000010 0000 01cc 0000 0086 0000 0001 0018 0000
0000020 0000 d258 0002 0000 0000 0000 0000 0000
0000030 0000 0000 0000 ffff ffff ffff ffff ffff
0000040 ffff ffff ffff ffff ffff ffff ffff ffff
0000050 ffff ffff ffff ffff ffff ffff ffff ffff
0000060 ffff ffff ffff ffff ffff ffff ffff ffff
0000070 ffff ffff ffff ffff ffff ffff ffff ffff
0000080

$ hexdump -v -n 128 output_ecb.bin
0000000 84f5 37a4 1698 d97a c529 7360 7038 6a86
0000010 af54 a084 3127 1d87 adae b5c5 c3f2 e1b0
0000020 5906 6599 c30d 0ed9 3a1c bafc 2477 d7e5
0000030 368d 9ff9 86e2 bd31 3388 2321 17f8 515d
0000040 b164 1493 1ac3 5af4 dfcc 3c7e b74d 0d9f
0000050 b164 1493 1ac3 5af4 dfcc 3c7e b74d 0d9f
0000060 b164 1493 1ac3 5af4 dfcc 3c7e b74d 0d9f
0000070 b164 1493 1ac3 5af4 dfcc 3c7e b74d 0d9f
0000080

Note each line of hexdump is 16 bytes = 128 bits, which is the size of one block of AES. The above actually verifies the following:

In the ECB mode, if \( M_i = M_j \), then \( C_i = C_j \)

Q: Why so many ffff ffff ... ffff in the bmp file?

Answer: R=0xff=255, G=0xff=255, B=0xff=255. White pixel. Check out the picture.

The first 54 bytes of this bitmap file should contain the legitimate header information. If you're interested in the bitmap format, see wikipedia. Since encryption messes up the header information, we recover the header information by simply overwrite the first 54 bytes of output_ecb.bin with that of pic_original.bmp.
```
>>> header = open("pic_original.bmp", "rb").read(54)    # read the bmp header from the bmp file  
>>> data = open("output_ecb.bin", "rb").read()[54:]     # read the ciphertext data part
>>> open("output_ecb.bmp", "wb").write( header + data ) # write header+data into output_ecb.bmp
```
Open output_ecb.bmp file from the file browser. It will look as follows:

Obviously, the ECB encrypted file roughly shows what the plaintext is -- not secure!

Cipher Block Chaining (CBC) Mode

Fig 2. Cipher Block Chaining (CBC) mode

To encrypt in this mode, a random initialization vector (IV) is chosen, which is set as the first cipher block \(C_0\). Then, ciphertext blocks are generated by applying the block cipher to the XOR of the current plaintext block and previous ciphertext block.

\( {\sf Gen} \): Choose a random key \( K \) uniformly at random from \( \{ 0, 1 \}^n \) .
\( {\sf Enc}_K(M_1, M_2, \ldots, M_m) \): Output \( (C_0, C_1, \ldots, C_m) \), where \( C_0 = IV \), where \( IV \) is chosen uniformly at random from \( \{ 0, 1 \}^\ell \) and for \( i = 1, \ldots, m:\) \[C_i = F_K(C_{i-1} \oplus M_i ). \]
\( {\sf Dec}_K(C_0, C_1, C_2, \ldots, C_m) \): Output \( (M_1, \ldots, M_m) \), where for \( i = 1, \ldots, m:\) \[M_i = F^{-1}_K(C_i) \oplus C_{i-1}. \]

Security of CBC Mode

If IV is chosen uniformly random, the encryption scheme with CBC mode is
indistinguishable under the chosen plaintext attack.

It is important to choose IV randomly. Otherwise, for example, if the same message is encrypted again in some future, the adversary will note it.

Example

Let's compare the CBC mode from the ECB mode:

Execute the following command to encrypt the file with the CBC mode:

openssl enc -aes-128-cbc -in pic_original.bmp -out output_cbc.bin -K 00112233445566778899aabbccddeeff -iv 000102030405060708090a0b0c0d0e0f

As we did for the ECB mode, copy the header of the bmp file into the output_cbc.bmp.
The file output_cbc.bmp look as follows:

Parallelization

The encryption of the CBC cannot be parallelized well, since block \(C_i\) affects block \(C_{i+1}\). However, the decryption of the CBC mode can be parallelized. Look at the diagram and think about how to parallelize the decryption.

Counter (CTR) Mode

Fig 3. Counter (CTR) mode

To encrypt in this mode, a uniform value (ctr) of length \( \ell \) (i.e., the block length) is first chosen. Then, a pseudorandom stream is generated from ctr.

\( {\sf Gen} \): Choose a random key \( K \) uniformly at random from \( \{ 0, 1 \}^n \) .
\( {\sf Enc}_K(M_1, M_2, \ldots, M_m) \):
1. Choose \( ctr \) uniformly random from \( \{ 0, 1 \}^\ell \).
2. Output \( (C_0, \ldots, C_m) \), where \( C_0 = ctr \) and for \( i = 1, \ldots, m:\) \[ C_i = F_K(ctr+i) \oplus M_i.\]
\( {\sf Dec}_K(C_0, C_1, C_2, \ldots, C_m) \):
1. Set \( ctr = C_0 \).
2. Output \( (M_1, \ldots, M_m) \), where for \( i = 1, \ldots, m:\) \[M_i = F_K(ctr+i) \oplus C_i.\]

Parallelization

In contrast to the CBC mode, the CTR mode has the advantage that encryption and decryption can be fully parallelized , since all the blocks of the pseudorandom stream can be computed independently of each other. It is also possible to decrypt the \(i\)th block of the ciphertext using only a single evaluation of \( F \). These feature makes CTR mode an attractive choice.

Security of CTR Mode

If ctr is chosen uniformly random, the encryption scheme with CTR mode is
indistinguishable under the chosen plaintext attack.

It is important to choose ctr randomly. If the same ctr is ever used again, the same pseudorandom stream will be generated!

Example

Execute the following command to encrypt the file with the CTR mode:

openssl enc -aes-128-ctr -in pic_original.bmp -out output_ctr.bin -K 00112233445566778899aabbccddeeff -iv 000102030405060708090a0b0c0d0e0f

As with the other modes, modify the header.
The output_ctr.bmp file will look as follows:

Padding

One issue we ignored so far is that the message length may not be a multiple of 16 bytes (128 bits) .

$ ls -alF *.bin *.bmp
-rw------- 1 choi choi 184976 Jun 20 03:16 output_cbc.bin
-rw------- 1 choi choi 184974 Jun 20 03:17 output_ctr.bin
-rw------- 1 choi choi 184976 Jun 20 02:10 output_ecb.bin
-rwxrwxrwx 1 choi choi 184974 Jan  5  2016 pic_original.bmp

Since AES only takes a block, if the message is not the case, we sometimes need to pad it so that the total length is a multiple of 16. Note also that the IV or ctr is not stored in the ciphertext file.

ECB and CBC needs padding. Since the size of pic_original.bmp is 184974 = 11560*16 + 14. You need pad two bytes to make the remaining 14 bytes to 16 bytes. So, the output ciphertext will have 184976 bytes.
The reason that padding is necessary is because \(F\) is a block cipher; the input always has to be a block. (See the picture and understand what this means).
CTR mode don't need padding. This is because the input to \(F\) is ctr+i, which has nothing to do with the ciphertext or the plaintext.
The output \(F\) can be easily trimmed to fit into the plaintext/ciphertext length. (See the picture and understand what this means).

Comparison

In summary, the following table compares the modes.

	padding	IV or ctr	randomized encryption	parallel encryption	parallel decryption	IND-CPA	IND-CCA
ECB	yes	no	no	yes	yes	no	no
CBC	yes	yes	yes	no	yes	yes	no
CTR	no	yes	yes	yes	yes	yes	no