BRAND: Introduction to Cryptograpy and Symmetric Encryption

Learning Outcomes

After completing these activities, you should be able to:

Define cryptography and identify the types of cryptography we discuss in this class.
Define symmetric encryption and be able to discuss what it is used for.
Describe how symmetric encryption is used in the context of the Cyber Pillars discussed earlier in this course.
Be able to decrypt ciphertext given an example that was encrypted using the Caesar Cipher.
Be able to decrypt ciphertext given an example that was encrypted using the Vigenere Cipher.
Be able to understand, describe, and execute a real-world example of symmetric encryption, Advanced Encryption Standard (AES).

In this section we will learn how cryptography can be used to defend our networks and secure our data. In an earlier lesson we learned about the tenets of cybersecurity (AKA CIA-triad), to include, confidentiality, integrity, and availability. Over this and the next several lessons we will learn about several cryptographic techniques that can be used to be used to support the cybersecurity tenets of confidentiality and integrity: symmetric encryption and asymmetric encryption. We will learn through the use of simple examples and then see encryption in action through the use of real-world tools. Finally, we will discuss and explore several attacks on these techniques.

What is Cryptography

Cryptography is the study and practice of hiding information. Cryptography has been around for a long time - Julius Caesar is known to have used it. In fact, Cryptos comes from the Greek word hidden. Today, cryptography is a core component of how we secure modern communication networks and the data critical to our daily lives. To make this happen mathematicians work tirelessly to craft new cryptographic algorithms and ensure that the current algorithms are still secure. At this very moment there are mathematicians dealing with the new challenges of quantum computers and other technological advances. In this course we will cover just a small fraction of the encryption algorithms that are in use today. The two fundamental areas of cryptography that we'll look at in this course include:

Encryption - With encryption, the idea is to scramble a message in such a way that only the intended recipient of the message can unscramble it, so that only the two of you know the message. The fundamental areas of cryptography involve two encryption methods. A method of encryption is called a cipher. A cipher is a method of transforming a message to conceal its original meaning. It should be immediately obvious to you that encryption is used to help provide confidentiality, what we'll see later is that it also helps with the other tenets of cybersecurity. The fundamental areas of cryptography are:
- Symmetric Encryption is also known as secret-key encryption, for reasons that we will learn shortly, is one of the oldest and simplest forms of encryption. Symmetric means the same, so when we talk about symmetric encryption, we are talking about using the SAME key for both encrypting and decrypting information. It is for this reason that we must keep the key SECRET. This key can be a word, or a string of random letters, and is applied to the information (message) in a particular way (algorithm or CIPHER). One of the biggest issues with symmetric encryption is how to distribute the SECRET key to everyone you want to communicate with.
- Asymmetric Encryption, also known as public-key cryptography, allows us to establish secure communications even when we have no opportunity to agree on a secret key ahead of time. Public-key cryptography does this by creating a pair of keys with unique properties. This key pair consists of a public key and a private key. The public key is meant to be shared and used for encryption, while the private key is meant to be kept SECRET and be used for decryption. If you encrypt a message with the key-pair public key, it can only be decrypted by using the key-pair's private key and vice-versa.

Symmetric Encryption:

key distribution

Simple Symmetric Encryption Example: Caesar Cipher

Caesar Shift Cipher

plaintext

ciphertext

encrypting

decrypting

The Caesar Shift Cipher assumes your message is all capital letters, and replaces each letter in the plaintext with a new letter to produce the ciphertext. The replacement scheme is based on a secret key that Alice and Bob have agreed upon ahead of time — a number in the range 1-25 called the shift value. The replacement scheme is simple: if the shift value is s, the kth letter in the alphabet is replaced by letter k+s in the alphabet, circling back around to the front of the alphabet if necessary. So with a shift value of 3, the letter B (the 2nd letter in the alphabet) is replaced with the letter E (the letter number 2+3 = 5 in the alphabet). You can use the little applet below to help encrypt a message once you've chosen a shift value. The applet should also more clearly show you how the shift has the effect of mapping the original plaintext value to a new letter which when complete will become the new ciphertext value.

Important: When encrypting you ADD the shift value and when decrypting you SUBTRACT the shift value. You might also notice that a shift value of 26 is equivalent to a shift value of 0. Let's follow this process through from start to finish:

At some earlier time, Alice and Bob agree to a secret key/shift-value k = 11.
Alice decides to send Bob the secret message "MEET ME AT NOON", i.e. plaintext = MEET ME AT NOON.
Alice encrypts the plaintext (using something like the table above) with key k=11 to get: ciphertext = XPPE XP LE YZZY.
Alice sends Bob the ciphertext.
Eve manages to read the message in transit, but since she reads the ciphertext, XPPE XP LE YZZY, she can't make sense of it. Not knowing the key, she can't decrypt it to recover the plaintext.
Bob receives the ciphertext and decrypts it using something like the table above, in reverse, with key k=11, recovering the plaintext MEET ME AT NOON.

Symmetric Encryption Diagram — https://www.ssl2buy.com/wiki/symmetric-vs-asymmetric-encryption-what-are-differences

cryptosystem

decryption

symmetric encryption

the secret key is the shift value

Encryption Key Management

Much of military communications are encrypted today for obvious reasons. What is not so obvious is how the Navy and Marine Corps manage all of the encryption keys used for encrypted communications. The system used throughout the military is called the Electronic Key Management System (EKMS) and is centrally controlled by the National Security Agency (NSA). EKMS is in place to provide communications security (COMSEC) material (i.e. encryption keys) and support tools for tracking and managing encryption key material, generation, distribution, and accounting.

Sound like an important job? It is and you might be the one doing it at your command as a junior officer. Every Naval or Marine unit that uses secure communications has at least two EKMS managers and it is common practice to have a junior officer act as one of them.

Cryptanalysis

Brute Force Attack: This involves trying to use every possible key with the aim of discovering the secret key in use. This is obviously a very time consuming process and in most cases not a practical technique. Is this a useful technique to attack the Caesar cipher? The answer is yes ... you only have to try 25 keys which shouldn't take too long.
Frequency Analysis: This involves analyzing the frequency of letters used in a given ciphertext. The most common ciphertext letters are assumed to be directly related to the most common plaintext letters in the language being attacked. We will give you an in-depth look at using frequency analysis to target the Caesar cipher below.
Known Plaintext Attack: In this attack the cryptanalyst has access to both the ciphertext and either some or all of the plaintext. The more ciphertext and plaintext pairs the cryptanalyst has access to will increase their chances of success. Success is defined by recovering any information previously unknown about the encryption key and the encryption scheme in use.
Chosen Plaintext Attack (CPA) and Chosen Ciphertext Attack (CCA): These both involve the cryptanalyst having the ability to interact with the encryption device or system. In this case the cryptanalyst can provide the system with plaintext or ciphertext and look at the resulting output.

Frequency Analysis Example — breaking the Caesar cipher

cryptographic attack

Let's suppose you are Eve, and you've intercepted the message(ciphertext) XPPE XP LE YZZY. There are more P's than anything else, so you might guess (correctly in this case) that a P in the ciphertext came from an E in the plaintext. This would lead you to guess the key/shift-value k = 11.

It's not always going to be that easy of course. The ciphertext RNCP KU QHH has more H's than anything else. If we assume that H's in the ciphertext came from E's in the plaintext, we would deduce a key/shift-value of 3. Decrypting assuming k = 3 gives OKZM HR NEE... which is probably not the secret message. In fact, the plaintext that produced this message was PLAN IS OFF.

The problem with this approach is that we only considered one letter — the most common appearing in the ciphertext. Assuming H's came from E's gave us lots of E's in our "cracked" message, but it also gave us Z's and K's, which are pretty uncommon. To do frequency analysis properly, we should consider all the letters in the message. This is tedious, of course, but when something is tedious, it just means that we ought to write a program and let the computer do it for us. Try out this page which features a JavaScript program for cracking Caesar shift encryption via frequency analysis. It functions by calculating for each shift value the likelihood of that shift value being correct based on the frequencies of the letters that result from decrypting the given ciphertext with that shift value. It's very interesting to see how few characters of ciphertext are required to recover the key with a high degree of certainty.

So we see that the Caesar Shift Cipher is not very secure. In particular, it's quite vulnerable to attack via frequency analysis. Its problems are a) there are only 26 key values, so trying them all is a viable option, and b) since a given character in the plaintext is always replaced with the same character in the ciphertext, letter frequencies carry over from plaintext to ciphertext.

More Sophisticated Symmetric Encryption: The Vigenere Cipher

The key is a string of letters like JOE. To encrypt, you take your plaintext (we'll reuse MEET ME AT NOON) and write it down. Then you write down the key string over the plaintext, with letters matching up. If the plaintext is longer than the key, you simply repeat the key. Like this:

JOEJ OE JO EJOE ← key (repeated as needed)
MEET ME AT NOON ← plaintext

table

this demo

Think about how the Vigenere Cipher addresses the flaws in the Caesar Shift. The key is a string of characters, and since there are roughly 6 trillion strings of length less than 10, for instance, the problem of too few keys has been addressed. The same letter at different positions in the plaintext generally does not get mapped to the same character in the ciphertext, since the key-character written above plays a role in the encryption. So letter frequencies in the plaintext do not get carried over to the ciphertext.

Frequency Analysis Attacks on the Vigenere Cipher & the One Time Pad

The Venona Project: Poor Practice Defeats Perfect Security
One-time pads provide provably perfect security ... but at a price. Managing keys is really difficult! After all, you have to have as many bytes of key as you have bytes of plaintext to communicate. During WWII, the British and US intercepted a large amount of Soviet Russian communication that was encrypted with one-time pad encryption. However, cryptanalysis revealed that some of the one-time pad keys had been reused ... which is the big no-no with one-time pad encryption. This misuse of the system allowed small parts of the communication to be decrypted. NSA's effort to exploit this misuse of one-time pad keys to decrypt as much as possible of the traffic was code-named VENONA. Over the years (Venona lasted until 1980), this code-breaking effort revealed Soviet espionage campaigns and spies at places like Los Alamos National Labs, the State Department and the White House. It identified the Rosenbergs and Alger Hiss as spies.

With any cryptographic protocol, even a small deviation from the protocol can compromise security. This fact should be a major take-away from the story of the Venona project!

Cyber Pigeons?
Before there was the Internet, there were pigeons. In late 2012, a British man found a dead carrier pigeon in his chimney. Turns out it was from WWII and it carried an encrypted message tied to its leg. CNN has a nice story, UK spies unable to crack coded message from WWII carrier pigeon, about it and the fact that nobody's been able to decrypt the message. Turns out, the sender used a one-time pad.

Finding the key length can be a problem, but one easy way given what we already know is this: for each possible key length n, form the string consisting of every nth character starting from the first, give that as a ciphertext input to our Caesar Shift Frequency Analysis page, and make a note of the probability of the shift index it gave you for that n. Whichever n value gave us the highest score is probably the actual length of the key. In class, we will actually have performed this exercise.

This kind of attack requires enough text that our Caesar Shift frequency analysis of every nth character finds the proper shift index with high probability. If the message length is L, and we assume we need about 20 characters to be assured of having a high probability with our Caesar Shift frequency analysis, we'd like to have L/n > 20. If L is short or n is long, our attack will fail. So, in general, a longer key gives you more security from frequency analysis. If you have a key that is a completely random sequence of letters, and which is as long or longer than the message, the Vigenere Cipher is unbreakable — provided you never use the key again. In this situation, the system becomes what is called a one-time pad. The problem with such a system is that arranging to have this one huge key is difficult.

Chosen Plaintext Attack & the Vigenere Cipher

A kind of chosen plaintext attack was done by the US during WWII. We knew a Japanese attack was imminent because we had cracked a code, but we didn't know whether the string designating the target was referring to Hawaii or Midway. So we leaked a story about a water shortage on Midway, and discovered that same symbol in a message that was, we were sure, relaying that leaked information.

The thing about the Vigenere and Caesar Shift Ciphers is that there are three strings — the key, the plaintext and the ciphertext — and knowing any two is enough to get the third. As an attacker, i.e. as "Eve", you know the ciphertext. If you haven't got that, you've got nothing. What if you knew the message? What if you could induce Alice to send a specific message to Bob? Or at least a message that contains some text you know. For instance, if Bob is a spy, and I leak to him that I'm planning an attack on Albuquerque, I can guess that the next message he sends will contain the word "Albuquerque" somewhere. So, suppose Bob goes and sends the message:

JZFDEYNFUDS MB KLNFI CVIH KMUZ ECHELY

???????????
ALBUQUERQUE
↓↓↓↓↓↓↓↓↓↓↓
JZFDEYNFUDS

Real-World Symmetric (Secret Key) Encryption: AES (Advanced Encryption Standard)

A stick figure guide to AES
Check out the stick figure guide to AES which tells you as much or as little as you want to know about AES, how it came to be and how it works. Plus, it's pretty funny.

Encrypting Bitcoin

This Wired article is about a man’s quest to decrypt an encrypted zip file with $300,000 worth of bitcoin. This file was not encrypted with AES but a somewhat outdated encryption standard and was ultimately able to be decrypted.

block cipher

Usually, AES is not a stand-alone tool. Instead, it is embedded in systems that use it for security. For example, the archiving tool PKZIP uses it to provide the option of creating a password protected .zip file. Windows offers BitLocker, which allows you to encrypt your hard disk. That way, if your computer, or disk, or backups are stolen, your data is still secure. Bitlocker uses AES. SSL traffic may use AES. IPSec, which is a standard for secure IP (Internet Protocol) traffic uses AES. Suffice it to say, there are many more examples.

Instructor AES Demonstration

In this activity your Professors will demonstrate how to encrypt and decrypt a file using AES (with CBC mode). Note an in depth understanding of the different modes used with AES are beyond the scope of this course, but it is worth noting that not all implementations of AES are equally secure.

Generate a 256-bit key

$ openssl enc -aes-256-cbc -md sha256 -pbkdf2 -iter 100000 -P -k MySecretWord 

salt=0C8E8E821AB80FAB
key=AF1F7C30DCDF8B5691817EE5B1A3E11CB77B61CEF993772D73C2A55E2CABDCF6
iv =EE286E2CD7D56FE55874EC6E1BAA159C

Encrypt

$ openssl enc -aes-256-cbc -pass pass:MySecretWord -p -in WhatIWantToEncrypt.txt -out TheNameOfMyEncryptedOutput.txt

salt=DF60C340D63467F1
key=D3325D264A3C61913A3D575CB8179675298D94929FE982A6FCAF872D8EFFCF25
iv =D59C03D3C4D0C42BD4983B06CA34B35B

Decrypt

$ openssl enc -d -aes-256-cbc -pass pass:MySecretWord -p -in TheNameOfMyEncryptedOutput.txt -out DecryptedFile.txt

I Love SY110!!!

Student AES Activity

shell

aes

After running the commands think back to your Professor's demo - what is different about the encrypted output? Is this version of AES secure? What might be some potential issue(s) with your version of AES?

Generate a 128-bit key

$ aes -k
adccbb42e647d10370992205ddc268e6

Encrypt

$ aes -e adccbb42e647d10370992205ddc268e6 "Who's afraid of the big bad wolf?"
d31cfef5689d1cf37b725934c8d314f9e57a26bbbeb528c0364ab4edd0819b70829cc4f22add97df1a866a4ef176c2c4

Decrypt

$ aes -d adccbb42e647d10370992205ddc268e6 d31cfef5689d1cf37b725934c8d314f9e57a26bbbeb528c0364ab4edd0819b70829cc4f22add97df1a866a4ef176c2c4
Who's afraid of the big bad wolf?

Identify

aes

aes -h

References

Schneier, Bruce. 1996. "Applied Cryptography : Protocols, Algorithms, and Source Code in C". Wiley

Singh, Simon. 1999. "The Code Book: The Science of Secrecy from ancient Egypt to Quantum Cryptograph". New York, Anchor Books

Dworkin, M. , Barker, E. , Nechvatal, J. , Foti, J. , Bassham, L. , Roback, E. and Dray, J. 2001. "Federal Inf. Process. Stds. (NIST FIPS)-197, National Institute of Standards and Technology". Gaithersburg, Maryland, NIST Pubs

Committee on National Security Systems. 2015. Committee on National Security Systems (CNSS) Glossary. Ft. Meade, MD: April 6, 2015.

Martinez, Michael. 2012. "UK Spies Unable to Crack Coded Message from WWII Carrier Pigeon" CNN, November 24, 2012.
https://www.cnn.com/2012/11/23/world/europe/uk-wwii-pigeon-mystery/index.html

National Security Agency. VENONA Washington: National Security Agency and Central Security Service.
https://www.nsa.gov/news-features/declassified-documents/venona/

U.S. Code. 2012. Title 44 - Public Printing and Documents Section 3542. Washington: Government Printing Office, January 3, 2012.

Introduction to Cryptography and Symmetric Encryption