"Cyber Space" is a name for this global information system consisting of many familiar pieces — like web sites, distributed video games, email — and many less familiar pieces as well, that work behind the scenes. All of these systems do little more than process, store and retrieve digital data. So to even begin to understand Cyber Space, we need to get a handle on what "digital data" is all about. That's what this lesson will do.

Submarine Radio Communications

Ballistic missile submarines remain undetected beneath the ocean's surface awaiting the order to launch their missiles. The launch order must be sent via radio transmission, but sea water blocks those radio waves typically used with satellites or for long-range radio because of their high frequencies. For submarines, very low frequency (VLF) radio waves must be used (3-30kHz) to penetrate the ocean and reach the submarine's VLF antenna.

Communicating with submarines while completely submerged comes at a cost. VLF radio waves have a severely limited capacity for carrying data. VLF data transmission rates are around 300 bps. Compare that with a data transmission rate of 10 Mbps for a 4G wireless phone. Your smart phone is 33,000 times faster than VLF! In other words, it would take 2 hours and 47 minutes to download one MP3 song using submarine VLF communications, where it would only take 0.3 seconds using your 4G phone.

(Image courtesy of Jim Hawkins)
This is a picture of the VLF antenna array that used to be at Greenbury Point. The three small antennas you see today are all that's left. The rest were pulled down in the late 90's.

Bits and Bytes

A computer is typically capable of storing and processing an immense number of bits and bytes. So we often speak of kilo, mega, giga and tera bytes or bits. What do those mean? Normally kilo means thousand, mega means million, giga means billion, and tera means trillion, and that's approximately true in the context of digital data, but not exactly. In the context of digital data:

Binary Numbers

In fact, it's easy to compute how many bits you need to represent a number of a specific size. With k bits, you can represent any number from 0 up to and including 2^k-1. To represent a positive integer N, you need 1 + log₂N bits. In a byte, i.e. eight bits, we can represent numbers up to 2⁸-1 = 256 - 1 = 255.

Hexadecimal

ASCII Encoding and Plain Text

You can actually enter ASCII values into the address bar in your browser. Although you have to write them in hexadecimal notation rather than decimal or binary. (Hexadecimal is a base 16 (rather than 10 or 2) number system, whose digits are 0,1,2,3,4,5,6,7,8,9,a,b,c,d,e,f.) For example, c has ASCII value 99 which is 63 in hex, so a c can be written in the address bar as %63. Thus, entering %63nn.com in your browser's address bar gets you go cnn.com! BTW: Firefox might have this turned off by default, since there are actually security implications with this.

A sequence of characters is called a string, and what we've just seen is that ASCII gives us a way to encode strings as sequences of bits (or, if you prefer, bytes).

Files

The rules that define how the bytes of a particular file are supposed to be interpreted are called a file format. We described the format of a plain text file in the previous paragraph. You might have heard of .jpg files (JPEG files). JPEG is a file format for images, and any file whose bytes conform to the JPG rules can be viewed as an image with the proper program. So usually to use a file you need to know what kind it is (i.e. what format it follows) and what program(s) to use to operate on that kind of file. Here are some common formats:

One of the most important file types is one you might not have thought of: a program. A program is a regular old file whose bytes can be interpreted by the physical computer as instructions to be executed.

The Role of File Extensions

Windows by default hides the file extension from you. For this course (and life) you really want to see this information. Follow these instructions to turn off the extension hiding, so you actually know the true names of files.

Under the Windows Start button, click on Computer.
Under the 'Organize' menu, select 'Folder and search options'
Click on the 'View' tab.
Uncheck the box for 'Hide extensions for known file types'
Click 'Apply' at the bottom of the window. Done!

Why is hiding file extensions bad?
Well, aside from making it hard for you to know what a file's real name is, hidden file extensions can be used to trick people. Check out #1 on this list of ways to trick users into executing malicious programs .

Here's a common example of playing games with file extensions. The mail server here at USNA won't let you send a zip file. Any .zip attachment just mysteriously disappears. In fact, the server only looks at the file name, not at the bytes that make up the file. So you can simply rename the file, say changing foo.zip into foo.piz, and then attach it. The file will be sent, no problem, and the recipient merely needs to change the extension back to .zip when he saves it. So: don't believe what file extensions tell you!

How many bytes does it take ...

to store the complete works of Shakespeare in plain text? 5,590,193 bytes
to store the complete Harry Potter series in plain text? 6,272,550 bytes
to store Beethoven's 9th Symphony? 62,948,072 bytes
to store a two-hour long High-Def movie? 4,294,967,296 bytes

File headers

"GIFAR" Files

You might ask: "what's the point?" Java jar files can instruct the Java program to do seriously bad things to your computer — they can really be evil. Jpg image files, on the other hand, are pretty benign. So websites that allow users to post content will often allow jpg image files to be posted, but definitely not Java jar files. What the bad guys figured out, is that by posting a gifar, they could post files to these websites that the websites thought were innocuous jpg image files (and so would allow to be posted), but which were also malicious Java jar files.

hex digit	0	1	2	3	4	5	6	7	8	9	a	b	c	d	e	f
4-bit pattern	0000	0001	0010	0011	0100	0101	0110	0111	1000	1001	1010	1011	1100	1101	1110	1111

Videos showing how to convert from binary to decimal and back