Data Scientists work with a lot of data! Want to know how to store and manage large amounts of data? How to make
sense of the ever increasing amount of information?
This course provides students with an introduction to data storage methods and
systems. We will focus on how to build and employ relational
databases and how to write applications that use data from databases.
We'll also discuss different types of hardware and software used for efficiently
storing and managing large amounts of data.
The course is divided into several modules:
After the course, you should be able to explain the main advantages of modern database management systems over file systems, design and create relational databases to satisfy user requirements, write SQL queries for data loading, data cleaning, data exploration, and data analysis, interact with a relational database from Python and R, as well as explain the main data storage hardware systems and associated cost-benefit tradeoffs.
Expect weekly labs and quizzes, with one large group project, culminating in a final project presentation at the end of the semester. Take a look at the course policy to understand the course expectations and the grading for each grading period.
Prof. Adina Crainiceanu (course coordinator)
adina at usna.edu
Hopper 442
Mr. Jeff Kenney
kenney at usna.edu
Hopper 456
Prof. Gavin Taylor
taylor at usna.edu
Hopper 455