We will have teams of 2 students each. Each teams will have 50 minutes for the presentation of their topic. Suggested topics for team presentations (others are possible - talk to me if you have some suggestions): -data mining - more algorithms or more in-depth algorithms/ analysis -information retrieval (suggested reading: sections 27.1-27.3 in Ramakrishnan and Gehrke textbook) -GoogleFileSystem, HDFS (suggested reading: GFS paper - http://labs.google.com/papers/gfs.html, Chapter 3 - Tom White book) -BigTable, HBase , more on no-SQL stores (suggested reading: BigTable paper - http://labs.google.com/papers/bigtable.html, chapter 13 - Tom White book ) -Pig (Suggested reading: Chapter 11 - Tom White book) -Hive (Suggested reading: Chapter 12 - Tom White book) -other cloud computing topics (but not the same thing as in advanced web) -file sharing and bittorent -object-oriented databases -object-relational databases -RDF and SPARQL -security in databases (either something interesting for centralized systems, or general security in distributed databases)