Table of contents
Databases and Advanced Data Techniques (CM3010)
This module aims to show you how to work with data in your computer programs. You will learn how to use SQL and NoSQL databases to store tabular data and documents. You will learn about the ethics of gathering and processing data and why it is important to consider issues around data security. You will learn about open data resources, and how you can access them from your computer programs. You will learn about audio and video data, and the challenges of working with this kind of data.
Professor(s)
- Dr. David Lewis
Topics covered
- Open data sources: different data formats
- Gathering and cleaning data
- The ethics of working with data
- SQL: advanced data representation
- SQL: advanced queries and linking
- NoSQL and document stores
- NoSQL and data pipelines
- Speeding up queries with indexes
- Audio and video data
- Data security
Assessment
One two-hour unseen written examination and coursework (Type I)
Module specification
Past exams
Syllabus
Resources
Notes
Reading list
Reading list for topics 1 to 5.
Document/JSON
- Introduction to MongoDB. - MongoDB University
- MongoDB Data Modeling Path. - MongoDB University
- Introduction to MongoDB. - JetBrains Academy on Hyperskill
Semantic Web
- Linked Data Engineering (Semantic Web) OpenHPI. - Prof. Dr. Harald Sack
- Working with RDF Graphs in Graph Studio. - Oracle LiveLabs
Relational databases and SQL
-
CS 6400: Database Systems Concepts and Design. - Georgia Tech
- Enroll for free here: https://edstem.org/us/join/uWdxj3
- The Art of PostgreSQL. - “The book that teaches SQL to developers: Learn to replace thousands of lines of code with simple queries!”
- SQL for Backend Developers. - JetBrains Academy on Hyperskill
XML
- Databases: Semistructured Data. - Stanford Online