This module provides an introduction to the management and analysis of big data, focusing specifically on the analysis of genome sequence data. Lectures first introduce relevant data types and data handling skills. They subsequently cover the bioinformatics methods, algorithms and resources used for tasks such as read cleaning, genome assembly, gene finding, variant calling, population genomics, and caveats and quality control approaches for such analyses Practical exercises are used to imbue experience of the Unix command line, high performance computing, and the use of these technologies to run computationally intensive genome analysis tools.

