• Genome Data Science

    We develop methods and tools to work with tens of thousands of genomes and analyze and integrate the corresponding data.

    Model of DNA double helix in front of a student.
    © Universität Bielefeld

Learning in Big Data Analytics


392232 Schönhuth Winter 2021/22 Wed 16-18 ONLINE (S) in Zoom

Contents

The recent surge of machine learning (ML) has opened up various opportunities when analyzing big datasets. Beyond basic, non-ML supported techniques of big data analytics, such as identifying similar items in big datasets, or arranging how to distribute jobs on large compute clusters, for example, the ML supported techniques enable to extract knowledge from large datasets at utmost diversity and accuracy.

The seminar will start with a mini lecture. First, lectures will explain how to cluster datasets. Clustering is an 'unsupervised' machine learning technique by which to mine social network graphs, for example. Second, 'supervised' machine learning techniques (where 'deep learning' likely is the most prominent recent technique) and their use in analyzing big data will be discussed. The mini lecture will be followed by seminar presentations, to be presented in small groups of 1-2 students.

LernraumPlus
How to write Scientific

Time table

Date Topic
20.10.2021 no seminar
27.10.2021 no seminar
03.11.2021 Introduction / How to present (Video)
10.11.2021 Machine Learning (Video)
17.11.2021 SVMs (Video)
24.11.2021 Web Advertisements I (Video)
01.12.2021 no seminar
08.12.2021 no seminar
15.12.2021 Web Advertisements II (Video)
22.12.2021
29.12.2021 Christmas Break
05.01.2022 Christmas Break
12.01.2022
19.01.2022 Wide and Deep Recommender Systems
26.01.2022 Adaptive random forests for evolving data stream classification
02.02.2022 An ensemble of cluster-based classifiers for semi-supervised classification of non-stationary data streams