Dr. Stefan Kühn: "Manifold Learning"

28. Nov.

Im Rahmen des Forschungskolloquiums Wirtschaftsinformatik und Data Science referiert Herr Dr. Stefan Kühn von der TOM TAILOR Group über "Manifold Learning".

Datum und Ort: 28. November 2019 12.15 Uhr Raum C 40.255

Abstract (deutsch):

Dimensionality Reduction ist nicht nur wichtig für Denoising von Daten oder um Daten kompakter und besser zugänglich zu machen, es ist außerdem wichtig für die Visualisierung hochdimensionaler Daten im Rahmen der Explorativen Datenanalyse. Manifold Learning umfasst eine Klasse moderner Methoden des Unsupervised Learnings, die unterschiedliche Aspekte der hochdimensionalen Daten in einer niedrig-dimensionalen Struktur einfangen wollen. Jedes dieser Verfahren versucht, bestimmte Eigenschaften möglichst gut zu erhalten - Abstände zwischen Punkten, Varianzen, statistische Eigenschaften und Verteilungen. Die Gesamtheit dieser Methoden bietet daher auch neue Möglichkeiten für (automatisches) Feature Engineering und die ultimative Herausforderung für Machine Learning und AI - "Learning from Data".

Abstract (english):
Dimensionality Reduction is not only useful for de-noising purposes or making data better accessible, it is also very important for Exploratory Data Analysis, especially with respect to Data Visualization. Manifold Learning subsumes a collection of advanced methods from the field of Unsupervised Learning that capture different aspects of the given high-dimensional data in a low-dimensional manifold. Each method tries to preserve an important quantity – distances between points, variance, statistical or distributional properties. The variety of these methods offers some new and interesting options for Automated Feature Engineering and the ultimate task of Machine Learning and AI – “Learning from Data”.

Algorithmen, die behandelt werden / Algorithms that are covered:

Classical Dimensionality Reduction

PCA - Principal Component Analysis
GRP - Gaussian Random Projections
SRP - Sparse Random Projections

Standard Manifold Learning

LLE - Locally Linear Embedding (multiple variants)
Isomap - Isometric Feature Mapping
MDS - Multi-Dimensional Scaling
Spectral Embedding (Laplacian Eigenmaps)
LTSA - Local Tangent Space Alignment

Advanced Manifold Learning

tSNE - t-Distributed Stochastic Neighbor Embedding
UMAP - Uniform Manifold Approximation and Projection