Data Science Trainings

I have been teaching statistics since 2015, initially with R and Minitab, then also with Python and in combination with MS Azure / Fabric or the Oracle database.

A data science course with R or Python teaches the basics and advanced techniques for analyzing and interpreting data. Such a course is ideal for data analysts, data scientists or anyone who wants to make data-driven decisions.

My seminars cover the following topics:

Basics of the Programming Language

  • Syntax and data structures (lists, dictionaries, data frames)
  • Functions and control structures (loops, conditions)
  • Packages and libraries (e.g. pandas, numpy, ggplot2, dplyr)

Data Preparation and Manipulation

  • Loading, cleaning and transforming data
  • Dealing with missing values ​​and anomalies
  • Aggregations and pivot tables

Data Visualization

  • Create charts and graphs (Matplotlib, Seaborn, ggplot2)
  • Interactive dashboards (Shiny for R, Plotly for Python)

Statistical Analysis

  • Descriptive statistics (means, dispersion, correlations)
  • Hypothesis tests and significance analyses
  • Probability distributions

Machine Learning

  • Supervised learning (regression, classification)
  • Unsupervised learning (clustering, dimensionality reduction)
  • Evaluation of models (train-test-split, cross-validation)

Big Data and Cloud-Technologies (optional)

  • Processing large amounts of data with Spark (PySpark, SparkR)
  • Use of cloud services (e.g. Azure ML, AWS SageMaker)

R

Complex data analysis requires support from a statistical environment. Here I chose R because it is open source and comes with an impressive range of analysis packages.

  • R Basics
  • R Data Mining
  • R Multivariate Methods Course I and Course II
  • R Time Series Analysis
  • R Regression Analysis

Python

If you want to integrate statistical analyses comprehensively into larger software, Python is the right choice.

  • Python Basics Statistics
  • Python Data Mining
  • Python Multivariate Methods
  • Python Time Series Analysis

Minitab

There are special statistical procedures for production and quality.

I conduct the seminars using Minitab, the leading statistical product in the field of engineering statistics.

  • Design of Experiments (DOE)
  • Statistical Process Control (DOE)
  • Engineering Statistics with Minitab