Skip to main content

An Introduction to

Data Science

.

Explain the Past and Predict the Future

Data Science (a.k.a. Data Mining) is about explaining the past and predicting the future by means of data analysis. Data science is a multi-disciplinary field which combines statistics, machine learning, artificial intelligence and database technology. The value of data science applications is often estimated to be very high. Many businesses have stored large amounts of data over years of operation, and data science is able to extract very valuable knowledge from this data. The businesses are then able to leverage the extracted knowledge into more clients, more sales, and greater profits. This is also true in the engineering and medical fields.

Course Syllabus

Part I. Problem Definition

  1. Data, Database, Data Science
  2. Data Science 6-Step

Part II. Data Preparation

  1. Extraction, Loading and Transformation (ETL)
  2. Data Cleaning and Wrangling
  3. Data Labeling

Part III. Data Exploration

  1. Univariate Statistics and Visualization
  2. Bivariate Statistics and Visualization
  3. Principal Component Analysis (PCA)

Part IV. Predictive Modeling

  1. Classification Models
  2. Regression Models
  3. Clustering Models
  4. Association Rules

Part V. Model Evaluation

  1. Evaluating Classification Models
  2. Evaluating Regression Models
  3. Evaluating Clustering Models

Part VI. Model Deployment

  1. A/B Testing

Credits

3

Textbooks

  1. An Introduction to Data Science, Saed Sayad, online book, 2010-2023.
  2. Introduction to Data Science, Rafael A. Irizarry, Chapman and Hall/CRC, 2019.

Expected Work

  1. Quizzes (10%)
  2. Projects (40%)
  3. Midterm Exam (20%)
  4. Final Exam (30%)

Learning Goals

This course will increase your marketability in the fast-paced data science industry. With an extensive theorithical knowledge and practoical experience of these in-demand technical skills, as well as the soft skills (e.g., project management) employers seek, you will be prepared to apply your data science top positions.

Onsite

€100 / hour

€500 / day

Online

€60 / hour

€300 / day