Data Analytics with KNIME: A hands-on approach Summer School

Mais Informações

Director:
Maria da Conceição Andrade e Silva | csilva@ucp.pt

Structure:
Start Date: 29th July 2024
End Date: 2nd August 2024

Applications:
Application deadline: 30th April 2024
Enrolment deadline: 6th May 2024

ECTS:
Participants can obtain 3 ECTS.

Investment:
School - Investment:
CPBS - €300
Partner University - €315
Other - €350
Apply here

Contacts:
For more information, please contact
Raquel Menezes Correia
International Programmes Manager
@ | summerschools.cpbs@ucp.pt

 

Descrição do Programa

This Training academy aims to provide students and professionals from different fields with the technical and scientific skills to analyse, model and extract knowledge from data in a variety of business contexts, using the KNIME Analytics platform. With a strong practical approach, the program is designed to present, discuss, and constructively apply classical and state-of-the-art approaches for descriptive and predictive analytics as well as for time series analysis and forecasting. Throughout the program, the participants are exposed to real world case studies, with varying degrees of complexity, for which they are challenged to develop, test, and evaluate analytical models able to extract useful knowledge and actionable insights from the data.

The main learning outcome is to enable students to gain proficiency in using KNIME as a white-box tool for modelling and implementing transparent and reproducible data workflows in an automatic fashion, towards proactive, data-driven decision-making.

Overview

The program starts by introducing the fundamentals of Analytics and its applications, as well as the KNIME Analytics platform and its main functionalities, so as to provide context and motivation for the students. The remainder of the program is structured in three main parts. Firstly, the students are introduced to descriptive analytics, in which they are exposed to and apply exploratory data analysis techniques, including pre-processing and transformation of variables in a data frame. Unsupervised learning techniques are also explored and applied, namely in the context of dimensionality reduction, clustering, and association. Secondly, the program covers the area of predictive analytics, with particular emphasis on understanding, implementing, and evaluating regression and classification models. Finally, some topics on the analysis, modelling and forecasting of uni/multivariate time series are covered, using statistical and machine learning techniques.

The teaching methodology follows a strategy with a strong practical component, in which the introduction of the various contents is supported by the discussion and resolution of case studies. Here, students are expected to develop, test, and critically analyse data pipelines based on real datasets of different size and structure.

Who can attend?

This Training academy is targeted for university students or Executive Education students and professionals in quantitative scientific fields that are not familiar with programming languages but are interested in developing technical skills in data mining, particularly:

  1. Students and professionals from STEM and non-STEM areas, with a basic analytical background, who are interested in data mining.
  2. Data/Business analysts interested in consolidating their knowledge in business analytics;
  3. Software engineers interested in the development of machine learning models and pipelines.

As entry requirements, it is assumed that the student has basic knowledge of manipulating and statistically exploring datasets using conventional spreadsheets.

Programme Outline

The syllabus has been designed and structured to be gradually more complex, allowing the students to better understand and connect the concepts learned.

  1. Foundations of Analytics
    1. Fundamental concepts and applications
    2. Problem solving with analytics
  2. Introduction to KNIME Analytics platform
    1. Basic terminology, settings and user interface
    2. Workflow control
    3. Data workflows in KNIME workbench
  3. Hands-on Descriptive Analytics with KNIME
    1. Exploratory data analysis: descriptive statistics and data visualization
    2. Data pre-processing and data transformation
    3. Introduction to unsupervised learning
  4. Hands-on Predictive Analytics with KNIME
    1. Introduction to supervised learning
    2. Regression and classification models
    3. Classical and robust evaluation of predictive models
  5. Time series analysis and forecasting with KNIME
    1. Data preparation
    2. Time series analysis and visualization
    3. Time series forecasting as a supervised learning problem
    4. Uni/multivariate time series forecasting
    5. Forecasting evaluation

Teaching

João Gonçalves holds a PhD in Industrial and Systems Engineering, a MSc in Systems Engineering and a BSc in Mathematics from the University of Minho (UMinho). He is a Lecturer in Data Science and Coordinator of the BSc in Applied Data Science at the Catholic University of Portugal (UCP). He also works as an Invited Assistant Professor at UMinho in the areas of Operations Research and Supply Chain Management. He has worked as Principal Data Scientist at Bosch Car Multimedia and Technical Coordinator of the Data Science & Machine Learning Group of the Collaborative Laboratory on Digital Transformation (DTx). His research interests lie in the areas of Machine Learning, Forecasting and Business Analytics.

Mais Informações

Director:
Maria da Conceição Andrade e Silva | csilva@ucp.pt

Structure:
Start Date: 29th July 2024
End Date: 2nd August 2024

Applications:
Application deadline: 30th April 2024
Enrolment deadline: 6th May 2024

ECTS:
Participants can obtain 3 ECTS.

Investment:
School - Investment:
CPBS - €300
Partner University - €315
Other - €350
Apply here

Contacts:
For more information, please contact
Raquel Menezes Correia
International Programmes Manager
@ | summerschools.cpbs@ucp.pt