Programme

MADS’ first year lays foundations in several disciplines. The second year offers a more open curriculum, with access to various courses and workshops.
The last semester is dedicated to an internship – either in industry or in research – chosen in line with professional goals.
MADS offers refresher courses at the beginning of the Master to help students fill gaps in linear algebra and analysis.
To graduate from the Master in Data Science, one must have acquired 120 ECTS by successfully completing each of the 9 modules in the program. Each module consists of courses that, upon passing the exams, award ECTS. A module is considered validated if either the courses within it are passed or if the grade for each of the exams is above 5, and the weighted average of these grades is at least 10.
The program offers a wide variety of pedagogical approaches. Depending on the courses, students may be assessed through written assignments, oral presentations, individual or group projects.
The course format is 30, 45, 60 or 90 Teaching Units (TU), with 1 TU equivalent to a duration of 45 minutes.
Download the full programme of the academic year 2023-2024 here
Academic Contents
Course offer for Semestre 1
-
Details
- Number of ECTS: 5
- Course number: MA_DS-31
- Module(s): Module 1 Mathematics for Data Science
- Language: EN
- Mandatory: Yes
-
Lecturer
Coming soon -
Course learning outcomes
At the end of the course, a student should be familiar with the basic concepts of probability theory (event, random variables, distributions) and master the tools that allow him or her to make calculations (calculations of expectations, variances, distributions etc). The student should also be able to master the law of the large numbers as well as the central limit theorem. In particular, he or she must be able to calculate limits of random variables. Given an integral, the student should also be able to provide a probabilistic approximation of it and assess the probability of error. -
Description
What is an event, the probability of an event, a random variable?What are the main notations associated to probability theory?What is a law or a distribution? The main probability distributions: Bernoulli, binomial, Poisson, uniform, exponential, Cauchy, Gaussian. What random phenomena do they model? What is a density with respect to the Lebesgue measure? What is a density with respect to the counting measure? What is the distribution of a random variable?What is the expectation (or mean) of a random variable? How can we interpret it? What is the variance of a random variable?How to calculate the probability that a random variable belongs to a set from its distribution? How to calculate the expectation of a function of a given random variable from its distribution? What is a distribution function? What are its properties? What link densities and distribution functions? What are the quantiles of a real-valued random variable? What are the median and quartiles of a random variable?The main inequalities: the Markov inequality, the Bienaymé-Tchebychev inequality, the Jensen inequality.What does it mean that two random variables are independent? What is the density of a pair of independent random variables? How to generalize to n independent random variables? What is a random vector? What are the marginal distributions? How to calculate them?How to calculate the distributions of random variables (from distribution functions, change of variables)What does it mean that a sequence of random variables converges almost surely? In probability? In distribution? What are the connections between these convergence modes?What is Law of large numbers? What is the Monte Carlo method for calculating integrals?What is the central limit theorem? How can it be used to evaluate the error in the Monte Carlo method.
-
Details
- Number of ECTS: 5
- Course number: MA_DS-2
- Module(s): Module 1 Mathematics for Data Science
- Language: EN
- Mandatory: Yes
-
Lecturer
Coming soon -
Objectives
Functional analysis in finite-dimensional A successful student should be able to: Prove that a function defined on Rd is differentiable and compute its derivative. Determine the gradient, Hessian matrix of a function twice differentiable on Rd. Prove that a function is convex. Apply the Hilbert projection theorem. Optimization in Rd A successful student should be able to: Apply necessary and sufficient criteria to solve an unconstrained optimization problem. In an optimization problem with constraints of the form h(x)=0 and g(x)≤0: define the associated Lagrangian function, conjugate function, dual problem. Give the necessary Karush-Kuhn-Tucker conditions. Sufficient conditions in convex problems. Solve basic linear programming, quadratic programming, convex problems. Know some algorithms that allow to numerically determine the minimizer of a convex function of a convex domain: Gradient Descent (GD) method, Newton’s method, projected GD. Numerical probability A successful student should be able to: Know and apply some procedures to simulate a random variable (RV): inverse-transform methods, accept-reject method. Design a Monte-Carlo procedure to compute the expectation of a bounded function of a RV directly, by using importance sampling. Basically study simple Markov chains: determine transient/recurrent states, compute probabilities, determine (if applicable) invariant measure(s), apply the ergodic theorem. Design a MCMC method (Metropolis-Hasting algorithm) to compute the expectation of a bounded function of a RV.
-
Description
1. Elements of functional analysis in finite-dimensional normed vector spacesBasics of functional analysisDifferential calculusConvex sets and convex functionsHilbert projection theorem 2. Optimization in RdUnconstrained optimizationOptimization with constraints – convex optimizationAlgorithms for optimization3. Numerical probabilitySimulation of random variablesMonte Carlo methods (principle, MCMC, Metropolis algorithm) -
Assessment
First session Written exam Retake exam Written exam -
Note
Note / Literature / Bibliography Functional Analysis, Calculus of Variations and Optimal Control, by Francis Clarke. Springer Science & Business Media, 2013. Numerical Probability, by Gilles Pagès. Springer Cham, Universitext, 2018.
-
Details
- Number of ECTS: 3
- Course number: MA_DS-3
- Module(s): Module 1 Mathematics for Data Science
- Language: EN
- Mandatory: Yes
-
Lecturer
Coming soon -
Objectives
Being able to understand and manipulate modern signal processing tools; Being able to choose between these techniques for efficient data representation in various domains
-
Description
This course will introduce signal processing tools like Fourier transform, time frequency analyses, wavelet transforms. These tools will then be used for efficient, e.g. sparse, data representation, data compression (e.g. JPEG 2000 image compression standard), approximation, denoising, … -
Assessment
Written problem solving, maybe completed with hands on in Octave. -
Note
Note / Literature / Bibliography * A wavelet tour of signal processing: the sparse way, Stéphane MALLAT,Academic Press, 2009* Fourier analysis and applications. Filtering, numerical computation, wavelets. Claude Gasquet, Patrick Witomski. Springer, 30, pp.442, 1999, Texts in Applied Mathematics,
-
Details
- Number of ECTS: 5
- Course number: MA_DS-5
- Module(s): Module 2 Programming, Data Management and Visualization
- Language: EN
- Mandatory: Yes
-
Lecturer
Coming soon -
Objectives
On successful completion of this course, students are capable to: explain both the theoretical foundations and the practical application of current NoSQL and cloud-computing architectures; describe how different concepts concerning the modeling and management of large data collections are implemented on top of these architectures; develop and evaluate various use-case applications based on the above platforms.
-
Description
The course provides an introduction to both the theoretical foundations and practical applications concerning the broad area of "NoSQL Databases & Cloud Computing". We specifically focus on current tools and respective application-programming interfaces (APIs) in the context of the Apache Hadoop and Spark ecosystems. The course starts by reviewing the functionality of a classical SQL database system (PostgreSQL) and then moves forward to distributed file systems, including the Google (GFS) and Hadoop (HDFS) distributed file systems, which is followed by a detailed discussion of the MapReduce distributed computing principle with different extensions. We then move on to a number of recent NoSQL engines and key-value stores, including Apache Pig, HBase, Hive, Spark and MongoDB, which provide a variety of options for processing different data formats such as text, CSV, XML and JSON. All of the practical examples discussed during the course will be interactively deployed on top of the Amazon Web Services (AWS) platform and/or the University’s HPC infrastructure.The course covers the following topics:Usage of classical data-modeling languages such as E/R diagramsData management in SQL using the PostgreSQL open-source DBMSDistributed file systems (GFS & HDFS), session semantics vs. transaction semantics, CAP theoremApache Hadoop: distributed computing principles (MapReduce), replication, fault tolerance, backup tasks, custom combiners and partitioners, local aggregation, linear scalabilityApache Pig: first dataflow language (Pig Latin), translation into MapReduce and optimizationsApache HBase: distributed key-value store for very large tabular data, columns and column families, indexing and lookupsApache Hive: SQL-like query language on top of Hadoop, translation into MapReduceMongoDB: API overview, JSON processing, user-defined functionsApache Spark: distributed resilient data objects (RDDs) and dataframes, basic overview of streaming and machine-learning extensions -
Assessment
First session Practical exercises (group solutions): 50% Final written or oral exam (individual): 50% -
Note
Background literature announced at the beginning of each course.
-
Details
- Number of ECTS: 3
- Course number: MA_DS-6
- Module(s): Module 2 Programming, Data Management and Visualization
- Language: EN
- Mandatory: Yes
-
Lecturer
Coming soon -
Objectives
Know the Tufte principles such as “data to ink ratio”, “show the data” Explain the principles of good charts – labels, use of area, angles and lengths, etc. Create publication-ready plots Know all basic chart types and their use as well as some advanced plots Formatting tables and figures for ease of reading Know about data transformation for display purposes (normalization, log-transformation, standard error vs standard deviation) Set-up interactive data analysis (using shiny or plotly)
-
Description
The course introduces visualization using common software packages for exploratory data analysis and communication of data-driven findings. Data visualization principles such as data-to-ink ratio and weaknesses and strengths of types of data display will be introduced. It will teach best practices for figures, such as labeling of axis, titles, captions, red-green awareness as well as tables. The concepts for the use of ggplot2, seaborn, plotly and matplotlib packages will be discussed. Interactive visualizations will be introduced and discussed. -
Assessment
Assignments with a final practical work on an individual data set. -
Note
Note / Literature / Bibliography Fundamentals of Data Visualization https://clauswilke.com/dataviz/
-
Details
- Number of ECTS: 5
- Course number: MA_DS-4
- Module(s): Module 2 Programming, Data Management and Visualization
- Language: EN
- Mandatory: Yes
-
Lecturer
Coming soon -
Objectives
Perform data manipulations in R and Python Understand the principles of reproducible analyses Apply descriptive statistics to data Knowledge of clustering algorithms and regression Principles of machine learning methods and their implementation
-
Description
The course introduces data science with R and Python and prepares students for more specialized courses. The course introduces basic programming concepts as data types, how to load, store and manipulate data sets, first in R using the tidyverse concepts for data manipulation, then in Python. The course then covers descriptive statistics, confidence intervals and hypothesis testing after an introduction to R and RStudio as developing environment to prepare for basic machine learning using standard libraries. Functional programming is used as the paradigm for data analysis.The last part of the course introduces standard machine techniques and algorithms to solve classification, clustering and regression problems. The course is delivered as a series of lectures and practical exercises, familiarizing students with version control systems. -
Assessment
written exam (40%) 30% project 30% exercises -
Note
Note / Literature / Bibliography R for data science (https://r4ds.had.co.nz) Python for data science (https://wesmckinney.com/book/) Modern Statistics for Modern Biology (https://web.stanford.edu/class/bios221/)
-
Details
- Number of ECTS: 3
- Course number: MA_DS-7
- Module(s): Module 3 Transversal courses
- Language: EN
- Mandatory: Yes
-
Lecturer
Coming soon -
Course learning outcomes
Get familiar with the scientific goals and methods. Learn the most common data science and visualization misconduct problems. Critically evaluate ethical issues and method choices. Part 1: Scientific goals, methods and knowledge Scientific Goals Methodology Scientific Knowledge What is Philosophy of Science? The scientific method Methodology Part 2: Scientific Inference Scientific inferences. Deduction and Induction Hume’s problem of induction.Ref. https://stanford.library.sydney.edu.au/entries/induction-problem/ The Hypothetico-deductive Method. Falsification. Confirmation. Scientific Explanation Historical cases.Neptune and VulcanoMichelson & Morley experimentCovid19 ML issues can X-Ray images be used to diagnose Covid? Part 3: Empirical practices and models What is an experiment? Observational studies Field, laboratory and simulation experiments Observability, Indicators and Evidence Testing methods. How to evaluate experiment success. Repetition, Reproduction and Replication What is a model? Models as analogies; as isolations; as mirrors Differences between Models, Theories and Experiments. The problems with Machine LearningRef. Chapter “A blueprint of reality” from The book of Why. Part 4: Experimental Control and Statistical Abuse. Experimental Control Bias and Confounders.Ref. Chapter Confounding and deconfounding, from The Book of Why.Ref. Judgment under Uncertainty. (Amos Tversky and Daniel Kahneman).Ref. John Snow and cholera.Ref. https://catalogofbias.org/ Imbalanced datasets; surrogates, proxies… “Data alone is not enough”. Randomised Control Trials Origins of RCTs Validity Randomisation Cross-validation in ML “Visualizations can lie”.Ref. How charts lie from Alberto Cairo. Part 5: Ethics and responsibility Morality and ethics. Data ethics Ethical Frameworks: Consequentialism, deontology and virtue. Informed consent and its limitations, how does it affect data scientists? Data ownership; data destruction Undesired consequences. – Case studies Privacy, anonymity, de-identification and re-identification GDPR Data ethics, reproducibility and FAIR data.Ref. Chapter 3 Data Ethics of Deep Learning for Coders with Fastai and PyTorch.Ref. https://www.fast.ai/2020/08/19/data-ethics/ and https://ethics.fast.ai/ from Jeremy Howard and Rachel ThomasEthics and Data Science, O'Reilly. Dilution of responsibility. How progress and technology raise new ethical questions. -
Description
This course aim is to provide the students with guidelines and methodologies to identify epistemic and ethical issues present in data science. We expect the students to develop a critical eye that helps them mitigate such problems in their daily work as data scientists.During this course, students will learn by example different layers of the scientific method and how they relate to data science and data ethics. In particular, they will learn how the mechanisms behind the data affect the data analysis, and how the different types of scientific inference condition the application of data science solutions and conclusions to other contexts. In this sense, examples of statistical abuse, misconduct and bad visualization will be shown together with their, sometimes catastrophic, collateral consequences.PlagiarisimQuizzes, assignments and any other pieces of work produced during the course by the students should be written in their own words. Any attempt to copy from internet sources or between the students may be detected and the corresponding disciplinary procedure will be open. I encourage the students to read the document on academic misconduct from UL for further details. Note that the potential sanctions for plagiarism may include:The cancellation of all grades obtained in examinations for the module or the entire examination session of the respective semester;A ban for up to five years on taking any examinations leading to the award of a degree, diploma or certificate by the University;The retroactive withdrawal of the degree, diploma or certificate awarded by the University. -
Assessment
First session Students' evaluations will be based on their performance in different types of individual and group exercises as well as test exams during the course weeks. Written essays/reports regarding the course content. Open discussion, test exercises and quiz tests; Depending on time, there may be presentation exercises (slides exposition). The final exam will be on paper, without the help of a computer or notes. It will consist of single/multiple-choice questions as well as essay questions. The student is thereby responsible for producing intelligible answers employing readable handwriting and correct English writing. Importantly, the final exam may be more challenging than the individual quizzes and assignments as it mixes chapters and does not allow any help from notes. Therefore, it is strongly recommended to study in-depth the content of the course (slides, questions, extra material, etc.) before the exam. Only those students with a final grade less than 50% (from the quizzes and assignments of the course) must attend the final exam. This grade can be checked in the grade book in Moodle. If in doubt, do not hesitate to contact the teacher by e-mail. Retake exam Further retake exams will follow the same conditions as the first final exam. The questions of the exam are subject to change for each retake exam.
-
Details
- Number of ECTS: 3
- Course number: MA_DS-8
- Module(s): Module 3 Transversal courses
- Language: EN
- Mandatory: Yes
-
Lecturer
Coming soon -
Objectives
On successful completion of the course the students should be able to understand the relevance of the topics covered in the course for their applications, master the proofs of the main results of the course, solve problems using the toolkit developed in the course be autonomous in learning in the field of Graph Theory.
-
Description
Through a presentation of selected topics, the course aims to be an introduction to graph theory, its applications and its algorithmic aspects. It is designed as a self-contained course and focused on problems pertaining to Data Science. Possible topics for the course include, but are not limited toGraphs and digraphs, degree and the degree sequence algorithmConnectedness, distance, shortest paths and connected components algorithmsGraph matching problems and algorithmsElements of algebraic graph theory and PageRank algorithmGraph traversal algorithmsTrees and applicationsMinimum spanning tree algorithmsNetwork flow, min cut – max flow theorem and Ford–Fulkerson algorithmCentrality and betweness measuresCluster analysisRandom Graphs -
Assessment
First session Written exam and homework, and possibly algorithm implementation project during the semester. Retake exam Writen exam -
Note
Note / Literature / Bibliography R. Diestel, Graph Theory, Springer, 2017 D. Jungnickel, Graphs, Networks and algorithms, Springer 2017
Course offer for Semestre 2
-
Details
- Number of ECTS: 5
- Course number: MA_DS-9
- Module(s): Module 4 Mathematics for Statistical Learning
- Language: EN
- Mandatory: Yes
-
Lecturer
Coming soon -
Objectives
A successful student should be able to: Understand the purpose of statistical learning Be familiar with the probabilistic tools that enables to evaluate the risk and the excess risk of a predictor. Be able to evaluate the Rademacher complexity of some sets of predictors Be able to calibrate hyperparameters and benchmark learning methods.
-
Description
ProgrammeEmpirical risk minimization. Bounding the prediction error. Rademacher Complexity. Vapnik-Chervonenkis classes. Case of supervised binary classification.Prediction in bounded regression.Regularization approaches (cross-validation, unbiased risk estimation, Label muddling).PAC Bayesian theory.Active learning.Nonparametric statistics.Learning methods: plug-in, penalised ERM, kernel methods, perceptron, exponential weights, deep learning.Unsupervised learning: density estimation, principal component analysis, clustering. -
Assessment
First session Final grade will be based on class participation and a written exam. Retake exam Oral exam -
Note
Bibliography Hastie, Tibshirani, and Friedman, The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Springer, Second Edition, pdf available for download. Boucheron, Bousquet, Lugosi, Theory of classification: a survey of some recent advances, ESAIM Probab. Stat. pdf available for download. Tsybakov, an introduction to nonparametric statistics, Springer.
-
Details
- Number of ECTS: 5
- Course number: MA_DS-11
- Module(s): Module 4 Mathematics for Statistical Learning
- Language: EN
- Mandatory: Yes
-
Lecturer
Coming soon -
Objectives
The goal of the course is a detailed introduction to modern techniques in high dimensional estimation of linear models and covariance matrices. After successful completion the student should be able to establish the main properties of Lasso and related estimators in the setting of linear regression models, asymptotic results for large random matrices and non-asymptotic properties for high dimensional estimators of covariance matrices.
-
Description
High dimensional regression model Parameter estimation under constraints: Lasso and relates estimation procedures Variable selection Introduction to principal component analysis Basic elements of random matrix theory: semi-circle law, Marcenko-Pastur distribution, Tracy-Widom distribution Estimation of high dimensional covariance and precision matrices; hard and soft thresholding -
Assessment
There will be a written exam at the end of the course. -
Note
Bibliography P. Bühlmann and S. van de Geer „Statistics for High-Dimensional Data“ C. Giraud “Introduction to High-Dimensional Statistics” G.W. Anderson, A. Guoinnet and O. Zeitouni „An Introduction to Random Matrices“
-
Details
- Number of ECTS: 5
- Course number: MA_DS-10
- Module(s): Module 4 Mathematics for Statistical Learning
- Language:
- Mandatory: Yes
-
Lecturer
Coming soon -
Objectives
Students will be familiar with the most popular model selection methods and their known theoretical properties.
-
Description
In machine learning and statistics, there is generally a tradeoff between accuracy of a model and difficulty of estimating it. This gives rise to the problem of model selection. This course will provide an overview of the model selection problem with specific emphasis on resampling methods, including cross-validation, the hold-out and the bootstrap. -
Assessment
Written exams, at mid-term and end of semester (2h and 4h long, respectively). They will take place on campus if allowed. Letting G1 and G2 represent the grades of the mid-term and end-of-semester exams, the final grade will be max (G2, (G1 + G2)/2). -
Note
Bibliography The Elements of Statistical Learning, Chapter 7 – Hastie, Tibshirani, Friedman An introduction to the bootstrap – Bradley Efron, R.J. Tibshirani Concentration inequalities and Model Selection, Chapter 8 – Pascal Massart A survey of cross-validation procedures for model selection – Arlot, Celisse. Resampling and model selection / Rééchantillonage et sélection de modèles – Sylvain Arlot, thèse
-
Details
- Number of ECTS: 5
- Course number: MICS2-41
- Module(s): Module 5 Big Data Analytics
- Language: EN
- Mandatory: Yes
-
Lecturer
Coming soon -
Objectives
The lecture provides an entry point to large-scale data management and distributed computing principles in recent NoSQL architectures. We start with an overview of distributed file systems and MapReduce in Apache Hadoop and then move on to more advanced analytical tasks based on the machine-learning libraries in Apache Spark. The lecture serves as an ideal basis for further topics in this area (such as Master seminars, projects and theses).
-
Course learning outcomes
– Students become familiar with the usage of recent Big Data platforms such as Apache Hadoop and Spark- Student obtain an overview of both the theoretical foundations and practical applications of various Big Data and Machine Learning algorithms- Students learn how to approach and solve different data-analysis tasks by a number of programming exercises with real-world datasets -
Description
The course consists of a combination of theory-oriented lectures and practical exercises, through which the students are guided by a series of real-world use cases and hands-on examples. Specifically, we focus on the following topics:- Distributed File Systems (DFS) and MapReduce in Apache Hadoop- Resilient Distributed Data (RDD) objects and DataFrames in Apache Spark- Implementation of complex DataFlow programs in Spark using Scala- Performing advanced analytical tasks in Spark’s MLlib: o Distributed clustering and classification of objects o Decision trees and random forests o Recommender systems via matrix factorization o Text analysis via latent semantic indexing o Geospatial data analysis o Social-network analysis -
Assessment
Practical exercises: 50%Final written exam: 50%
-
Details
- Number of ECTS: 5
- Course number: MA_DS-13
- Module(s): Module 6 Introduction to Machine Learning Methods and Data Mining
- Language: EN
- Mandatory: Yes
-
Lecturer
Coming soon -
Objectives
After successfully finishing the course, a student will become familiar with the basics of supervised and unsupervised ML methods, understand their theoretical background, advantages, limitations, and get practical experience in solving real problems employing ML techniques. Particular attention will be focused on the interdisciplinary aspect of ML applications and advantages, which provides an understanding of the nature of the data. Within the course, the students will learn how to implement the basic elements of ML models by themselves, as well as how to use state-of-the-art ML software packages.
-
Description
The main chapters arePreprocessing of collected data, understanding their structure, visualization. (1 hour)Introduction into Scikit-Learn and TensorFlow. (7 hours)Unsupervised methods: clustering, nearest neighbor task, association rules mining; rule- and tree-based classifications. (12 hours)(Kernel) ridge regression. (4 hours)Support vector machines. (4 hours)Artificial neural networks. (12 hours)Advanced topics: model evaluation and selection, anomaly detection, conformal learning (prediction with guarantees of accuracy), causal inference (identification of causal relationships). (4 hours)Combining different machine learning methods for solving actual problems in natural sciences. (4 hours)Presentation of personal projects. (8 hours)The course will be split into series of lectures with following practical exercises. The ideal schedule will be one day per week in a computer class, where the lecture is directly followed by practical exercises. At the end of the course each student will have to present his individual project. -
Assessment
The evaluation will be based on the presence on the lectures and practical exercises (25%), the individual project (50%), and the answers on the questions following the presentation (25%). -
Note
Bibliography Géron, Aurélien. Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow: Concepts, tools, and techniques to build intelligent systems. O'Reilly Media, 2019. Andriy Burkov. The Hundred-Page Machine Learning Book, 2019
-
Details
- Number of ECTS: 5
- Course number: MA_DS-15
- Module(s): Module 7 Optional courses
- Language: EN
- Mandatory: No
-
Lecturer
Coming soon -
Objectives
The students will get an in-depth understanding of statistical modelling, and be able to apply the learnt methods on (complex) datasets.
-
Description
This course will deal with two cultures of statistical modelling: top-down via a parametric model, and bottom-up via data-adaptive methods. More concretely, it will cover flexible distributions allowing to model complex modern datasets, survival analysis, classification and regression trees as well as random forests. These topics will be treated both from a theoretical and practical aspect. -
Assessment
First session Continuous Assessment and final exam with in-person assessment Retake exam Continuous Assessment and final exam with in-person assessment -
Note
Literatur Ley, C., Babic, S. and Craens, D. (2021) Flexible models for complex data with applications. Annual Review of Statistics and Its Application 8, 18.1-18.23. Genuer, R. and Poggi, J.-M. (2020) Random Forests with R, Springer. Kleinbaum, D.G. and Klein, M. (2012) Survival Analysis – A Self-learning Text, Springer.
-
Details
- Number of ECTS: 5
- Course number: MA_DS-14
- Module(s): Module 7 Optional courses
- Language: EN
- Mandatory: No
-
Lecturer
Coming soon -
Objectives
structure and function of the cell basics in biochemistry basics of genetics basics of evolution introduction to plant biology introduction to animal biology introduction to ecology
-
Description
We will follow the Campbell Biology. The course includes fundamental principles of biochemistry, genetics, molecular biology and cell biology. -
Assessment
Written exam
-
Details
- Number of ECTS: 5
- Course number: MA_DS-16
- Module(s): Module 7 Optional courses
- Language: EN
- Mandatory: No
-
Lecturer
Coming soon -
Objectives
The fundamental learning outcomes of the course are the following: Understanding of DL models and their applications. Students do not need to memorize any formulae but have to know how algorithms work in the context of the course. Knowledge of foundational building blocks in DL and popular network architectures. Students should be able to combine these building blocks and create their own DL applications. Ability to formulate and solve problems using DL models and methods. Students should be able to identify key parts or components of a larger problem and solve them incrementally (design thinking philosophy). Then, each lecture focuses on particular learning outcomes, aimed at developing more specific knowledge and abilities. The first slide of each lecture describes the particular learning outcomes.
-
Course learning outcomes
Due to the type of teaching and materials, participation in lectures is mandatory. The lack of attendance in some classes may be supplemented with an extra assignment, though you should not skip more than two classes.Python is the mandatory programming language for the course. Basic knowledge in math (algebra, calculus) and statistics is required. A previous course on machine learning is recommended but not mandatory. -
Description
This course provides students with a high-quality and informed understanding of deep learning models for developing AI-based applications. The course promotes problem solving via design thinking philosophy. The course helps students to develop state-of-the-art competencies to solve many different real-world problems. The course will provide students with a competitive advantage to solve challenging research problems that require dealing with complex search spaces, non-linear relationships within the data, and flexible models that can scale up to thousands or millions of observations. These competencies are also of key importance to many industrial and technological companies. Finally, the course also promotes development of soft skills such as written and verbal communication skills, via final project presentations. -
Assessment
This course uses project-based learning as a formative assessment method. The final grade is comprised in a 0-20 scale. A minimum number of 10 points is required to pass the course.The final grade is the sum of the points earned with the project report (max 16 points) and the project presentation (max 4 points). -
Note
Bibliograpy M. Nielsen. Neural Networks and Deep Learning, 2015. I. Goodfellow, Y. Bengio, A. Courville. Deep Learning, 2016. F. Chollet. Deep Learning with Python, 2017. A. Gibson, J. Patterson. Deep Learning: A Practitioners Approach, 2017. R. O. Duda, P. E. Hart, D. G. Stork. Pattern Classification, 2000.
Course offer for Semestre 3
-
Details
- Number of ECTS: 4
- Course number: MCMP-21
- Module(s): Module 8 Advanced courses
- Language: EN
- Mandatory: No
-
Lecturer
Coming soon -
Objectives
The main idea of the course is to provide knowledge and practical experience of the numerical techniques that constitute the basis of Computational Physics and Chemistry. Some emphasis will be put on the analysis of the outcome of the variation of physical parameters and numerical hyperparameters for the given problem.
-
Course learning outcomes
•For a given physical problem, students will be capable of writing a program to find its numerical solution and analyze the resulting data.•Students will have an overview of standard numerical algorithms adopted in computational Physics and Chemistry as well as their limitations. -
Description
The main idea of the course is to provide knowledge and practical experience of the numerical techniques that constitute the basis of Computational Physics and Chemistry. Some emphasis will be put on the analysis of the outcome of the variation of physical parameters and numerical hyperparameters for the given problem.The first part of the course will consolidate the basics of Python3 programming and cover the basic algorithms necessary to solve simple equations. The second part will introduce more advanced methods with applications to physical problems. Specifically, we will treat• Introduction to Python and relevant packages• Numerical differentiation and integration• Linear algebra solvers• Root finding and minimization• Ordinary differential equations• Partial differential equations• Monte Carlo methods• Molecular dynamics• Basics of machine learningEach lecture will be comprised of an introduction to the theory behind a given technique, followed by a practical session centered on its implementation and application to well-known problems. -
Assessment
The evaluation will consist of two midterm assignments (20% and 30% of the final grade) and one final project (50% of the final grade), each involving a homework problem and a short exam after it has been handed in. -
Note
Support / Literature: The most recommended textbooks•Numerical Methods in Physics with Python, Alex Gezerlis (Cambridge University Press, 2020)•Numerical Analysis 9th ed., Richard L. Burden and J. Douglas Faires (Brooks/Cole, 2011)•Computational Methods for Physicists, Simon Sirca and Martin Harvat (Springer, 2012)Other textbooks of note•Numerical Methods for Scientists and Engineers, Richard W. Hamming (Dover publications)•Numerical Recipes series, William H. Press, Saul A. Teukolsky, William T. Vetterling and Brian P. Flannery (Cambridge university press)•Numerical Methods, E. A. Volkov (Hemisphere publishing corporation)General resources on Python•https://www.codecademy.com/learn/learn-python-3•https://www.learnpython.org/en/•https://lectures.scientific-python.org
-
Details
- Number of ECTS: 6
- Course number: F1_MA_MAT_MMCS2-2
- Module(s): Module 8 Advanced courses
- Language: EN
- Mandatory: No
-
Lecturer
Coming soon -
Objectives
The aim of the course is to provide students with efficient tools to study linear operators between infinite dimensional vector spaces. In particular, this class will deal with normed vector spaces, Banach and Hilbert spaces, with bounded linear operators on normed vector spaces, fundamental principles such as Fourier analysis, Lebesgue integral, Han-Banach Theorem, Uniform Boundedness Principle or Closed Graph Theorem, and with spectral theory of compact (self-adjoint) linear operators.
-
Course learning outcomes
Students will acquire a solid understanding of functional analysis, its fundamental results and basic techniques. In particular, students will understand applications to measure theory, Fourier theory, and the spectral theorem for (unbounded) operators. Students will know the relevance of a theorem, its underlying motivation and a precise idea of its proof. Hopefully, students will demonstrate capacity for mathematical reasoning through analyzing, proving and explaining concepts from functional analysis. -
Description
Functional analysis aims to study infinite-dimensional spaces of functions and features of linear operators on these spaces. Though, from the point of view of Mathematics, functional analysis has its own interest, it plays a crucial role in many related areas, in particular in Physics, Engineering or Finance. Roughly speaking, most “real-life problems” involve non-linear partial differential equations with infinite-dimensional solution spaces of functions (or distributions). This course provides Master students with the basic tools and the fundamental results to develop skills to solve such problems. -
Assessment
There will be an exam after the first half of the semester, and a final exam at the end of semester. Attendance is mandatory. Homework assignments will be posted as recommended problems, but not graded. Selected problems will be discussed in class. If you miss a lecture, you are responsible for obtaining lecture notes and for determining if any announcements were made. -
Note
Note / Literature / Bibliography Introductory Functional Analysis with Applications, by Erwin Kreyszig.Functional Analysis, Spectral Theory and Applications, by Manfred Einsiedler and Thomas Ward.An Introduction to Fourier Analysis, by Russell L. Herman
-
Details
- Number of ECTS: 5
- Course number: MA_DS-19
- Module(s): Module 8 Advanced courses
- Language: EN
- Mandatory: No
-
Lecturer
Coming soon -
Course learning outcomes
Knowing important image formats and their specific constraints Knowing and being able to apply basic image processing operations Aware of the problems that can arise from different spatial alignments and resolution issues Knows an overview about state-of-the art deep learning network types and their principle application domains Hands on experience in employing feed forward networks for image classification and segmentation Hands on experience in training deep neural networks utilizing a high-performance computing environment -
Description
A first part of the course will introduce basic skills in image analysis and the related field of computer vision and pattern recognition (images, transfer functions, image formats, basic operations incl. mathematical morphology and basic filters, spatial alignment and coordinate systems), this part of the course will mainly rely on MATLAB.Practical examples will focus on medical imaging as example application domain.In the second part the basic concepts of convolutional neural networks will be introduced and solutions for the most frequent problem domains – image classification and image segmentation – will be teached, mainly using deep feed forward network architectures. An overview of related problems – like object detection, de-noising, .. – as well as of other network types (autoencoders, GANs) that cannot be covered in detail will be given.Practical hands-on session will include MATLAB and Python based experiments including utilization of the HPC environment using GPUs for training of large state of the art neural network. -
Assessment
Final Grade: 50% Part a) + 50% Part b), both parts have to be graded with at least "passed". I.e. we will combine a written final exam on the theory, with a practical hands project where i will give an assignment on which the students can work at home and present the solution. -
Note
Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep learning. MIT press. Sonka, M., & Fitzpatrick, J. M. (2000). Handbook of medical imaging. Volume 2, Medical image processing and analysis. University of Iowa. Coursera – Course “AI for Medical Diagnosis”
-
Details
- Number of ECTS: 5
- Course number: MA_DS-20
- Module(s): Module 8 Advanced courses
- Language: EN
- Mandatory: No
-
Lecturer
Coming soon -
Objectives
Understanding data structure of biological Omics-Data DNA analysis Single-cell RNA sequencing analysis Heterogeneous data integration
-
Course learning outcomes
Blast approaches for sequence alignments. Single-cell RNA sequencing workflow Biological network analyses -
Description
The course will give an overview of current challenges in biomedical data management and analysis and introduce bioinformatic approaches and tools to analyze biomedical data. A particular focus will be on sequence alignment for DNA and RNA analyses and on single-cell sequencing approaches to biomedical data including the subsequent representation in networks. -
Assessment
First session tbaRetake exam tba -
Note
Literatur E. Klipp: Systems Biology U. Alon: Introduction to Systems Biology A Lesk: Introduction to Bioinformatics S. Strogatz: Non-linear Dynamics and Chaos
-
Details
- Number of ECTS: 5
- Course number: MA_DS-21
- Module(s): Module 8 Advanced courses
- Language: EN
- Mandatory: No
-
Lecturer
Coming soon -
Objectives
The course aims to provide the students an overview of (1) important concepts, theories and computational methods in network science, and (2) recent developments in machine learning for graph data.
-
Course learning outcomes
After the course, the student should develop an intuition on how to model systems as networks and perform reasoning and analysis on real-world network data. The student should achieve a good understanding of complex networks (i.e., network metrics, structural properties, types of networks, and network models). The student should have a good overview on different community detection algorithms. The student should be able to describe different type of epidemic spreading over complex networks and establish a formal background for information diffusion and influence maximization in online social networks. It is more desirable that by the end of the course the student also has a basic understanding of machine learning on graphs. -
Description
Networks are a fundamental concept for modelling complex physical, technological, social, and biological systems. The course will cover the fundamental aspects of networks: network models, methods for describing network structure and measuring networks, community detection, and information diffusion in complex networks. More advanced topics, such as network embedding and graph neural networks (GNNs) and their applications, will be also introduced and discussed. With the course, students will learn how to explore computational algorithms and machine learning techniques to reveal insights of real-world networks. -
Assessment
First session Students will be evaluated via course projects.Retake examOral exam. -
Note
Book “Networks: An Introduction”, by Mark Newman, 2010 Book “Network Science”, by Albter Laszlo Barabasi, 2016. Book “Graph Representation Learning”, by William L. Hamilton, 2020.
-
Details
- Number of ECTS: 4
- Course number: MICS-COMMSYST-024
- Module(s): Module 8 Advanced courses
- Language: EN
- Mandatory: No
-
Lecturer
Coming soon -
Objectives
Today, parallel computing is omnipresent across a large spectrum of computing platforms. At the microscopic level, processor cores have used multiple functional units in concurrent and pipelined fashions for years, and multiple-core chips are now commonplace with a trend toward rapidly increasing numbers of cores per chip. At this level, GPU also starts to enter the area. At a macroscopic level, one can now build clusters of hundreds to thousands of individual (multi-core) computers. Such distributed-memory systems have become mainstream and affordable in the form of commodity clusters. Furthermore, advances in network technology and infrastructures have made it possible to aggregate parallel computing platforms across wide-area networks in so-called grids. An efficient exploitation of parallel and distributed platforms requires a deep understanding of architecture, software and infrastructure mechanisms and advanced algorithmic principles. The aim of this course is thus twofold. It aims at introducing the main trends and principles in the area of high performance computing infrastructures, illustrated by examples of the current state of the art. It intends to provide a rigorous yet accessible treatment of parallel algorithms, including theoretical models of parallel computation, parallel algorithm design for homogeneous and heterogeneous platforms, complexity and performance analysis, and fundamental notions of scheduling and work-sharing.
-
Course learning outcomes
* Identify the key concepts related to parallel computing and parallel computer architecture * Solve problems (e.g., mathematics, physics and engineering) using parallel programming techniques. * Use parallel programming platforms, models and frameworks for science and engineering applications * Able to use performance analysis tools and methodology for efficient parallel programming implementation * Learn to parallelize basic linear algebra routines (BLAS operations) in MPI and OpenMP environment. -
Description
Parallel computing architecturesIntroduction to parallel computing using MPI and OpenMPPerformance analysis and tools for MPI and OpenMPProgramming labs are based on MPI and OpenMPComputing resource: ULHPC (https://hpc.uni.lu/) -
Assessment
Project: 100% -
Note
OpenMP – The Next Step by Ruud van der Pas, Eric Stotzer, and Christian Terboven Parallel Programming for Multicore and Cluster Systems , Thomas Rauber, Gudula Rünger (2nd edition, 2013) B. Wilkinson, M. Allen: Parallel Programming, Second Edition. Prentice Hall, 2005. ISBN 0-13-140563-2 Online resources : Introduction to Parallel Computing Tutorial: https://hpc.llnl.gov/documentation/tutorials/introduction-parallel-computing-tutorial MPI online tutorial: https://hpc-tutorials.llnl.gov/mpi/ OpenMP online tutorial: https://hpc-tutorials.llnl.gov/openmp/
-
Details
- Number of ECTS: 5
- Course number: MA_DS-22
- Module(s): Module 8 Advanced courses
- Language: EN
- Mandatory: No
-
Lecturer
Coming soon -
Objectives
Provide students with the necessary knowledge of Bayesian statistics, both from a theoretical as well as practical viewpoint.
-
Course learning outcomes
Handle of the concepts from Bayesian statistics, Bayesian inference, Bayesian modelling, practical handle of Bayesian techniques on Python. -
Description
This course will provide a thorough introduction and overview on the most important concepts from Bayesian inference, starting with the Bayesian philosophy in contrast to frequentist statistics. We will discuss various choices of prior distributions, Bayesian inference, Bayesian modelling, model checking and comparison (in particular introducing the concept of Bayes factor), and advanced computation. The latter aspect will especially be dealt with in the practicals, where Markov Chain Monte Carlo methods shall be treated. -
Assessment
First session Written exam with open questions, both theory and exercises, in particular there will be computational exercisesRetake examWritten exam with open questions, both theory and exercises, in particular there will be computational exercises -
Note
Note / Literature / Bibliography Box, GEP and Tiao, GC (1992) Bayesian Inference in Statistical Analysis. John Wiley and Sons Gelman, A, Carlin, JB, Stern, HS, Dunson, DB, Vehtari, A and Rubin, DB (2013) Bayesian Data Analysis 3rd edition. Chapman & Hall/CRC. Brooks, S, Gelman, A, Jones, GL, Meng, X-L (2011) Handbook of Markov Chain Monte Carlo. Chapman & Hall/CRC
-
Details
- Number of ECTS: 5
- Course number: MA_DS-23
- Module(s): Module 8 Advanced courses
- Language: EN
- Mandatory: No
-
Lecturer
Coming soon -
Objectives
After successfully finishing the course, a student will become familiar with applied machine-learning and robustification of ML-based systems, understand the key methods and learning algorithms, their advantages and limitations. Students will also get practical experience in setting up robust and secure ML systems.Within the course, the students will learn the fundamentals of metaheuristic search algorithms, which will be used to improve the robustness and security of ML systems. Additionally, students will learn to implement state-of-the-art techniques capable of attacking and defending ML systems.
-
Description
The course aims at introducing clear-eyed and principled algorithms to engineer robust and secure ML-systems. Meta-heuristics will be presented as they are key algorithms to optimize the search of a solution that applies to the robustification of ML-systems (data augmentation, model generalization). Adversarial testing and learning in realistic settings will also be studied. A variety of ML algorithms (CART, random forest, NN, DNN), approaches (active learning, multi-task learning) and applications (fintech, industry 4.0, energy optimization) will be considered. The students should be able to understand, synthetize and present a research paper in relation to the studied topics. Introduction (1 lecture)Engineering for Machine Learning Systems (1 lecture)Meta-heuristics search (2 lectures)Genetic Programming (1 lecture)Adversarial attacks and robustification (2 lectures)Project presentations (1 lecture)Selected topics on applied machine learning (3 lectures) Student presentations (2 lectures) -
Assessment
The evaluation will be based on practical individual projects (30%), presentations of two scientific articles (40%) and written critical appraisal of a scientific article (30%). -
Note
https://cs.gmu.edu/~sean/book/metaheuristics/ http://www.genetic-programming.org/ https://en.wikipedia.org/wiki/Evolutionary_computation Jie M. Zhang, Mark Harman, Lei Ma, Yang Liu: Machine Learning Testing: Survey, Landscapes and Horizons. IEEE Trans. Software Eng. 48(2): 1-36 (2022)
-
Details
- Number of ECTS: 5
- Course number: MA_DS-29
- Module(s): Module 8 Advanced courses
- Language: EN
- Mandatory: No
-
Lecturer
Coming soon -
Objectives
To introduce students the fundamentals of Natural Language Processing. Learn the techniques in natural language processing. Be familiar with the natural language generation. Be exposed to Text Mining. Understand major applications of NLP
-
Course learning outcomes
On completion of the course, students will have the ability to Understand and implement word and syntactic level analysis Extract relation in text. Implement the Python and NLTK libraries Implementing Sematic analysis Implement real-time based analysis -
Description
Chapter 1: Basics in NLP 6 hoursOverview: Origins and challenges of NLP-Need of NLP, python and NLTK for NLP, Text Wrangling and cleansing- Text cleansing, sentence splitter, tokenization, stemming, lemmatization, stop word removal, rare word removal, spell correction.Chapter 2: Text Preprocessing and Morphology 12 HoursCharacter Encoding, Word Segmentation, Sentence Segmentation, Introduction to Corpora, Corpora Analysis. Inflectional and Derivation Morphology, Morphological analysis and generation using Finite State Automata and Finite State transducer.Chapter 3: Language Modelling 12 HoursWords: Collocations- Frequency-Mean and Variance –Hypothesis testing: The t test, Hypothesis testing of differences, Pearson’s chi-square test, Likelihood ratios. Statistical Inference: n -gram Models over Sparse Data: Bins: Forming Equivalence Classes- N gram model – Statistical Estimators- Combining EstimatorsChapter 4: POS Tagging and Text Classification 12 HoursParts of Speech Tagging – Tagging in NLP, Sequential tagger, N-gram tagger, Regex tagger, Brill tagger, Machine learning taggers-MEC, HMM, CRF, NER tagger, Types of learning techniques, Text Classification-Sampling, Naïve Bayes, Decision trees, Stochastic gradient descent, Support vector machine, Text clusteringChapter 5: Syntax and Semantics 12 HoursShallow Parsing and Chunking, Shallow Parsing with Conditional Random Fields (CRF), Lexical Semantics, WordNet, Thematic Roles, Semantic Role Labelling with CRFs. Statistical Alignment and Machine Translation, Text alignment, Word alignment, Information extraction, Text mining, Information Retrieval, NL interfaces, Sentimental Analysis, Question Answering Systems, Social network analysis.Chapter 6: Recent Trends and Applications of NLP 6 hoursRecent trends in NLP, Applications of NLP: Transforming text, Sentiment Analysis, Information retrieval, text summarization, Question and Answering, Automatic Summarization -
Assessment
Continuous evaluation -
Note
Text Books Nitin Hardeniya, Jacob Perkins, Deepti Chopra, Nisheeth Joshi, Iti Mathur, “Natural Language Processing: Python and NLTK”, Packt publisher, 2016. Christopher D. Manning and Hinrich Schutze, “Foundations of Natural Language Processing” , 6 th Edition, The MIT Press Cambridge, Massachusetts London, England, 2003
-
Details
- Number of ECTS: 5
- Course number: MA_DS-18
- Module(s): Module 8 Advanced courses
- Language: EN
- Mandatory: No
-
Lecturer
Coming soon -
Objectives
Upon completion a successful student should be able to Construct classical estimators of unknown probability distribution Derive asymptotic properties of empirical distribution functions Perform tests for unknown distribution functions Construct non-parametric estimators of the density Estimate components of a non-linear regression model Understand the concept of minimax estimation
-
Course learning outcomes
Understanding the concepts of empirical distribution functions, kernel estimation and minimax theory Performing derivation of standard estimation and testing methods in nonparametric statistics -
Description
Estimation of probability measuresWeak limit theorems for empirical measuresKolmogorov-Smirnov and Cramer-von Mises testsEstimation of the densityNadaraja-Watson estimator and general kernel estimatorsNon-linear regression modelsMinimax theory -
Assessment
First session Solving bi-weekly exercises (40%) and a final exam (60%) will build the overall grade.Retake examWritten exam. -
Note
Literatur A.B. Tsybakov (2009): “Introduction to Nonparametric Estimation”, Springer. A. Van der Vaart and J. Wellner (1996): “Weak Convergence and Empirical Processes”, Springer.
-
Details
- Number of ECTS: 5
- Course number: MA_DS-24
- Module(s): Module 9 Workshops
- Language: EN
- Mandatory: Yes
-
Lecturer
Coming soon -
Objectives
Understand the main key topics in modern insurance, and how recent data science techniques may be used (or not) in this field.
-
Course learning outcomes
To know the key results in Risk Theory Implement machine learning models applied in insurance Define and calibrate models adapted to risk evaluation in insurance Identify the advantages and limits of machine learning techniques in the insurance field Apply Extreme Value Theory to analyze the tail of the distribution Determine methodologies adapted to emerging risks -
Description
This course aims at providing an overview of modern actuarial theory, and how recent data science techniques may help risk quantification in the context of insurance. Starting from standard risk theory, modern tools, for example from machine learning, will be introduced to show how new practices are currently introduced in the insurance industry, and what their limits are. Applications will cover non life and life insurance, reinsurance, and climate risk via natural disasters. An introduction to emerging risks, via the question of the rising cyber insurance market, will complete the course.Outline:I. Introduction to insuranceII. Basics of Risk TheoryIII. Standard regression models for insurance and extension to high dimensionIV. Machine learning: general principles and impact on actuarial risk evaluationV. Survival analysis for life and non-life insurance: application to individual reservingVI. Extreme value theory: introduction and tail index regressionVII. An example of an emerging risk: Cyber insurance -
Assessment
First session Project Retake exam Oral exam -
Note
Albrecher, H., Beirlant, J., & Teugels, J. L. (2017). Reinsurance: actuarial and statistical aspects. John Wiley & Sons. Charpentier, A. (Ed.). (2014). Computational actuarial science with R. CRC press. Kaas, R., Goovaerts, M., Dhaene, J., & Denuit, M. (2008). Modern actuarial risk theory: using R (Vol. 128). Springer Science & Business Media.
-
Details
- Number of ECTS: 5
- Course number: MA_DS-25
- Module(s): Module 9 Workshops
- Language: EN
- Mandatory: Yes
-
Lecturer
Coming soon -
Description
Part 1/2 – 20 hours, Vasja Sivec 1. Introduction to Forecasting Time Series (1 hours = 1 lecture) a. Introduction 2. Traditional Models for Forecasting Time Series (10 hours = 5 lectures + 5 exercise) a. Basic time series concepts – stationarity and integrated processes b. Forecasting Univariate time series – ARIMA c. Forecasting Multivariate time series – VAR d. Forecasting data with seasonal patterns – SARIMA and ETS i. Simple Example with Code ii. Theory iii. Exercise(s) 3. Neural Networks for Forecasting Time Series (6h = 3 lectures + 3 exercise) a. Introduction b. NN Representation c. Estimation: Feedforward & Backpropagation d. Forecasting i. Simple example with Code ii. Theory iii. Exercise(s) 4. Latest Findings & advanced models (4h = 4 lectures) a. Latest Findings (Traditional vs. Machine Learning Approach) b. Advanced models (ARCH, Markow Switching, MIDAS, Time Varying Parameter Models,..; if time)Part 2/2 – 20 hours, Bruno Rodrigues 5. Introduction to R and functional programming (4 hours) a. Pure functions b. Higher-order functions 6. Git and Github (3 hours) a. Intro to Git and Github b. Cloning repos c. Collaborating d. Branching e. Pull requests 7. Package development and unit testing (3 hours) a. Adding functions b. Documenting functions using roxygen c. Unit testing package d. Test coverage 8. Build automation and data products (6 hours) a. Build automation essentials b. Recording package versions c. Building a pipeline d. Literate programming with Quarto e. Interactive web apps using Shiny 9. Self-contained pipelines with Docker and Github Actions (4 hours) a. Docker essentials b. Building a fully reproducible pipeline c. Short introduction to Github Actions -
Assessment
Written examination on computer, possibly with testing of coding skills -
Note
Main textbook (students are not required to purchase these textbooks) Part 1 Ghysels, E. and Marcellino, M., 2018. Applied economic forecasting using time series methods. Oxford University Press. Lütkepohl, H., 2013. Introduction to multiple time series analysis. Springer Science & Business Media. Part 2 Textbook for Part 2 is available here: https://rap4mads.eu/Programming environment for exercises: Python or R, depending on what the students already know best.
Course offer for Semestre 4
-
Details
- Number of ECTS: 30
- Course number: MA_DS-30
- Module(s): Module 10 – Internship or Master Thesis
- Language: EN
- Mandatory: Yes
-
Lecturer
Coming soon