04 - 07 June 2018
Cancellation deadline:
25 May 2018
Linda Dib, Frédéric Schütz, Philippe Jacquet
Academic: 240 CHF
For-profit: 0 CHF
1 ECTS credits

No future instance of this course is planned yet

**The course if full therefore closed. **


In this course, biologists, computational biologists and bioinformatics scientists will acquire the key competencies that they need when dealing with big data from the health and life sciences fields. We will discuss the visualisation of large data sets, machine-leaning algorithms, linear models, classification algorithms as well introduce neural networks. Since many R functions cannot handle large datasets, we will also explore alternative solutions for several widely used statistical functions.


Biologists and bioinformaticians who need to handle large datasets, in particular those that R standard functions cannot deal with. The participants must already be confortable with R (i.e use linear models, calculate a correlation, and visualization of data using R).

Learning objectives

At the end of the course, the participants will be able to:

  1. Identify the challenges behind the analysis of big datasets
  2. Explore big datasets using visualisation and classification tools
  3. Apply decisional machine learning and clustering methods on large datasets
  4. Apply linear models functions on big data
  5. Classify big datasets using neural network methods


(subject to changes)

The course will cover the following topics:

Day 1.

  • Overview: big data case studies in health domain
  • Identify the general challenges behind big data analysis (model and overfitting)
  • Big data visualisation

Day 2. Linear models

Day 3.

  • Big data exploration and classifications
  • Machine learning and decisional algorithms – unsupervised learning

Day 4. Introduction to: Decision Tree, Random forest, Neural Networks, Deep learning


Knowledge / Competencies

  • Statistics beginner level (T-test, multiple testing methods, linear models)
  • R beginner level (Rstudio, ggplot, plots, install a library, matrix manipulation, read files)


  • You are required to bring your own laptop, with a working Wifi connection, and the latest versions of R and RStudio installed.


The course if full, therefore closed.

The registration fees for academics are 240 CHF for academics and 1200 CHF for for-profit companies. This includes course content material, coffee breaks and social dinner on Tuesday 5 June (Save the date, more info to follow).

Deadline for registration and free-of-charge cancellation is set is set to 25/05/2018. Cancellation after this date will not be reimbursed. Please note that participation to SIB courses is subject to our general conditions.

You will be informed by email of your registration confirmation.

Venue and Time

University of Lausanne

  • Monday, Tuesday, Wednesday: classroom 2020 - Génopode Building (Metro M1 line, Sorge station).
  • Thursday: classroom 321 - Amphipôle Building (Metro M1 line, Sorge station).

The course will start at 9:00 and end around 17:00. Precise information will be provided to the participants on due time.

Additional information

Coordination: Diana Marek

We will recommend 1 ECTS credits for this course (given a passed exam at the end of the course).

You are welcome to register to the SIB courses mailing list to be informed of all future courses and workshops, as well as all important deadlines using the form here.

For more information, please contact training@sib.swiss.