Protein sequence databases and sequence annotation at UniProtKB - streamed

Date 28 April 2022
Speaker(s) Elisabeth Gasteiger, Marie-Claude Blatter, Ivo Pedruzzi
ECTS 0.25
Fees *academic: 60 CHF   -   for-profit: 300 CHF
Cancellation deadline 21 Apr 2022
City Streamed from Geneva
*academic fee includes non for-profit organisations as well as unemployed participants at the time of application.

This course will be streamed only for registered participants. Registered participants will receive information from the course organizer in due time.

Overview

Using high-throughput technologies, you can identify long lists of candidate genes that differ between two experimental conditions. In order to interpret these gene lists and to discover fundamental properties like gene function and disease relevance, you need to use the annotation linked to a given gene or protein sequence.

The goals of this course and the practical exercises that follow are to give some basic theoretical and practical knowledge on protein sequence databases with a focus on UniProtKB, on the different manual and automated annotation pipelines (such as HAMAP) and, in particular, on the optimum use of UniProtKB. UniProtKB and HAMAP are SIB resources; they are listed in Expasy, the Swiss Bioinformatics Portal.During the theory and the practical sessions, we will discuss questions such as:

  • Where do the protein sequences come from?
  • What are the differences between the major protein sequence databases?
  • What are the manual and automated gene / protein annotation pipelines?
  • What are the Gene Ontology (GO) annotation pipelines?
  • How to assess protein sequence accuracy and annotation quality?
  • How to extract biological knowledge from a Blast result or gene list?

Audience

This course targets biologists and bioinformaticians who seek to analyze protein data. It will also be useful to programmers and data scientists, be they from academia or industry, who programmatically access protein sequence databases and need to understand the data.

Learning outcomes

At the end of the course, the participants are expected to:

  • list the differences between the major protein sequence databases
  • describe the major protein sequence and GO annotation pipelines
  • assess the accuracy of a protein sequence and the quality of annotation

Prerequisites

Knowledge / competencies

This course is designed for beginners and there are no requirements.

Technical

This course will be streamed, you are thus required to have your own computer with an Internet connection.

Application

The registration fees for academics are 60 CHF and 300 CHF for for-profit companies.

You will be informed by email of your registration confirmation. Upon reception of the confirmation email, participants will be asked to confirm attendance by paying the fees within 5 days.

Deadline for free-of-charge cancellation is set to 21/04/2022. Cancellation after this date will not be reimbursed. Please note that participation in SIB courses is subject to our general conditions.

Venue and Time

This course will be streamed using Zoom.

The course will start at 9:00 CET and end around 17:00 CET. Precise information will be provided to the participants in due time.

Additional information

Coordination: Monique Zahn, SIB training group.

We will recommend 0.25 ECTS credits for this course (given a passed exam at the end of the course).

You are welcome to register to the SIB courses mailing list to be informed of all future courses and workshops, as well as all important deadlines using the form here.

Please note that participation in SIB courses is subject to our general conditions.

SIB abides by the ELIXIR Code of Conduct. Participants of SIB courses are also required to abide by the same code.

For more information, please contact training@sib.swiss.