We will dive into the fascinating world of bioinformatics to unravel the secrets of proteins, one of the fundamental building blocks of molecular biology. Through a series of interactive exercises, we will investigate how proteins are made, how they look and how they have evolved across species. Join us on this fictional scientific adventure that is intertwined with real-life research activities at EMBL, and gain insights into cutting-edge research methodologies.
The aim of this activity is to deepen our understanding of proteins and their evolution, while also providing valuable insights into the practical applications of bioinformatics in scientific research.
During the TREC (Traversing European Coastlines) expedition, researchers gather a wide array of biological samples from the European coastline to investigate the coastal ecosystems. These samples are obtained from soil, sediments, aerosol, shallow water and sea, and are meticulously analysed using scientific techniques such as microscopy or DNA sequencing.
Recently, the researchers stumbled upon a perplexing discovery within one of their samples from the shallow water — an unidentified DNA sequence that does not match any known sequence. Intrigued by this finding, the TREC researchers have a challenge for you: Can you help them to identify the protein encoded by this sequence and decipher its biological function? Furthermore, can you assist in identifying the species of origin and exploring its evolutionary relationships with other organisms?
Most of the tools we will employ during this activity have been developed and made freely available to scientists all over the world by EMBL-EBI.
NCBI BLAST+ (Basic Local Alignment Search Tool)
A tool used to identify and compare biological sequence information, such as amino acid sequences. It enables researchers to compare a subject protein (referred to as a query) with a database of sequences, making it useful in identifying unknown proteins.
A tool designed for aligning and comparing multiple sequences, specifically suited for amino acid sequences.
A tool designed to perform basic phylogenetic analysis on a multiple sequence alignment.
UniProt (Universal Protein Resource) database
A comprehensive catalogue of protein information, including protein sequences and their functions.
A database for protein structure predictions. AlphaFold is an artificial intelligence (AI) system that can predict a protein’s 3D structure based on its amino acid sequence. The AlphaFold Database is also integrated into UniProt.
Note: These tools require some time for the calculations. Therefore, a little patience may be necessary.
Topic area: Bioinformatics, Evolutionary biology, Structural & Computational biology, Biochemistry
Type of resource: Online resource
Age group: 16-19
Contact: SEPE team