{"id":19361,"date":"2015-10-05T13:46:40","date_gmt":"2015-10-05T11:46:40","guid":{"rendered":"http:\/\/emblog.embl.de\/ells\/?post_type=teachingbase&#038;p=19361"},"modified":"2021-06-16T13:57:10","modified_gmt":"2021-06-16T13:57:10","slug":"part-3-phylogenetic-analysis-of-aligned-protein-sequences","status":"publish","type":"teachingbase","link":"https:\/\/www.embl.org\/ells\/teachingbase\/opsins-bioinformatics\/part-3-phylogenetic-analysis-of-aligned-protein-sequences\/","title":{"rendered":"Part 3: Phylogenetic analysis of aligned protein sequences"},"content":{"rendered":"\n<div class=\"vf-tabs\"><ul class=\"vf-tabs__list\" data-vf-js-tabs=\"true\"><li class=\"vf-tabs__item\"><a class=\"vf-tabs__link\" href=\"#vf-tabs__section-3687f4f2-ec44-4a8b-b9d9-892ccf767415\">Overview<\/a><\/li><li class=\"vf-tabs__item\"><a class=\"vf-tabs__link\" href=\"#vf-tabs__section-021ae0ac-4eb7-45e0-a5a3-34b30f9bad18\">Your task<\/a><\/li><li class=\"vf-tabs__item\"><a class=\"vf-tabs__link\" href=\"#vf-tabs__section-adf6fadf-79f2-42ed-a503-272a803893c1\">MUSCLE<\/a><\/li><li class=\"vf-tabs__item\"><a class=\"vf-tabs__link\" href=\"#vf-tabs__section-c6293ee4-0b5e-40e0-a9c1-6cc5cf1cd33e\">Questions<\/a><\/li><li class=\"vf-tabs__item\"><a class=\"vf-tabs__link\" href=\"#vf-tabs__section-62917c2a-aa30-4312-ad3c-464533410cc0\">Optional tasks<\/a><\/li><li class=\"vf-tabs__item\"><a class=\"vf-tabs__link\" href=\"#vf-tabs__section-90482065-e743-4d9f-8b75-fb7b30a95db5\">Activity navigation<\/a><\/li><\/ul><div class=\"vf-tabs-content\" data-vf-js-tabs-content=\"true\">\n<section class=\"vf-tabs__section\" id=\"vf-tabs__section-3687f4f2-ec44-4a8b-b9d9-892ccf767415\"><h2>Overview<\/h2>\n<p>In this part of the activity, we will construct a phylogenetic tree of the 24 opsin proteins. In order to do this, we will first have to remove the non-conserved amino acid residues which we identified in the multiple sequence alignment during Part 2 of the activity.<\/p>\n<\/section>\n\n\n\n<section class=\"vf-tabs__section\" id=\"vf-tabs__section-021ae0ac-4eb7-45e0-a5a3-34b30f9bad18\"><h2>Your task<\/h2>\n<p><strong>Proceed as described below:<\/strong><\/p>\n\n\n\n<p><strong>1.&nbsp;&nbsp;&nbsp;<\/strong>Edit the multiple sequence alignment from Part 2 to remove the non-conserved residues.<\/p>\n\n\n\n<p><strong>i.&nbsp;<\/strong>&nbsp; Go back to the JalView window which contains the multiple sequence alignment from Part 2.<\/p>\n\n\n\n<p><strong>ii.<\/strong>&nbsp;&nbsp; In JalView, delete any gaps or residues which appear as non-conserved by selecting the upper most area above the designated residue stretch with your cursor (a red box with a solid red top will appear) and hitting the backspace on your keyboard.<\/p>\n\n\n\n<p><strong>iii.<\/strong>&nbsp;Save the edited alignment as FASTA file: File &gt; Output to Textbox &gt; FASTA. Open a new text document on the desktop of your computer by right-clicking on your desktop: New &gt; Text Document. Copy and paste the sequence data from the open window into the text document and leave text document open for later use.<\/p>\n\n\n\n<p><strong>Note:<\/strong>&nbsp;if, for any reason, you do not manage to edit the sequences or to save them, it does not matter too much for the purpose of this exercise. In this exercise we selected the sequences in such a way that they produce nice phylogenetic tree without the need for extensive editing. (Please note, however, that in real life, before creating a phylogenetic tree, it is always required to edit and remove non-aligned regions!)<\/p>\n\n\n\n<p><strong>2. &nbsp;<\/strong>&nbsp;To create a phylogenetic tree file of the edited sequences, enter the edited alignment into MUSCLE (output format \u201cClustalW\u201d). To do this, go the the \u201cMUSCLE\u201d tab and follow the instructions there.<\/p>\n\n\n\n<p><strong>3.<\/strong>&nbsp;&nbsp; Try to answer some of the task questions.<\/p>\n<\/section>\n\n\n\n<section class=\"vf-tabs__section\" id=\"vf-tabs__section-adf6fadf-79f2-42ed-a503-272a803893c1\"><h2>MUSCLE<\/h2>\n<p><strong>1. &nbsp;<\/strong>&nbsp;To create a phylogenetic tree file of the edited sequences, copy the FASTA data of your edited alignment into the MUSCLE input box (before hitting \u201cSubmit\u201d, ensure that&nbsp; \u201cClustalW\u201d is selected as output format).<\/p>\n\n\n\n<p><strong>2.&nbsp;&nbsp;&nbsp;<\/strong>On the results page, click \u201cSend to ClustalW_Phylogeny\u201d and in \u201cStep 2\u201d on the ClustalW phylogeny page select the following:<br>\u201cTree format\u201d: Default; \u201cDistance correction\u201d: ON; \u201cExclude gaps\u201d: ON (This is the most important step. By editing your sequences, you will have removed non-aligned gaps. However, in case you left any gaps in the alignment, selecting this option will take care that the algorithm works only with sequence positions not containing any gaps.); \u201cClustering method\u201d: UPGMA; \u201cP.I.M.\u201d: ON<\/p>\n\n\n\n<p><strong>3.<\/strong>&nbsp;&nbsp; Click \u201cSubmit\u201d.<\/p>\n\n\n\n<p><strong>4.<\/strong>&nbsp;&nbsp; At the bottom of the results window, beneath \u201cPhylogram\u201d, you will find an image of your phylogenetic tree. You can select the branch length as \u201cReal\u201d in order to see how fast your sequences have evolved. However, for the next part of the activity, the \u201cCladogram\u201d view is handier. Looking at the tree structure, try to answer some of the task questions.<\/p>\n\n\n<div\n  class=\"vf-embed vf-embed--custom-ratio\"\n\n  style=\"--vf-embed-max-width: 100%;\n    --vf-embed-custom-ratio-x: 640;\n    --vf-embed-custom-ratio-y: 360;\"\n><iframe loading=\"lazy\" width=\"640\" height=\"360\" src=\"https:\/\/www.ebi.ac.uk\/Tools\/msa\/muscle\/\" frameborder=\"0\" allow=\"accelerometer; autoplay; encrypted-media; gyroscope; picture-in-picture\" allowfullscreen><\/iframe><\/div>\n<\/section>\n\n\n\n<section class=\"vf-tabs__section\" id=\"vf-tabs__section-c6293ee4-0b5e-40e0-a9c1-6cc5cf1cd33e\"><h2>Questions<\/h2>\n<p><strong>1. &nbsp;<\/strong>&nbsp;Have a look at the structure of the tree. You should see one sequence which is an outgroup to all the others. Which one is it? Why do you think it is that one?<br><strong>2.&nbsp;<\/strong>&nbsp; The rest of the tree splits the sequences into two major groups. Does this split generally also reflect the evolutionary relationships between the species? Do you see any exceptions?<\/p>\n\n\n\n<p>You can now either further analyse the opsin evolution in a short optional exercise (cf. \u201cOptional Task\u201d tab) or proceed to Part 4 of the activity by clicking on the link below.<\/p>\n<\/section>\n\n\n\n<section class=\"vf-tabs__section\" id=\"vf-tabs__section-62917c2a-aa30-4312-ad3c-464533410cc0\"><h2>Optional tasks<\/h2>\n<h3 class=\"wp-block-heading\"><strong>Further analysis of the multiple sequence alignment<\/strong><\/h3>\n\n\n\n<p>We have now successfully constructed a phylogenetic tree of the opsin proteins. In this part of the activity we will use the multiple sequence alignment to further analyse the evolution of opsins.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Your Task<\/strong><\/h3>\n\n\n\n<p>Go back to the multiple sequence alignment in JalView and, based on what you learnt from the phylogenetic analysis in Part 3, group the sequences according to the groupings you noticed in the tree in Part 3 (\u201cCladogram\u201d view). Sequences can be moved around by clicking on the sequence name and using the up and down arrows on your keyboard. Note: ignore the Danio_mel_rec1A sequence at the top\/bottom of the alignment in your analysis or remove the sequence from JalView completely (you can remove it by clicking on the sequence name and hitting the backspace button of your keyboard).<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Questions<\/strong><\/h3>\n\n\n\n<p><strong>1.&nbsp;<\/strong>In G protein-coupled receptors, a tripeptide motive just after the transmembrane domain VII (the final one) is important for G-alpha binding. Can you find a tripeptide which also reflects the grouping of the sequences?<br><strong>2.<\/strong>&nbsp;What do you think is the reason for this tripeptide to be so conserved?<\/p>\n<\/section>\n\n\n\n<section class=\"vf-tabs__section\" id=\"vf-tabs__section-90482065-e743-4d9f-8b75-fb7b30a95db5\"><h2>Activity navigation<\/h2>\n<ul class=\"wp-block-list\"><li><a href=\"https:\/\/www.embl.org\/ells\/teachingbase\/opsins-bioinformatics\/\" data-type=\"teachingbase\" data-id=\"3942\">Introductory page<\/a><\/li><li><a href=\"https:\/\/www.embl.org\/ells\/teachingbase\/opsins-bioinformatics\/part-1-search-for-protein-identity\/\" data-type=\"teachingbase\" data-id=\"19357\">Part 1: Search for protein identity<\/a><\/li><li><a href=\"https:\/\/www.embl.org\/ells\/teachingbase\/opsins-bioinformatics\/part-2-multiple-sequence-alignment-of-protein-sequences\/\" data-type=\"teachingbase\" data-id=\"19359\">Part 2: Multiple sequence alignment of protein sequences<\/a><\/li><li><a href=\"https:\/\/www.embl.org\/ells\/teachingbase\/opsins-bioinformatics\/part-3-phylogenetic-analysis-of-aligned-protein-sequences\/\" data-type=\"teachingbase\" data-id=\"19361\">Part 3: Phylogenetic analysis of aligned protein sequences<\/a><\/li><li><a href=\"https:\/\/www.embl.org\/ells\/teachingbase\/opsins-bioinformatics\/part-4-topological-and-structural-analysis-of-proteins\/\" data-type=\"teachingbase\" data-id=\"19363\"><strong>Part 4: Topological and structural analysis of proteins<\/strong><\/a><\/li><\/ul>\n<\/section>\n<\/div><\/div>\n","protected":false},"excerpt":{"rendered":"","protected":false},"featured_media":1201,"parent":3942,"menu_order":2,"template":"","class_list":["post-19361","teachingbase","type-teachingbase","status-publish","has-post-thumbnail","hentry"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.embl.org\/ells\/wp-json\/wp\/v2\/teachingbase\/19361","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.embl.org\/ells\/wp-json\/wp\/v2\/teachingbase"}],"about":[{"href":"https:\/\/www.embl.org\/ells\/wp-json\/wp\/v2\/types\/teachingbase"}],"up":[{"embeddable":true,"href":"https:\/\/www.embl.org\/ells\/wp-json\/wp\/v2\/teachingbase\/3942"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.embl.org\/ells\/wp-json\/wp\/v2\/media\/1201"}],"wp:attachment":[{"href":"https:\/\/www.embl.org\/ells\/wp-json\/wp\/v2\/media?parent=19361"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}