{"id":25568,"date":"2021-04-28T18:42:40","date_gmt":"2021-04-28T16:42:40","guid":{"rendered":"http:\/\/emblog.embl.de\/ells\/?post_type=teachingbase&#038;p=25568"},"modified":"2023-02-07T09:00:06","modified_gmt":"2023-02-07T09:00:06","slug":"bioinformatics-tools-and-sample-data","status":"publish","type":"teachingbase","link":"https:\/\/www.embl.org\/ells\/teachingbase\/dna-barcoding-resource\/bioinformatics-tools-and-sample-data\/","title":{"rendered":"Bioinformatics tools and sample data"},"content":{"rendered":"\n<div class=\"vf-tabs\"><ul class=\"vf-tabs__list\" data-vf-js-tabs=\"true\"><li class=\"vf-tabs__item\"><a class=\"vf-tabs__link\" href=\"#vf-tabs__section-29e5e60e-4f29-4387-8f40-7dab28d4be39\" data-vf-js-location-nearest-activation-target=\"\">Overview<\/a><\/li><li class=\"vf-tabs__item\"><a class=\"vf-tabs__link\" href=\"#vf-tabs__section-9c83087e-2c23-4dae-b71d-00be7d6d002a\" data-vf-js-location-nearest-activation-target=\"\">Example data<\/a><\/li><li class=\"vf-tabs__item\"><a class=\"vf-tabs__link\" href=\"#vf-tabs__section-705d3c74-42cc-4a2d-b94c-23e55887a9dd\" data-vf-js-location-nearest-activation-target=\"\">EMBOSS Seqret<\/a><\/li><li class=\"vf-tabs__item\"><a class=\"vf-tabs__link\" href=\"#vf-tabs__section-aa9a2c8e-0d1f-4d55-8a00-f50779be43f8\" data-vf-js-location-nearest-activation-target=\"\">EMBOSS Needle<\/a><\/li><li class=\"vf-tabs__item\"><a class=\"vf-tabs__link\" href=\"#vf-tabs__section-f2a280cb-f46c-4dab-bf17-807a01abe71d\" data-vf-js-location-nearest-activation-target=\"\">European Nucleotide Archive<\/a><\/li><li class=\"vf-tabs__item\"><a class=\"vf-tabs__link\" href=\"#vf-tabs__section-ee25bba9-dee0-4534-8345-22fd2da685ea\" data-vf-js-location-nearest-activation-target=\"\">Chromatogram viewer<\/a><\/li><li class=\"vf-tabs__item\"><a class=\"vf-tabs__link\" href=\"#vf-tabs__section-6d091066-2367-42b8-a912-72686ae181f3\" data-vf-js-location-nearest-activation-target=\"\">Activity navigation<\/a><\/li><\/ul><div class=\"vf-tabs-content\" data-vf-js-tabs-content=\"true\">\n<section class=\"vf-tabs__section\" id=\"vf-tabs__section-29e5e60e-4f29-4387-8f40-7dab28d4be39\"><h2>Overview<\/h2>\n<p>On this page, we provide a summary of the bioinformatics tools to use for the analysis of your DNA sequencing data. We also provide example sequence data which you can use in case you did not collect your own plant samples and wet-lab experimentation.<\/p>\n<\/section>\n\n\n\n<section class=\"vf-tabs__section\" id=\"vf-tabs__section-9c83087e-2c23-4dae-b71d-00be7d6d002a\"><h2>Example data<\/h2>\n<p>The example sequence data provide you with the opportunity to use barcode sequences without the need to carry out any experiments yourself. The sequence folder contains the .seq (text information) and .abi (chromatogram information) files for. You may use them to assemble contigs and then search the ENA database for matches. In case you would like to skip contig assembly, we have provided text files which contain the contig sequences of the respective examples.<\/p>\n\n\n\n<p>Following sequence preparation and search using the example data, the identity and an image of the plant in question can be viewed by clicking on &#8220;Example 1&#8221;, &#8220;Example 2&#8221;, etc. below, respectively. Please enter the genus name of the identified plant (e.g. Olea) to access the image.<\/p>\n\n\n\n<p><strong>A folder containing all example data files (.seq and .abi files) can be accessed<\/strong> <a href=\"https:\/\/drive.google.com\/drive\/folders\/16WRHbqR-S2c7V7h92NGjnp9P4J1ilmiS?usp=share_link\" data-type=\"URL\" data-id=\"https:\/\/drive.google.com\/drive\/folders\/16WRHbqR-S2c7V7h92NGjnp9P4J1ilmiS?usp=share_link\" target=\"_blank\" rel=\"noreferrer noopener\"><strong>here<\/strong><\/a>.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Example 1<\/strong> <\/h3>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Forward and reverse sequencing reads for <em>rbcL<\/em> barcode<\/li>\n<\/ul>\n\n\n\n<ul class=\"wp-block-list\">\n<li>This sequence is identical to &#8220;TutorialExample&#8221;<a title=\"Click to reveal but have your password ready;)\" href=\"\/ells\/dnabarcoding\/?page_id=5482\"><\/a><\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Example 2<\/strong><\/h3>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Forward and reverse sequencing reads for <em>rbcL<\/em> barcode<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Example 3<\/strong><\/h3>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Forward and reverse sequencing reads for <em>rbcL<\/em> barcode<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Example 4<\/strong><\/h3>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Forward and reverse sequencing reads for <em>rbcL<\/em> barcode<\/li>\n\n\n\n<li>Contig for <em>rbcL<\/em> read<\/li>\n\n\n\n<li>Reverse sequencing reads for <em>matK<\/em> barcode (please read note doc in folder)<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Example 5<\/strong><\/h3>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Forward and reverse sequencing reads for <em>rbcL<\/em> barcode<\/li>\n\n\n\n<li>Contig for <em>rbcL<\/em> read<\/li>\n\n\n\n<li>Forward and reverse sequencing reads for <em>matK<\/em> barcode<\/li>\n\n\n\n<li>Contig for <em>matK<\/em> read<\/li>\n<\/ul>\n<\/section>\n\n\n\n<section class=\"vf-tabs__section\" id=\"vf-tabs__section-705d3c74-42cc-4a2d-b94c-23e55887a9dd\"><h2>EMBOSS Seqret<\/h2>\n<p>Tool which can be used to reverse complement a nucleotide sequence.<\/p>\n\n\n\n<p><a href=\"https:\/\/www.ebi.ac.uk\/Tools\/sfc\/emboss_seqret\/\">https:\/\/www.ebi.ac.uk\/Tools\/sfc\/emboss_seqret\/<\/a><\/p>\n\n\n<div\n  class=\"vf-embed vf-embed--custom-ratio\"\n\n  style=\"--vf-embed-max-width: 100%;\n    --vf-embed-custom-ratio-x: 640;\n    --vf-embed-custom-ratio-y: 360;\"\n><iframe loading=\"lazy\" width=\"640\" height=\"360\" src=\"https:\/\/www.ebi.ac.uk\/Tools\/sfc\/emboss_seqret\/\" frameborder=\"0\" allow=\"accelerometer; autoplay; encrypted-media; gyroscope; picture-in-picture\" allowfullscreen><\/iframe><\/div>\n\n\n\n<p class=\"has-text-align-right\"><\/p>\n<\/section>\n\n\n\n<section class=\"vf-tabs__section\" id=\"vf-tabs__section-aa9a2c8e-0d1f-4d55-8a00-f50779be43f8\"><h2>EMBOSS Needle<\/h2>\n<p>Tool to align two sequences. To align nucleotide sequences select &#8220;nucleotide&#8221;.<\/p>\n\n\n\n<p><a href=\"https:\/\/www.ebi.ac.uk\/Tools\/psa\/emboss_needle\/\">https:\/\/www.ebi.ac.uk\/Tools\/psa\/emboss_needle\/<\/a><\/p>\n\n\n<div\n  class=\"vf-embed vf-embed--custom-ratio\"\n\n  style=\"--vf-embed-max-width: 100%;\n    --vf-embed-custom-ratio-x: 640;\n    --vf-embed-custom-ratio-y: 360;\"\n><iframe loading=\"lazy\" width=\"640\" height=\"360\" src=\"https:\/\/www.ebi.ac.uk\/Tools\/psa\/emboss_needle\/\" frameborder=\"0\" allow=\"accelerometer; autoplay; encrypted-media; gyroscope; picture-in-picture\" allowfullscreen><\/iframe><\/div>\n\n\n\n<h4 class=\"wp-block-heading\"><strong>Guide to EMBOSS Needle nucleotide sequence alignment result<\/strong><\/h4>\n\n\n\n<p><strong>Vertical lines<\/strong>&nbsp;indicate identical nucleotides (matches).<\/p>\n\n\n\n<p><strong>Dots<\/strong>&nbsp;or<strong>&nbsp;empty spaces<\/strong>&nbsp;indicate mismatches (i.e. conflicting nucleotide reads in forward and reverse sequence).<\/p>\n\n\n\n<p><strong>Empty spaces<\/strong>&nbsp;indicate gaps (i.e. a nucleotide read versus no nucleotide read in the two sequences).<\/p>\n\n\n\n<p><strong>Horizontal dashes within a sequence<\/strong>&nbsp;indicate gaps.<\/p>\n\n\n\n<p><strong>Horizontal dashes at either side of the alignment<\/strong>&nbsp;indicate that the two sequences did not produce an overlapping read in these areas (no alignment).<\/p>\n\n\n\n<p>The&nbsp;<strong>letter \u201cN\u201d<\/strong>&nbsp;within a sequence is used to denote an unknown nucleotide.<\/p>\n<\/section>\n\n\n\n<section class=\"vf-tabs__section\" id=\"vf-tabs__section-f2a280cb-f46c-4dab-bf17-807a01abe71d\"><h2>European Nucleotide Archive<\/h2>\n<p>The ENA database provides a comprehensive record of the world\u2019s nucleotide sequencing information, covering raw sequencing data, sequence assembly information and functional annotation. ENA is made up of a number of distinct databases that includes EMBL-Bank, the Sequence Read Archive (SRA) and the Trace Archive.<\/p>\n\n\n\n<p><a href=\"https:\/\/www.ebi.ac.uk\/ena\/browser\/sequence-search\">https:\/\/www.ebi.ac.uk\/ena\/browser\/sequence-search<\/a><\/p>\n\n\n<div\n  class=\"vf-embed vf-embed--custom-ratio\"\n\n  style=\"--vf-embed-max-width: 100%;\n    --vf-embed-custom-ratio-x: 640;\n    --vf-embed-custom-ratio-y: 360;\"\n><iframe loading=\"lazy\" width=\"640\" height=\"360\" src=\"https:\/\/www.ebi.ac.uk\/ena\/browser\/sequence-search\" frameborder=\"0\" allow=\"accelerometer; autoplay; encrypted-media; gyroscope; picture-in-picture\" allowfullscreen><\/iframe><\/div>\n\n\n\n<h4 class=\"wp-block-heading\"><strong>ENA result columns in NCBI BLAST+ (definitions)<\/strong><\/h4>\n\n\n\n<p><strong>&#8220;Align.&#8221;<\/strong> reports the number which has been assigned to the search result according to the best alignment (i.e. the result with the best hit is assigned number 1).<\/p>\n\n\n\n<p><strong>&#8220;DB:ID&#8221;<\/strong> gives the database identification code of the database entry\/search result.<\/p>\n\n\n\n<p><strong>&#8220;Source&#8221;<\/strong> provides information on the source of the DNA, i.e. organism name and sequence type (e.g. rbcL gene).<\/p>\n\n\n\n<p><strong>&#8220;Length&#8221;<\/strong> reports the length of the database sequence.<\/p>\n\n\n\n<p><strong>&#8220;Score&#8221;<\/strong> describes the quality of the alignment. The higher the score, the higher the similarity between query and database sequence.<\/p>\n\n\n\n<p><strong>&#8220;Identities&#8221;<\/strong> reports the number of identical residues that are found aligned between the query and database sequence.<\/p>\n\n\n\n<p><strong>&#8220;Positives&#8221;<\/strong> reports the number of aligned residues that score positively in the substitution matrix (i.e. similar types of residues).<\/p>\n\n\n\n<p><strong>&#8220;E()&#8221;<\/strong> gives the expect value for the alignment \u2013 this is a measure of how likely this particular alignment occurs by chance when searching a database of a particular size. An E-value of 0.0 is the lowest possible value and indicates that you would not expect to see this alignment to occur by chance alone. For example, an E-value of 10 means, in a database of current size, one might expect to see 10 matches with a similar or better score, simply by chance alone. Exponential numbers will appear when the value is very small above 0.<\/p>\n<\/section>\n\n\n\n<section class=\"vf-tabs__section\" id=\"vf-tabs__section-ee25bba9-dee0-4534-8345-22fd2da685ea\"><h2>Chromatogram viewer<\/h2>\n<p>To study your chromatograms, we recommend you use one of the chromatogram viewers below and open the .ab1 files of the forward and reverse sequence.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><strong>Suggested freeware chromatogram viewers<\/strong><\/h4>\n\n\n\n<p>Windows users: <strong><a title=\"Chromas Lite\" rel=\"noopener noreferrer\" href=\"http:\/\/technelysium.com.au\/?page_id=13\" target=\"_blank\">Chromas Lite<\/a><\/strong> <\/p>\n\n\n\n<p>Mac users: <strong><a title=\"4Peaks\" rel=\"noopener noreferrer\" href=\"http:\/\/nucleobytes.com\/index.php\/4peaks\" target=\"_blank\">4Peaks<\/a><\/strong> <\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><strong>Suggested web browser-based alternative<\/strong><\/h4>\n\n\n\n<p>For the purpose of sequence analysis and contig assembly it is recommended to use downloadable chromatogram viewer software. In case downloading software is not an option, a web browser-based viewer (see link below) could be used. Please note that the web browser-based alternative is less convenient for sequence analysis than software versions as it exports the chromatogram as images. Software functions such as nucleotide search are thus not available and need to be replaced manually has thus to be done visually rather than with the help of the software&#8217;s search function.<\/p>\n\n\n\n<p>Web browser-based chromatogram viewer: <strong><a title=\"abiview\" rel=\"noopener noreferrer\" href=\"http:\/\/emboss.bioinformatics.nl\/cgi-bin\/emboss\/abiview\" target=\"_blank\">abiview chromatogram viewer<\/a><\/strong> <\/p>\n<\/section>\n\n\n\n<section class=\"vf-tabs__section\" id=\"vf-tabs__section-6d091066-2367-42b8-a912-72686ae181f3\"><h2>Activity navigation<\/h2>\n<ul class=\"wp-block-list\"><li><a href=\"https:\/\/www.embl.org\/ells\/teachingbase\/dna-barcoding-resource\/\" data-type=\"teachingbase\" data-id=\"22912\">Introductory page <\/a><\/li><\/ul>\n\n\n\n<h3 class=\"wp-block-heading\">Sample collection and wet-lab<\/h3>\n\n\n\n<ul class=\"wp-block-list\"><li><a href=\"https:\/\/www.embl.org\/ells\/teachingbase\/dna-barcoding-resource\/wet-lab-protocols\/\" data-type=\"teachingbase\" data-id=\"25575\">Protocols for sample collection and wet-lab activity<\/a><\/li><\/ul>\n\n\n\n<h3 class=\"wp-block-heading\">Bioinformatics<\/h3>\n\n\n\n<ul class=\"wp-block-list\"><li><a href=\"https:\/\/www.embl.org\/ells\/teachingbase\/dna-barcoding-resource\/bioinformatics-tutorials\/\" data-type=\"teachingbase\" data-id=\"25570\">Bioinformatics tutorial<\/a><\/li><li><a href=\"https:\/\/www.embl.org\/ells\/teachingbase\/dna-barcoding-resource\/species-identification\/\" data-type=\"teachingbase\" data-id=\"25573\">Species identification using bioinformatics<\/a><\/li><li><a href=\"https:\/\/www.embl.org\/ells\/teachingbase\/dna-barcoding-resource\/bioinformatics-tools-and-sample-data\/\" data-type=\"teachingbase\" data-id=\"25568\">Bioinformatics tools and sample data<\/a><\/li><li><a href=\"https:\/\/www.embl.org\/ells\/teachingbase\/dna-barcoding-resource\/troubleshooting-glossary\/\" data-type=\"teachingbase\" data-id=\"25566\">DNA barcoding troubleshooting &amp; glossary<\/a><\/li><\/ul>\n<\/section>\n<\/div><\/div>\n","protected":false},"excerpt":{"rendered":"","protected":false},"featured_media":26870,"parent":22912,"menu_order":0,"template":"","class_list":["post-25568","teachingbase","type-teachingbase","status-publish","has-post-thumbnail","hentry"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.embl.org\/ells\/wp-json\/wp\/v2\/teachingbase\/25568","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.embl.org\/ells\/wp-json\/wp\/v2\/teachingbase"}],"about":[{"href":"https:\/\/www.embl.org\/ells\/wp-json\/wp\/v2\/types\/teachingbase"}],"up":[{"embeddable":true,"href":"https:\/\/www.embl.org\/ells\/wp-json\/wp\/v2\/teachingbase\/22912"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.embl.org\/ells\/wp-json\/wp\/v2\/media\/26870"}],"wp:attachment":[{"href":"https:\/\/www.embl.org\/ells\/wp-json\/wp\/v2\/media?parent=25568"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}