{"id":64001,"date":"2023-11-15T10:00:00","date_gmt":"2023-11-15T09:00:00","guid":{"rendered":"https:\/\/www.embl.org\/news\/?post_type=embletc&#038;p=64001"},"modified":"2023-11-15T10:09:02","modified_gmt":"2023-11-15T09:09:02","slug":"deciphering-the-data-deluge-how-large-language-models-are-transforming-scientific-data-curation","status":"publish","type":"embletc","link":"https:\/\/www.embl.org\/news\/embletc\/issue-101\/deciphering-the-data-deluge-how-large-language-models-are-transforming-scientific-data-curation\/","title":{"rendered":"Deciphering the data deluge: how large language models are transforming scientific data curation"},"content":{"rendered":"\n<p>In a world inundated with data, curating valuable information has never been more challenging, or more important. From academic papers to scientific databases, the deluge of new information can be overwhelming, leaving researchers in a constant struggle to keep up. However, a groundbreaking innovation in artificial intelligence is helping to transform the data curation landscape: large language models (LLMs) such as those behind ChatGPT. Powered by sophisticated deep-learning algorithms, these models are revolutionising how we streamline and curate massive volumes of data.<\/p>\n\n\n\n<p>Here we look at some of the ways researchers at EMBL\u2019s European Bioinformatics Institute (EMBL-EBI) are taking advantage of LLMs to aid their data curation processes. From automating the summary and annotation of academic papers to assisting with ontology mapping, LLMs are not just aiding human curators but also have the potential to enhance the quality of the data EMBL-EBI provides to its users.&nbsp;<\/p>\n\n\n\n<article class=\"vf-card vf-card--brand vf-card--bordered vf-u-margin__bottom--800\" default>\n  <div class=\"vf-card__content | vf-stack vf-stack--400\">\n      <h3 class=\"vf-card__heading\">\n      What are large language models (LLMs)?\u00a0    <\/h3>\n                <p class=\"vf-card__text\"><span style=\"font-weight: 400;\">LLMs are a type of artificial intelligence system trained on vast amounts of textual data. By processing and learning from this data, these models can generate coherent and contextually relevant text across a wide range of topics. LLMs can understand and produce human-like text, making them valuable tools for tasks such as content creation, answering questions, and natural language understanding. <\/span><\/p>\n      <\/div>\n<\/article>\n\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Transforming data curation&nbsp;<\/strong><\/h2>\n\n\n\n<p>Academic papers are currently being published at an unprecedented pace, and the challenge of pulling out relevant information has never been greater. <a href=\"https:\/\/www.ebi.ac.uk\/people\/person\/andrew-green\/\">Andrew Green, ARISE Fellow at EMBL-EBI<\/a> has been using LLMs to streamline the data curation for EMBL-EBI\u2019s database for non-coding RNAs, <a href=\"https:\/\/rnacentral.org\/\">RNAcentral<\/a>.&nbsp;<\/p>\n\n\n\n<p>To do this, Green has successfully developed a tool to scrape scientific articles that mention specific RNA identifiers. These sentences are then fed into GPT-3.5, which generates concise, coherent summaries about the RNA of interest. These summaries describe key details such as the RNA&#8217;s functions, its involvement in diseases, and the organisms in which it has been studied.<\/p>\n\n\n\n<p>&#8220;One of the intriguing features of using LLMs is the accuracy and contextual understanding they bring into the summarisation process,&#8221; said Green. &#8220;We&#8217;ve seen the model accurately decipher acronyms in a given context and even self-correct its errors when asked to fact-check its summaries.&#8221;<\/p>\n\n\n\n<p>To ensure the summaries generated are robust, they go through multiple rounds of validation, and are then rated for quality before appearing in the RNAcentral database. The summaries serve as quick references for scientists to better understand a particular RNA, and also include clickable citations to the original articles on <a href=\"https:\/\/europepmc.org\/\">Europe PMC<\/a>.<\/p>\n\n\n\n<p>&#8220;It&#8217;s crucial to remember that LLMs don&#8217;t inherently know the difference between what&#8217;s real and what&#8217;s fabricated,\u201d added Green. \u201cIn the scientific community, where factual accuracy is paramount, this could be a major concern. Models can sometimes &#8216;hallucinate&#8217; details that aren&#8217;t in the original text. To mitigate this, we have put multiple validation rounds in place. This, combined with constant human oversight, ensures that the information presented is both accurate and reliable.&#8221;<\/p>\n\n\n\n<p>At the heart of this approach is an automated method for extracting and summarising valuable information from a multitude of academic articles. This means that this work can also be applied to many other EMBL-EBI resources. Once fully developed and implemented, this automated process for curation serves to aid the work of many of EMBL-EBI\u2019s curators, acting as a first filter in the lengthy process of data collection and interpretation.&nbsp;<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Accelerating annotation&nbsp;<\/strong><\/h2>\n\n\n\n<p>Another aspect of the EMBL-EBI database pipelines that can benefit from LLMs is data annotation. <a href=\"https:\/\/www.ebi.ac.uk\/people\/person\/melanie-vollmar\/\">Melanie Vollmar is an ARISE Fellow at EMBL-EBI<\/a> with a strong background in structural biology and a growing expertise in machine learning. As part of her fellowship, she is looking at how to fully automate the extraction of functional information about proteins from academic papers using LLMs.&nbsp;<\/p>\n\n\n\n<p>Her project focuses on gathering structural information from the Protein Data Bank in Europe (PDBe) and supplementing it with related academic publications from Europe PMC. This curated information is then mined for specific functional details, which are mapped back onto the protein sequences listed in <a href=\"https:\/\/www.uniprot.org\/\">UniProt<\/a>.&nbsp;<\/p>\n\n\n\n<p>Until now, curating literature for functional annotations followed a purely manual approach supplemented by traditional text mining methods. LLMs, designed to grasp the intricacies of human language, can parse through vast amounts of scientific literature, weigh contrasting opinions, and generate complex text-based outputs.&nbsp;<\/p>\n\n\n\n<p>This capability can bring in a new era of data enrichment, as these models help to extract more detailed and contextually rich information from existing biological literature at an accelerated pace. At no point is such a model intended to replace the human biocurator who is required to provide a critical view on the produced output.<\/p>\n\n\n\n<p>\u201cWith automation, not only do we increase the pace at which we can annotate data, but we also enrich the quality of that data, offering a more comprehensive resource for our users,\u201d said Vollmar. \u201cOur focus now is on protein structures, but the beauty of our approach is its adaptability: the methods we&#8217;re developing could easily be transplanted onto other types of biological data, elevating the annotation process across the board.&#8221;<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Fine-tuning existing LLMs&nbsp;<\/strong><\/h2>\n\n\n\n<p>Europe PMC is EMBL-EBI\u2019s home of scientific literature, and after many years of serving the scientific community, the resource remains an intuitive and powerful search tool to help users stay on the cutting edge of science. Many of the database&#8217;s functionalities rely on literature curation, which involves scanning through dense academic material to extract essential information.&nbsp;<\/p>\n\n\n\n<p><a href=\"https:\/\/www.ebi.ac.uk\/people\/person\/santosh-tirunagari\/\">Santosh Tirunagari, Senior Machine Learning Developer<\/a> at EMBL-EBI is leveraging the capabilities of LLMs to accelerate the curation of scientific literature within Europe PMC. He and others in the team have developed specialised named entity recognition models, which are fine-tuned versions of existing LLMs. These sophisticated tools are designed to automatically identify critical scientific entities such as genes, proteins, diseases, and chemicals in research papers and patents.<\/p>\n\n\n\n<p>Using this approach helps to side-step the high computational costs of developing a language model from scratch, which could require dozens of GPUs and extensive training time. By concentrating on the fine-tuning phase, Tirunagari has been able to adapt these powerful language models to specific tasks relevant to scientific curation. This maximises efficiency while achieving high levels of accuracy.<\/p>\n\n\n\n<p>In one of his models, Tirunagari also uses an innovative \u2018human-in-the-loop&#8217; methodology for model training. Beginning with a limited dataset, the fine-tuned models undergo further adjustments using additional scientific papers. Human curators then verify the model&#8217;s findings, enabling an iterative feedback loop that continually improves the model\u2019s accuracy.<\/p>\n\n\n\n<p>&#8220;Large language models have been a game-changer in our efforts to automate the complex task of scientific curation. By fine-tuning these models, we&#8217;ve been able to develop highly specialised tools that can sift through vast amounts of scientific literature and patents to identify key entities such as genes, organisms, proteins, and diseases with impressive accuracy,\u201d said Tirunagari. \u201cThis not only accelerates our work but also opens up new possibilities for collaborations, like our ongoing partnership with Open Targets to use these models to aid drug discovery.&#8221;<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>A novel approach to ontology mapping&nbsp;<\/strong><\/h2>\n\n\n\n<p>Ontologies are structured, hierarchical classifications that are widely used for standardising diseases. Current practices for ontology mapping rely heavily on manual curation, making it a time-consuming and error-prone task. To address these issues, <a href=\"https:\/\/www.ebi.ac.uk\/people\/person\/kirill-tsukanov\/\">Kirill Tsukanov, Senior Bioinformatician at EMBL-EBI<\/a>, has developed a new method for ontology mapping using openly-available, GPT-based language models.&nbsp;<\/p>\n\n\n\n<article class=\"vf-card vf-card--brand vf-card--bordered vf-u-margin__bottom--800\" default>\n  <div class=\"vf-card__content | vf-stack vf-stack--400\">\n      <h3 class=\"vf-card__heading\">\n      What is ontology mapping?\u00a0    <\/h3>\n                <p class=\"vf-card__text\">Ontology mapping is when you have data in one format and you want to convert it to another standard format so that it can be combined with other data. For example, if one database uses &#8220;heart disease&#8221; and another uses &#8220;cardiovascular disorder,&#8221; ontology mapping would help align these terms so the databases can work together.<\/p>\n      <\/div>\n<\/article>\n\n\n\n\n<p>The new method integrates EMBL-EBI\u2019s <a href=\"https:\/\/www.ebi.ac.uk\/ols\/index\">Ontology Lookup Service (OLS)<\/a> with GPT-3.5 to evaluate the relevancy of ontology terms provided by OLS. Rather than generating ontology identifiers from scratch, the GPT model is tasked with grading existing mappings. This new workflow enables the system to map about 20% more terms compared to existing methods while retaining the same accuracy.<\/p>\n\n\n\n<p>&#8220;Our prototype already shows immense promise,\u201d said Tsukanov. \u201cThe integration of GPT models helps us overcome the limitations of existing systems, increasing the speed of ontology mapping. The application of LLMs in our research is not just innovative; it&#8217;s transformative. These models are helping us bridge the gap between raw, unstructured information and actionable, standardised data.&#8221;<\/p>\n\n\n\n<p>&#8220;While LLMs like GPT-3.5 have proven to be invaluable in tasks like ontology mapping, they present an intriguing challenge,\u201d continues Tsukanov. \u201cThese models don&#8217;t inherently know the difference between fact and fiction. Recognising this, we&#8217;ve been careful to integrate additional layers of validation and are exploring the use of open-source, stable models that can be fine-tuned specifically for our ontological needs. The goal is to have a tool that not only understands human language but aligns that understanding with the precise, standardised terms in our ontologies.&#8221;<\/p>\n\n\n\n<p>The project is currently in its developmental phase but Tsukanov plans to test the stability of other LLMs to further improve this new system. The ultimate goal is to create a universally applicable library, serving as a foundation for ontology mapping for different EMBL-EBI initiatives.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Large language models: a catalyst for change<\/strong><\/h2>\n\n\n\n<p>The advent of LLMs such as GPT represents a pivotal moment not only in the field of artificial intelligence but also in how we handle, curate, and understand enormous volumes of data. The success stories above show that, while not without their challenges, LLMs hold immense promise for making our data-rich world more understandable, accessible, and usable.<\/p>\n\n\n\n<p>There are obstacles to overcome, one of the foremost concerns is data integrity and trustworthiness: as LLMs are trained on massive datasets, there&#8217;s a risk of perpetuating inaccuracies or biases present in the source data. This is particularly critical in scientific applications where incorrect or biased information could have far-reaching implications. Additionally, the automated nature of LLMs could lead to unintended consequences, such as the omission of nuanced insights that human experts might catch, thereby impacting the quality and reliability of curated data.&nbsp;<\/p>\n\n\n\n<p>Given these complexities, it&#8217;s crucial to integrate ethical considerations into the design, implementation, and ongoing management of LLMs in scientific data curation.To address these challenges, our researchers discuss how they have implemented multi-layered verification frameworks for data curated by LLMs. Regular updates to the LLMs themselves, coupled with continuous feedback loops with human curators, allow for ongoing refinement of the models, reducing the likelihood of errors over time.<\/p>\n\n\n\n<p>As LLMs become an increasingly integral part of scientific research, vigilance in maintaining data quality remains a top priority. As the technology matures and as we get better at integrating human expertise with machine capabilities, these challenges are likely to diminish. Ultimately, LLMs have the potential to act as powerful catalysts in the evolution of data curation and scientific research, propelling us into an era where data can not only inform but also enhance our pursuit of understanding.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Large language models are changing the way we carry out scientific data curation, annotation, and research, setting the stage for a more efficient understanding of scientific literature<\/p>\n","protected":false},"author":77,"featured_media":64039,"parent":0,"menu_order":0,"template":"","tags":[28,36,42,315],"class_list":["post-64001","embletc","type-embletc","status-publish","has-post-thumbnail","hentry","tag-bioinformatics","tag-embl-ebi","tag-genomics","tag-open-data"],"acf":{"featured":true,"show_featured_image":false,"field_target_display":"embl","field_article_language":{"value":"english","label":"English"},"article_intro":"<p>Large language models are changing the way we carry out scientific data curation, annotation, and research, setting the stage for a more efficient understanding of scientific literature<\/p>\n","related_links":false,"source_article":false,"in_this_article":false,"press_contact":"None","article_translations":false,"languages":"","embletc_issue":[{"ID":63969,"post_author":"72","post_date":"2023-11-15 10:00:00","post_date_gmt":"2023-11-15 09:00:00","post_content":"","post_title":"Issue 101","post_excerpt":"","post_status":"publish","comment_status":"closed","ping_status":"closed","post_password":"","post_name":"issue-101","to_ping":"","pinged":"","post_modified":"2024-05-29 12:02:08","post_modified_gmt":"2024-05-29 10:02:08","post_content_filtered":"","post_parent":0,"guid":"https:\/\/www.embl.org\/news\/?post_type=embletc-issue&#038;p=63969","menu_order":0,"post_type":"embletc-issue","post_mime_type":"","comment_count":"0","filter":"raw"}],"embletc_in_this_issue":[{"ID":64003,"post_author":"124","post_date":"2023-11-15 10:00:00","post_date_gmt":"2023-11-15 09:00:00","post_content":"<!-- wp:paragraph -->\n<p>The Kristineberg Center for Marine Research and Innovation sits near the mouth of the Gullmar fjord in Sweden. One day in early August this year, a large truck rolled up the narrow streets of the small settlement, coming to a halt only a few meters from the water. One of those most eagerly awaiting the truck\u2019s safe arrival was biologist Niko Leisch.&nbsp;&nbsp;<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>\u201cThere were a couple of areas where I was not sure if the truck will actually manage to make the narrow turns,\u201d said Leisch, who is currently the Operational Manager of EMBL\u2019s mobile services. \u201cI was very relieved when it finally made it.\u201d&nbsp;<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>With a sturdy frame, expandable walls, and scientific equipment worth several million euros, the truck that arrived at Kristineberg \u2013 the EMBL Advanced Mobile Laboratory (AML) \u2013 is a unique undertaking in the history of European life science research. The AML brings cutting-edge technology directly to the field, helping researchers process biological samples immediately after collecting them, using a variety of advanced methodologies.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:acf\/vfwp-embed {\"name\":\"acf\/vfwp-embed\",\"data\":{\"url\":\"https:\/\/www.youtube.com\/embed\/jepbtufJrIg?si=anh8uW1zSQbirlJ-\",\"_url\":\"field_5ecbb005d2d93\",\"caption\":\"\",\"_caption\":\"field_5ecbb0e7d2d94\",\"ratio\":\"16 x 9\",\"_ratio\":\"field_5ecbb0f3d2d95\"},\"mode\":\"preview\"} \/-->\n\n<!-- wp:heading -->\n<h2 class=\"wp-block-heading\"><strong>Exploring symbiosis in the wild<\/strong><\/h2>\n<!-- \/wp:heading -->\n\n<!-- wp:paragraph -->\n<p>One of the first users of the AML at Kristineberg was Flora Vincent, Group Leader at EMBL Heidelberg. Vincent, a marine biologist and no stranger to fieldwork, is interested in exploring the complex interactions within microbial communities, especially those involving symbiosis in single-celled microorganisms.&nbsp;<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>As part of the Traversing European Coastlines (TREC expedition) \u2013 a flagship project of EMBL\u2019s Planetary Biology Transversal Theme \u2013 Vincent has been collecting species across land-water interfaces along the European coast. A particular point of interest for Vincent\u2019s group is diatoms, small microscopic organisms that live in the ocean and produce a large proportion of the planet\u2019s oxygen supply.&nbsp;<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>\u201cI use a lot of single-cell approaches, particularly single-cell sequencing,\u201d said Vincent. \u201cIn the past, I have sometimes needed to collect water samples and then drive two hours to a hospital with a flow cytometry unit to do my single-cell sorting. Here, with the cell sorter in the AML, it makes it much easier to begin the process on the spot, with a much lower risk of sample damage.\u201d<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:acf\/vfwp-card {\"name\":\"acf\/vfwp-card\",\"data\":{\"image_source\":\"media_library\",\"_image_source\":\"field_602ba83883d48\",\"image\":\"\",\"_image\":\"field_5ebd3243ee8ec\",\"title\":\"What is TREC?\",\"_title\":\"field_5ebd3243ee989\",\"subheading\":\"\",\"_subheading\":\"field_603b4403d386e\",\"text\":\"With the ambitious aim of sampling soil and water at 120 sites in 46 regions across Europe within two years, the TREC expedition aims to study life across all biological scales and help provide solutions to some of the most pressing global challenges of the day. Launched in March 2023, TREC brings EMBL researchers together with many partners and collaborators to investigate life at land-sea transects all along the European coast.\",\"_text\":\"field_5ebd3243eea26\",\"link\":\"\",\"_link\":\"field_5ebd3243eeac3\",\"style\":\"bordered\",\"_style\":\"field_5ebd3244102b1\",\"image_ratio\":\"default\",\"_image_ratio\":\"field_61d41d4bcbbee\"},\"mode\":\"preview\"} \/-->\n\n<!-- wp:paragraph -->\n<p>Also interested in symbiosis is Johan Decelle, Group Leader at CNRS in the Cell and Plant Physiology Laboratory, Grenoble and one of the collaborators for TREC. \u201cMy experience with the AML was like a scientific dream coming true,\u201d he said. \u201cNot only was it a unique scientific experience with cutting-edge instruments on the field, but also a memorable human adventure with experts and colleagues in the field.\u201d<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:image {\"align\":\"left\",\"id\":64043,\"width\":\"440px\",\"height\":\"auto\",\"aspectRatio\":\"1.3368983957219251\",\"sizeSlug\":\"large\",\"linkDestination\":\"none\"} -->\n<figure class=\"wp-block-image alignleft size-large is-resized\"><img src=\"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2023\/11\/TREC-Drone-1-1024x767.jpg\" alt=\"An overhead drone image showing the AML making its way through a narrow hilly road in Sweden. \" class=\"wp-image-64043\" style=\"aspect-ratio:1.3368983957219251;width:440px;height:auto\"\/><figcaption class=\"wp-element-caption\">The AML arriving in Kristineberg, Sweden. Credit: EMBL<\/figcaption><\/figure>\n<!-- \/wp:image -->\n\n<!-- wp:paragraph -->\n<p>Decelle\u2019s team also participated in previous TREC pilot expeditions in Iceland, France, and Italy, and this work was significant in helping shape the current version of the AML.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>\u201cWe know little about the structural organisation of these systems, or how the host cell integrates with the photosynthesising cell or machinery at the subcellular level,\u201d said Decelle, \u201cThese organisms can\u2019t be easily cultured in the lab, and we wanted to explore these symbiotic relationships in their natural environment and physiological state, freezing them for 3D electron microscopy \u2013 which we can now do with some of the instruments in the AML.\u201d<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>Decelle\u2019s team is also studying symbiosis in plankton \u2013 small unicellular ocean-living organisms, especially focusing on known symbiotic associations between photosynthetic and non-photosynthetic eukaryotic cells. Similar symbiotic interactions are believed to have taken place several times during evolution, leading to the acquisition of chloroplast \u2013 a key cell organelle responsible for photosynthesis \u2013 in lineages of plants and photosynthetic algae.&nbsp;<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:image {\"align\":\"right\",\"id\":64529,\"width\":\"504px\",\"height\":\"auto\",\"aspectRatio\":\"1.6722408026755853\",\"sizeSlug\":\"full\",\"linkDestination\":\"none\"} -->\n<figure class=\"wp-block-image alignright size-full is-resized\"><img src=\"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2023\/11\/IMG_9453_EDIT_1000x600px-1.jpg\" alt=\"\" class=\"wp-image-64529\" style=\"aspect-ratio:1.6722408026755853;width:504px;height:auto\"\/><figcaption class=\"wp-element-caption\">The AML at Kristineberg. Credit: Tina Wiegand\/EMBL<\/figcaption><\/figure>\n<!-- \/wp:image -->\n\n<!-- wp:paragraph -->\n<p>According to Vincent, the AML also provides researchers an opportunity to think creatively, with tools now being available on the field that were never possible before. \u201cIt's pushing us to think about a field sampling in a different way,\u201d she said. \u201cIt really enables me to draw new paths in my brain, ones that I could have never thought of as possible in the natural environment. It also gives us the opportunity to combine the best of two worlds by using the AML tools on samples collected by the Tara schooner. In a single day, we can get a full picture of the marine microbiome thanks to the Tara holistic sampling strategy, and then leverage the AML to conduct state-of-the-art molecular and cellular analyses on targeted species\u201d<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>This is particularly suited to Vincent\u2019s team, who use an adaptive sampling approach where the research strategy is tailored to the collection of living organisms that each sample contains, rather than vice versa.&nbsp;&nbsp;<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>\u201cThe challenge is that we are really building the plane as we fly it,\u201d said Vincent. \u201cA lot of the machines are on the field for the first time. Each of those machines can be used in a standalone manner, but where I have a lot of fun is building connections between those tools.\u201d<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>Using these tools and creating such new symbiotic relationships between methodologies and scientific approaches, the researchers hope to arrive at new insights that could help address some of the most pressing global challenges we face today \u2013 climate change, environmental degradation, and biodiversity loss.&nbsp;<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:acf\/vfwp-embed {\"name\":\"acf\/vfwp-embed\",\"data\":{\"url\":\"https:\/\/www.youtube.com\/embed\/ryPPHPxuB2k?si=muncAPAo04MiT6aK\",\"_url\":\"field_5ecbb005d2d93\",\"caption\":\"\",\"_caption\":\"field_5ecbb0e7d2d94\",\"ratio\":\"16 x 9\",\"_ratio\":\"field_5ecbb0f3d2d95\"},\"mode\":\"preview\"} \/-->\n\n<!-- wp:heading -->\n<h2 class=\"wp-block-heading\"><strong>Peering deeper inside plankton<\/strong><\/h2>\n<!-- \/wp:heading -->\n\n<!-- wp:paragraph -->\n<p>Another researcher interested in exploring plankton communities is Omaya Dudin, group leader at the Swiss Federal Institute of Technology Lausanne (EPFL). Dudin\u2019s lab explores the evolutionary origins of animal development and is part of the expansion microscopy (PlanExM) team that, as part of TREC, is trying to create an atlas of plankton biodiversity along European coasts. They hope to create a snapshot that would be crucial in assessing the impact of climate change on these populations in the future.&nbsp;<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>\u201cThe biggest problem that happens in environmental sampling is that the moment you go beyond an hour or two from the time of sampling, you\u2019re not really sure what you\u2019re looking at any more,\u201d said Dudin.&nbsp;<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>This is because many of the plankton living in coastal waters tend to die quickly when taken out of their native environment, greatly altering the species composition of the sample depending on how much time has passed since collection.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>The ability of the AML to travel very close to sampling locations allows Dudin and his team to expedite these crucial sample preservation steps and image plankton at the sub-cellular resolution using expansion and electron microscopy. It is also critical for the second part of the team\u2019s work, which focuses on culturing some of the species collected from the environment in order to potentially bring them back to the lab to study.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>\u201cEMBL is very well known for having high-end techniques in microscopy and molecular biology, and to bring them here, to a place where the samples are fresh, is something very unique,\u201d said Rainer Pepperkok, Director of Scientific Core Facilities and Services at EMBL. \u201cThis is something that I think will set the basis, for many decades, of functional and mechanistic research that integrates these different domains \u2013 the sea and the land \u2013 and to understand how these molecular ecosystems are functioning.\u201d<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:image {\"align\":\"right\",\"id\":64045,\"width\":\"442px\",\"height\":\"auto\",\"aspectRatio\":\"1.3350717079530638\",\"sizeSlug\":\"large\",\"linkDestination\":\"none\"} -->\n<figure class=\"wp-block-image alignright size-large is-resized\"><img src=\"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2023\/11\/DJI_0205-13-1024x767.jpg\" alt=\"A drone image showing an overhead view of the AML truck parked close to the shore at Kristineberg, Sweden. \" class=\"wp-image-64045\" style=\"aspect-ratio:1.3350717079530638;width:442px;height:auto\"\/><figcaption class=\"wp-element-caption\">The AML at Kristineberg, Sweden. Credit: EMBL<\/figcaption><\/figure>\n<!-- \/wp:image -->\n\n<!-- wp:paragraph -->\n<p>Arriving in Kristineberg just a day or two after the AML, Dudin was excited to be able to finally use the mobile laboratories. \u201cIt's almost like an extension of my own lab,\u201d he said. \u201cThe quality of the equipment is beyond imagining and it\u2019s so well-organised, it just feels like home.\u201d<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>With the help of the AML, the PlanExM team can collect samples using specially designed \u2018plankton nets\u2019, bring them to the laboratory on wheels, and \u2018fix\u2019 them in three different ways in less than an hour. These methods include using chemical fixatives, high-pressure freezing, and plunge freezing, the latter two allowing researchers to use these samples for electron microscopy. Similarly, for culturing new species, the team is going to be able to combine the speed at which the samples are brought to the mobile lab with the single-cell sorter present on board to potentially get single-cell cultures, something still uncommon in plankton research.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>\u201cI think it\u2019s going to change the perspective on how we do certain kinds of research,\u201d said Dudin. \u201cRight now, almost everyone wants to work on model systems because there are clear protocols, images, and tools. And for many of these wild species, no one\u2019s been working on them simply because they are so difficult to catch \u2013 by the time they reach the lab, they are dead. With the AML and with this mission, we are going to keep getting better at collecting data on these understudied organisms and potentially give hope to researchers who want to work on these. In that sense, I think it\u2019s going to be a game changer.\u201d<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:heading -->\n<h2 class=\"wp-block-heading\"><strong>Zooming in on species<\/strong><\/h2>\n<!-- \/wp:heading -->\n\n<!-- wp:paragraph -->\n<p>One of the key challenges in working with environmental samples when compared with model organisms in the laboratory is the sheer heterogeneity \u2013 samples can contain thousands of diverse species and finding the ones you are particularly interested in can be a herculean task.&nbsp;<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>The TREC team led by Yannick Schwab, Team Leader and Head of the Electron Microscopy Core Facility at EMBL, is trying to solve this problem. \u201cMy team is interested in developing methods that enable researchers to target electron microscope imaging to specific cells in complex and highly heterogenous specimens,\u201d said Schwab.&nbsp;<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>One of the projects Schwab is leading focuses on dinoflagellates, a very rich and diverse group of plankton. When trying to zoom in on specific dinoflagellate groups in environmental samples, researchers need to perform multimodal correlative imaging \u2013 where the same sample is studied via different advanced imaging methods and the results are combined to yield a \u2018big picture\u2019 view. However, to preserve the ultrastructure of these organisms, state-of-the-art cryo-fixation methods are needed, and these sample preparation steps can only be performed with dedicated machines like a high-pressure freezer or a plunge freezer. As Dudin also noted, it\u2019s very unusual for such machines to be available at or near field locations.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>\u201cWhen we are collecting non-cultivable dinoflagellates in the field, we need to rush to fix them before they start being denatured. Therefore, we must deploy those instruments for cryo-fixation as close as possible to the sampling; by doing so, we literally freeze our samples on the beach,\u201d said Schwab. \u201cThis would simply not be possible without the AML.\u201d<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:image {\"id\":64069,\"width\":\"780px\",\"height\":\"auto\",\"aspectRatio\":\"1.6666666666666667\"} -->\n<figure class=\"wp-block-image is-resized\"><img src=\"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2023\/11\/image.png\" alt=\"A composite image of four 3D micrographs showing the cellular organelles of a phytoplankton marked in different colours.\" class=\"wp-image-64069\" style=\"aspect-ratio:1.6666666666666667;width:780px;height:auto\"\/><figcaption class=\"wp-element-caption\">The new vCLEM method reveals ultrastructural details of the phytoplankton <em>Ensiculifera tyrrhenica<\/em>. Shown are the theca (metallic purple), mitochondria (green), chloroplasts (red), nucleus (blue), Golgi complex (yellow), mucocysts (orange) and trichocysts (pink and magenta). Credit: Karel Mocaer and Isabel Romero Calvo\/EMBL<\/figcaption><\/figure>\n<!-- \/wp:image -->\n\n<!-- wp:paragraph -->\n<p>In a <a href=\"https:\/\/journals.biologists.com\/jcs\/article\/136\/15\/jcs261355\/325830\/Targeted-volume-correlative-light-and-electron\">recent publication<\/a>, a team led by Schwab and Paolo Ronchi from EMBL Heidelberg <a href=\"https:\/\/www.embl.org\/news\/science\/spotlight-seeing-into-the-seas\/\">demonstrates a new method<\/a> that uses correlative light and electron microscopy to help us accurately identify plankton species collected in the field. The researchers plan to apply this technique to samples from the TREC expedition, with the help of the AML.&nbsp;<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:heading -->\n<h2 class=\"wp-block-heading\"><strong>Bringing technologies to the field<\/strong><\/h2>\n<!-- \/wp:heading -->\n\n<!-- wp:acf\/vfwp-embed {\"name\":\"acf\/vfwp-embed\",\"data\":{\"url\":\"https:\/\/www.youtube.com\/embed\/tSgRkyx-9wM?si=KME-IkxkaDk6T8mU\",\"_url\":\"field_5ecbb005d2d93\",\"caption\":\"\",\"_caption\":\"field_5ecbb0e7d2d94\",\"ratio\":\"16 x 9\",\"_ratio\":\"field_5ecbb0f3d2d95\"},\"mode\":\"preview\"} \/-->\n\n<!-- wp:paragraph -->\n<p>However, as Schwab points out, it is not only the advanced instruments that are important here, but also the dedicated team that enables their use by offering their expertise. In addition to Leisch, the EMBL mobile services team currently includes Michael Bonadonna \u2013 specialist in flow cytometry and cell sorting, Tina Wiegand \u2013 specialist in fluorescence microscopy, and Paulina Cherek \u2013 specialist in electron microscopy sample preparation.&nbsp;<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>Along with the AML, the mobile services team also helped outfit vehicles that can access different terrains of the sampling sites to support on-site sample collection and a sampling van equipped for sample processing and storage.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>Leisch, along with Schwab, Pepperkok, and Paola Bertucci (Head of EMBL Scientific Expeditions), has spent over a year helping develop and deploy the advanced mobile lab, working with the Toutenkamion Group, a French industrial mobility company. \u201cThe process was surprisingly smooth,\u201d said Leisch. \u201cFor every single machine, we came up with a way of making sure we can install it in the truck in such a way that it can be used to its full potential on site, but can also be easily secured during moving. This ensures we can quickly get to work once we arrive at a site. We also brainstormed potential challenges that may arise once on the road and tried to find solutions.\u201d<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>The process involved many custom-made setups and coordination with multiple scientific instrumentation providers. However, the effort paid off and in August 2023, the AML saw its first full deployment in Kristineberg.&nbsp;<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>\u201cOn a personal note, it was a great pleasure to work with Franck Neveu and Melanie Asselin from the Toutenkamion Group,\u201d added Leisch. \u201cThe two of them were our contacts on the industry side and halfway through the project, they became as invested as we were in seeing the unit delivered in the best possible way.\"<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:acf\/vfwp-embed {\"name\":\"acf\/vfwp-embed\",\"data\":{\"url\":\"https:\/\/www.youtube.com\/embed\/cq3X3KAOkgY?si=D0laFsaq4CfqEqd1\",\"_url\":\"field_5ecbb005d2d93\",\"caption\":\"\",\"_caption\":\"field_5ecbb0e7d2d94\",\"ratio\":\"16 x 9\",\"_ratio\":\"field_5ecbb0f3d2d95\"},\"mode\":\"preview\"} \/-->\n\n<!-- wp:paragraph -->\n<p>The TREC expedition is going to continue till mid-2024, but AML\u2019s story does not end there. \u201cWe are right now working on making the AML available as a service unit,\u201d said Leisch. \u201cEveryone from individual researchers to consortia or even nations can then ask for and request the services, and then we together with the team would come to provide those on site.\u201d<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>Researchers from EMBL member states agree. \u201cBringing a truck like this on the international scene will allow us to go to places that no one has gone before,\u201d said Dudin. \u201cAnd that opens up possibilities that never existed before.\u201d<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>In the meantime, the AML will make its way to the next few TREC supersites, where it will continue to inspire intellectual curiosity and collaboration.&nbsp;<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p><strong>\u201c<\/strong>One evening, we observed the unicellular organism <em>Noctiluca<\/em> in our samples. Like its name suggests (\u2018Noctiluca\u2019 translates to \u2018light at night\u2019) it is often responsible for the well-known phenomenon of bioluminescence.<em>\u201d<\/em> recalled Leisch. <strong>\u201c<\/strong>Everyone got excited and we set up an impromptu midnight sampling in order to collect more of these microbes in the night. It was past midnight and the whole place was buzzing with enthusiasm and excitement and scientific curiosity. And it was just so beautiful to see that and be able to experience that with my colleagues.\u201d<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:acf\/vfwp-card {\"name\":\"acf\/vfwp-card\",\"data\":{\"image_source\":\"media_library\",\"_image_source\":\"field_602ba83883d48\",\"image\":\"\",\"_image\":\"field_5ebd3243ee8ec\",\"title\":\"\",\"_title\":\"field_5ebd3243ee989\",\"subheading\":\"\",\"_subheading\":\"field_603b4403d386e\",\"text\":\"EMBL acknowledges the generous support of many institutions, donors and sponsors, in particular the Manfred Lautenschl\u00e4ger Foundation, as well as Eppendorf SE, and Carl Zeiss Microscopy, in helping make the AML possible. We also thank Thermo Fisher Scientific and Ted Pella, Inc. for donating scientific instruments for the AML.\",\"_text\":\"field_5ebd3243eea26\",\"link\":\"\",\"_link\":\"field_5ebd3243eeac3\",\"style\":\"striped\",\"_style\":\"field_5ebd3244102b1\",\"image_ratio\":\"default\",\"_image_ratio\":\"field_61d41d4bcbbee\"},\"mode\":\"preview\"} \/-->","post_title":"Taking science on the road","post_excerpt":"With the new advanced mobile laboratory, EMBL is taking its service offerings to new heights, bringing cutting-edge life science technologies to the field in a way never seen before. ","post_status":"publish","comment_status":"closed","ping_status":"closed","post_password":"","post_name":"taking-science-on-the-road","to_ping":"","pinged":"","post_modified":"2023-11-15 10:02:50","post_modified_gmt":"2023-11-15 09:02:50","post_content_filtered":"","post_parent":0,"guid":"https:\/\/www.embl.org\/news\/?post_type=embletc&#038;p=64003","menu_order":0,"post_type":"embletc","post_mime_type":"","comment_count":"0","filter":"raw"},{"ID":64005,"post_author":"16","post_date":"2023-11-15 10:00:00","post_date_gmt":"2023-11-15 09:00:00","post_content":"<!-- wp:paragraph -->\n<p><em>By Maria Papanikolaou and Anna Stanescu, EMBL<\/em><\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>Although the idea of a European laboratory for life sciences was conceived in 1962, it would take 11 more years, involving conversations between Nobel prize winners, an unusual endorsement from two female UK government officials, and the signatures of 10 member state officials, to authorise the concept.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>\u201cThere was resistance at the Royal Society, but persuading governments was the biggest problem. A lot of people felt that it was hopeless,\u201d said Sir John Kendrew, EMBL's first Director General and Nobel Prize-winning structural biologist <a href=\"https:\/\/archive.embl.org\/uploads\/r\/archive-of-european-molecular-biology-laboratory\/5\/2\/7\/527f5bacfba61700710e3872918b270ac75a07ab1f7dcac8ee629522006c67cb\/DE_2324_C-PUBS-A-20.pdf\">in a 1994 interview<\/a>. \u201cIn the end, the British political support came from two women, Shirley Williams and Margaret Thatcher who were both ministers in the Department of Education and Science in successive British governments. Long before she became Prime Minister, Margaret Thatcher had read chemistry at Oxford\u2026as the new Conservative Secretary of State for Education and Science, she apparently took the EMBL file home one weekend \u2013 it was about a foot thick \u2013 and came back on Monday morning and said, \u2018We join!\u2019\u201d<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:image {\"align\":\"right\",\"id\":64093,\"width\":\"437px\",\"height\":\"auto\",\"aspectRatio\":\"1.3365921787709498\"} -->\n<figure class=\"wp-block-image alignright is-resized\"><img src=\"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2023\/11\/image-1.png\" alt=\"\" class=\"wp-image-64093\" style=\"aspect-ratio:1.3365921787709498;width:437px;height:auto\"\/><figcaption class=\"wp-element-caption\">Photo: EMBL Archive, Grenoble Anniversary material (unprocessed collection)&nbsp;<\/figcaption><\/figure>\n<!-- \/wp:image -->\n\n<!-- wp:paragraph -->\n<p>The idea for the creation of a European molecular biology laboratory was first proposed when two Nobel Prize-winning biologists, Kendrew and <a href=\"https:\/\/en.wikipedia.org\/wiki\/James_Watson\">James Watson<\/a>, met with<a href=\"https:\/\/en.wikipedia.org\/wiki\/Leo_Szilard\"> Leo Szilard<\/a> \u2013 Hungarian-German-American physicist and inventor \u2013&nbsp; and the Director General of CERN <a href=\"https:\/\/en.wikipedia.org\/wiki\/Victor_Weisskopf\">Victor Weisskopf<\/a> at CERN in Geneva in December 1962.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>In 1964, the <a href=\"https:\/\/www.embo.org\/\">European Molecular Biology Organization (EMBO)<\/a> was founded, followed by the establishment of the <a href=\"https:\/\/embc.embo.org\/about-embc\/\">European Molecular Biology Conference<\/a> (EMBC) in 1970. Prolonged discussions followed, allaying the concerns of the national governments \u2013 member states of the EMBC \u2013 that a central laboratory might <a href=\"https:\/\/www.embl.org\/documents\/wp-content\/uploads\/2023\/09\/40-years-embl-magazine.pdf\">\u201cdraw away <\/a>their country\u2019s best scientists\u201d<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>In 1971, the EMBC took significant steps towards the creation and growth of the laboratory. They did so by forming four working groups, in cooperation with EMBO, to guide the establishment process. These efforts were overseen by a Steering Committee, and as a result of these collaborative endeavours, a draft agreement outlining the establishment of the laboratory was prepared (EMBC, Annual Report April 1971, <a href=\"https:\/\/archive.embl.org\/europ-conference-on-m-b\">Kenneth Holmes material<\/a>, EMBL Archive).<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p><a href=\"https:\/\/archive.embl.org\/embo-allgemeine-mitteilungen\">Between 1972 and 1973<\/a> the EMBL Agreement was ready to be signed by ten out of the thirteen member states of the EMBC. On April 12, 1973, the EMBL Secretary-General announced to the EMBO members the date and location of the EMBL Signing Agreement Ceremony, pointing out that \u201cthe Agreement will take legal effect when a sufficient number of the Governments have ratified the representatives\u2019 signatures \u2013 a process that is estimated to require about one year\u201d.&nbsp;<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>The pivotal moment in the establishment of EMBL was the signing of the <a href=\"https:\/\/archive.embl.org\/uploads\/r\/archive-of-european-molecular-biology-laboratory\/3\/3\/9\/33933be653199adc40e6b9cf5ef0bf96855c0dae0234ed8e68444168a9c40d23\/DE_2324_SEC-A-1.pdf\">EMBL Agreement<\/a> on May 10, 1973, followed by its ratification in July 1974.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:image {\"align\":\"right\",\"id\":64095,\"width\":\"572px\",\"height\":\"auto\",\"aspectRatio\":\"1.3194444444444444\"} -->\n<figure class=\"wp-block-image alignright is-resized\"><img src=\"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2023\/11\/image-2.png\" alt=\"\" class=\"wp-image-64095\" style=\"aspect-ratio:1.3194444444444444;width:572px;height:auto\"\/><figcaption class=\"wp-element-caption\">Photo: EMBL Archive, Grenoble Anniversary material (unprocessed collection)&nbsp;<\/figcaption><\/figure>\n<!-- \/wp:image -->\n\n<!-- wp:paragraph -->\n<p>The agreement signing ceremony took place at CERN in Geneva on the 10th of May 1973, where the document was signed by representatives of ten of EMBC member states \u2013 <a href=\"https:\/\/de.wikipedia.org\/wiki\/Josef_L%C3%B6ns\">Josef L\u00f6ns<\/a> for the Federal Republic of Germany, <a href=\"https:\/\/en.wikipedia.org\/wiki\/Frederick_Mason_(diplomat)\">Frederick Mason<\/a> for the United Kingdom of Great Britain and Northern Ireland, Erik Thrane for Denmark, <a href=\"https:\/\/de.wikipedia.org\/wiki\/Emanuel_Treu\">Rudolf Martins <\/a>for Austria, Bernard Dufournier for France, <a href=\"https:\/\/en.wikipedia.org\/wiki\/Shabtai_Rosenne\">Shabtai Rosenne<\/a> for Israel, <a href=\"https:\/\/de.wikipedia.org\/wiki\/Ren%C3%A9_Keller_(Diplomat)\">Rene Keller<\/a> for Switzerland, Polak Rosenberg for the Netherlands, Smoquina from Italy, and Rune Fremlin from Sweden.&nbsp;<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>The EMBL Signing Agreement Ceremony was a significant event attended by prominent figures from the worlds of science and politics who had played pivotal roles in the establishment of EMBL.&nbsp;<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:image {\"align\":\"left\",\"id\":64097,\"width\":\"410px\",\"height\":\"auto\",\"aspectRatio\":\"0.7578323454699407\"} -->\n<figure class=\"wp-block-image alignleft is-resized\"><img src=\"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2023\/11\/image-3.png\" alt=\"\" class=\"wp-image-64097\" style=\"aspect-ratio:0.7578323454699407;width:410px;height:auto\"\/><figcaption class=\"wp-element-caption\">Photo: <a href=\"https:\/\/archive.embl.org\/frieda-glockner-material\">Frieda Gl\u00f6ckner material<\/a>, EMBL Archive&nbsp;<\/figcaption><\/figure>\n<!-- \/wp:image -->\n\n<!-- wp:paragraph -->\n<p>Among the distinguished participants were: <a href=\"https:\/\/en.wikipedia.org\/wiki\/Fran%C3%A7ois_Jacob\">Fran\u00e7ois Jacob<\/a>, a renowned French biologist and Nobel Prize laureate, <a href=\"https:\/\/cordis.europa.eu\/article\/id\/13161-prof-paolo-fasella-19301999\">Paolo Fasella<\/a>, the Director General of the European Commission's Directorate General for Science, <a href=\"https:\/\/100jahre.ggh-heidelberg.de\/geschichten\/geschichte\/erster-buergermeister-dr-karl-korz\">Karl Korz<\/a>, the First Mayor of Heidelberg (Baub\u00fcrgermeister), <a href=\"https:\/\/prabook.com\/web\/raymond.appleyard\/1297824\">Raymond Appleyard<\/a>, the Executive Director of EMBO, <a href=\"https:\/\/en.wikipedia.org\/wiki\/Willibald_Jentschke\">Willibald Jentschke<\/a>, the Director General of CERN (the European Organization for Nuclear Research), H. Voirier, the president of EMBC,&nbsp; and <a href=\"https:\/\/en.wikipedia.org\/wiki\/Jeffries_Wyman_(biologist)\">Jeffries Wyman<\/a>, who served as the Chairman of the Steering Committee responsible for the establishment of EMBL, and was also the first Secretary General of EMBO and professor at the University of Rome.&nbsp;<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>These individuals came together to mark the historic occasion of the EMBL Signing Agreement Ceremony, symbolising the culmination of dedicated efforts to bring this vital scientific institution into existence.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>The EMBL Agreement officially became effective on July 4 1974, coinciding with the day when most of the signatory states had ratified, approved, and accepted the agreement, in accordance with <a href=\"https:\/\/www.embl.org\/documents\/wp-content\/uploads\/2020\/05\/un_agreement.pdf\">its Article XV<\/a>. Notably, France's participation in the ratification process at this stage was pivotal in bringing the agreement into force on this date.<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>In 2024, EMBL will celebrate its fiftieth anniversary. The occasion not only offers us an opportunity to share stories from past achievements but more importantly, provides us a reminder to keep moving forward to pursue tough scientific questions, pushing forward the frontiers of molecular biology and further integrating with other disciplines to have real-world impact. We hope you will join us in celebrating and reflecting on EMBL\u2019s tradition of pushing the envelope of scientific possibility as well as leading European life sciences to new heights.<\/p>\n<!-- \/wp:paragraph -->","post_title":"Remembering the moment EMBL was established","post_excerpt":"The EMBL agreement, which made reality the idea of creating a European laboratory for life sciences, was signed in 1973 and ratified in 1974. Fifty years later, we look back at this historic moment.","post_status":"publish","comment_status":"closed","ping_status":"closed","post_password":"","post_name":"remembering-the-moment-embl-was-established","to_ping":"","pinged":"","post_modified":"2023-11-15 10:17:54","post_modified_gmt":"2023-11-15 09:17:54","post_content_filtered":"","post_parent":0,"guid":"https:\/\/www.embl.org\/news\/?post_type=embletc&#038;p=64005","menu_order":0,"post_type":"embletc","post_mime_type":"","comment_count":"0","filter":"raw"}]},"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.2 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Deciphering the data deluge: how large language models are transforming scientific data curation | EMBL<\/title>\n<meta name=\"description\" content=\"Large language models are changing the way we carry out scientific data curation, annotation, and research.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.embl.org\/news\/embletc\/issue-101\/deciphering-the-data-deluge-how-large-language-models-are-transforming-scientific-data-curation\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Deciphering the data deluge: how large language models are transforming scientific data curation | EMBL\" \/>\n<meta property=\"og:description\" content=\"Large language models are changing the way we carry out scientific data curation, annotation, and research.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.embl.org\/news\/embletc\/issue-101\/deciphering-the-data-deluge-how-large-language-models-are-transforming-scientific-data-curation\/\" \/>\n<meta property=\"og:site_name\" content=\"EMBL\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/embl.org\/\" \/>\n<meta property=\"article:modified_time\" content=\"2023-11-15T09:09:02+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2023\/11\/LLMs-EMBL-EBI-EMBLetc-1000x600-1.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1000\" \/>\n\t<meta property=\"og:image:height\" content=\"600\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:site\" content=\"@embl\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"8 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.embl.org\/news\/embletc\/issue-101\/deciphering-the-data-deluge-how-large-language-models-are-transforming-scientific-data-curation\/\",\"url\":\"https:\/\/www.embl.org\/news\/embletc\/issue-101\/deciphering-the-data-deluge-how-large-language-models-are-transforming-scientific-data-curation\/\",\"name\":\"Deciphering the data deluge: how large language models are transforming scientific data curation | EMBL\",\"isPartOf\":{\"@id\":\"https:\/\/www.embl.org\/news\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.embl.org\/news\/embletc\/issue-101\/deciphering-the-data-deluge-how-large-language-models-are-transforming-scientific-data-curation\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.embl.org\/news\/embletc\/issue-101\/deciphering-the-data-deluge-how-large-language-models-are-transforming-scientific-data-curation\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2023\/11\/LLMs-EMBL-EBI-EMBLetc-1000x600-1.jpg\",\"datePublished\":\"2023-11-15T09:00:00+00:00\",\"dateModified\":\"2023-11-15T09:09:02+00:00\",\"description\":\"Large language models are changing the way we carry out scientific data curation, annotation, and research.\",\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.embl.org\/news\/embletc\/issue-101\/deciphering-the-data-deluge-how-large-language-models-are-transforming-scientific-data-curation\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.embl.org\/news\/embletc\/issue-101\/deciphering-the-data-deluge-how-large-language-models-are-transforming-scientific-data-curation\/#primaryimage\",\"url\":\"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2023\/11\/LLMs-EMBL-EBI-EMBLetc-1000x600-1.jpg\",\"contentUrl\":\"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2023\/11\/LLMs-EMBL-EBI-EMBLetc-1000x600-1.jpg\",\"width\":1000,\"height\":600,\"caption\":\"Credit: Karen Arnott\/EMBL-EBI\"},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.embl.org\/news\/#website\",\"url\":\"https:\/\/www.embl.org\/news\/\",\"name\":\"European Molecular Biology Laboratory News\",\"description\":\"News from the European Molecular Biology Laboratory\",\"publisher\":{\"@id\":\"https:\/\/www.embl.org\/news\/#organization\"},\"alternateName\":\"EMBL News\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.embl.org\/news\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.embl.org\/news\/#organization\",\"name\":\"European Molecular Biology Laboratory\",\"alternateName\":\"EMBL\",\"url\":\"https:\/\/www.embl.org\/news\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.embl.org\/news\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2025\/09\/EMBL_logo_colour-1-300x144-1.png\",\"contentUrl\":\"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2025\/09\/EMBL_logo_colour-1-300x144-1.png\",\"width\":300,\"height\":144,\"caption\":\"European Molecular Biology Laboratory\"},\"image\":{\"@id\":\"https:\/\/www.embl.org\/news\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/embl.org\/\",\"https:\/\/x.com\/embl\",\"https:\/\/www.instagram.com\/embl_org\/\",\"https:\/\/www.linkedin.com\/company\/15813\/\",\"https:\/\/www.youtube.com\/user\/emblmedia\/\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Deciphering the data deluge: how large language models are transforming scientific data curation | EMBL","description":"Large language models are changing the way we carry out scientific data curation, annotation, and research.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.embl.org\/news\/embletc\/issue-101\/deciphering-the-data-deluge-how-large-language-models-are-transforming-scientific-data-curation\/","og_locale":"en_US","og_type":"article","og_title":"Deciphering the data deluge: how large language models are transforming scientific data curation | EMBL","og_description":"Large language models are changing the way we carry out scientific data curation, annotation, and research.","og_url":"https:\/\/www.embl.org\/news\/embletc\/issue-101\/deciphering-the-data-deluge-how-large-language-models-are-transforming-scientific-data-curation\/","og_site_name":"EMBL","article_publisher":"https:\/\/www.facebook.com\/embl.org\/","article_modified_time":"2023-11-15T09:09:02+00:00","og_image":[{"width":1000,"height":600,"url":"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2023\/11\/LLMs-EMBL-EBI-EMBLetc-1000x600-1.jpg","type":"image\/jpeg"}],"twitter_card":"summary_large_image","twitter_site":"@embl","twitter_misc":{"Est. reading time":"8 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.embl.org\/news\/embletc\/issue-101\/deciphering-the-data-deluge-how-large-language-models-are-transforming-scientific-data-curation\/","url":"https:\/\/www.embl.org\/news\/embletc\/issue-101\/deciphering-the-data-deluge-how-large-language-models-are-transforming-scientific-data-curation\/","name":"Deciphering the data deluge: how large language models are transforming scientific data curation | EMBL","isPartOf":{"@id":"https:\/\/www.embl.org\/news\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.embl.org\/news\/embletc\/issue-101\/deciphering-the-data-deluge-how-large-language-models-are-transforming-scientific-data-curation\/#primaryimage"},"image":{"@id":"https:\/\/www.embl.org\/news\/embletc\/issue-101\/deciphering-the-data-deluge-how-large-language-models-are-transforming-scientific-data-curation\/#primaryimage"},"thumbnailUrl":"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2023\/11\/LLMs-EMBL-EBI-EMBLetc-1000x600-1.jpg","datePublished":"2023-11-15T09:00:00+00:00","dateModified":"2023-11-15T09:09:02+00:00","description":"Large language models are changing the way we carry out scientific data curation, annotation, and research.","inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.embl.org\/news\/embletc\/issue-101\/deciphering-the-data-deluge-how-large-language-models-are-transforming-scientific-data-curation\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.embl.org\/news\/embletc\/issue-101\/deciphering-the-data-deluge-how-large-language-models-are-transforming-scientific-data-curation\/#primaryimage","url":"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2023\/11\/LLMs-EMBL-EBI-EMBLetc-1000x600-1.jpg","contentUrl":"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2023\/11\/LLMs-EMBL-EBI-EMBLetc-1000x600-1.jpg","width":1000,"height":600,"caption":"Credit: Karen Arnott\/EMBL-EBI"},{"@type":"WebSite","@id":"https:\/\/www.embl.org\/news\/#website","url":"https:\/\/www.embl.org\/news\/","name":"European Molecular Biology Laboratory News","description":"News from the European Molecular Biology Laboratory","publisher":{"@id":"https:\/\/www.embl.org\/news\/#organization"},"alternateName":"EMBL News","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.embl.org\/news\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.embl.org\/news\/#organization","name":"European Molecular Biology Laboratory","alternateName":"EMBL","url":"https:\/\/www.embl.org\/news\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.embl.org\/news\/#\/schema\/logo\/image\/","url":"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2025\/09\/EMBL_logo_colour-1-300x144-1.png","contentUrl":"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2025\/09\/EMBL_logo_colour-1-300x144-1.png","width":300,"height":144,"caption":"European Molecular Biology Laboratory"},"image":{"@id":"https:\/\/www.embl.org\/news\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/embl.org\/","https:\/\/x.com\/embl","https:\/\/www.instagram.com\/embl_org\/","https:\/\/www.linkedin.com\/company\/15813\/","https:\/\/www.youtube.com\/user\/emblmedia\/"]}]}},"_links":{"self":[{"href":"https:\/\/www.embl.org\/news\/wp-json\/wp\/v2\/embl-etc\/64001","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.embl.org\/news\/wp-json\/wp\/v2\/embl-etc"}],"about":[{"href":"https:\/\/www.embl.org\/news\/wp-json\/wp\/v2\/types\/embletc"}],"author":[{"embeddable":true,"href":"https:\/\/www.embl.org\/news\/wp-json\/wp\/v2\/users\/77"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.embl.org\/news\/wp-json\/wp\/v2\/media\/64039"}],"wp:attachment":[{"href":"https:\/\/www.embl.org\/news\/wp-json\/wp\/v2\/media?parent=64001"}],"wp:term":[{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.embl.org\/news\/wp-json\/wp\/v2\/tags?post=64001"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}