{"id":48731,"date":"2022-05-12T10:33:25","date_gmt":"2022-05-12T08:33:25","guid":{"rendered":"https:\/\/www.embl.org\/news\/?p=48731"},"modified":"2024-03-22T12:59:40","modified_gmt":"2024-03-22T11:59:40","slug":"europe-pmc-harnessing-the-power-of-text-mining-to-accelerate-life-sciences-research","status":"publish","type":"post","link":"https:\/\/www.embl.org\/news\/science\/europe-pmc-harnessing-the-power-of-text-mining-to-accelerate-life-sciences-research\/","title":{"rendered":"Europe PMC: Harnessing the power of text mining to accelerate life sciences research"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\">Text mining is the process of analysing vast amounts of textual material to extract meaningful concepts, relationships, and trends using machine learning approaches. It enables researchers to rapidly find new and hidden information in text-based sources. When these techniques are applied to scientific publications, it becomes possible to uncover new meaning and hidden patterns that would otherwise take years to manually curate.&nbsp;<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Tackling data challenges and ensuring that we are able to exploit large datasets to their full potential for life science research is a key part of the <a href=\"https:\/\/www.embl.org\/about\/programme\/data-sciences-plans\/\">Data Sciences Plans<\/a> within EMBL\u2019s <a href=\"https:\/\/www.embl.org\/about\/programme\/\">Molecules to Ecosystems Programme<\/a>. This includes developing and experimenting with new technologies and machine learning approaches. For example, these methods are used in a variety of projects to extract new information from publications. This includes mining and extraction of gene\u2013disease associations for drug discovery, enriching our services with metagenomics data, and providing information to the wider text mining community to help others train their own machine learning algorithms.&nbsp;<\/p>\n\n\n\n<div class=\"vf-box vf-box--normal vf-box-theme--primary | vf-u-margin__bottom--400\">\n      <h3 class=\"vf-box__heading\">\n                What is Europe PMC?                  <\/h3> \n        <p class=\"vf-box__text\"><a href=\"https:\/\/europepmc.org\/\">Europe PMC<\/a> is EMBL-EBI\u2019s open science platform for life science publications. It&#8217;s available to anyone, anywhere for free. With Europe PMC, scientists can search and read over 40 million publications, preprints, and other documents enriched with links to supporting data, protocols, etc.<\/p>\n<\/div>\n\n\n\n<div class=\"vf-grid | vf-grid__col-1\"><div class=\"\"><!--[vf\/content]-->\n<div class=\"vf-content\">\n\n<div style=\"height:17px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n<\/div>\n<\/div>\n<\/div>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Mining for gene\u2013disease associations<\/strong><\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Text mining approaches are hugely beneficial for improving the way we identify novel drug targets. A vast amount of information on gene\u2013disease associations and associated drug targets already exists online, hidden within millions of scientific publications. Manually sorting through these texts would take decades. However, using text mining to search the literature allows data to be accessed and analysed for more rapid drug discovery.&nbsp;<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In collaboration with <a href=\"https:\/\/www.opentargets.org\/\">Open Targets<\/a>, researchers at Europe PMC are doing just this by creating a pipeline that maximises literature information extraction using named entity recognition (NER) models. Named Entity Recognition (NER) is a widely used natural language processing approach to identify real-world objects, such as people, location, and time within text. The Europe PMC team uses this approach to identify genes, proteins, diseases, chemicals, and other biomedical concepts from life science literature. These bioNERs form the basis of gene<meta charset=\"utf-8\"\/>\u2013disease association identification from literature for Open Targets.&nbsp;<\/p>\n\n\n\n<div class=\"vf-box vf-box--normal vf-box-theme--primary | vf-u-margin__bottom--400\">\n      <h3 class=\"vf-box__heading\">\n                What are NER models?                  <\/h3> \n        <p class=\"vf-box__text\"><!-- wp:paragraph --><\/p>\n<p class=\"vf-box__text\">NER models are a form of natural language processing (NLP) \u2013 a type of machine learning method which allows computers to analyse text rather than computer code. In this case, the natural language being detected consists of disease and gene terms found within life science literature.<\/p>\n<p class=\"vf-box__text\"><!-- \/wp:paragraph --><\/p>\n<\/div>\n\n\n\n<div class=\"vf-grid | vf-grid__col-1\"><div class=\"\"><!--[vf\/content]-->\n<div class=\"vf-content\">\n\n<\/div>\n<\/div>\n<\/div>\n\n\n\n<p class=\"wp-block-paragraph\"><br \/>\u201cFor our machine learning algorithms to work effectively we needed to train them with high-quality data,\u201d said <a href=\"https:\/\/www.ebi.ac.uk\/people\/person\/9da82d70eeb9541f939fcc8e7ad91252a629d5e2ece7d1e1655097c7f50c8747\/\">Shyamasree Saha, Machine Learning and Text Mining Scientist at EMBL-EBI<\/a>. \u201cAt Europe PMC, we developed a gold standard dataset for genes, proteins, disease, and organisms. We are using BioBERT, a domain-specific language model pre-trained on a large biomedical corpora and fine-tuning the model for the NER task using our gold standard dataset. The model replaces our old dictionary based NER approach and significantly improves entity association identification accuracy.\u201d&nbsp;<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><a href=\"https:\/\/blog.opentargets.org\/developing-the-open-targets-literature-pipeline\/\">Learn more about how NER is being used to develop the Open Targets Platform<\/a>.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Generating metadata descriptions<\/strong><\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Metadata \u2013 the information that describes where, when, and how specific data are obtained \u2013 enriches the scientific value of genomic sequencing data and makes data FAIR (Findable, Accessible, Interoperable, and Reproducible). However, these metadata are frequently missing from databases or contain poor quality descriptions, meaning they cannot be used to interpret the data. For metagenomics \u2013 the direct analysis of genomes contained within an environmental sample \u2013 the use of metadata is of vital importance to increase data reuse and improve interpretation.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Researchers from Europe PMC and EMBL-EBI\u2019s metagenomics data resource <a href=\"https:\/\/www.ebi.ac.uk\/metagenomics\/\">MGnify<\/a>, have found a solution to this challenge by automatically extracting relevant metadata key terms straight from the literature. This is done using a <a href=\"https:\/\/europepmc.org\/article\/PPR\/PPR462054\">machine learning framework to mine a wide range of metagenomics studies found in publications<\/a> stored within the Europe PMC database. The project is called <a href=\"https:\/\/gtr.ukri.org\/projects?ref=BB%2FS009043%2F1\">Enriching MEtagenomics Results using Artificial intelligence and Literature Data (EMERALD)<\/a>.\u00a0<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u201cOne of the major limitations when comparing datasets is the lack of contextual metadata relating to a sample,\u201d said <a href=\"https:\/\/www.ebi.ac.uk\/people\/person\/42359e9015760d597a620628b55acb65c55f41e66e5d7ae10bd4cf7e5eaa3489\/\">Lorna Richardson, Coordinator for MGnify at EMBL-EBI<\/a>. \u201cTo address this, we partnered with Europe PMC to automatically extract relevant metadata terms from publications, improving the range and depth of metadata available to our users. This metadata includes terms relating to the sequencing platform used, extraction kits, primers, the environment of the sample, and much more, which will help researchers get the most out of the data stored in MGnify.\u201d<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><a href=\"https:\/\/ebi-metagenomics.github.io\/blog\/2021\/11\/17\/Publication-Annotations\/\">Find out more about how the EMERALD project is benefiting MGnify users<\/a>.&nbsp;<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Annotations for the text mining community<\/strong><\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Finally, the Europe PMC database itself is helping to advance the field of text mining by simplifying the way its users can find and access data from scientific literature. One of the tools available within Europe PMC is the <a href=\"https:\/\/europepmc.org\/Annotations\">annotation tool<\/a>. This allows users developing their own text mining algorithms to quickly extract relevant terms and use them to develop their own text mining pipelines.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The annotations within this tool are collected by both Europe PMC and the wider text mining community and they include biological terms such as disease names, chemicals, and proteins. The annotation terms available for each article are located in the tools menu within Europe PMC and can also be accessed programmatically using the <a href=\"https:\/\/europepmc.org\/AnnotationsApi\">annotations API<\/a>.&nbsp;<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u201cWe have close to 1.6 billion annotations available to help our users locate entities in the full text and abstracts of articles stored in Europe PMC,\u201d said&nbsp; <a href=\"https:\/\/www.ebi.ac.uk\/people\/person\/a5c34677c7fde76dbd96602d08087e22fe7524f8a8b70cdb476199133b9a182b\/\">Aravind Venkatesan, Senior Data Scientist at EMBL-EBI<\/a>. \u201cThese are available through the Europe PMC annotations tool, which supports scientists and database curators in their literature research by making it easy to find the relevant annotation terms they need to train their text mining models. This will help advance a range of research fields and also accelerate the field of text mining itself.\u201d<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Text mining is a tool which can benefit many research areas by increasing the rate at which we can unlock uncharted information already present in the millions of life science articles published online. Here we have shown how EMBL-EBI scientists have been able to harness the power of text mining to accelerate fields including drug discovery and metagenomics research. But it doesn\u2019t stop there; this same approach can be used to leverage a vast range of fields with endless possibilities. Text mining to advance the life sciences is still a young field, but it is an exciting one to be a part of right now.&nbsp;<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Find out more about the <a href=\"https:\/\/www.embl.org\/about\/programme\/data-sciences-plans\/\">Data Sciences Plans<\/a> at EMBL.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Funding<\/strong><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\"><a href=\"https:\/\/gtr.ukri.org\/projects?ref=BB%2FS009043%2F1\">The EMERALD project is funded by the UK Research and Innovation (UKRI)<\/a>.&nbsp;<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Open Targets funding.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>How text mining collaborations benefit our research, data resources, and the wider scientific community.<\/p>\n","protected":false},"author":77,"featured_media":48737,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[2,17591,11056],"tags":[4718,28,5780,1345,36,428,677,604,556,315],"embl_taxonomy":[14027],"class_list":["post-48731","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-science","category-science-technology","category-technology-and-innovation","tag-artificial-intelligence","tag-bioinformatics","tag-data-science","tag-data-sharing","tag-embl-ebi","tag-europepmc","tag-literature","tag-machine-learning","tag-open-access","tag-open-data","embl_taxonomy-literature-services"],"embl_taxonomy_terms":[{"uuid":"a:3:{i:0;s:36:\"302cfdf7-365b-462a-be65-82c7b783ebf7\";i:1;s:36:\"18699e63-ed43-40c6-8d1c-203db7ed72ee\";i:2;s:36:\"9d1440f0-a307-4a28-9262-a3fdcf9ecb3d\";}","parents":[],"name":["Literature Services"],"slug":"literature-services","description":"What &gt; EMBL-EBI Services &gt; Literature Services"}],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.2 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Europe PMC: Harnessing the power of text mining to accelerate life sciences research | EMBL<\/title>\n<meta name=\"description\" content=\"How text mining collaborations at EMBL-EBI benefit our research, data resources, and the wider scientific community\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.embl.org\/news\/science\/europe-pmc-harnessing-the-power-of-text-mining-to-accelerate-life-sciences-research\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Europe PMC: Harnessing the power of text mining to accelerate life sciences research | EMBL\" \/>\n<meta property=\"og:description\" content=\"How text mining collaborations at EMBL-EBI benefit our research, data resources, and the wider scientific community\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.embl.org\/news\/science\/europe-pmc-harnessing-the-power-of-text-mining-to-accelerate-life-sciences-research\/\" \/>\n<meta property=\"og:site_name\" content=\"EMBL\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/embl.org\/\" \/>\n<meta property=\"article:published_time\" content=\"2022-05-12T08:33:25+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-03-22T11:59:40+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2022\/05\/2022-Europe-PMC-Text-Mining-AI-1000x600-1.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1000\" \/>\n\t<meta property=\"og:image:height\" content=\"600\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Vicky Hatch\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@embl\" \/>\n<meta name=\"twitter:site\" content=\"@embl\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Vicky Hatch\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"NewsArticle\",\"@id\":\"https:\/\/www.embl.org\/news\/science\/europe-pmc-harnessing-the-power-of-text-mining-to-accelerate-life-sciences-research\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.embl.org\/news\/science\/europe-pmc-harnessing-the-power-of-text-mining-to-accelerate-life-sciences-research\/\"},\"author\":{\"name\":\"Vicky Hatch\",\"@id\":\"https:\/\/www.embl.org\/news\/#\/schema\/person\/d8477ba2d7a6164b141a3872a25ee982\"},\"headline\":\"Europe PMC: Harnessing the power of text mining to accelerate life sciences research\",\"datePublished\":\"2022-05-12T08:33:25+00:00\",\"dateModified\":\"2024-03-22T11:59:40+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.embl.org\/news\/science\/europe-pmc-harnessing-the-power-of-text-mining-to-accelerate-life-sciences-research\/\"},\"wordCount\":1040,\"publisher\":{\"@id\":\"https:\/\/www.embl.org\/news\/#organization\"},\"image\":{\"@id\":\"https:\/\/www.embl.org\/news\/science\/europe-pmc-harnessing-the-power-of-text-mining-to-accelerate-life-sciences-research\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2022\/05\/2022-Europe-PMC-Text-Mining-AI-1000x600-1.jpg\",\"keywords\":[\"artificial intelligence\",\"bioinformatics\",\"data science\",\"data sharing\",\"embl-ebi\",\"europepmc\",\"literature\",\"machine learning\",\"open access\",\"open data\"],\"articleSection\":[\"Science\",\"Science &amp; Technology\",\"Technology and innovation\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.embl.org\/news\/science\/europe-pmc-harnessing-the-power-of-text-mining-to-accelerate-life-sciences-research\/\",\"url\":\"https:\/\/www.embl.org\/news\/science\/europe-pmc-harnessing-the-power-of-text-mining-to-accelerate-life-sciences-research\/\",\"name\":\"Europe PMC: Harnessing the power of text mining to accelerate life sciences research | EMBL\",\"isPartOf\":{\"@id\":\"https:\/\/www.embl.org\/news\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.embl.org\/news\/science\/europe-pmc-harnessing-the-power-of-text-mining-to-accelerate-life-sciences-research\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.embl.org\/news\/science\/europe-pmc-harnessing-the-power-of-text-mining-to-accelerate-life-sciences-research\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2022\/05\/2022-Europe-PMC-Text-Mining-AI-1000x600-1.jpg\",\"datePublished\":\"2022-05-12T08:33:25+00:00\",\"dateModified\":\"2024-03-22T11:59:40+00:00\",\"description\":\"How text mining collaborations at EMBL-EBI benefit our research, data resources, and the wider scientific community\",\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.embl.org\/news\/science\/europe-pmc-harnessing-the-power-of-text-mining-to-accelerate-life-sciences-research\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.embl.org\/news\/science\/europe-pmc-harnessing-the-power-of-text-mining-to-accelerate-life-sciences-research\/#primaryimage\",\"url\":\"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2022\/05\/2022-Europe-PMC-Text-Mining-AI-1000x600-1.jpg\",\"contentUrl\":\"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2022\/05\/2022-Europe-PMC-Text-Mining-AI-1000x600-1.jpg\",\"width\":1000,\"height\":600,\"caption\":\"Harnessing the power of text mining. Image credit: Karen Arnott\/EMBL-EBI\"},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.embl.org\/news\/#website\",\"url\":\"https:\/\/www.embl.org\/news\/\",\"name\":\"European Molecular Biology Laboratory News\",\"description\":\"News from the European Molecular Biology Laboratory\",\"publisher\":{\"@id\":\"https:\/\/www.embl.org\/news\/#organization\"},\"alternateName\":\"EMBL News\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.embl.org\/news\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.embl.org\/news\/#organization\",\"name\":\"European Molecular Biology Laboratory\",\"alternateName\":\"EMBL\",\"url\":\"https:\/\/www.embl.org\/news\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.embl.org\/news\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2025\/09\/EMBL_logo_colour-1-300x144-1.png\",\"contentUrl\":\"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2025\/09\/EMBL_logo_colour-1-300x144-1.png\",\"width\":300,\"height\":144,\"caption\":\"European Molecular Biology Laboratory\"},\"image\":{\"@id\":\"https:\/\/www.embl.org\/news\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/embl.org\/\",\"https:\/\/x.com\/embl\",\"https:\/\/www.instagram.com\/embl_org\/\",\"https:\/\/www.linkedin.com\/company\/15813\/\",\"https:\/\/www.youtube.com\/user\/emblmedia\/\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.embl.org\/news\/#\/schema\/person\/d8477ba2d7a6164b141a3872a25ee982\",\"name\":\"Vicky Hatch\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.embl.org\/news\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/6d864d58088d9e60f42c501aa30714d303efc1ca5aed268210905409910b90d5?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/6d864d58088d9e60f42c501aa30714d303efc1ca5aed268210905409910b90d5?s=96&d=mm&r=g\",\"caption\":\"Vicky Hatch\"},\"url\":\"https:\/\/www.embl.org\/news\/author\/vicky-hatch-2-2-2-2-2-2-2-2-2-2-2-2-2-2-2-2-2-2--2\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Europe PMC: Harnessing the power of text mining to accelerate life sciences research | EMBL","description":"How text mining collaborations at EMBL-EBI benefit our research, data resources, and the wider scientific community","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.embl.org\/news\/science\/europe-pmc-harnessing-the-power-of-text-mining-to-accelerate-life-sciences-research\/","og_locale":"en_US","og_type":"article","og_title":"Europe PMC: Harnessing the power of text mining to accelerate life sciences research | EMBL","og_description":"How text mining collaborations at EMBL-EBI benefit our research, data resources, and the wider scientific community","og_url":"https:\/\/www.embl.org\/news\/science\/europe-pmc-harnessing-the-power-of-text-mining-to-accelerate-life-sciences-research\/","og_site_name":"EMBL","article_publisher":"https:\/\/www.facebook.com\/embl.org\/","article_published_time":"2022-05-12T08:33:25+00:00","article_modified_time":"2024-03-22T11:59:40+00:00","og_image":[{"width":1000,"height":600,"url":"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2022\/05\/2022-Europe-PMC-Text-Mining-AI-1000x600-1.jpg","type":"image\/jpeg"}],"author":"Vicky Hatch","twitter_card":"summary_large_image","twitter_creator":"@embl","twitter_site":"@embl","twitter_misc":{"Written by":"Vicky Hatch","Est. reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/www.embl.org\/news\/science\/europe-pmc-harnessing-the-power-of-text-mining-to-accelerate-life-sciences-research\/#article","isPartOf":{"@id":"https:\/\/www.embl.org\/news\/science\/europe-pmc-harnessing-the-power-of-text-mining-to-accelerate-life-sciences-research\/"},"author":{"name":"Vicky Hatch","@id":"https:\/\/www.embl.org\/news\/#\/schema\/person\/d8477ba2d7a6164b141a3872a25ee982"},"headline":"Europe PMC: Harnessing the power of text mining to accelerate life sciences research","datePublished":"2022-05-12T08:33:25+00:00","dateModified":"2024-03-22T11:59:40+00:00","mainEntityOfPage":{"@id":"https:\/\/www.embl.org\/news\/science\/europe-pmc-harnessing-the-power-of-text-mining-to-accelerate-life-sciences-research\/"},"wordCount":1040,"publisher":{"@id":"https:\/\/www.embl.org\/news\/#organization"},"image":{"@id":"https:\/\/www.embl.org\/news\/science\/europe-pmc-harnessing-the-power-of-text-mining-to-accelerate-life-sciences-research\/#primaryimage"},"thumbnailUrl":"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2022\/05\/2022-Europe-PMC-Text-Mining-AI-1000x600-1.jpg","keywords":["artificial intelligence","bioinformatics","data science","data sharing","embl-ebi","europepmc","literature","machine learning","open access","open data"],"articleSection":["Science","Science &amp; Technology","Technology and innovation"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/www.embl.org\/news\/science\/europe-pmc-harnessing-the-power-of-text-mining-to-accelerate-life-sciences-research\/","url":"https:\/\/www.embl.org\/news\/science\/europe-pmc-harnessing-the-power-of-text-mining-to-accelerate-life-sciences-research\/","name":"Europe PMC: Harnessing the power of text mining to accelerate life sciences research | EMBL","isPartOf":{"@id":"https:\/\/www.embl.org\/news\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.embl.org\/news\/science\/europe-pmc-harnessing-the-power-of-text-mining-to-accelerate-life-sciences-research\/#primaryimage"},"image":{"@id":"https:\/\/www.embl.org\/news\/science\/europe-pmc-harnessing-the-power-of-text-mining-to-accelerate-life-sciences-research\/#primaryimage"},"thumbnailUrl":"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2022\/05\/2022-Europe-PMC-Text-Mining-AI-1000x600-1.jpg","datePublished":"2022-05-12T08:33:25+00:00","dateModified":"2024-03-22T11:59:40+00:00","description":"How text mining collaborations at EMBL-EBI benefit our research, data resources, and the wider scientific community","inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.embl.org\/news\/science\/europe-pmc-harnessing-the-power-of-text-mining-to-accelerate-life-sciences-research\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.embl.org\/news\/science\/europe-pmc-harnessing-the-power-of-text-mining-to-accelerate-life-sciences-research\/#primaryimage","url":"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2022\/05\/2022-Europe-PMC-Text-Mining-AI-1000x600-1.jpg","contentUrl":"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2022\/05\/2022-Europe-PMC-Text-Mining-AI-1000x600-1.jpg","width":1000,"height":600,"caption":"Harnessing the power of text mining. Image credit: Karen Arnott\/EMBL-EBI"},{"@type":"WebSite","@id":"https:\/\/www.embl.org\/news\/#website","url":"https:\/\/www.embl.org\/news\/","name":"European Molecular Biology Laboratory News","description":"News from the European Molecular Biology Laboratory","publisher":{"@id":"https:\/\/www.embl.org\/news\/#organization"},"alternateName":"EMBL News","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.embl.org\/news\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.embl.org\/news\/#organization","name":"European Molecular Biology Laboratory","alternateName":"EMBL","url":"https:\/\/www.embl.org\/news\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.embl.org\/news\/#\/schema\/logo\/image\/","url":"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2025\/09\/EMBL_logo_colour-1-300x144-1.png","contentUrl":"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2025\/09\/EMBL_logo_colour-1-300x144-1.png","width":300,"height":144,"caption":"European Molecular Biology Laboratory"},"image":{"@id":"https:\/\/www.embl.org\/news\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/embl.org\/","https:\/\/x.com\/embl","https:\/\/www.instagram.com\/embl_org\/","https:\/\/www.linkedin.com\/company\/15813\/","https:\/\/www.youtube.com\/user\/emblmedia\/"]},{"@type":"Person","@id":"https:\/\/www.embl.org\/news\/#\/schema\/person\/d8477ba2d7a6164b141a3872a25ee982","name":"Vicky Hatch","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.embl.org\/news\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/6d864d58088d9e60f42c501aa30714d303efc1ca5aed268210905409910b90d5?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/6d864d58088d9e60f42c501aa30714d303efc1ca5aed268210905409910b90d5?s=96&d=mm&r=g","caption":"Vicky Hatch"},"url":"https:\/\/www.embl.org\/news\/author\/vicky-hatch-2-2-2-2-2-2-2-2-2-2-2-2-2-2-2-2-2-2--2\/"}]}},"field_target_display":"both","field_article_language":{"value":"english","label":"English"},"fimg_url":"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2022\/05\/2022-Europe-PMC-Text-Mining-AI-1000x600-1.jpg","featured_image_src":"https:\/\/www.embl.org\/news\/wp-content\/uploads\/2022\/05\/2022-Europe-PMC-Text-Mining-AI-1000x600-1.jpg","acf":{"featured":true,"show_featured_image":false,"field_target_display":"both","article_intro":"<p>How text mining collaborations benefit our research, data resources, and the wider scientific community<\/p>\n","related_links":[{"link_description":"Europe PMC","link_url":"https:\/\/europepmc.org\/"},{"link_description":"MGnify","link_url":"https:\/\/www.ebi.ac.uk\/metagenomics\/"},{"link_description":"Open Targets","link_url":"https:\/\/www.opentargets.org\/"}],"source_article":false,"in_this_article":false,"press_contact":"None"},"_links":{"self":[{"href":"https:\/\/www.embl.org\/news\/wp-json\/wp\/v2\/posts\/48731","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.embl.org\/news\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.embl.org\/news\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.embl.org\/news\/wp-json\/wp\/v2\/users\/77"}],"replies":[{"embeddable":true,"href":"https:\/\/www.embl.org\/news\/wp-json\/wp\/v2\/comments?post=48731"}],"version-history":[{"count":32,"href":"https:\/\/www.embl.org\/news\/wp-json\/wp\/v2\/posts\/48731\/revisions"}],"predecessor-version":[{"id":67347,"href":"https:\/\/www.embl.org\/news\/wp-json\/wp\/v2\/posts\/48731\/revisions\/67347"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.embl.org\/news\/wp-json\/wp\/v2\/media\/48737"}],"wp:attachment":[{"href":"https:\/\/www.embl.org\/news\/wp-json\/wp\/v2\/media?parent=48731"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.embl.org\/news\/wp-json\/wp\/v2\/categories?post=48731"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.embl.org\/news\/wp-json\/wp\/v2\/tags?post=48731"},{"taxonomy":"embl_taxonomy","embeddable":true,"href":"https:\/\/www.embl.org\/news\/wp-json\/wp\/v2\/embl_taxonomy?post=48731"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}