The first public nucleotide sequence database turns 25
Today EMBL-Bank, the nucleotide sequence database of the European Molecular Biology Laboratory (EMBL), celebrates its 25th anniversary.
It was the world’s earliest public database of DNA and RNA sequences and remains Europe’s primary nucleotide sequence resource. The database is maintained by EMBL’s European Bioinformatics Institute in Hinxton (UK) in collaboration with its US and Japanese counterparts GenBank and the DNA Databank of Japan.
EBI Associate Director Graham Cameron commented: “In the early days, databases were an adjunct to scientific publications and sequences were transcribed from the literature. Times have moved on. The databases are now the primary record for high-throughput science. We and our partners in Japan and the USA are custodians of that record, and proud of the long-standing collaboration which has kept all of the data available to scientists worldwide.”
Over the years EMBL-Bank has grown exponentially and currently contains over 96 million entries corresponding to 170 gigabases of sequence from over 280.000 organisms. New sequences are submitted at a rate of more than one sequence every two seconds and the database receives millions of accesses every day.
Today, half an hour at the computer can suggest a function for a new gene – a task that might previously have occupied a researcher for a year. In future, connections to diverse data from new high-throughput methods will help create an information space crucial to interdisciplinary systems biology.
It’s almost a year since the coronavirus outbreak was declared a pandemic, affecting all our lives. While the virus continues its grip on the world, scientists are understanding it better and better, increasing our knowledge about it and opening up new ways to fight it.