Navigation Links
UF scientists encounter holes in tree of life, push for better data storage
Date:9/3/2013

GAINESVILLE, Fla. --- When it comes to public access, the tree of life has holes.

A new study co-authored by University of Florida researchers shows about 70 percent of published genetic sequence comparisons are not publicly accessible, leaving researchers worldwide unable to get to critical data they may need to tackle a host a problems ranging from climate change to disease control.

Scientists are using the genetic data to construct the largest open-access tree of life as part of the National Science Foundation's $5.6-million Assembling, Visualizing and Analyzing the Tree of Life project. Understanding organismal relationships is increasingly valuable for tracking the origin and spread of emerging diseases, creating agricultural and pharmaceutical products, studying climate change, controlling invasive species and establishing plans for conservation and ecosystem restoration.

The study appearing today in PLoS Biology describes a significant challenge for the project, which is expected to produce an initial draft tree by the end of the year. It highlights the need for developing more effective methods for storing data for long-term use and urges journals to adopt more stringent data-sharing policies.

"I think what we need is a major change in our mindset about just how important it is to deposit your data this has to be a standard part of what we do," said co-author Doug Soltis, a distinguished professor at the Florida Museum of Natural History on the UF campus and UF's biology department. "Because if it's not there, it's lost forever. These are really, really important for long-term use, as we're seeing now in our efforts to build a tree."

Estimates of the amount of missing data were based on 7,539 peer-reviewed studies about animals, fungi, seed plants, bacteria and various microscopic organisms. Soltis said the missing genetic data has required project collaborators to contact hundreds of researchers to request information, or attempt to reproduce the sequence alignments and analyses, which is extremely labor intensive.

"There are ambiguities with the alignments, you have to make certain judgment calls, and so an alignment that I do is not going to be the same as an alignment that somebody else does," said lead author Bryan Drew, a postdoctoral researcher in UF's biology department. "It's hard to assess a publication's validity in a lot of cases if you don't have access to the alignments. To me, that's the biggest problem with all of this."

Challenges include complicated mechanisms for uploading data and inconsistencies between journals some require or strongly recommend data be stored in an online database and others do not, Drew said. The most widely used, publicly accessible databases include GenBank, TreeBASE and Dryad. Most journals require DNA sequences be deposited in GenBank, but comparatively few require the sequence alignments to be publicly archived. When study co-authors emailed researchers to obtain missing information, a majority did not respond, and the co-authors were rarely successful in retrieving the data.

"A lot of the authors I contacted said their data was in TreeBASE, but they were unaware of the next step needed after acceptance by the journal the researchers didn't know they had to go back into TreeBASE and actually make the data available to the public," Drew said.

Elizabeth Kellogg, a professor in the department of biology at the University of Missouri-St. Louis who was not involved with the study, said she is not surprised about the large amount of missing information.

"They're absolutely right that when people are publishing papers, you want to document your results as much as you can," Kellogg said. "But many journals aren't requiring that extra step, so some researchers are only submitting the minimum to have their studies published. "There are databases for archiving, but some of their interfaces are somewhat cumbersome, and if you haven't previously done this, it can appear to be a daunting task."


'/>"/>

Contact: Doug Soltis
dsoltis@botany.ufl.edu
352-273-1963
University of Florida
Source:Eurekalert

Related biology news :

1. Scientists discover new bat species in West Africa
2. Worlds scientists, researchers and nutrition experts convene to explore the benefits of mushrooms
3. Now hear this: Scientists discover compound to prevent noise-related hearing loss
4. NIH scientists describe how anthrax toxins cause illness, death
5. UCLA scientists receive $2 million grant to improve quality of donor livers for transplant
6. Cancer scientists discover novel way gene controls stem cell self-renewal
7. UK & USA scientists collaborate to design crops of the future
8. Scientists uncover the secret life of frozen soils
9. Tufts scientists develop new early warning system for cholera epidemics
10. Cattle can be a source of MRSA in people, scientists find
11. LLNL scientists make new discoveries in the transmission of viruses between animals and humans
Post Your Comments:
*Name:
*Comment:
*Email:
(Date:6/2/2016)... 2016   The Weather Company , an IBM Business ... industry-first capability in which consumers will be able to interact ... questions via voice or text and receive relevant information about ... Marketers have long sought an advertising solution that can ... personal, relevant and valuable; and can scale across millions of ...
(Date:5/16/2016)... May 16, 2016   EyeLock LLC , a ... the opening of an IoT Center of Excellence in ... expand the development of embedded iris biometric applications. ... of convenience and security with unmatched biometric accuracy, making ... aside from DNA. EyeLock,s platform uses video technology to ...
(Date:4/28/2016)... GOTHENBURG, Sweden , April 28, 2016 ... 1,491.2 M (139.9), up 966% compared with the first quarter of ... Operating profit totaled SEK 589.1 M (loss: 18.8) and the operating ... SEK 7.12 (loss: 0.32) Cash flow from operations was ... , The 2016 revenue guidance is unchanged, SEK 7,000-8,500 M. ...
Breaking Biology News(10 mins):
(Date:6/27/2016)... ... June 27, 2016 , ... Cancer experts from ... believe could be a new and helpful biomarker for malignant pleural mesothelioma. Surviving ... to read it now. , Biomarkers are components in the blood, tissue ...
(Date:6/27/2016)... ... June 27, 2016 , ... Newly ... technologies, services and solutions to the healthcare market. The company's primary focus is ... manufacturing, sales and marketing strategies that are necessary to help companies efficiently bring ...
(Date:6/27/2016)... PHILADELPHIA , June 27, 2016  Liquid ... today announced the funding of a Sponsored Research ... study circulating tumor cells (CTCs) from cancer patients.  ... changes in CTC levels correlate with clinical outcomes ... therapies. These data will then be employed to ...
(Date:6/24/2016)... on a range of subjects including policies, debt and investment ... Speaking at a lecture to the Canadian Economics ... the country,s inflation target, which is set by both the ... "In certain areas there needs to be frequent ... not sit down and address strategy together?" He ...
Breaking Biology Technology: