Innovation

AI Tool Guides Researchers to Coronavirus Insights

May 13, 2020 2920

The big idea

The scientific community worldwide has mobilized with unprecedented speed to tackle the COVID-19 pandemic, and the emerging research output is staggering. Every day, hundreds of scientific papers about COVID-19 come out, in both traditional journals and non-peer-reviewed preprints. There’s already far more than any human could possibly keep up with, and more research is constantly emerging.

And it’s not just new research. We estimate that there are as many as 500,000 papers relevant to COVID-19 that were published before the outbreak, including papers related to the outbreaks of SARS in 2002 and MERS in 2012. Any one of these might contain the key information that leads to effective treatment or a vaccine for COVID-19.

Traditional methods of searching through the research literature just don’t cut it anymore. This is why we and our colleagues at Lawrence Berkeley National Lab are using the latest artificial intelligence techniques to build COVIDScholar, a search engine dedicated to COVID-19. COVIDScholar includes tools that pick up subtle clues like similar drugs or research methodologies to recommend relevant research to scientists. AI can’t replace scientists, but it can help them gain new insights from more papers than they could read in a lifetime.

COVIDScholar is a search engine with machine learning algorithms under the hood. Screen capture by The Conversation

Why it matters

When it comes to finding effective treatments for COVID-19, time is of the essence. Scientists spend 23% of their time searching for and reading papers. Every second our search tools can save them is more time to spend making discoveries in the lab and analyzing data.

AI can do more than just save scientists time. Our group’s previous work showed that AI can capture latent scientific knowledge from text, making connections that humans missed. There, we showed that AI was able to suggest new, cutting-edge functional materials years before their discovery by humans. The information was there all along, but it took combining information from hundreds of thousands of papers to find it.

We are now applying the same techniques to COVID-19, to find existing drugs that could be repurposed, genetic links that might help develop a vaccine or effective treatment regimens. We’re also starting to build in new innovations, like using molecular structures to help find which drugs are similar to each other, including those that are similar in unexpected ways.

The Conversation logo
This article by Amalie Trewartha and John Dagdelen originally appeared at The Conversation, a Social Science Space partner site, under the title “AI tool searches thousands of scientific papers to guide researchers to coronavirus insights.”

How we do this work

The most important part of our work is the data. We’ve built web scrapers that collect new papers as they’re published from a wide variety of sources, making them available on our website within 15 minutes of their appearance online. We also clean the data, fixing mistakes in formatting and comparing the same paper from multiple sources to find the best version. Our machine learning algorithms then go to work on the paper, tagging it with subject categories and marking work important to COVID-19.

COVIDScholar labels and categorizes about 250 journal papers a day to help researchers make connections they might otherwise miss. Kevin Cruse and Haoyan Huo, CC BY-ND

We’re also continuously seeking out experts in new areas. Their input and annotation of data is what allows us to train new AI models.

What’s next

So far, we have assembled a collection of over 60,000 papers on COVID-19, and we’re expanding the collection daily. We’ve also built search tools that group research into categories, suggest related research and allow users to find papers that connect different concepts, such as papers that connect a specific drug to the diseases it’s been used to treat in the past. We’re now building AI algorithms that allow researchers to plug search results into quantitative models for studying topics like protein interactions. We’re also starting to dig through the past literature to find hidden gems.

We hope that very soon, researchers using COVIDScholar will start to identify relationships that they might never have imagined, bringing us closer to treatments and a remedy for COVID-19.

Amalie Trewartha (pictured) is a postdoc in Gerbrand Ceder's group in the materials science division at Lawrence Berkeley National Lab. She began her career as a nuclear physicist, before moving into materials science in 2019, with a focus on machine learning. Her research interests include the application of natural language processing techniques to scientific literature, and building thermodynamically-motivated ML models for materials property prediction. John is a PhD student studying materials science in the Persson Group at the University of California, Berkeley and Lawrence Berkeley National Laboratory. He is using natural language processing and other statistical learning techniques to discover materials with extraordinary properties.

View all posts by Amalie Trewartha and John Dagdelen

Related Articles

The Conversation Podcast Series Examines Class in British Politics
Communication
October 25, 2024

The Conversation Podcast Series Examines Class in British Politics

Read Now
The Cult of Donald Trump
Insights
October 16, 2024

The Cult of Donald Trump

Read Now
Viewing 2024 Economics Nobel Through Lens of Colonialism’s Impact on Institutions
Recognition
October 15, 2024

Viewing 2024 Economics Nobel Through Lens of Colonialism’s Impact on Institutions

Read Now
Exploring the ‘Publish or Perish’ Mentality and its Impact on Research Paper Retractions
Research
October 10, 2024

Exploring the ‘Publish or Perish’ Mentality and its Impact on Research Paper Retractions

Read Now
Neuromania – Or Where Did the Person Go?

Neuromania – Or Where Did the Person Go?

David Canter bemoans how people are disappearing as ‘brains’ take over.

Read Now
Nick Camp on Trust in the Criminal Justice System

Nick Camp on Trust in the Criminal Justice System

The relationship between citizens and their criminal justice systems comes down to just that – relationships. And those relations generally start with […]

Read Now
This Anthropology Course Looks at Built Environment From Animal Perspective

This Anthropology Course Looks at Built Environment From Animal Perspective

Title of course: Space/Power/Species What prompted the idea for the course? A few years ago, I came across the architect Joyce Hwang’s […]

Read Now
0 0 votes
Article Rating
Subscribe
Notify of
guest

This site uses Akismet to reduce spam. Learn how your comment data is processed.

0 Comments
Newest
Oldest Most Voted
Inline Feedbacks
View all comments