Innovation

AI Tool Guides Researchers to Coronavirus Insights

May 13, 2020 2979

The big idea

The scientific community worldwide has mobilized with unprecedented speed to tackle the COVID-19 pandemic, and the emerging research output is staggering. Every day, hundreds of scientific papers about COVID-19 come out, in both traditional journals and non-peer-reviewed preprints. There’s already far more than any human could possibly keep up with, and more research is constantly emerging.

And it’s not just new research. We estimate that there are as many as 500,000 papers relevant to COVID-19 that were published before the outbreak, including papers related to the outbreaks of SARS in 2002 and MERS in 2012. Any one of these might contain the key information that leads to effective treatment or a vaccine for COVID-19.

Traditional methods of searching through the research literature just don’t cut it anymore. This is why we and our colleagues at Lawrence Berkeley National Lab are using the latest artificial intelligence techniques to build COVIDScholar, a search engine dedicated to COVID-19. COVIDScholar includes tools that pick up subtle clues like similar drugs or research methodologies to recommend relevant research to scientists. AI can’t replace scientists, but it can help them gain new insights from more papers than they could read in a lifetime.

COVIDScholar is a search engine with machine learning algorithms under the hood. Screen capture by The Conversation

Why it matters

When it comes to finding effective treatments for COVID-19, time is of the essence. Scientists spend 23% of their time searching for and reading papers. Every second our search tools can save them is more time to spend making discoveries in the lab and analyzing data.

AI can do more than just save scientists time. Our group’s previous work showed that AI can capture latent scientific knowledge from text, making connections that humans missed. There, we showed that AI was able to suggest new, cutting-edge functional materials years before their discovery by humans. The information was there all along, but it took combining information from hundreds of thousands of papers to find it.

We are now applying the same techniques to COVID-19, to find existing drugs that could be repurposed, genetic links that might help develop a vaccine or effective treatment regimens. We’re also starting to build in new innovations, like using molecular structures to help find which drugs are similar to each other, including those that are similar in unexpected ways.

This article by Amalie Trewartha and John Dagdelen originally appeared at The Conversation, a Social Science Space partner site, under the title “AI tool searches thousands of scientific papers to guide researchers to coronavirus insights.”

How we do this work

The most important part of our work is the data. We’ve built web scrapers that collect new papers as they’re published from a wide variety of sources, making them available on our website within 15 minutes of their appearance online. We also clean the data, fixing mistakes in formatting and comparing the same paper from multiple sources to find the best version. Our machine learning algorithms then go to work on the paper, tagging it with subject categories and marking work important to COVID-19.

COVIDScholar labels and categorizes about 250 journal papers a day to help researchers make connections they might otherwise miss. Kevin Cruse and Haoyan Huo, CC BY-ND

We’re also continuously seeking out experts in new areas. Their input and annotation of data is what allows us to train new AI models.

What’s next

So far, we have assembled a collection of over 60,000 papers on COVID-19, and we’re expanding the collection daily. We’ve also built search tools that group research into categories, suggest related research and allow users to find papers that connect different concepts, such as papers that connect a specific drug to the diseases it’s been used to treat in the past. We’re now building AI algorithms that allow researchers to plug search results into quantitative models for studying topics like protein interactions. We’re also starting to dig through the past literature to find hidden gems.

We hope that very soon, researchers using COVIDScholar will start to identify relationships that they might never have imagined, bringing us closer to treatments and a remedy for COVID-19.

Amalie Trewartha and John Dagdelen

Amalie Trewartha (pictured) is a postdoc in Gerbrand Ceder's group in the materials science division at Lawrence Berkeley National Lab. She began her career as a nuclear physicist, before moving into materials science in 2019, with a focus on machine learning. Her research interests include the application of natural language processing techniques to scientific literature, and building thermodynamically-motivated ML models for materials property prediction. John is a PhD student studying materials science in the Persson Group at the University of California, Berkeley and Lawrence Berkeley National Laboratory. He is using natural language processing and other statistical learning techniques to discover materials with extraordinary properties.

View all posts by Amalie Trewartha and John Dagdelen

Published

May 13, 2020

NAS Report Examines Nexus of AI and Workplace

By Jenna Collins

Read Now

Watch Now: ‘All Change! 2024 – A Year of Elections’

Insights

December 17, 2024

Watch Now: ‘All Change! 2024 – A Year of Elections’

By Joe Sweeney

Read Now

What European SMEs Can Teach Us About Innovation and Informal Human Resource Management

Business and Management INK

December 16, 2024

What European SMEs Can Teach Us About Innovation and Informal Human Resource Management

By Desiree Meurs, Marise Born, Yolanda Grift, Maaike Lycklama à Nijeholt, and Joop Schippers

Read Now

When Do You Need to Trust a GenAI’s Input to Your Innovation Process?

Business and Management INK

December 13, 2024

When Do You Need to Trust a GenAI’s Input to Your Innovation Process?

By Frank T. Piller, Tucker J. Marion and Mahdi Srour

Read Now

The Authors of ‘Artificial Intelligence and Work’ on Future Risk

John Bratton and Laura Steele 869 Business and Management INK, Innovation, Public Policy

During the final stages of editing the proofs for Artificial Intelligence and Work: Transforming Work, Organizations, and Society in an Age of Insecurity, […]

Read Now

Beware! AI Can Lie.

David Canter 696 Innovation

David Canter reveals how he discovered Microsoft Copilot acted like a lazy student, inventing responses with apparent confidence that were blatantly wrong. […]

Read Now

Joshua Greene on Effective Charities

Social Science Space 861 Insights, Public Policy, Social Science Bites

Harvard psychology professor Joshua Greene studies the back-and-forth between emotion and reason in how human beings make moral decisions. In this Social […]

Read Now