PSC Provides Researchers Access to China’s National Genomics Data Center COVID-19 Database
Working with collaborators at Weill Cornell Medicine, PSC is currently hosting the COVID-19 database for the National Genomics Data Center of the Chinese People’s Republic. By providing ready access to this important dataset, we hope to enable researchers to better understand the COVID-19 virus, helping to control the pandemic in the U.S. and around the world. PSC will host the database on its Bridges platform, which is optimized for the kind of Big Data analysis that will be necessary for the task.
Thanks in part to PSC resources, the scientific platform Galaxy at the Pennsylvania State University has developed an open source set of computer tools for helping scientists studying the COVID-19 virus obtain transparent and reproducible results.
Galaxy has long been a popular set of tools for scientists studying genetic data and other Big Data research tasks. Galaxy relies in part on PSC’s Bridges platform, automatically routing the largest analysis jobs to Bridges to take advantage of the system’s unique capabilities for managing and computing the most massive data projects.
The purpose of the current open source tools—so named because all the computer code is readily available for scientists to study and critique their programming—is to create a set of standards and rules for handling and calculating using COVID-19 data. Such rules are needed to ensure that a given computation will turn out the same way no matter how many times it is done or which laboratory/computer system carries it out. This, in turn, will help ensure that measures to control or fight the virus will be more reliable and predictable.
For more information on the Galaxy toolkit for COVID-19, see https://covid19.galaxyproject.org/
For more information on Bridges work on COVID-19, see https://www.psc.edu/bridges-for-covid-19-research