The initiative will create an analytics platform to systematically collect clinical, laboratory and diagnostic data from health care provider organizations nationwide. It will then harmonize the aggregated information into a standard format and make it available rapidly for researchers and health care providers to accelerate COVID-19 research and provide information that may improve clinical care.
Having access to a centralized enclave of this magnitude will help researchers and health care providers answer clinically important questions they previously could not, such as, "Can we predict who might need dialysis because of kidney failure?" or "Who might need to be on a ventilator because of lung failure?" and "Are there different patient responses to coronavirus infection that require distinct therapies?"
"NCATS initially supported the development of this innovative collaborative technology platform to speed the process of understanding the course of diseases, and identifying interventions to effectively treat them," said NCATS Director Christopher P. Austin, M.D. "This platform was deployed to stand up this important COVID-19 effort in a matter of weeks, and we anticipate that it will serve as the foundation for addressing future public health emergencies."
Data access will be open to all approved users, regardless of whether they contribute data. The data are being provided to NCATS as a Limited Data Set (LDS) that retains only two of 18 HIPAA-defined elements: health care provider zip code and dates of service.
NCATS, which is serving as stewards of the data, is taking multiple security and privacy measures. For example, NCATS oversees the use of N3C through user registration, federated login, data use agreements with institutions and data use requests with users. The data reside and remain in NCATS' secure, cloud-based database certified through the Federal Risk and Authorization Management Program, or FedRAMP, which provides standardized assessment, authorization, and continuous monitoring for cloud products and services ensuring the validity of the data while protecting patient privacy. Approved users must analyze data within the platform. In addition, the N3C data will be used only for COVID-19 research purposes, including clinical and translational research and public health surveillance.
The information available via the N3C enclave will be rich in scope and scale. There currently are 35 collaborating sites across the country and the platform contains diverse data from individuals tested for COVID-19. A key component is the harmonization of data, which translates the different ways that contributing hospitals store patient data into a single, common format to enable combined 'apples to apples' analyses. Contributing sites add demographics, symptoms, medications, lab test results, and outcomes data regularly over a five-year period, enabling both the immediate and long-term study of the impact of COVID-19 on health outcomes.
The platform is built to enable machine learning approaches and rigorous statistical analyses, identifying connections and patterns more quickly than can be done through traditional methodologies. These advanced analytics approaches require large, robust datasets to generate statistically valid results and can lead to the simultaneous exploration of multiple questions - and the revealing of likely answers - on a powerful scale.
MEDICA-tradefair.com; Source: NIH / National Center for Advanced Translational Sciences (NCATS)