The Challenges and Lessons Learned Building a New UK Infrastructure for Finding and Accessing Population-Wide COVID-19 Data for Research and Public Health Analysis: The CO-CONNECT Project

J Med Internet Res. 2024 Nov 20:26:e50235. doi: 10.2196/50235.

Abstract

The COVID-19-Curated and Open Analysis and Research Platform (CO-CONNECT) project worked with 22 organizations across the United Kingdom to build a federated platform, enabling researchers to instantaneously and dynamically query federated datasets to find relevant data for their study. Finding relevant data takes time and effort, reducing the efficiency of research. Although data controllers could understand the value of such a system, there were significant challenges and delays in setting up the platform in response to COVID-19. This paper aims to present the challenges and lessons learned from the CO-CONNECT project to support other similar initiatives in the future. The project encountered many challenges, including the impacts of lockdowns on collaboration, understanding the new architecture, competing demands on people's time during a pandemic, data governance approvals, different levels of technical capabilities, data transformation to a common data model, access to granular-level laboratory data, and how to engage public and patient representatives meaningfully on a highly technical project. To overcome these challenges, we developed a range of methods to support data partners such as explainer videos; regular, short, "touch base" videoconference calls; drop-in workshops; live demos; and a standardized technical onboarding documentation pack. A 4-stage data governance process emerged. The patient and public representatives were fully integrated team members. Persistence, patience, and understanding were key. We make 8 recommendations to change the landscape for future similar initiatives. The new architecture and processes developed are being built upon for non-COVID-19-related data, providing an infrastructural legacy.

Keywords: CO-CONNECT; COVID-19; analysis; challenges; cohort discovery; data; data transformation; feasibility analysis; federated analytics; federated discovery; infrastructure; lessons learned; population wide; public health; safe havens; trusted research environments.

MeSH terms

  • COVID-19* / epidemiology
  • COVID-19* / prevention & control
  • Humans
  • Pandemics / prevention & control
  • Public Health / methods
  • SARS-CoV-2
  • United Kingdom