posted on 2023-06-08, 16:23authored byRosemary Tate, Natalia BeloffNatalia Beloff, Balques Al-Radwan, Joss Wickson, Shivani Puri, Timothy Williams, Tijeed Van Staa, Adrian Bleach
Objective: UK primary care databases, which contain diagnostic, demographic and prescribing information for millions of patients geographically representative of the UK, represent a significant resource for health services and clinical research. They can be used to identify patients with a specified disease or condition (phenotyping) and to investigate patterns of diagnosis and symptoms. Currently, extracting such information manually is time-consuming and requires considerable expertise. In order to exploit more fully the potential of these large and complex databases, our interdisciplinary team developed generic methods allowing access to different types of user. Materials and methods: Using the Clinical Practice Research Datalink database, we have developed an online user-focused system (TrialViz), which enables users interactively to select suitable medical general practices based on two criteria: suitability of the patient base for the intended study (phenotyping) and measures of data quality. Results: An end-to-end system, underpinned by an innovative search algorithm, allows the user to extract information in near real-time via an intuitive query interface and to explore this information using interactive visualization tools. A usability evaluation of this system produced positive results. Discussion: We present the challenges and results in the development of TrialViz and our plans for its extension for wider applications of clinical research. Conclusions: Our fast search algorithms and simple query algorithms represent a significant advance for users of clinical research databases.
History
Publication status
Published
File Version
Published version
Journal
Journal of the American Medical Informatics Association
<28>This very recent article reports a major advance in algorithms (patent pending) for exploratory search and visualisation of the content of large bioinformatics relational databases such as the Clinical Practice Research Datalink (CPRD) database. With high speed search, the CPRD database can now be used for a wider range of epidemiological studies, randomised controlled trials and pragmatic trials. The technology is being made available to CPRD users, which include all the major pharmaceutical companies with a presence in the UK. It also forms a core part of a Horizon2020 proposal under development with the University of Bordeaux.