MedPath

Study to Develop a Tool to Estimate the Kidney Function in Databases Without Laboratory Data

Completed
Conditions
Renal Function
Registration Number
NCT03605810
Lead Sponsor
Bayer
Brief Summary

Scientific analyses are frequently performed on e.g. health insurance databases to study the usage and effectiveness of drugs in real life.

Kidney function is known to have an influence on a patients disease development and/or drug levels in blood.

However, often direct measures for kidney function are not available in databases.

This study plans to develop tools to classify the renal function of patients, which helps scientists to identify patient cohorts (groups of patients sharing same characteristics) for scientific analyses.

Detailed Description

Renal impairment is a common comorbidity in patients with diverse main underlying diseases and a pathology accompanying increasing age. Renal function might be an important modifier of treatment effects.

Population-based administrative claims databases are increasingly used in large-scale comparative outcomes studies of drug treatments. However, claims databases often lack information on laboratory tests results limiting their usefulness in Real-World Evidence(RWE) research of patients with renal impairment.

There is a need to develop methods for identification of patients with renal dysfunction from healthcare administrative claims-based proxies.

The main objective of this study is the development of algorithms/models to predict eGFR values and/or classes for patients at certain time point based on entries in claims database (demographic characteristics, clinical diagnoses, procedures and drug treatments) for a general population and a variety of use-cases (atrial fibrillation, coronary artery disease, type 2 diabetes mellitus patients sub-populations). To achieve this, modern data-driven machine learning techniques will be applied to discover relationships between renal status, measured by eGFR, and longitudinal patient-level data.

Evaluation of models' performance (out of sample validation, benchmark test, performance differences between eGFR value prediction algorithms and classification models tailored for the pre-defined eGFR classes) will be done as well.

Recruitment & Eligibility

Status
COMPLETED
Sex
All
Target Recruitment
5132200
Inclusion Criteria

Not provided

Exclusion Criteria

Not provided

Study & Design

Study Type
OBSERVATIONAL
Study Design
Not specified
Primary Outcome Measures
NameTimeMethod
Performance of classification to predict eGFRFrom eGRF values starting and lasting 180d + 370d

For numeric models cross-validated performance is measured as correlation via r\*2.

Class based performances are measured as cross-validated sensitivities given pre-defined false discovery rates with following definition for positives and negatives:

Observed eGFR class X:

* positive: eGFR measured at begin of time frame is in class X

* negative: eGFR measured at begin of time frame is not in class X

Class predicted by model:

* positive: eGFR predicted is class X

* negative: eGFR predicted is not class X

Secondary Outcome Measures
NameTimeMethod

Trial Locations

Locations (1)

US OPTUM CDM database

🇺🇸

Whippany, New Jersey, United States

US OPTUM CDM database
🇺🇸Whippany, New Jersey, United States

MedPath

Empowering clinical research with data-driven insights and AI-powered tools.

© 2025 MedPath, Inc. All rights reserved.