Weighted Cox regression for the prediction of heterogeneous patient subgroups

03/19/2020
by   Katrin Madjar, et al.
0

An important task in clinical medicine is the construction of risk prediction models for specific subgroups of patients based on high-dimensional molecular measurements such as gene expression data. Major objectives in modeling high-dimensional data are good prediction performance and feature selection to find a subset of predictors that are truly associated with a clinical outcome such as a time-to-event endpoint. In clinical practice, this task is challenging since patient cohorts are typically small and can be heterogeneous with regard to their relationship between predictors and outcome. When data of several subgroups of patients with the same or similar disease are available, it is tempting to combine them to increase sample size, such as in multicenter studies. However, heterogeneity between subgroups can lead to biased results and subgroup-specific effects may remain undetected. For this situation, we propose a penalized Cox regression model with a weighted version of the Cox partial likelihood that includes patients of all subgroups but assigns them individual weights based on their subgroup affiliation. Patients who are likely to belong to the subgroup of interest obtain higher weights in the subgroup-specific model. Our proposed approach is evaluated through simulations and application to real lung cancer cohorts. Simulation results demonstrate that our model can achieve improved prediction and variable selection accuracy over standard approaches.

READ FULL TEXT
research
04/16/2020

Combining heterogeneous subgroups with graph-structured variable selection priors for Cox regression

Important objectives in cancer research are the prediction of a patient'...
research
03/23/2016

Predicting Glaucoma Visual Field Loss by Hierarchically Aggregating Clustering-based Predictors

This study addresses the issue of predicting the glaucomatous visual fie...
research
07/22/2021

Inference for High Dimensional Censored Quantile Regression

With the availability of high dimensional genetic biomarkers, it is of i...
research
07/19/2020

Supervised clustering of high dimensional data using regularized mixture modeling

Identifying relationships between molecular variations and their clinica...
research
11/28/2022

Regression-based heterogeneity analysis to identify overlapping subgroup structure in high-dimensional data

Heterogeneity is a hallmark of complex diseases. Regression-based hetero...
research
05/06/2023

A Nonparametric Mixed-Effects Mixture Model for Patterns of Clinical Measurements Associated with COVID-19

Some patients with COVID-19 show changes in signs and symptoms such as t...
research
05/10/2023

Flexible cost-penalized Bayesian model selection: developing inclusion paths with an application to diagnosis of heart disease

We propose a Bayesian model selection approach that allows medical pract...

Please sign up or login with your details

Forgot password? Click here to reset