Estimating Incident Population Distribution from Prevalent Data

Kwun Chuen Gary Chan; Mei Cheng Wang

doi:10.1111/j.1541-0420.2011.01708.x

Estimating Incident Population Distribution from Prevalent Data

Kwun Chuen Gary Chan, Mei Cheng Wang

Bloomberg School of Public Health

Research output: Contribution to journal › Article › peer-review

9 Scopus citations

Abstract

A prevalent sample consists of individuals who have experienced disease incidence but not failure event at the sampling time. We discuss methods for estimating the distribution function of a random vector defined at baseline for an incident disease population when data are collected by prevalent sampling. Prevalent sampling design is often more focused and economical than incident study design for studying the survival distribution of a diseased population, but prevalent samples are biased by design. Subjects with longer survival time are more likely to be included in a prevalent cohort, and other baseline variables of interests that are correlated with survival time are also subject to sampling bias induced by the prevalent sampling scheme. Without recognition of the bias, applying empirical distribution function to estimate the population distribution of baseline variables can lead to serious bias. In this article, nonparametric and semiparametric methods are developed for distribution estimation of baseline variables using prevalent data.

Original language	English (US)
Pages (from-to)	521-531
Number of pages	11
Journal	Biometrics
Volume	68
Issue number	2
DOIs	https://doi.org/10.1111/j.1541-0420.2011.01708.x
State	Published - Jun 2012

Keywords

Accelerated failure time model
Cross-sectional sampling
Left truncation
Proportional hazards model

ASJC Scopus subject areas

Statistics and Probability
General Biochemistry, Genetics and Molecular Biology
General Immunology and Microbiology
General Agricultural and Biological Sciences
Applied Mathematics

Access to Document

10.1111/j.1541-0420.2011.01708.x

Cite this

@article{57fdd0463b264379b965f4bbba488f4f,

title = "Estimating Incident Population Distribution from Prevalent Data",

abstract = "A prevalent sample consists of individuals who have experienced disease incidence but not failure event at the sampling time. We discuss methods for estimating the distribution function of a random vector defined at baseline for an incident disease population when data are collected by prevalent sampling. Prevalent sampling design is often more focused and economical than incident study design for studying the survival distribution of a diseased population, but prevalent samples are biased by design. Subjects with longer survival time are more likely to be included in a prevalent cohort, and other baseline variables of interests that are correlated with survival time are also subject to sampling bias induced by the prevalent sampling scheme. Without recognition of the bias, applying empirical distribution function to estimate the population distribution of baseline variables can lead to serious bias. In this article, nonparametric and semiparametric methods are developed for distribution estimation of baseline variables using prevalent data.",

keywords = "Accelerated failure time model, Cross-sectional sampling, Left truncation, Proportional hazards model",

author = "Chan, {Kwun Chuen Gary} and Wang, {Mei Cheng}",

year = "2012",

month = jun,

doi = "10.1111/j.1541-0420.2011.01708.x",

language = "English (US)",

volume = "68",

pages = "521--531",

journal = "Biometrics",

issn = "0006-341X",

publisher = "Wiley-Blackwell",

number = "2",

}

TY - JOUR

T1 - Estimating Incident Population Distribution from Prevalent Data

AU - Chan, Kwun Chuen Gary

AU - Wang, Mei Cheng

PY - 2012/6

Y1 - 2012/6

N2 - A prevalent sample consists of individuals who have experienced disease incidence but not failure event at the sampling time. We discuss methods for estimating the distribution function of a random vector defined at baseline for an incident disease population when data are collected by prevalent sampling. Prevalent sampling design is often more focused and economical than incident study design for studying the survival distribution of a diseased population, but prevalent samples are biased by design. Subjects with longer survival time are more likely to be included in a prevalent cohort, and other baseline variables of interests that are correlated with survival time are also subject to sampling bias induced by the prevalent sampling scheme. Without recognition of the bias, applying empirical distribution function to estimate the population distribution of baseline variables can lead to serious bias. In this article, nonparametric and semiparametric methods are developed for distribution estimation of baseline variables using prevalent data.

AB - A prevalent sample consists of individuals who have experienced disease incidence but not failure event at the sampling time. We discuss methods for estimating the distribution function of a random vector defined at baseline for an incident disease population when data are collected by prevalent sampling. Prevalent sampling design is often more focused and economical than incident study design for studying the survival distribution of a diseased population, but prevalent samples are biased by design. Subjects with longer survival time are more likely to be included in a prevalent cohort, and other baseline variables of interests that are correlated with survival time are also subject to sampling bias induced by the prevalent sampling scheme. Without recognition of the bias, applying empirical distribution function to estimate the population distribution of baseline variables can lead to serious bias. In this article, nonparametric and semiparametric methods are developed for distribution estimation of baseline variables using prevalent data.

KW - Accelerated failure time model

KW - Cross-sectional sampling

KW - Left truncation

KW - Proportional hazards model

UR - http://www.scopus.com/inward/record.url?scp=84862887417&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84862887417&partnerID=8YFLogxK

U2 - 10.1111/j.1541-0420.2011.01708.x

DO - 10.1111/j.1541-0420.2011.01708.x

M3 - Article

C2 - 22313264

AN - SCOPUS:84862887417

SN - 0006-341X

VL - 68

SP - 521

EP - 531

JO - Biometrics

JF - Biometrics

IS - 2

ER -

Estimating Incident Population Distribution from Prevalent Data

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this