Skip to content

Abstract Search

Primary Submission Category: Generalizability/Transportability

Paradoxes and resolutions for semiparametric fusion of individual and summary data

Authors: Wenjie Hu, Ruoyu Wang, Wei Li, Wang Miao,

Presenting Author: Wenjie Hu*

Suppose we have available individual data from an internal study and various types of summary statistics from relevant external studies. External summary statistics have been used as constraints on the internal data distribution, which promised to improve the statistical inference in the internal data; however, the additional use of external summary data may lead to paradoxical results: efficiency loss may occur if the uncertainty of summary statistics is not negligible and a large estimation bias can emerge even if the bias of external summary statistics is small. We investigate these paradoxical results in a semiparametric framework. We establish the semiparametric efficiency bound for estimating a general functional of the internal data distribution, which is shown to be no larger than that using only internal data. We propose a data-fused efficient estimator that achieves this bound so that the efficiency paradox is resolved. Besides, we propose a debiased estimator that can achieve the same asymptotic distribution as the oracle estimator as if one knew whether the summary statistics were biased or not. Simulations and
application to a Helicobacter pylori infection dataset are used to illustrate the proposed methods.