Title


mregger -- Mendelian randomization Egger regression

Syntax

mregger varname_gd varname_gp [aweight] [if] [in] [, options]

options Description ---------------------------------------------------------------------------------------------- fe Fixed effect standard errors (default is multiplicative) gxse(varname) variable of genotype-phenotype (SNP-exposure) SEs heterogi Display heterogeneity/pleiotropy statistics ivw Inverse-variance weighted estimator (default is MR-Egger) level(#) set confidence level; default is level(95) norescale Do not rescale residual variance to be 1 (if less than 1) oldnames Revert to using longer outcome name in b and V ereturned matrices penweighted Use penalized weights radial Use radial formulations of the models tdist Use t-distribution for Wald test and CI limits unwi2gx Additionally report unweighted Q_GX and I^2_GX statistics

Description

mregger performs inverse-variance weighted (IVW; Burgess et al., 2013) and Mendelian randomization Egger (MR-Egger) regression (Bowden et al., 2015) using summary level data (i.e. using genotype-disease [SNP-outcome] and genotype-phenotype [SNP-exposure] association estimates and their standard errors).

varname_gd variable containing the genotype-disease (SNP-outcome) association estimates.

varname_gp variable containing the genotype-phenotype (SNP-exposure) association estimates.

For the analytic weights you need to specify the inverse of the genotype-disease (SNP-outcome) standard errors squared, i.e. aw=1/(gdse^2).

Options

fe specifies fixed effect standard errors (i.e. variance of residuals constrained to 1 as in fixed effect meta-analysis models). The default is to use multiplicative standard errors (i.e. variance of residuals unconstrained as in standard linear regression), see Thompson and Sharp (1999) for further details. We recommend specifying this option when using an allelic score as the instrumental variable.

gxse(varname) specifies the variable containing the genotype-phenotype (SNP-exposure) association standard errors. These are required for calculating the I^2_GX statistic (Bowden et al., 2016). An I^2_GX statistic of 90% means that the likely bias due measurement error in the MR-Egger slope estimate is around 10%. If I^2_GX values are less than 90% estimates should be treated with caution.

heterogi displays heterogeneity/pleiotropy statistics. In the heterogeneity output the model based Q-statistic is reported by multiplying the variance of the residuals by the degrees of freedom (Del Greco et al., 2015). For the IVW model this is the Cochran Q-statistic, and for the MR-Egger model this is the Ruecker's Q-statistic. The corresponding I-squared statistic and its 95% CI is also reported.

ivw specifies inverse-variance weighted (IVW) model (Burgess et al., 2013), the default is MR-Egger.

level(#); see [R] estimation options.

norescale specifies that the residual variance is not set to 1 (if it is found to be less than 1). Bowden et al. (2016) rescale the residual variance to be 1 if it is found to be less than 1.

oldnames revert to using the longer outcome variable name in the b and V ereturned matrices.

penweighted specifies using penalized weights as described in Burgess et al. (2016).

radial specifies the radial formulation of the IVW and MR-Egger models (Bowden et al., 2017). Note there is only a difference for the MR-Egger model.

tdist specifies using the t-distribution, instead of the normal distribution, for calculating the Wald test and the confidence interval limits.

unwi2gx specifies the unweighted Q_GX and I^2_GX statistics to be additionally displayed in the output and in the ereturn scalars.

Examples

Using the data provided by Do et al. (2013) recreate Bowden et al. (2016), Table 4, LDL-c "All genetic variants" estimates.

Setup . use https://raw.github.com/remlapmot/mrrobust/master/dodata, clear

Select observations (p-value with exposure < 10^-8) . gen byte sel1 = (ldlcp2 < 1e-8)

IVW (with fixed effect standard errors) . mregger chdbeta ldlcbeta [aw=1/(chdse^2)] if sel1==1, ivw fe

MR-Egger (with SEs using an unconstrained residual variance) . mregger chdbeta ldlcbeta [aw=1/(chdse^2)] if sel1==1

MR-Egger reporting I^2_GX statistic and heterogeneity Q-test . mregger chdbeta ldlcbeta [aw=1/(chdse^2)] if sel1==1, gxse(ldlcse) heterogi

MR-Egger using a t-distribution for inference & CI limits . mregger chdbeta ldlcbeta [aw=1/(chdse^2)] if sel1==1, tdist

MR-Egger using the radial formulation . mregger chdbeta ldlcbeta [aw=1/(chdse^2)] if sel1==1, radial

MR-Egger using the radial formulation and reporting heterogeneity Q-test . mregger chdbeta ldlcbeta [aw=1/(chdse^2)] if sel1==1, radial heterogi



Stored results

mregger stores the following in e():

Scalars e(df_r) residual degrees of freedom (with tdist option) e(k) number of instruments e(I2GX) I^2_GX (with gxse() option) e(QGX) Q_GX (with gxse() option) e(phi) Scale parameter (root mean squared error)

Macros e(cmd) mregger e(cmdline) command as typed

Matrices e(b) coefficient vector e(V) variance-covariance matrix of the estimates

If heterogi is specified mregger additionally returns the r-class results of heterogi in the e-class results.

If unwi2gx is specified mregger additionally returns e(I2GXunw) Unweighted I^2_GX statistic e(QGXunw) Unweighted Q_GX statistic

mregger stores the following in r():

Matrices r(table) Coefficient table with rownames: b, se, z, pvalue, ll, ul, df, crit, eform

References

Bowden J, Davey Smith G, Burgess S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. International Journal of Epidemiology, 2015, 44, 2, 512-525. DOI

Bowden J, Davey Smith G, Haycock PC, Burgess S. Consistent estimation in Mendelian randomization with some invalid instruments using a weighted median estimator. Genetic Epidemiology, 2016, 40, 4, 304-314. DOI

Bowden J, Del Greco F, Minelli C, Davey Smith G, Sheehan NA, Thompson JR. Assessing the suitability of summary data for two-sample Mendelian randomization analyses using MR-Egger regression: the role of the I-squared statistic. International Journal of Epidemiology, 2016, 45, 6, 1961-1974. DOI

Bowden J, Spiller W, Del-Greco F, Sheehan NA, Thompson JR, Minelli C, Davey Smith G. Improving the visualisation, interpretation and analysis of two-sample summary data Mendelian randomization via the radial plot and radial regression. International Journal of Epidemiology, 2018, 47, 4, 1264-1278. DOI

Burgess S, Bowden J, Dudbridge F, Thompson SG. Robust instrumental variable methods using candidate instruments with application to Mendelian randomization. arXiv:1606.03729v1, 2016. Link

Burgess S, Butterworth A, Thompson S. Mendelian randomization analysis with multiple genetic variants using summarized data. Genetic Epidemiology, 2013, 37, 7, 658-665. DOI

Del Greco F M, Minelli C, Sheehan NA, Thompson JR. Detecting pleiotropy in Mendelian randomization studies with summary data and a continuous outcome. Statistics in Medicine, 2015, 34, 21, 2926-2940. DOI

Do R et al. Common variants associated with plasma triglycerides and risk for coronary artery disease. Nature Genetics, 2013, 45, 1345-1352. DOI: DOI

Thompson SG, Sharp SJ. Explaining heterogeneity in meta-analysis: a comparison of methods. Statistics in Medicine, 1999, 18, 20, 2693-2708. DOI

Author

INCLUDE help mrrobust-author