Zhiqiang Tan's Research
Return to the home page.
Software
R package iWeigReg: Improved methods for causal inference and missing data problems.
Description: Improved methods based on inverse probability weighting and outcome regression for causal inference and missing data problems.
References: Tan (2006), Tan (2010), Tan (2013).
Vignette: here.
R package UWHAM: Unbinned weighted histogram analysis method.
Description: A method for estimating log-normalizing constants (or free energies) and expectations from multiple distributions (such as multiple generalized ensembles).
References: Tan et al. (2012).
Working Papers
Journal articles (including refereed conference papers)
Probability & Statistics (see below for Interdisciplinary fields)
Yang, T. and Tan, Z. (2018) Backfitting algorithms for total-variation and empirical-norm penalized additive modeling with high-dimensional data, STAT, to appear. (Supplement)
Small, D., Tan, Z., Ramsahai, R.R., Lorch, S.A., and Brookhart, A. (2017) Instrumental variable estimation with a stochastic monotonicity assumption, Statistical Science, 32, 561-579.
Tan, Z. (2017) Optimally adjusted mixture sampling and locally weighted histogram analysis, Journal of Computational and Graphical Statistics, 26, 54-65. (Supplement)
Li, W., Chen, R., and Tan, Z. (2016) Efficient sequential Monte Carlo with multiple proposals and control variates, Journal of the American Statistical Association, 111, 298-313.
Tan, Z. (2016) Steinized empirical Bayes estimation for heteroscedastic data, Statistica Sinica, 26, 1219-1248. (Supplement)
Tan, Z. (2015) Resampling Markov chain Monte Carlo algorithms: Basic analysis and empirical comparisons, Journal of Computational and Graphical Statistics, 24, 328-356. (Supplement)
Tan, Z. (2015) Improved minimax estimation of a multivariate normal mean under heteroscedasticity, Bernoulli, 21, 574-603. (Supplement)
Tan, Z. and C. Wu (2015) Generalized pseudo empirical likelihood inferences for complex surveys, Canadian Journal of Statistics, 43, 1-17.
Tan, Z. (2014) Second-order asymptotic theory for calibration estimators in sampling and missing-data problems, Journal of Multivariate Analysis, 131, 240-253. (Supplement)
Wang, C., Tan, Z., and Louis, T.A. (2014) An exponential tilt mixture model for time-to-event data to evaluate treatment effect heterogeneity in randomized clinical trials, Biometrics & Biostatistics International Journal, 1, 00006.
Wang, C., Tan, Z., and Louis, T.A. (2014) An exponential tilt model for quantitative trait loci mapping with time-to-event data, Journal of Bioinformatics Research Studies, 1, 2.
Li, W., Tan, Z., and Chen, R. (2013) Two-stage importance sampling with mixture proposals, Journal of the American Statistical Association, 108, 1350-1365.
Tan, Z. (2013) Simple design-efficient calibration estimators for rejective and high-entropy sampling, Biometrika, 100, 399-415. (Supplement) (Correction)
Tan, Z. (2013) Calibrated path sampling and stepwise bridge sampling, Journal of Statistical Planning and Inference, 143, 675-690.
Tan, Z. (2013) A cluster-sample approach for Monte Carlo integration using multiple samplers, Canadian Journal of Statistics, 41, 151-173. (Supplement)
Okui, R., Small, D., Tan, Z., and Robins, J.M. (2012) Doubly robust instrumental variables regression, Statistica Sinica, 22, 173-205.
VanderWeele, T.J. and Tan, Z. (2012) Directed acyclic graphs with edge-specific bounds, Biometrika, 99, 115-126.
Tan, Z. (2011) Efficient restricted estimators for conditional mean models with missing data, Biometrika, 98, 663-684.
Wang, C., Tan, Z., and Louis, T.A. (2011) Exponential tilt models for two-group comparison with censored data, Journal of Statistical Planning and Inference, 141, 1102-1117.
Tan, Z. (2010) Bounded, efficient, and doubly robust estimation with inverse weighting, Biometrika, 97, 661-682.
Tan, Z. (2010) Marginal and nested structural models using instrumental variables, Journal of the American Statistical Association, 105, 157-169.
Tan, Z. (2010) On estimation of conditional density models with two-phase sampling, Journal of Statistical Planning and Inference, 140, 1986-2002.
Tan, Z. (2010) Nonparametric likelihood and doubly robust estimating equations for marginal and nested structural models, Canadian Journal of Statistics, 38, 609-632. (Supplement)
Cheng, J., Small, D., Tan, Z., and TenHave, T.R. (2009) Efficient nonparametric estimation of causal effects in randomized trials with noncompliance, Biometrika, 96, 19-36.
Tan, Z. (2009) On profile likelihood for exponential tilt mixture models, Biometrika, 96, 229-236.
Wang, W., Scharfstein, D.O., Tan, Z., MacKenzie, E.J. (2009) Causal inference in outcome-dependent two-phase sampling designs, Journal of of the Royal Statistical Society, Ser. B, 71, 947-969.
Chi, Z. and Tan, Z. (2008) Positive false discovery proportions: Intrinsic bounds and adaptive control, Statistica Sinica, 18, 837-860. (Supplement)
Tan, Z. (2008) Monte Carlo integration with Markov chain,
Journal of Statistical Planning and Inference, 138, 1967-1980.
Tan, Z. (2006) Regression and weighting methods for causal inference using instrumental variables, Journal of the American Statistical Association, 101, 1607-1618. (Correction)
Tan, Z. (2006) A distributional approach for causal inference using propensity scores,
Journal of the American Statistical Association, 101, 1619-1637.
Tan, Z. (2006) Monte Carlo integration with acceptance-rejection,
Journal of Computational and Graphical Statistics, 15, 735-752.
Tan, Z. (2004) On a likelihood approach for Monte Carlo integration, Journal of the American Statistical Association, 99, 1027-1036.
Kong, A., McCullagh, P., Meng, X.-L., Nicolae, D., and Tan, Z. (2003) A theory of statistical models for Monte Carlo integration (with discussion), Journal of the Royal Statistical Society, Ser. B, 65, 585-618.
Interdisciplinary fields (see above for Probability & Statistics)
Gerhard, T., Stroup, T.S., Correll, C.U., Huang, C., Tan, Z., Crystal, S., and Olfson, M. (2017) Antipsychotic medication treatment patterns in adult depression, Journal of Clinical Psychiatry, to appear.
Wang, B., Ou, Z., and Tan, Z. (2017) Learning trans-dimensional random fields with applications to language modeling, IEEE Transactions on Pattern Analysis and Machine Intelligence, to appear. (Supplement)
Zhang, B.W., Deng, N., Tan, Z., and Levy, R.M. (2017) Stratified UWHAM and its stochastic approximation for multicanonical simulations which are far from equilibrium, Journal of Chemical Theory and Computation, 13, 4660-4674.
Tan, Z., Xia, J., Zhang, B.W., and Levy, R.M. (2016) Locally weighted histogram analysis and stochastic solution for large-scale multistate free energy estimation, Journal of Chemical Physics, 144, 034107.
Zhang, B.W., Gallicchio, E., Dai, W., He, P., Tan, Z., and Levy, R.M. (2016) Replica exchange simulations of binding free energies: Markov state models, proposal schemes, and reweighting techniques, Journal of Physical Chemistry B, 120, 8289-8301.
Flynn, W.F., Chang, M.W., Tan, Z., Oliveira, G., Yuan, J., Okulicz, J.F., Torbett, B.E., and Levy, R.M. (2015) Deep sequencing of protease inhibitor resistant HIV patient isolates reveals patterns of correlated mutations in Gag and Protease, PLoS Computational Biology, 11, e1004249.
Wang, B., Ou, Z., and Tan, Z. (2015) Trans-dimensional random fields for language modeling, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics (ACL), 785-794. (See Zhijian Ou's webpage for codes.)
Xia, J., Flynn, W.F., Gallicchio, E., Zhang, B.W., He, P., Tan, Z., and Levy, R.M. (2015) Large-scale asynchronous and distributed multidimensional replica exchange molecular simulations and efficiency analysis, Journal of Computational Chemistry, 36, 1772-1785.
Zhang, B.W., Xia, J., Tan, Z., and Levy, R.M. (2015) A stochastic solution to the unbinned WHAM equations, Journal of Physical Chemistry Letters, 6, 3834-3840.
Tan, Z., Gallicchio, E., Lapelosa, M., and Levy, R.M. (2012) Theory of binless multi-state free energy estimation with applications to protein-ligand binding, Journal of Chemical Physics, 136, 144102.
Pluzhnikov, A., Nolan, D.K., Tan, Z., McPeek, M.S., and Ober, C. (2007) Correlation of intergenerational family sizes suggests a genetic component to reproductive fitness, American Journal of Human Genetics, 81, 165-169.
Discussions
Tan, Z. (2008) Improved local efficiency and double robustness , Comment on "Empirical efficiency maximization: Improved locally efficient covariate adjustment in randomized experiments and survival analysis" by Rubin and van der Laan, International Journal of Biostatistics, 4, Article 10.
Tan, Z. (2007) Understanding OR, PS, and DR , Comment on "Demystifying double robustness: A comparison of alternative strategies for estimating a population mean from incomplete data" by Kang and Schafer, Statistical Science, 22, 560-568.
Unpublished manuscripts