Home
Students
Papers
Talks
Software
Books
Links
Data Sets

Papers

  • Selected Papers in 2017-2018
    • Ma, S., Ma, Y. Wang, Y. and Carroll, R. J. (2017). A semiparametric single-index risk score across populations. JASA, to appear.
    • Zhang, X., Wang, H., Ma, Y. and Carroll, R. J. (2017). Linear model selection when covariates contain errors. JASA, to appear.
    • Bertrand, A., Legrand, C., Carroll, R. J., de Meester, C. and Van Keilegom, I. (2017). Inference in a survival cure model with mismeasured covariates using a SIMEX approach. Biometrika, to appear.
    • Sarkar, A., Pati, D., Chakraborty, A., Mallick, B. K. and Carroll, R. J. (2017). Bayesian semiparametric multivariate density deconvolution. JASA, to appear.
    • Cao, J., Zhang, X. and Carroll, R. J. (2017). Estimating varying coefficients for partial differential equation models. Biometrics, to appear.
    • Sampson, J. N., Matthews, C. E., Freedman, L. S., Carroll, R. J. and Kipnis, V. (2017). Methods to assess measurement error in questionnaires of sedentary behavior. Journal of Applied Statistics, 43, 1706-1721. Online version pdf
    • Keadle, S., Sampson, J., Li, H., Lyden, K., Matthews, C. E. and Carroll, R. J. (2017). An evaluation of accelerometer-derived metrics to assess daily behavioral patterns. Medicine & Science in Sports & Exercise, 49, 54-63. PMC5176102 pdf
    • Cook, S. J., Blas, B., Carroll, R. J. and Sinha, S. (2017). Two wrongs make a right: addressing underreporting in binary data from multiple sources. Political Analysis, to appear. The journal has an impact factor of 6.098, and it has the largest impact factor of all political science journals.
    • Pfeiffer, R. M., Redd, A. and Carroll, R. J. (2017). On the impact of model selection on predictor identification and parameter inference. Computational Statistics, to appear.

  • Selected Papers in 2016
    • Chatterjee, N., Chen, Y.-H., Maas, P. and Carroll, R. J. (2016). Constrained maximum likelihood estimation for model calibration using summary-level information from external big data sources. JASA, 111, 107-117. pdf
    • Ma, Y. and Carroll, R. J. (2016). Semiparametric estimation in the secondary analysis of case-control studies. Journal of the Royal Statistical Society, Series B, 78, 127-151. pdf
    • Delaigle, A. and Carroll, R. J. (2016). Obituary of Peter Gavin Hall, 1951-2016. IMS Bulletin, 45, 4-5. pdf
    • de la Cruz, R., Meza, C., Arribas-Gil, A. and Carroll, R. J. (2016). Journal of Multivariate Analysis, to appear.Bayesian regression analysis of data with random effects covariates from nonlinear longitudinal measurements. Journal of Multivariate Analysis, 143, 94-106. pdf
    • Potgieter, C. J., Wei, R., Kipnis, V., Freedman, L. S. and Carroll, R. J. (2016). Moment reconstruction and moment-adjusted imputation when exposure is generated by a complex, nonlinear random effects modeling process. Biometrics, to appear. NIHMSID 794866 pdf
    • Gail, M. H., Wu, Jincao, Wang, M., Yaune, S.-S., Cook, N. R., Eliassend, A. H., McCullough, M. L., Yu, K., Zeleniuch-Jacquottei, A., Smith-Warner, S., Ziegler, R. G. and Carroll, R. J. (2017). Calibration and seasonal adjustment for matched case-control studies of Vitamin D and cancer. Statistics in Medicine, 35, 2133-2148. pdf
    • Huque, M. H., Bondell, H., Carroll, R. J. and Ryan, L. (2016). Spatial regression with covariate measurement error: a semi-parametric approach. Biometrics, 72, 678–686. pdf
    • Bhadra, A. and Carroll, R. J. (2016). Exact sampling of the unobserved covariates in Bayesian spline models for measurement error problems. Statistics and Computing, to appear. Online version pdf
    • Alexeff, S. E., Carroll, R. J. and Coull, B. (2014). Spatial measurement error in linear regression models when using predicted air pollution exposures and correction by spatial SIMEX. Biostatistics, 17, 377-389. pdf
    • Kipnis, V., Freedman, L. S., Carroll, R. J. and Midthune, D. (2016). A bivariate measurement error model for semicontinuous and continuous variables: application to nutritional epidemiology. Biometrics, 76,106-115 pdf
    • Midthune, D., Carroll, R. J., Freedman, L. S. and Kipnis, V. (2016). Measurement error models with interactions. Biostatistics, 17, 277-290. pdf
    • Masiuk, S., Shklyar, S., Kukush, A., Carroll, R. J., Kovgan, L. and Likhtarov, I. A. (2016). Estimation of radiation risk in presence of classical additive and Berkson multiplicative errors in exposure doses. Biostatistics, 17, 422-436. pdf
    • Zoh, R., Mallick, B. K., Ivanov, I., Baladandayuthapani, V., Manyam, G., Chapkin, R., Lampe, J. W. and Carroll, R. J. (2016). PCAN: probabilistic correlation analysis of two non-normal data sets. Biometrics, 72, 1358-1368. PMC5045754 pdf
    • Li, H., Keadle, S., Kipnis, V. and Carroll, R. J. (2016). Longitudinal functional additive model with continuous proportional outcomes for physical activity data. STAT, 5, 242-250. pdf
    • Huque, M. H., Carroll, R. J., Christiani, D. C. and Ryan, L. M. (2017). Exposure enriched case-control (EECC) design for the assessment of gene-environment interaction. Genetic Epidemiology, 40, 570--578. PMCID: PMC5069109 pdf
    • Keogh, R. H., Carroll, R. J., Tooze, J., Kirkpatrick, S. I. and Freedman, L. S. (2016). Statistical issues related to dietary intake as the response variable in intervention trials. Statistics in Medicine, 35, 4493-4508. PMID 27324170 pdf


  • Selected Papers in 2015
    • Freedman, L. S., Midthune, D., Dodd, K., Carroll, R. J. and Kipnis, V. (2015). A statistical model for measurement error that incorporates variation over time in the target measure, with application to nutritional epidemiology. Statistics in Medicine, 34, 3590-3605. pdf
    • Li, H., Keadle, S. K., Staudenmayer, J., Assaad, H., Huang, J. Z. and Carroll, R. J. (2015). Methods to assess an exercise intervention trial based on three-level functional data. Biostatistics, 16, 754-771. pdf
    • Wang, Y., Wang, S. and Carroll, R. J. (2015). The direct integral method for confidence intervals for the ratio of two location prameters. Biometrics, to appear. PMID25939421 pdf
    • Zhang, X., Zou, G. and Carroll, R. J. (2015). Model averaging based on Kullback-Leibler distance. Statistica Sinica, 25, 1583-1598. pdf
    • Freedman, L. S., Midthune, D., Carroll, R. J., Commins, J. M., Arab, L., Baer, D. J., Moler, J. E., Moshfegh, A. J., Neuhouser, M. L., Prentice, R. L. and Rhodes, D. (2015). Application of a new statistical model for measurement error to the evaluation of dietary self-report instruments. Epidemiology, 26, 925-933. pdf
    • Ma, S., Carroll, R. J., Liang, H. and Xu, S. (2015). Estimation and inference in generalized additive coefficient models for nonlinear interactions with high-dimensional covariates. Annals of Statistics, 43, 2102-2131. pdf
    • Assaad, H. I., Hou, Y., Zhou, L., Carroll, R. J., and Wu, G. (2015). Rapid publication-ready MS-Word tables for two-way ANOVA. SpringerPlus, 4, 1-9. PMC4305362
    • Yi, G. Y., Ma, Y., Spiegelman, D. and Carroll, R. J. (2015). Functional and structural methods with mixed measurement error and misclassification in covariates. Journal of the American Statistical Association, to appear. Online version pdf
    • Zhang, X., Cao, J. and Carroll, R. J. (2015). On the selection of ordinary differential equation models with application to predator-prey dynamical models. Biometrics, to appear. Online version pdf
    • Gregory, K. B., Carroll, R. J., Baladandayuthapani, V. and Lahiri, S. N. (2015). A two-sample test for equality of means in high dimension. Journal of the American Statistical Association, advance access version. PDF File
    • Staicu, A.-M., Lahiri, S. and Carroll, R. J. (2015). Significance tests for functional data with complex dependence structure. Journal of Statistical Planning and Inference, 156, 1-13. PDF File
    • Lian, H., Liang, H. and Carroll, R. J. (2014). Variance function partially linear single-index models. Journal of the Royal Statistical Society, Series B, 77, 171-194. PDF File
    • Li, H., Staudenmayer, J. and Carroll, R. J. (2015). Hierarchical functional data with mixed continuous and binary measurements. Biometrics, 70, 802-811. PDF File

  • Selected Papers in 2014
    • Carroll, R. J. (2014). Estimating the distribution of dietary consumption patterns. Statistical Science, 29, 2-8. PDF File
    • Sarkar, A., Mallick, B. K. and Carroll, R. J. (2014). Bayesian semiparametric regression in the presence of conditionally heteroscedastic measurement and regression errors. Biometrics, 70, 823-834. PDF File
    • Martinez, J. G., Bohn, K. M., Carroll, R. J. and Morris, J. S. (2014). A study of Mexican Free-Tailed Bat chirp syllables: Bayesian functional mixed models for nonstationary acoustic time series. Journal of the American Statistical Association, 108, 514-526. PDF File
    • Gazioglu, S., Wei, J., Jennings, E. M. and Carroll, R. J. (2014). A note on penalized regression spline estimation in the secondary analysis of case-control data. Statistics in Biosciences, to appear.
    • Ward, R. and Carroll, R. J. (2014). Testing Hardy-Weinberg equilibrium with a simple root-mean-square statistic. Biostatistics, in press. Advanced access PDF File
    • Garcia, T. P., M\"uller, S., Carroll, R. J., and Walzem, R. L. (2014). Identification of important regressor groups, subgroups, and individuals via regularization methods: application to gut microbial data. {\it Bioinformatics}, to appear. doi: 10.1093/bioinformatics/btt608 Advanced access PDF File
    • Tekwe, C. D., Carter, R. L., Cullings, H. M. and Carroll, R. J. (2014). Multiple indicators, multiple causes measurement error models. Statistics in Medicine, to appear. Advanced access PDF File
    • Guenther, P. M., Kirkpatrick, S. L., Reedy, J., Krebs-Smith, S. M., Buckman, D. W., Dodd, K. W. Casavale, K. O. and Carroll, R. J. (2014). Healthy Eating Index-2010 is a valid and reliable measure of diet quality according to the 2010 Dietary Guidelines for Americans. Journal of Nutrition, to appear. Advanced access PDF File
    • Sarkar, A., Mallick, B. K., Staudenmayer, J., Pati, D. and Carroll, R. J. (2014). Bayesian semiparametric density deconvolution in the presence of conditionally heteroscedastic measurement errors. Journal of Computational and Graphical Statistics, 25, 1101-1125. PDF File
    • Qi, X., Luo, R., Carroll, R. J. and Zhao, H. (2014). Sparse regression by projection and sparse discriminant analysis. Journal of Computational and Graphical Statistics, to appear. Advanced access PDF File
    • Little, M. P., Kukush, A. G., Masiuk, S. V., Shklyar, S. V., Carroll, R. J., Lubin, J. H., Kwon, D., Brenner, A. V., Tronko, M. D., Mabuchi, K., Bogdanova, T. I., Hatch, M., Zablotska, L. B., Tereschenko,V. P., Ostroumova, E., Bouville, A. C., Drozdovitch, V., Chepurny, M. I., Kovgan, L. N., Simon, S. L., Shpak, V. M. and Likhtarev, I. A. (2014). Impact of uncertainties in exposure assessment on thyroid cancer risk among Ukrainian children and adolescents exposed from the Chornobyl accident. PLoS ONE, 9, e85723. PDF File
    • Qahtan, A., Wang, S., Carroll, R. J., and Zhang, X. (2014). A new study of two divergence metrics for change detection in data streams. {\it Proceedings of the 21st European Conference on Artificial Intelligence (ECAI 2014)}, to appear. Advanced access PDF File


  • Selected Papers in 2013
    • Serban, N., Staicu, A.-M. and Carroll, R. J. (2014). Multilevel cross-dependent binary longitudinal data. Biometrics, 69, 903-913. PDF File
    • Li, Y., Wang, N. and Carroll, R. J. (2013). Selecting the number of principal components in functional data. Journal of the American Statistical Association, 108, 1284-1291. PDF File
    • Carroll, R. J., Delaigle, A. and Hall, P. (2013). Unexpected properties of bandwidth choice when smoothing discrete data for constructing a functional data classifier. Annals of Statistics, 41, 2739-2767. PDF File
    • Xun, X., Cao, J., Mallick, B. K., Maity, A. and Carroll, R. J. (2013). Parameter estimation of partial differential equation models. Journal of the American Statistical Association, 108, 1009-1020.
      PDF File
    • Garcia, T. P., Mueuller, S., Carroll, R. J., Dunn, T. N., Thomas, A. P., Adams, S. H., Pillai, S. D. and Walzem, R. S. (2013). Structured variable selection with q-values. Biostatistics, 14, 695-707.
      PDF File
    • Sampson, J. N., Chatterjee, N., Carroll, R. J. and Mueller, S. (2013). Controlling the local false discovery rate in the Adaptive Lasso. Biostatistics, 14, 653-666.
      PDF File
    • Wei, J., Carroll, R. J., Mueller, U., Van Keilegom, I. and Chatterjee, N. (2013). Locally efficient estimation for homoscedastic regression in the secondary analysis of case-control data. Journal of the Rpyal Statistical Society, Series B, 75, 186-206.
      PDF File
    • Tooze, J. A., Troiano, R. P., Carroll, R. J., Moshfegh, A. L. and Freedman, L. S. (2013). A measurement error model for physical activity level as measured by a questionnaire with application to the NHANES 1999-2006 questionnaire. American Journal of Epidemiology, online version.
      PDF File
    • Chen, Y.-H., Chatterjee, N. and Carroll, R. J. (2013). Using shared genetic controls in studies of gene-environment interactions. Biometrika, 100, 319-338.
      PDF File
    • Jennings, E. M., Morris, J. S., Carroll, R. J., Ganiraju, M. C. and Baladandayuthapani V. (2013). Bayesian methods for expression-based integration of various types of genomics data. EURASIP Journal on Bioinformatics and Systems Biology, 2013.13, http://bsb.eurasipjournals.com/content/2013/1/13.
      PDF File


  • Selected Papers in 2012
    • Identifying genetic marker sets associated with phenotypes via an ffficient adaptive score test. Biostatistics, 13, 776-790 (with Tianxi Cai and Xihong Lin).
      PDF File
    • Multiple imputation in quantile regression. Biometrika, 99, 423-438 (with Y. Wei and Y. Ma).
      PDF File
    • Collier, B. A., Groce, J. E., Morrison, M. L., Newnam, J. C., Campomizzi, A.J., Farrell, S. J., Mathewson, H. A., Snelgrove, R. T., Carroll, R. J. and Wilkins, R. N. (2012). Predicting patch occupancy in fragmented landscapes at the rangewide scale for endangered species: an example of an American warbler. Diversity and Distributions, 18, 158-167.
      PDF File
    • Park, J.-H., Gail, M. H., Weinberg, C., Carroll, R. J., Chung, C., Wang, Z., Chanock, S., Fraumeni, J. F and Chatterjee, N. (2012). Distribution of allele frequencies, effect-sizes and their interrelationships for common susceptibility variants. Proceedings of the National Academy of Sciences, 108, 18026-18031.
      PDF File
    • Ma, S., Yang, L. and Carroll, R. J. (2012). A simultaneous confidence band for sparse longitudinal regression. Statistica Sinica, 22, 95-122.
      PDF File
    • Yi, G. Y. Y., Ma, Y and Carroll, R. J. (2012). A robust, functional generalized method of moments approach for longitudinal studies with missing responses and covariate measurement error. Biometrika, 99, 151-165.
      PDF File
    • Bliznyuk, N., Carroll, R. J., Genton, M. and Wang, Y. (2012). Variogram Estimation in the presence of trend. Statistics and its Interface, 5, 159-168..
      PDF File
    • Carroll, R. J., Midthune, D., Subar, A. F., Shumakovich, M., Freedman, L. S., Thompson, F. E. and Kipnis, V. (2012). Taking advantage of the strengths of two different dietary assessment instruments to improve intake estimates for nutritional epidemiology. American Journal of Epidemiology, 175, 340-347.
      PDF File of advanced access version
    • Kipnis, V., Midthune, D., Freedman, L. S. and Carroll, R. J. (2012). Regression calibration with more instruments than mismeasured variables. Statistics in Medicine, 31, 2713-2732.
      PDF File
    • Carroll, R. J., Delaigle, A. and Hall, P. (2012). Deconvolution when classifying noisy data involving transformations. Journal of the American Statistical Association, 106, 1166-1177.
      PDF File
    • Tekwe, C. D., Dabney, A. R. and Carroll, R. J. (2012). Application of survival analysis methodology to the quantitative analysis of LC-MS proteomics data. Bioinformatics, 28, 1998-2003
      PDF File of advanced access version



  • Selected Papers in 2011
    • Estimation and variable selection for generalized additive partial linear models. Annals of Statistics, 39, 1827-1851 (with L. Wang, X. Liu and Hua Liang).
      PDF File
    • Density estimation in several populations with uncertain population membership. Journal of the American Statistical Association, 106, 1180-1192 (with Yanyuan Ma and Jeffrey D. Hart).
      PDF File
    • A Bayesian approach to detection of small low emission sources. Inverse Problems, electronic version (with Xiaolei Xun, Bani Mallick and Peter Kuchment).
      PDF File
    • Semiparametric Bayesian analysis of gene-environment interactions with error in measurement of environmental covariates and missing genetic data. Statistics and its Interface, 4, 305-315 (with Iryna Lobach and Bani Mallick)
      PDF File
    • Testing and estimating shape-constrained nonparametric density and regression in the presence of measurement error. JASA, 106, 191-202 (with A. Delaigle and P. Hall).
      PDF File
    • Testing for constant nonparametric effects in general semiparametric regression models with interactions. Statistics and Probability Letters, 81, 717-723 (with J. Wei and A. Maity).
      PDF File
    • Local and omnibus tests in classical measurement error models. Journal of the Royal Statistical Society, Series B, 73, 81-98 (with Y. Ma, JR. Janicki and J. D. Hart).
      PDF File
    • P-splines using derivative information. Multiscale Modeling and Simulation, 8, 1562-1580 (with C. P. Calderon, J. G. Martinez and D. C. Sorensen).
      PDF File
    • A mixed-effects model approach for estimating the distribution of usual intake of nutrients: the NCI method. Statistics in Medicine, 29, 2857-2868 (with J. A. Tooze, V. Kipnis, D. W. Buckman, L. S. Freedman, P. M. Guenther, S. M. Krebs-Smith, A. F. Subar and K. W. Dodd).
      PDF File
    • A new multivariate measurement error model with zero-inflated dietary data, and its application to dietary assessment. Annals of Applied Statistics, 5, 1456-1487 (with S. Zhang, J. A. Tooze, V. Kipnis, D. W. Buckman, L. S. Freedman, P. M. Guenther, S. M. Krebs-Smith, A. F. Subar and K. W. Dodd).
      PDF File
    • A simultaneous confidence band for sparse longitudinal regression. Statistica Sinica, 22, 95-122 (with S. Ma and L. Yang).
      PDF File
    • How to estimate the measurement error variance associated with ancestry proportion estimates. Statistics and its Interface, 4, 327-337 (with J. Divers, D. T. Redden and D. B. Allison).
      PDF File
    • Generalized additive partial linear models--- polynomial spline smoothing estimation and variable selection procedures. Annals of Statistics, 39, 1827-1851 (with L. Wang, X. Liu and H. Liang).
      PDF File
    • Fitting a bivariate measurement error model for episodically consumed dietary components. International Journal of Biostatistics, Volume 7, Issue 1, Article 1, DOI: 10.2202/1557-4679.1267 (with S. Zhang, J. A. Tooze, V. Kipnis, D. W. Buckman, L. S. Freedman, P. M. Guenther, S. M. Krebs-Smith, A. F. Subar and K. W. Dodd).
      PDF File
    • Mixtures of classical and Berkson uncertainties in the analysis of the Chernobyl accident. International Journal of Biostatistics, Volume 7 : Issue 1, Article 15.DOI: 10.2202/1557-4679.1281 (with Kukush, A., Shklyar, S., Masiuk, S., Likhtarov, I., Kovgan, L. and Bouville, A.).
      PDF File
    • Statistical methods for comparative phenomics using high-throughput phenotype microarrays. {\it International Journal of Biostatistics}, 6, Issue 1, Article 29, DOI: 10.2202/1557-4679.1227 (with Sturino, J. M., Zorych, I., Mallick, B., Chang, Y.-Y. and Bliznuyk, N.).
      PDF File



  • Selected Papers in 2010
    • Genotype-based association mapping of complex diseases: gene-environment interactions with multiple genetic markers and measurement error in environmental exposures. Genetic Epidemiology, 34, 792-802 (with I. Lobach and R. Fan).
      PDF File
    • Generalized empirical likelihood methods for analyzing longitudinal data. Biometrika, 97, 79-93 (with S. Wang and L. Qian).
      PDF File
    • A nonparametric approach to detect nonlinear correlation in gene expression. Journal of Computational and Graphical Statistics, 19, 552-568 (with Y. A. Chen, J. Almeida, A. Richards, P. Mueller and B. Rohrer).
      PDF File
    • Semiparametric estimation of fixed effects panel data varying coefficient models. Nonparametric Econometric Methods, Advances in Econometrics, 24, (with Y. Sun and D. Li).
      PDF File
    • Use of multiple singular value decompositions to analyze complex intracellular calcium ion signals. Annals of Applied Statistics, 3, 1467-1492 (with J. G. Martinez, J. Z. Huang, R. C. Burghardt and R. Barhoumi).
      PDF File
    • Semiparametric Bayesian analysis of nutritional epidemiology data in the presence of measurement error. Biometrics, 66, 444-454 (with S. Sinha, B. K. Mallick and V. Kipnis).
      PDF File
    • Functional principal component selection via Stochastic Approximation Monte Carlo (SAMC). Canadian Journal of Statistics, 38, 256-270 (with J. G. Martinez and F. Liang).
      PDF File
    • Reduced rank mixed effects models for spatially correlated hierarchical functional data. Journal of the American Statistical Association, 105, 390-400 (with L. Zhou, J. Z. Huang, J. G. Martinez, A. Maity and V. Baladandayuthapani).
      PDF File
    • Fast methods for spatially correlated multilevel functional data. Biostatistics, 11, 177-194 (with A.-M. .Staicu and C. M. Crainiceanu).
      PDF File
      Supplement
    • Statistics and bioinformatics in nutritional sciences: analysis of complex data in the era of systems biology. Journal of Nutritional Biochemistry, 21, 561-572 (with W. Fu, A. J. Stromberg, K. Viele and G. Wu).
      PDF File
    • Bayesian modeling of MPSS data: gene expression analysis of bovine salmonella infection. Journal of the American Statistical Association, 105, 956-967 (with S. Dhavala and B. K. Mallick).
      PDF File
    • Generalized functional latent feature models with single-index interactions. Journal of the American Statistical Association, 105, 621-633 (with Y. Li and N. Wang).
      PDF File
    • Marginal longitudinal semiparametric regression via penalized splines. Statistics and Probability Letters, 80, 1242-1252 (with M. Al-Kadiri and M. P. Wand).
      PDF File
    • A note on the effect on power of score tests via dimension reduction by penalized regression under the null. International Journal of Biostatistics, Vol. 6 : Issue 1, Article 12. DOI: 10.2202/1557-4679.1231 (with J. G. Martinez, S. Mueller, J. N. Sampson and N. Chatterjee)
      PDF File


    Selected Papers in 2009
    • Analysis of case-control association studies: SNPs, Imputation and Haplotypes. Statistical Science, 24, 489-502 (with N. Chatterjee, Y.-H. Chen and S. Luo).
      PDF File
    • Modeling data with excess zeros and measurement error: application to evaluating relationships between episodically consumed foods and health outcomes. Biometrics, 65, 1003-1010 (with V. Kipnis, D. Midthune, L. S. Freedman and K. Dodd).
      PDF File
    • Quantile regression with measurement error. Journal of the American Statistical Association, 1129-1143 (with Ying Wei).
      PDF File
    • Semiparametric regression during 2003-2007. Electronic Journal of Statistics, 3, 1192-1256 (with D. Ruppert and M. P. Wand).
      PDF File
    • Efficient semiparametric marginal estimation for the partially linear additive model for longitudinal/clustered data. Statistics in Bioscience, 1, 10-31 (with A. Maity, E. Mammen and K. Yu).
      PDF File
    • Covariate adjusted correlation analysis with application to FMR1 permutation female carrier data. Biometrics, 65, 781-792 (with D. Senturk and D. Nguyen).
      PDF File
    • Nonparametric additive regression for repeatedly measured data. Biometrika, 96, 383-398 (with A. Maity, E. Mammen and K. Yu).
      PDF File
    • SIMEX and variance estimation in semiparametric measurement error models. Electronic Journal of Statistics, 3, 318-348 (with T. Apanasovich and A. Maity).
      PDF File
    • Testing in semiparametric models with interaction, with applications to gene-environment interactions. Journal of the Royal Statistical Society, Series B, 71, 75-96 (with A. Maity, N. Chatterjee and E. Mammen).
      PDF File
    • Shrinkage estimators for robust and efficient inference in haplotype-based case-control studies. Journal of the American Statistical Association, 104, 220-233 (with Y.-H. Chen and N. Chatterjee).
      PDF File
    • A design-adaptive local polynomial estimator for the errors-in-variables problem. Journal of the American Statistical Association, 104, 348-359 (with Aurore Delaigle and Jianqing Fan).
      PDF File
    • Variance estimation in the analysis of microarray data. Journal of the Royal Statistical Society, Series B, 71, 425-445 (with Yuedong Wang and Yanyuan Ma).
      PDF File
    • Nonparametric prediction in measurement error models. Journal of the American Statistical Association, 104, 993-1014 (with A. Delaigle and P. Hall).
      PDF File
    • Covariate-adjusted linear mixed effects model with an application to longitudinal data. Journal of Nonparametric Statistics, 20, 459-481 (with D. Senturk and D. Nguyen).
      PDF File
    • A Bayesian multi-level model for estimating the diet/disease relationship in a multicenter study with exposure measured with error: The EPIC study. Statistics in Medicine, 27, 6037-6054 (with P. Ferrari, P. Gustafson and E. Riboli).
      PDF File
    • Identification and estimation of nonlinear models using two samples with nonclassical measurement errors (with Xiaohong Chen and Yingyao Hu).
      PDF File of actual paper
      PDF File of the long version of the paper


  • Selected Papers in 2008
    • Semiparametric analysis of heterogeneous data using varying scale generalized linear models. Journal of the American Statistical Association, 103, 650-660 (with Minge Xie and Douglas Simpson).
      PDF File
    • Nonparametric estimation and testing of fixed effects panel data models. Journal of Econometrics, 144, 257-276 (with D. Henderson and Q. Li).
      PDF File
    • Nonparametric variance estimation in the analysis of microarray data: a measurement error approach. Biometrika, 95, 437-449 (with Yuedong Wang).
      PDF File
    • A comparison of regression calibration, moment reconstruction and imputation for adjusting for covariate measurement error in regression. Statistics in Medicine, 27, 5195-5216 (with L. S. Freedman, D. Midthune and V. Kipnis).
      PDF File
    • Combining assays for estimating prevalence of human herpesvirus 8 infection using multivariate mixture models. Biostatistics, 9, 137-151 (with R. Pfeiffer).
      PDF File
    • Semiparametric longitudinal-spatial binary regression, with application to colon carcinogenesis. Biometrics, 94, 490-500 (with T. Apanasovich, D. Ruppert, J. Lupton, N. Popovic, N. Turner and R. Chapkin).
      PDF File
    • Haplotype-based regression analysis and inference of case-control studies with unphased genotypes and measurement errors in environmental exposures. Biometrics, 64, 673-684 (with I. Lobach, N. Chatterjee, C. Spinka and M. H. Gail).
      PDF File
      Web Appendix
    • Binary regression in truncated samples, with application to comparing dietary instruments in a large prospective study. Biometrics, 64, 289-298 (with Doug Midthune, Victor Kipnis and Laurence Freedman).
      PDF File
    • Joint modeling of paired sparse functional data using principal components. Biometrika, 95. 601-619 (with Lan Zhou and Jianhua Huang).
      PDF File
    • Bayesian hierarchical spatially correlated functional data analysis with application to colon carcinogenesis. Biometrics, 64, 64-73 (with Veerabhadran Baladandayuthapani, Bani Mallick, Nancy Turner, MeeYoung Hong, Robert Chapkin and Joanne Lupton).
      PDF File


  • Selected Papers in 2007
    • Nonparametric regression function estimation from data contaminated by a mixture of classical and Berkson measurement errors, Journal of the Royal Statistical Society, Series B, 69, 859-878 (with Aurore Delaigle and Peter Hall).
      PDF File
    • Spatially adaptive Bayesian P-splines with heteroscedastic errors. Journal of Computational and Graphical Statistics, 16, 265-288 (with Ciprian Crainiceanu, David Ruppert, Adarsh Joshi and Billy Goodwin).
      PDF File
    • Retrospective analysis of haplotype-based case-control studies under a flexible model for gene-environment association, Biostatistics, 9, 81-99 (with Nilanjan Chatterjee and Yi-hau Chen).
      PDF File
    • Estimation of population-level summaries in general semiparametric repeated measures regression models. IMS Monograph Series Festschrift for P. K. Sen (with Arnab Maity and Tatiyana Apanasovich).
      PDF File
    • Shared uncertainty in measurement error problems, With application to Nevada Test Site fallout data. Biometrics, 63, 1226-1236 (with Yehua Li, Owen Hoffman and Annamaria Guolo).
      PDF File
    • An asymptotic theory for model selection inference in general semiparametric problems. Biometrika, published online May 15, 2007 (with Gerda Claeskens).
      Please note that the second formula on page 10 has a typesetter error: the left square bracket should be after the convergence symbol, not well before it.
      PDF File
    • Nonparametric estimation of correlation functions in longitudinal and spatial data, with application to colon carcinogenesis experiments. Annals of Statistics, 35, (with Yehua Li, Naisyin Wang, Nancy Turner, Mee Young Hong, Robb Chapkin and Joanne Lupton).
      PDF File
    • The Hanford Thyroid Disease Study: an alternative view of the findings. Health Physics, 92, 99-111 (with F. Owen Hoffman, J. Ruttenber and Sander Greenland).
      PDF File
    • Partially linear models with missing response variables and error-prone covariates.Biometrika, 94, 185-198 (with Hua Liang and Suojin Wang).
      PDF File of published paper
      PDF File with proofs and technical arguments
    • Stochastic approximation in Monte-Carlo Computation. Journal of the American Statistical Association, 102, 305-320 (with Faming Liang and Chuanhai Liu).
      PDF File
    • Efficient estimation of population-level summaries in general semiparametric regression models. Journal of the American Statistical Association, 102, 123-139 (with Arnab Maity and Yanyuan Ma).
      PDF File
    • Backfitting versus profiling in general criterion functions. Statistica Sinica, 17, 797-816 (with Ingrid Van Keilegom).
      PDF File
  • Selected Papers in 2006
    • Bayesian error analysis model (BEAM) for reconstructing transcriptional regulatory networks. PNAS, 103, 7988-7993 (With Ning Sun and Hongyu Zhao).
      PDF File
    • Radiation exposure and thyroid cancer: Letter to the editor. JAMA (with Owen Hoffman, James Ruttenber and Sander Greenland), 296, 513.
      PDF File
    • Semiparametric estimation in general repeated measures problems. Journal of the Royal Statistical Society, Series B, 68, 69-88 (with Xihong Lin).
      PDF File of published paper
      PDF File with proofs and technical arguments (of major interest in their own right.)
    • Wavelet-based functional mixed models. Journal of the Royal Statistical Society, Series B, 68, 179-199 (with Jeffrey Morris).
      PDF File Web site with executable code and examples of spiky data
    • Seemingly unrelated measurement error models, with application to nutritional epidemiology. Biometrics, 62, 75-84 (with Doug Midthune, Larry Freedman and Victor Kipnis).
      PDF File
    • Locally efficient estimators for semiparametric models with measurement error. Journal of the American Statistical Association, 101, 1465-1474 (with Yanyuan Ma).
      PDF File
    • Thyroid disease associated with exposure to the Nevada Test Site radiation: a reevaluation based on corrected dosimetry and examination data. Epidemiology, 17, 604-614 (with J. Lyon, F. O. Hoffman, and others)
      PDF File
    • Discussion of Conditional Growth Charts by Ying Wei and Xuming He. Annals of Statistics (with David Ruppert)
      PDF File
  • Selected Papers in 2005
    • Semiparametric maximum likelihood estimation exploiting gene-environment independence in case-control studies. Biometrika, 92, 399-418 (with Nilanjan Chatterjee).
      PDF File
    • Analysis of case-control studies of genetic and environmental factors with missing genetic information and haplotype-phase ambiguity. Genetric Epidemiology, 29, 108-127 (with Christine Spinka and Nilanjan Chatterjee).
      PDF File
    • Efficient semiparametric marginal estimation for longitudinal/clustered data Journal of the American Statistical Association, (with Naisyin Wang and Xihong Lin), 100, 147-157.
      PDF File
    • Estimating misclassification error with small samples via bootstrap cross-validation, Bioinformatics, 21, 1979-1986 (with Wenjiang Fu and Suojin Wang).
      PDF File
    • Exploiting gene-environment independence in family-based case-control studies: increased power for detecting associations, interactions and joint effects. Genetic Epidemiology, 28, 138-156 (with Nilanjan Chatterjee and Zeynep Kalaylioglu)
      PDF File
    • Spatially adaptive Bayesian penalized regression splines (P-splines). Journal of Computational and Graphical Statistics, 14, 378-394 (with Veera Baladandayuthapani and Bani Mallick).
      PDF File
    • How many samples are needed to build a classifier: a general sequential approach. Bioinformatics, 21, 63-70 (with Wenjiang Fu, Ed Dougherty and Bani Mallick).
      PDF File
    • Semiparametric Bayesian analysis of matched case-control studies with missing exposure. Journal of the American Statistical Association, 100, 591-601 (with Samiran Sinha, Bhramar Mukherjee, Malay Ghosh and Bani K. Mallick). PDF File
    • On Estimation in binary autologistic spatial models. Journal of Statistical Computation and Simulation (with Mike Sherman and Tanya Apanasovich).
      PDF File
  • Selected Papers in 2004
    • Profile-kernel versus backfitting in partially linear models for longitudinal/clustered data. Biometrika, 91, 252-261 (with Zhonghui Hu and Naisyin Wang).
      PDF File
    • Simple fitting of subject-specific curves for longitudinal data. Statistics in Medicine (with Maria Durban, Jarek Harelezk and Matt Wand).
      PDF File
    • Noise factor analysis for cDNA microarrays. Journal of Biomedical Optics, 9, 663-678 (with Yoganand Balagurunathana, Naisyin Wang, Edward R. Dougherty, Danh V. Nguyen, Yidong Chen, Michael L. Bittner and Jeffrey. M. Trent).
      PDF File
    • Estimation in Partially Linear Models with Missing Covariates. Journal of the American Statistical Association, 99, 357-367 (with Hua Liang, Suojin Wang and Jamie Robins).
      PDF File
    • Instrumental variables and nonparametric regression. Journal of the American Statistical Association, 99, 736-750 (with David Ruppert, Tor Tosteson, Ciprian Crainiceanu and Margaret Karagas).
      PDF File
    • Equivalent kernels of smoothing splines in nonparametric regression for clustered/longitudinal data. Biometrika, 91, 177-193 (with Xihong Lin, Naisyin Wang and Alan Welsh)
      PDF File
    • Is crossvalidation better than resubstitution for ranking genes? Bioinformatics.
      PDF File
    • Histospline method in nonparametric regression models with applications. Statistica Sinica, 14, 633-658 (with Peter Hall, Xihong Lin and Tanya Apanasovich).
      PDF File
    • A new method for dealing with measurement error in explanatory variables of regression models. Biometrics, 60, 172-181 (with Larry Freedman, Victor Kipnis, Vitali Fainberg and Douglas Midthune).
      PDF File
    • Missing value estimation for cancer microarray gene expression data. Journal of Data Science, 2, 347-370 (with Danh Nguyen and Naisyin Wang).
      PDF File
    • Low--Order Approximations in Deconvolution and Regression With Errors in Variables. Journal of the Royal Statistical Society, Series B (with Peter Hall), 66, 31-46.
      PDF File
      Regression simulation results not included in the paper: PDF File - PostScript
  • Selected Papers in 2003
    • A comparison of a food frequency questionnaire with a 24-hour recall for use in an epidemiological cohort study: results from the biomarker-based OPEN study. International Journal of Epidemiology.
      PDF File
    • The structure of dietary measurement error: results from the OPEN biomarker study. American Journal of Epidemiology.
      PDF File
    • Variance are Not Always Nuisance Parameters: The 2002 R. A. Fisher Lecture. Biometrics
      PDF File
    • Semiparametric regression splines in matched case-control studies Biometrics (With Inyoung Kim and Noah Cohen).
      PDF File
    • Testing for spatial correlation in nonstationary binary data with application to Aberrant Crypt Foci in colon carcinogenesis. Biometrics, (with T. Apanasovich, S. Sheather, N. Popovic, N. Turner, R. Chapkin and J. Lupton).
      PDF File
    • Color images of aberrant crypt foci in colon carcinogenesis.
      PDF File - PostScript
    • Wavelet-based nonparametric modeling of hierarchical functions in colon carcinogenesis Journal of the American Statistical Association, Editor Invited Paper for 2003 (with Jeffrey Morris, Philip Brown and Marina Vannucci).
      PDF File
    • More efficient kernel estimation in nonparametric regression with correlated errors. (With Zhijie Xiao, Oliver Linton and Enno Mammen), Journal of the American Statistical Association.
      PDF File
    • Accounting for correlation in marginal longitudinal nonparametric regression. Seattle Biostatistics Symposium (with Oliver Linton, Enno Mammen and Xihong Lin).
      PDF File - PostScript
    • Semiparametric inference in matched case--control studies with missing covariate data. Biometrika (with Paul Rathouz and Glen Satten).
      PDF File - PostScript
    • The relationship between virologic and immunologic responses in AIDS clinical research using mixed--effects varying--coefficient semiparametric models with measurement error. Biostatistics (with Hua Liang and Hulin Wu).
      PDF File
  • Selected Papers in 2002
    • DNA microarray experiments: biological and technological aspects Biometrics (with Danh Nguyen, A. Bulak Arpat and Naisyin Wang).
      PDF File
      Tutorial Figures: PDF File - Color Figures: PDF File
    • Marginal Longitudinal Nonparametric Regression: Locality and Efficiency of Spline and Kernel Methods. Journal of the American Statistical Association (with Alan Welsh and Xihong Lin).
      PDF File - PostScript
    • Morris, J. S., Wang, N., Lupton, J. R., Chapkin, R. S., Turner, N. D. Hong, M. Y. and Carroll, R. J. (2002). A Bayesian analysis of colonic crypt structure and coordinated response incorporating missing crypts. Biostatistics.
      PDF File - PostScript
    • Semiparametric regression modeling with mixtures of Berkson and classical error, with application to fallout from the Nevada test site. Biometrics (with Bani Mallick, F. Owen Hoffman).
      PDF File
    • Bayesian smoothing and regression splines for measurement error problems. Journal of the American Statistical Association (with Scott Berry and David Ruppert).
      PDF File - PostScript
    • Semiparametric regression for clustered data using generalized estimating equations. Journal of the American Statistical Association (with Xihong Lin).
      PDF File - PostScript
    • Estimation in an additive model when components are linked parametrically. Econometric Theory (with Wolfgang Haerdle and Enno Mammen).
      PDF File
    • Berry, S. A., Carroll, R. J. & Ruppert, D. (2002). Bayesian smoothing for measurement error problems. Total Least Squares and Errors--in--Variables Modeling: Analysis, Algorithms and Applications, editors S. van Huffel and P. Lemmerling}. Kluwer Academic Publishers. (With S. Berry and D. Ruppert).
      PDF File - PostScript
    • Covariate measurement error adjustment for matched case-control studies. Biometrics, 57, 62-73 (with Lisa McShane, Doug Midthune, J. Dorgan and Larry Freedman).
      PDF File
  • Selected Papers in 2001
    • Parametric and nonparametric methods for understanding the relationship between carcinogen--induced DNA adduct levels in distal and proximal regions of the colon. Journal of the American Statistical Association (with Jeffrey Morris, Naisyin Wang, Joanne Lupton, Nancy Turner, Mee-Young Hong and Robert Chapkin).
      PDF File - PostScript
    • A note on the efficiency of sandwich covariance estimation. Journal of the American Statistical Association (with Goeran Kauermann).
      PDF File
    • Thyroid Cancer Following Scalp Irradiation: A Reanalysis Accounting for Uncertainty in Dosimetry. Biometrics (with Dan Schafer, Elaine Ron, Jay Lubin and Marilyn Stovall).
      PDF File
    • Semiparametric regression for clustered data with a nonparametric cluster--level component. Biometrika (with Xihong Lin).
      PDF File - PostScript
    • Parameterization and inference for nonparametric regression problems. Journal of the Royal Statistical Society Series B (with Wenxin Jiang, Victor Kipnis and Doug Midthune).
      PDF File
    • Review times in Statistics: tilting at windmills? (Biometrics)
      PDF File
    • Bootstrap confidence intervals for local likelihood, local estimating equations and varying coefficient models. Statistica Sinica (with C. Galindo, G. Kauermann and H. Liang)
      PDF File - PostScript
    • Combining datasets to predict the effects of regulation of environmental lead exposure in housing stock. Biometrics (with W. Strauss, S. Bortnick, J. Menkedick and B Schulz)
      PDF File
    • Empirical evidence of correlated biases in dietary instruments and its implication. American Journal of Epidemiology (with V. Kipnis, L. Freedman and D. Midthune).
      PDF File
    • Discussion of the Journal of the American Statistical Association paper by Lin and Ying (with Xihong Lin)
      PDF File
    • Semiparametric Regression for Clustered Data Using Generalized Estimating Equations. Journal of the American Statistical Association, 96, 1045-1054. (with Xihong Lin)
      PDF File
  • Selected Papers in 2000
      Nonparametric Function Estimation for Clustered Data When the Predictor is Measured Without/With Error. Journal of the American Statistical Association (with Xihong Lin)
      PDF File
    • Nonparametric Function Estimation of the Relationship Between Two Repeatedly Measured Variables. Statistica Sinica (with Andreas Ruckstuhl and Alan Welsh)
      PDF File
    • Random effects in interval-censored ordinal regression: latent structure and Bayesian approach. Biometrics 56, 376-383 (with Minge Xie and Douglas Simpson).
      PDF File
    • On Meta-Analytic Assessment of Surrogate Outcomes. Biostatistics (with Mitchell H. Gail, Ruth Pfeiffer and Hans C. van Houwelingen)
      PDF File - PostScript
    • Score tests for familial correlation in genotyped proband designs. Genetic Epidemiology (with M. H. Gail, D. Pee and J. Benichou)
      PDF File
    • Spatially adaptive penalties for spline fitting. Australian Journal of Statistics (with David Ruppert)
      PDF File - Correcttion
    • Conditional and Unconditional Categorical Regression Models With Missing Covariates. Biometrics (with Glen Satten)
      PDF File
    • Efficient Regression Calibration for Logistic Regression in Main Study/Internal Validation Study Designs with an Imperfect Reference Instrument Statistics in Medicine (with Donna Spiegelman and Victor Kipnis)
      PDF File - PostScript
    • Phase II clinical trial design for noncytotoxic anticancer agents for which time to disease progression is the primary endpoint Controlled Clinical Trials (with Rosie Mick and John Crowley)
      PDF File - PostScript
  • Selected Papers in 1999
    • Large sample theory in a semiparametric partially linear errors in variables model. Annals of Statistics (with Hua Liang and Wolfgang Haerdle)
      PDF File
    • SIMEX variance component tests in generalized linear mixed measurement error models. Biometrics (with Xihong Lin)
      PDF File
    • The efficiency of bias--corrected estimators for nonparametric kernel estimation based local estimating equations.
      PDF File
    • Polynomial regression and estimating functions in the presence of multiplicative measurement error. Journal of the Royal Statistical Society, Series B (with Steve Iturria and David Firth) PDF File
    • Nonparametric regression in the presence of measurement error Biometrika (with Jeff Maca and David Ruppert)
      PDF File
    • Flexible parametric measurement error models. Biometrics (with Kathryn Roeder and Larry Wasserman)
      PDF File
    • High-order asymptotics for retrospective sampling problems. Biometrika (with Suojin Wang)
      PDF File
  • Old Favorite Papers
    • Deconvolution kernel density estimators. Statistics, 1990 (with Len Stefanski)
      PDF File
    • Generalized partially linear single-index models. Journal of the American Statistical Association, 1997, pp 477-- (with J. Fan, M. Wand and I. Gijbels)
      PDF File
    • Asymptotics for prospective analysis of stratified logistic case-control studies. (Journal of the American Statistical Association, 1995, with C. Y. Wang and Suojin Wang)
      PDF file
    • A nonparametric mixture approach to case-control studies with errors in covariables. Journal of the American Statistical Association, 1996 (with Kathryn Roeder and Bruce Lindsay)
      PDF
    • The use and misuse of orthogonal regression estimation in linear errors-in-variables models.
      PDF
    • Local Estimating Equations (Journal of the American Statistical Association, 1998, with David Ruppert and Alan Welsh)
      PDF File
  • Papers 1998 or previously
    • On robustness in the logistic regression model.
      PDF File
    • Robust linear regression in replicated measurement error models.
      PDF File
    • Robust estimation in case-control studies with errors in predictors.
      PDF File
    • Estimation of lag in misregistration problems for averaged signals.
      PDF File
    • Meta-analysis, measurement error and corrections for attenuation.
      PDF File
    • A semiparametric correction for attenuation.
      PostScript
    • Adjusting for time trends when estimating the relationship between dietary intake obtained from a food frequency questionnaire and true average intake.
      PostScript
    • Transformation to additivity in measurement error models. Biometrics, 1997, 53, 262-272 (with Stephen Eckert and Naisyin Wang)
      PDF file
    • Binary regressors in dimension reduction models: a new look at treatment comparisons.
      PostScript
    • Dimension reduction in semiparametric measurement error models.
      PostScript
    • Estimating the reliability of an exposure variable in the presence of confounders.
    • Analysis of tomato root initiation using a mixture normal distribution.
      PostScript
    • Quasilikelihood estimation in measurement error models with correlated replicates.
      PostScript
    • Asymptotics for the SIMEX estimator in structural measurement error models. (Journal of the American Statistical Association, 1996, with Fred Lombard, Helmut Kuechenhoff and Len Stefanski)
      PDF File
    • Estimation in choice-based sampling with measurement error and bootstrap analysis. Journal of Econometrics, 1997, with C.Y. Wang and S. Wang
      PDF file
    • The use of semiquantitative food frequency questionnaires to estimate the distribution of usual intake.
      PostScript
    • Review of Measurement, Regression and Calibration.
      PostScript
    • Interval censoring and marginal analysis in ordinal regression.
      PostScript
    • Segmented regression with errors in predictors.
      PostScript
    • Application of robust methods in combining toxicology data from diverse studies.
      PostScript
    • Analyzing bivariate continuous data that have been grouped into categories defined by sample quantiles of the marginal distribution.
      PDF File
    • Design aspects of calibration studies in nutrition, with analysis of missing data in linear measurement error models.
      PDF File
    • Measurement Error in Epidemiologic Studies
      PDF File
    • Nonparametric Function Estimation of the Relationship Between Two Repeatedly Measured Variables
      PDF File
    • A New Class of Measurement Error Models, With Applications to Dietary Data
      PDF File
    • The Efficiency of Bias-Corrected Estimators for Nonparametric Kernel Estimation Based on Local Estimating Equations
      PDF File
    • Large sample theory in a semiparametric partially linear errors in variables model.
      PDF File
    • Bias analysis and SIMEX approach in generalized linear mixed measurement error models.
      PDF File
    • Measurement error, biases, and the validation of complex models for blood lead levels in children.
      PDF File
    • Hunt, J. H., Carroll, R. J., Chinchilli, V. and Frankenberg, D. (1980). Relationship between environmental factors and brown shrimp production in Pamlico Sound, North Carolina. Report to the Division of Marine Fisheries, North Carolina Department of Natural Resources.
      PDF File