

Papers

Selected Papers in 20172018

Ma, S., Ma, Y. Wang, Y. and Carroll, R. J. (2017). A semiparametric singleindex risk score across populations. JASA, to appear.

Zhang, X., Wang, H., Ma, Y. and Carroll, R. J. (2017). Linear model selection when covariates contain errors. JASA, to appear.

Bertrand, A., Legrand, C., Carroll, R. J., de Meester, C. and Van Keilegom, I. (2017). Inference in a survival cure model with mismeasured covariates using a SIMEX approach. Biometrika, to appear.

Sarkar, A., Pati, D., Chakraborty, A., Mallick, B. K. and Carroll, R. J. (2017). Bayesian semiparametric multivariate density deconvolution. JASA, to appear.

Cao, J., Zhang, X. and Carroll, R. J. (2017). Estimating varying coefficients for partial differential equation models. Biometrics, to appear.

Sampson, J. N., Matthews, C. E., Freedman, L. S., Carroll, R. J. and Kipnis, V. (2017). Methods to assess
measurement error in questionnaires of sedentary behavior. Journal of Applied Statistics, 43, 17061721.
Online version pdf

Keadle, S., Sampson, J., Li, H., Lyden, K., Matthews, C. E. and Carroll, R. J. (2017). An evaluation of accelerometerderived metrics to assess daily behavioral patterns. Medicine & Science in Sports & Exercise, 49, 5463. PMC5176102
pdf

Cook, S. J., Blas, B., Carroll, R. J. and Sinha, S. (2017). Two wrongs make a right: addressing underreporting in binary data from multiple sources. Political Analysis, to appear. The journal has an impact factor of 6.098, and it has the largest impact factor of all political science journals.

Pfeiffer, R. M., Redd, A. and Carroll, R. J. (2017). On the impact of model selection on predictor identification and parameter inference. Computational Statistics, to appear.

Selected Papers in 2016

Chatterjee, N., Chen, Y.H., Maas, P. and Carroll, R. J. (2016). Constrained maximum likelihood estimation for
model calibration using summarylevel information from external big data sources. JASA, 111, 107117.
pdf

Ma, Y. and Carroll, R. J. (2016). Semiparametric estimation in the secondary analysis of casecontrol studies. Journal of the Royal Statistical Society, Series B, 78, 127151.
pdf

Delaigle, A. and Carroll, R. J. (2016). Obituary of Peter Gavin Hall, 19512016. IMS Bulletin, 45, 45.
pdf

de la Cruz, R., Meza, C., ArribasGil, A. and Carroll, R. J. (2016). Journal of Multivariate Analysis, to appear.Bayesian regression analysis of data with
random effects covariates from nonlinear longitudinal measurements. Journal of Multivariate Analysis, 143, 94106.
pdf

Potgieter, C. J., Wei, R., Kipnis, V., Freedman, L. S. and Carroll, R. J. (2016). Moment reconstruction and momentadjusted imputation when exposure is generated by a complex, nonlinear random effects modeling process. Biometrics, to appear. NIHMSID 794866
pdf

Gail, M. H., Wu, Jincao, Wang, M., Yaune, S.S., Cook, N. R., Eliassend, A. H., McCullough, M. L., Yu, K., ZeleniuchJacquottei, A., SmithWarner, S., Ziegler, R. G. and Carroll, R. J. (2017). Calibration and seasonal adjustment for matched casecontrol studies of Vitamin D and cancer. Statistics in Medicine, 35, 21332148.
pdf

Huque, M. H., Bondell, H., Carroll, R. J. and Ryan, L. (2016). Spatial regression with covariate measurement error:
a semiparametric approach. Biometrics, 72, 678–686.
pdf

Bhadra, A. and Carroll, R. J. (2016). Exact sampling of the unobserved covariates in Bayesian spline models for measurement error problems. Statistics and Computing, to appear.
Online version pdf

Alexeff, S. E., Carroll, R. J. and Coull, B. (2014). Spatial measurement error in linear regression models when using predicted
air pollution exposures and correction by spatial SIMEX. Biostatistics, 17, 377389.
pdf

Kipnis, V., Freedman, L. S., Carroll, R. J. and Midthune, D. (2016). A bivariate measurement error model for semicontinuous and continuous variables: application to nutritional epidemiology. Biometrics, 76,106115
pdf

Midthune, D., Carroll, R. J., Freedman, L. S. and Kipnis, V. (2016). Measurement error models with interactions. Biostatistics, 17, 277290.
pdf

Masiuk, S., Shklyar, S., Kukush, A., Carroll, R. J., Kovgan, L. and Likhtarov, I. A. (2016). Estimation of radiation risk in presence of classical additive and Berkson multiplicative errors in exposure doses. Biostatistics, 17, 422436.
pdf

Zoh, R., Mallick, B. K., Ivanov, I., Baladandayuthapani, V., Manyam, G., Chapkin, R., Lampe, J. W. and Carroll, R. J. (2016). PCAN: probabilistic correlation analysis of two nonnormal data sets. Biometrics, 72, 13581368. PMC5045754
pdf

Li, H., Keadle, S., Kipnis, V. and Carroll, R. J. (2016). Longitudinal functional additive model with continuous proportional outcomes for physical activity data. STAT, 5, 242250.
pdf

Huque, M. H., Carroll, R. J., Christiani, D. C. and Ryan, L. M. (2017). Exposure enriched casecontrol (EECC) design for the assessment of geneenvironment interaction. Genetic Epidemiology, 40, 570578. PMCID: PMC5069109
pdf

Keogh, R. H., Carroll, R. J., Tooze, J., Kirkpatrick, S. I. and Freedman, L. S. (2016). Statistical issues related to dietary intake as the response variable in intervention trials. Statistics in Medicine, 35, 44934508. PMID 27324170
pdf

Selected Papers in 2015

Freedman, L. S., Midthune, D., Dodd, K., Carroll, R. J. and Kipnis, V. (2015).
A statistical model for measurement error that incorporates variation over time in the target measure, with application to nutritional epidemiology.
Statistics in Medicine, 34, 35903605.
pdf

Li, H., Keadle, S. K., Staudenmayer, J., Assaad, H., Huang, J. Z. and Carroll, R. J. (2015). Methods to assess an exercise intervention trial based on threelevel functional data. Biostatistics, 16, 754771.
pdf

Wang, Y., Wang, S. and Carroll, R. J. (2015). The direct integral method for confidence intervals for the ratio of two location prameters. Biometrics, to appear. PMID25939421
pdf

Zhang, X., Zou, G. and Carroll, R. J. (2015). Model averaging based on KullbackLeibler distance. Statistica Sinica, 25, 15831598.
pdf

Freedman, L. S., Midthune, D., Carroll, R. J., Commins, J. M., Arab, L., Baer, D. J., Moler, J. E., Moshfegh, A. J., Neuhouser, M. L., Prentice, R. L. and Rhodes, D. (2015). Application of a new statistical model for measurement error to the evaluation of dietary selfreport instruments. Epidemiology, 26, 925933.
pdf

Ma, S., Carroll, R. J., Liang, H. and Xu, S. (2015). Estimation and inference in generalized additive coefficient models for nonlinear interactions with highdimensional covariates. Annals of Statistics, 43, 21022131.
pdf

Assaad, H. I., Hou, Y., Zhou, L., Carroll, R. J., and Wu, G. (2015). Rapid publicationready MSWord tables for twoway ANOVA. SpringerPlus, 4, 19. PMC4305362

Yi, G. Y., Ma, Y., Spiegelman, D. and Carroll, R. J. (2015). Functional and structural methods with mixed measurement error and misclassification in covariates.
Journal of the American Statistical Association, to appear.
Online version pdf

Zhang, X., Cao, J. and Carroll, R. J. (2015). On the selection of ordinary differential equation models with application to predatorprey dynamical models. Biometrics, to appear.
Online version pdf

Gregory, K. B., Carroll, R. J., Baladandayuthapani, V. and Lahiri, S. N. (2015). A twosample test for equality of means in high dimension.
Journal of the American Statistical Association, advance access version.
PDF File

Staicu, A.M., Lahiri, S. and Carroll, R. J. (2015). Significance tests for functional data with complex dependence structure. Journal of Statistical Planning and Inference, 156, 113.
PDF File

Lian, H., Liang, H. and Carroll, R. J. (2014). Variance function partially linear singleindex models.
Journal of the Royal Statistical Society, Series B, 77, 171194.
PDF File

Li, H., Staudenmayer, J. and Carroll, R. J. (2015). Hierarchical functional data with mixed continuous and binary measurements. Biometrics, 70, 802811.
PDF File

Selected Papers in 2014

Carroll, R. J. (2014). Estimating the distribution of dietary consumption patterns. Statistical Science, 29, 28.
PDF File

Sarkar, A., Mallick, B. K. and Carroll, R. J. (2014). Bayesian semiparametric regression in the presence of conditionally heteroscedastic measurement and regression errors. Biometrics, 70, 823834.
PDF File

Martinez, J. G., Bohn, K. M., Carroll, R. J. and Morris, J. S. (2014). A study of Mexican FreeTailed Bat chirp syllables:
Bayesian functional mixed models for nonstationary acoustic time series.
Journal of the American Statistical Association, 108, 514526.
PDF File

Gazioglu, S., Wei, J., Jennings, E. M. and Carroll, R. J. (2014). A note on penalized regression spline estimation in the secondary analysis of casecontrol data. Statistics in Biosciences, to appear.

Ward, R. and Carroll, R. J. (2014). Testing HardyWeinberg equilibrium with a simple rootmeansquare statistic. Biostatistics, in press.
Advanced access PDF File

Garcia, T. P., M\"uller, S., Carroll, R. J., and Walzem, R. L. (2014). Identification of important regressor groups, subgroups, and individuals via regularization methods: application to gut microbial data. {\it Bioinformatics}, to appear. doi: 10.1093/bioinformatics/btt608
Advanced access PDF File

Tekwe, C. D., Carter, R. L., Cullings, H. M. and Carroll, R. J. (2014). Multiple indicators, multiple causes measurement error models. Statistics in Medicine, to appear.
Advanced access PDF File

Guenther, P. M., Kirkpatrick, S. L., Reedy, J., KrebsSmith, S. M., Buckman, D. W., Dodd, K. W. Casavale, K. O. and Carroll, R. J. (2014). Healthy Eating Index2010 is a valid and reliable measure of diet quality according
to the 2010 Dietary Guidelines for Americans. Journal of Nutrition, to appear.
Advanced access PDF File

Sarkar, A., Mallick, B. K., Staudenmayer, J., Pati, D. and Carroll, R. J. (2014). Bayesian semiparametric density deconvolution in the presence of
conditionally heteroscedastic measurement errors. Journal of Computational and Graphical Statistics, 25, 11011125.
PDF File

Qi, X., Luo, R., Carroll, R. J. and Zhao, H. (2014). Sparse regression by projection and sparse discriminant analysis.
Journal of Computational and Graphical Statistics, to appear.
Advanced access PDF File

Little, M. P., Kukush, A. G., Masiuk, S. V., Shklyar, S. V., Carroll, R. J., Lubin, J. H.,
Kwon, D., Brenner, A. V., Tronko, M. D., Mabuchi, K., Bogdanova, T. I., Hatch, M., Zablotska, L. B.,
Tereschenko,V. P., Ostroumova, E., Bouville, A. C., Drozdovitch, V., Chepurny, M. I., Kovgan, L. N., Simon, S. L.,
Shpak, V. M. and Likhtarev, I. A. (2014). Impact of uncertainties in exposure assessment on
thyroid cancer risk among Ukrainian children and adolescents exposed from the Chornobyl accident. PLoS ONE, 9, e85723.
PDF File

Qahtan, A., Wang, S., Carroll, R. J., and Zhang, X. (2014). A new study of two divergence metrics for change detection in data streams.
{\it Proceedings of the 21st European Conference on Artificial Intelligence (ECAI 2014)}, to appear.
Advanced access PDF File

Selected Papers in 2013

Serban, N., Staicu, A.M. and Carroll, R. J. (2014). Multilevel crossdependent binary longitudinal data. Biometrics, 69, 903913.
PDF File

Li, Y., Wang, N. and Carroll, R. J. (2013). Selecting the number of principal components in functional data. Journal of the American Statistical Association, 108, 12841291.
PDF File

Carroll, R. J., Delaigle, A. and Hall, P. (2013). Unexpected properties of bandwidth choice when smoothing discrete data for constructing a functional data classifier. Annals of Statistics, 41, 27392767.
PDF File

Xun, X., Cao, J., Mallick, B. K., Maity, A. and Carroll, R. J. (2013). Parameter estimation of partial differential equation models. Journal of the American Statistical Association, 108, 10091020.
PDF File

Garcia, T. P., Mueuller, S., Carroll, R. J., Dunn, T. N., Thomas, A. P., Adams, S. H., Pillai, S. D. and Walzem, R. S. (2013). Structured variable selection with qvalues. Biostatistics, 14, 695707.
PDF File

Sampson, J. N., Chatterjee, N., Carroll, R. J. and Mueller, S. (2013). Controlling the local false discovery rate in the Adaptive Lasso. Biostatistics, 14, 653666.
PDF File

Wei, J., Carroll, R. J., Mueller, U., Van Keilegom, I. and Chatterjee, N. (2013). Locally efficient estimation for homoscedastic regression in the secondary analysis of casecontrol data. Journal of the Rpyal Statistical Society, Series B, 75, 186206.
PDF File

Tooze, J. A., Troiano, R. P., Carroll, R. J., Moshfegh, A. L. and Freedman, L. S. (2013). A measurement error model for physical activity level as measured by a questionnaire with application to the NHANES 19992006 questionnaire. American Journal of Epidemiology, online version.
PDF File

Chen, Y.H., Chatterjee, N. and Carroll, R. J. (2013). Using shared genetic controls in studies of geneenvironment interactions. Biometrika, 100, 319338.
PDF File

Jennings, E. M., Morris, J. S., Carroll, R. J., Ganiraju, M. C. and Baladandayuthapani V. (2013). Bayesian methods for expressionbased integration of various types of genomics data. EURASIP Journal on Bioinformatics and Systems Biology, 2013.13, http://bsb.eurasipjournals.com/content/2013/1/13.
PDF File

Selected Papers in 2012

Identifying genetic marker sets associated with phenotypes via an ffficient adaptive score test. Biostatistics, 13, 776790 (with Tianxi Cai and Xihong Lin).
PDF File

Multiple imputation in quantile regression. Biometrika, 99, 423438 (with Y. Wei and Y. Ma).
PDF File

Collier, B. A., Groce, J. E., Morrison, M. L., Newnam, J. C., Campomizzi, A.J., Farrell, S. J., Mathewson, H. A., Snelgrove, R. T., Carroll, R. J. and Wilkins, R. N. (2012). Predicting patch occupancy in fragmented landscapes at the rangewide scale for endangered species: an example of an American warbler. Diversity and Distributions, 18, 158167.
PDF File

Park, J.H., Gail, M. H., Weinberg, C., Carroll, R. J., Chung, C., Wang, Z., Chanock, S., Fraumeni, J. F and Chatterjee, N. (2012). Distribution of allele frequencies,
effectsizes and their interrelationships for common susceptibility variants. Proceedings of the National Academy of Sciences, 108, 1802618031.
PDF File

Ma, S., Yang, L. and Carroll, R. J. (2012). A simultaneous confidence band for sparse longitudinal regression. Statistica Sinica, 22, 95122.
PDF File

Yi, G. Y. Y., Ma, Y and Carroll, R. J. (2012). A robust, functional generalized method of moments approach for longitudinal studies with missing responses and covariate measurement error. Biometrika, 99, 151165.
PDF File

Bliznyuk, N., Carroll, R. J., Genton, M. and Wang, Y. (2012). Variogram Estimation in the presence of trend. Statistics and its Interface, 5, 159168..
PDF File

Carroll, R. J., Midthune, D., Subar, A. F., Shumakovich, M., Freedman, L. S., Thompson, F. E. and Kipnis, V. (2012). Taking advantage of the strengths of
two different dietary assessment instruments to improve intake estimates for nutritional epidemiology. American Journal of Epidemiology, 175, 340347.
PDF File of advanced access version

Kipnis, V., Midthune, D., Freedman, L. S. and Carroll, R. J. (2012). Regression calibration with more instruments than mismeasured variables. Statistics in Medicine, 31, 27132732.
PDF File

Carroll, R. J., Delaigle, A. and Hall, P. (2012). Deconvolution when classifying noisy data involving transformations. Journal of the American Statistical Association, 106, 11661177.
PDF File

Tekwe, C. D., Dabney, A. R. and Carroll, R. J. (2012). Application of survival analysis methodology to the quantitative analysis of LCMS proteomics data. Bioinformatics, 28, 19982003
PDF File of advanced access version

Selected Papers in 2011

Estimation and variable selection for generalized additive partial linear models. Annals of Statistics, 39, 18271851 (with L. Wang, X. Liu and Hua Liang).
PDF File

Density estimation in several populations with uncertain population membership. Journal of the American Statistical Association, 106, 11801192 (with Yanyuan Ma and Jeffrey D. Hart).
PDF File

A Bayesian approach to detection of small low emission sources. Inverse Problems, electronic version (with Xiaolei Xun, Bani Mallick and Peter Kuchment).
PDF File

Semiparametric Bayesian analysis of geneenvironment interactions with error in measurement of environmental covariates and missing genetic data. Statistics and its Interface, 4, 305315 (with Iryna Lobach and Bani Mallick)
PDF File

Testing and estimating shapeconstrained nonparametric density and regression in the presence of measurement error. JASA, 106, 191202 (with A. Delaigle and P. Hall).
PDF File

Testing for constant nonparametric effects in general semiparametric regression models with interactions. Statistics and Probability Letters, 81, 717723 (with J. Wei and A. Maity).
PDF File

Local and omnibus tests in classical measurement error models. Journal of the Royal Statistical Society, Series B, 73, 8198 (with Y. Ma, JR. Janicki and J. D. Hart).
PDF File

Psplines using derivative information. Multiscale Modeling and Simulation, 8, 15621580 (with C. P. Calderon, J. G. Martinez and D. C. Sorensen).
PDF File

A mixedeffects model approach for estimating the distribution of usual intake of nutrients: the NCI method. Statistics in Medicine, 29, 28572868 (with J. A. Tooze, V. Kipnis, D. W. Buckman, L. S. Freedman, P. M. Guenther, S. M. KrebsSmith, A. F. Subar and K. W. Dodd).
PDF File

A new multivariate measurement error model with zeroinflated dietary data, and its application to dietary assessment. Annals of Applied Statistics, 5, 14561487 (with S. Zhang, J. A. Tooze, V. Kipnis, D. W. Buckman, L. S. Freedman, P. M. Guenther, S. M. KrebsSmith, A. F. Subar and K. W. Dodd).
PDF File

A simultaneous confidence band for sparse longitudinal regression. Statistica Sinica, 22, 95122 (with S. Ma and L. Yang).
PDF File

How to estimate the measurement error variance associated with ancestry proportion estimates. Statistics and its Interface, 4, 327337 (with J. Divers, D. T. Redden and D. B. Allison).
PDF File

Generalized additive partial linear models polynomial spline smoothing estimation and variable selection procedures. Annals of Statistics, 39, 18271851 (with L. Wang, X. Liu and H. Liang).
PDF File

Fitting a bivariate measurement error model for episodically consumed dietary components. International Journal of Biostatistics, Volume 7, Issue 1, Article 1, DOI: 10.2202/15574679.1267 (with S. Zhang, J. A. Tooze, V. Kipnis, D. W. Buckman, L. S. Freedman, P. M. Guenther, S. M. KrebsSmith, A. F. Subar and K. W. Dodd).
PDF File

Mixtures of classical and Berkson uncertainties in the analysis of the Chernobyl accident. International Journal of Biostatistics, Volume 7 : Issue 1, Article 15.DOI: 10.2202/15574679.1281 (with Kukush, A., Shklyar, S., Masiuk, S., Likhtarov, I., Kovgan, L. and Bouville, A.).
PDF File

Statistical methods for comparative phenomics using highthroughput phenotype microarrays. {\it International Journal of Biostatistics}, 6, Issue 1, Article 29, DOI: 10.2202/15574679.1227 (with Sturino, J. M., Zorych, I., Mallick, B., Chang, Y.Y. and Bliznuyk, N.).
PDF File

Selected Papers in 2010

Genotypebased association mapping of complex diseases: geneenvironment interactions with multiple genetic markers and measurement error in environmental exposures. Genetic Epidemiology, 34, 792802 (with I. Lobach and R. Fan).
PDF File

Generalized empirical likelihood methods for analyzing
longitudinal data. Biometrika, 97, 7993 (with S. Wang and L. Qian).
PDF File

A nonparametric approach to detect nonlinear correlation in gene expression. Journal of Computational and Graphical Statistics, 19, 552568 (with Y. A. Chen, J. Almeida, A. Richards, P. Mueller and B. Rohrer).
PDF File

Semiparametric estimation of fixed effects panel data varying coefficient models. Nonparametric Econometric Methods, Advances in Econometrics, 24, (with Y. Sun and D. Li).
PDF File

Use of multiple singular value decompositions to analyze complex intracellular calcium ion signals. Annals of Applied Statistics, 3, 14671492 (with J. G. Martinez, J. Z. Huang, R. C. Burghardt and R. Barhoumi).
PDF File

Semiparametric Bayesian analysis of nutritional epidemiology data in the presence of measurement error. Biometrics, 66, 444454 (with S. Sinha, B. K. Mallick and V. Kipnis).
PDF File

Functional principal component selection via Stochastic Approximation Monte Carlo (SAMC). Canadian Journal of Statistics, 38, 256270 (with J. G. Martinez and F. Liang).
PDF File

Reduced rank mixed effects models for spatially correlated hierarchical functional data. Journal of the American Statistical Association, 105, 390400 (with L. Zhou, J. Z. Huang, J. G. Martinez, A. Maity and V. Baladandayuthapani).
PDF File

Fast methods for spatially correlated multilevel functional data. Biostatistics, 11, 177194 (with A.M. .Staicu and C. M. Crainiceanu).
PDF File
Supplement

Statistics and bioinformatics in nutritional sciences: analysis of complex data in the era of systems biology. Journal of Nutritional Biochemistry, 21, 561572 (with W. Fu, A. J. Stromberg, K. Viele and G. Wu).
PDF File

Bayesian modeling of MPSS data: gene expression analysis of bovine salmonella infection. Journal of the American Statistical Association, 105, 956967 (with S. Dhavala and B. K. Mallick).
PDF File

Generalized functional latent feature models with singleindex interactions. Journal of the American Statistical Association, 105, 621633 (with Y. Li and N. Wang).
PDF File

Marginal longitudinal semiparametric regression via penalized splines. Statistics and Probability Letters, 80, 12421252 (with M. AlKadiri and M. P. Wand).
PDF File

A note on the effect on power of score tests via dimension reduction by penalized regression under the null. International Journal of Biostatistics, Vol. 6 : Issue 1, Article 12. DOI: 10.2202/15574679.1231 (with J. G. Martinez, S. Mueller, J. N. Sampson and N. Chatterjee)
PDF File
Selected Papers in 2009

Analysis of casecontrol association studies: SNPs, Imputation and Haplotypes. Statistical Science, 24, 489502 (with N. Chatterjee, Y.H. Chen and S. Luo).
PDF File

Modeling data with excess zeros and measurement error: application to evaluating relationships between episodically consumed foods and health outcomes. Biometrics, 65, 10031010 (with V. Kipnis, D. Midthune, L. S. Freedman and K. Dodd).
PDF File

Quantile regression with measurement error. Journal of the American Statistical Association, 11291143 (with Ying Wei).
PDF File

Semiparametric regression during 20032007. Electronic Journal of Statistics, 3, 11921256 (with D. Ruppert and M. P. Wand).
PDF File

Efficient semiparametric marginal estimation for the partially linear additive model for longitudinal/clustered data. Statistics in Bioscience, 1, 1031 (with A. Maity, E. Mammen and K. Yu).
PDF File

Covariate adjusted correlation analysis with application to FMR1 permutation female carrier data.
Biometrics, 65, 781792 (with D. Senturk and D. Nguyen).
PDF File

Nonparametric additive regression for repeatedly measured data. Biometrika, 96, 383398 (with A. Maity, E. Mammen and K. Yu).
PDF File

SIMEX and variance estimation in semiparametric measurement error models. Electronic Journal of Statistics, 3, 318348 (with T. Apanasovich and A. Maity).
PDF File

Testing in semiparametric models with interaction, with applications to geneenvironment interactions.
Journal of the Royal Statistical Society, Series B, 71, 7596 (with A. Maity, N. Chatterjee and E. Mammen).
PDF File

Shrinkage estimators for robust and efficient inference in haplotypebased casecontrol studies.
Journal of the American Statistical Association, 104, 220233 (with Y.H. Chen and N. Chatterjee).
PDF File

A designadaptive local polynomial estimator for the errorsinvariables problem.
Journal of the American Statistical Association, 104, 348359 (with Aurore Delaigle and Jianqing Fan).
PDF File

Variance estimation in the analysis of microarray data. Journal of the Royal Statistical Society, Series B, 71, 425445 (with Yuedong Wang and Yanyuan Ma).
PDF File

Nonparametric prediction in measurement error models. Journal of the American Statistical Association, 104, 9931014 (with A. Delaigle and P. Hall).
PDF File

Covariateadjusted linear mixed effects model with an application to longitudinal data. Journal of Nonparametric Statistics, 20, 459481 (with D. Senturk and D. Nguyen).
PDF File

A Bayesian multilevel model for estimating the diet/disease relationship in a multicenter study with exposure measured with error: The EPIC study. Statistics in Medicine, 27, 60376054 (with P. Ferrari, P. Gustafson and E. Riboli).
PDF File

Identification and estimation of nonlinear models using two samples with nonclassical measurement errors (with Xiaohong Chen and Yingyao Hu).
PDF File of actual paper
PDF File of the long version of the paper

Selected Papers in 2008

Semiparametric analysis of heterogeneous data using varying scale generalized linear models. Journal of the American Statistical Association, 103, 650660 (with Minge Xie and Douglas Simpson).
PDF File

Nonparametric estimation and testing of fixed effects panel data models.
Journal of Econometrics, 144, 257276 (with D. Henderson and Q. Li).
PDF File

Nonparametric variance estimation in the analysis of
microarray data: a measurement error approach.
Biometrika, 95, 437449 (with Yuedong Wang).
PDF File

A comparison of regression calibration, moment reconstruction and imputation for adjusting for covariate measurement error in regression. Statistics in Medicine, 27, 51955216 (with L. S. Freedman, D. Midthune and V. Kipnis).
PDF File

Combining assays for estimating prevalence of human herpesvirus 8 infection using multivariate mixture models.
Biostatistics, 9, 137151 (with R. Pfeiffer).
PDF File

Semiparametric longitudinalspatial binary regression,
with application to colon carcinogenesis. Biometrics, 94, 490500
(with T. Apanasovich, D. Ruppert, J. Lupton, N. Popovic, N. Turner and
R. Chapkin).
PDF File

Haplotypebased regression analysis and inference of casecontrol
studies with unphased genotypes and measurement errors in
environmental exposures. Biometrics, 64, 673684 (with I. Lobach, N. Chatterjee, C. Spinka and M. H. Gail).
PDF File
Web Appendix

Binary regression in truncated samples, with application to comparing dietary instruments in a large
prospective study. Biometrics, 64, 289298 (with Doug Midthune, Victor Kipnis and Laurence Freedman).
PDF File

Joint modeling of paired sparse functional data using principal components. Biometrika, 95. 601619 (with Lan Zhou and Jianhua Huang).
PDF File

Bayesian hierarchical spatially correlated functional data analysis with application to colon carcinogenesis.
Biometrics, 64, 6473 (with Veerabhadran Baladandayuthapani, Bani Mallick, Nancy Turner, MeeYoung Hong, Robert Chapkin and Joanne Lupton).
PDF File

Selected Papers in 2007

Nonparametric regression function estimation from data contaminated by a mixture of classical and Berkson measurement errors, Journal of the Royal Statistical Society, Series B, 69, 859878 (with Aurore Delaigle and Peter Hall).
PDF File

Spatially adaptive Bayesian Psplines with heteroscedastic errors. Journal of Computational and Graphical Statistics, 16, 265288 (with Ciprian Crainiceanu, David Ruppert, Adarsh Joshi and Billy Goodwin).
PDF File

Retrospective analysis of haplotypebased casecontrol studies under a flexible model for geneenvironment association,
Biostatistics, 9, 8199 (with Nilanjan Chatterjee and Yihau Chen).
PDF File

Estimation of populationlevel summaries
in general semiparametric repeated measures regression models.
IMS Monograph Series Festschrift for P. K. Sen (with Arnab Maity and Tatiyana Apanasovich).
PDF File

Shared uncertainty in measurement error problems, With
application to Nevada Test Site fallout data.
Biometrics, 63, 12261236 (with Yehua Li, Owen Hoffman and Annamaria Guolo).
PDF File

An asymptotic theory for model selection inference in general semiparametric problems. Biometrika, published online May 15, 2007 (with Gerda Claeskens).
Please note that the second formula on page 10 has a typesetter error: the left square bracket should be after the convergence symbol, not well before it.
PDF File

Nonparametric estimation of correlation functions in longitudinal and spatial data, with
application to colon carcinogenesis experiments.
Annals of Statistics, 35, (with Yehua Li, Naisyin Wang, Nancy Turner, Mee Young Hong, Robb Chapkin and Joanne Lupton).
PDF File

The Hanford Thyroid Disease Study: an alternative view of the findings.
Health Physics, 92, 99111 (with F. Owen Hoffman, J. Ruttenber and Sander Greenland).
PDF File

Partially linear models with missing response variables and errorprone covariates.Biometrika, 94, 185198 (with Hua Liang and Suojin Wang).
PDF File of published paper
PDF File with proofs and technical arguments

Stochastic approximation in MonteCarlo Computation. Journal of the American Statistical Association, 102, 305320 (with Faming Liang and Chuanhai Liu).
PDF File

Efficient estimation of populationlevel summaries in general semiparametric regression models.
Journal of the American Statistical Association, 102, 123139 (with Arnab Maity and Yanyuan Ma).
PDF File

Backfitting versus profiling in general criterion functions. Statistica Sinica, 17, 797816 (with Ingrid Van Keilegom).
PDF File

Selected Papers in 2006

Bayesian error analysis model (BEAM) for reconstructing
transcriptional regulatory networks. PNAS, 103, 79887993 (With Ning Sun and Hongyu Zhao).
PDF File

Radiation exposure and thyroid cancer: Letter to the editor.
JAMA (with Owen Hoffman, James Ruttenber and Sander Greenland), 296, 513.
PDF File

Semiparametric estimation in general repeated measures problems. Journal of the Royal Statistical Society, Series B, 68, 6988 (with Xihong Lin).
PDF File of published paper
PDF File with proofs and technical arguments (of major interest in their own right.)

Waveletbased functional mixed models. Journal of the Royal Statistical Society, Series B, 68, 179199 (with Jeffrey Morris).
PDF File
Web site with executable code and examples of spiky data

Seemingly unrelated measurement error models, with application to nutritional epidemiology. Biometrics, 62, 7584 (with Doug Midthune, Larry Freedman and Victor Kipnis).
PDF File

Locally efficient estimators for semiparametric models with measurement error. Journal of the American Statistical Association, 101, 14651474 (with Yanyuan Ma).
PDF File

Thyroid disease associated with exposure to the Nevada Test Site radiation:
a reevaluation based on corrected dosimetry and examination data.
Epidemiology, 17, 604614 (with J. Lyon,
F. O. Hoffman, and others)
PDF File

Discussion of Conditional Growth Charts by Ying Wei and Xuming He. Annals of Statistics (with David Ruppert)
PDF File

Selected Papers in 2005

Semiparametric maximum likelihood estimation exploiting geneenvironment independence in casecontrol studies. Biometrika, 92, 399418 (with Nilanjan Chatterjee).
PDF File

Analysis of casecontrol studies of genetic and
environmental factors with missing genetic information and
haplotypephase ambiguity. Genetric Epidemiology, 29, 108127 (with Christine Spinka and Nilanjan Chatterjee).
PDF File

Efficient semiparametric marginal estimation
for longitudinal/clustered data
Journal of the American Statistical Association, (with Naisyin Wang and Xihong Lin), 100, 147157.
PDF File

Estimating misclassification error with small samples via
bootstrap crossvalidation, Bioinformatics, 21, 19791986 (with Wenjiang Fu and Suojin Wang).
PDF File

Exploiting geneenvironment independence in familybased casecontrol
studies: increased power for detecting associations, interactions and joint
effects. Genetic Epidemiology, 28, 138156 (with Nilanjan Chatterjee and
Zeynep Kalaylioglu)
PDF File

Spatially adaptive Bayesian penalized regression splines (Psplines).
Journal of Computational and Graphical Statistics, 14, 378394 (with Veera Baladandayuthapani and Bani Mallick).
PDF File

How many samples are needed to build a classifier: a
general sequential approach. Bioinformatics, 21, 6370
(with Wenjiang Fu, Ed Dougherty and Bani Mallick).
PDF File

Semiparametric Bayesian analysis of matched casecontrol
studies with missing exposure.
Journal of the American Statistical Association, 100, 591601 (with Samiran Sinha, Bhramar Mukherjee, Malay Ghosh and
Bani K. Mallick).
PDF File

On Estimation in binary autologistic spatial models.
Journal of Statistical Computation and Simulation
(with Mike Sherman and Tanya Apanasovich).
PDF File

Selected Papers in 2004

Profilekernel versus backfitting in partially linear models for longitudinal/clustered data.
Biometrika, 91, 252261 (with Zhonghui Hu and Naisyin Wang).
PDF File

Simple fitting of subjectspecific curves for longitudinal data.
Statistics in Medicine (with Maria Durban, Jarek Harelezk and Matt Wand).
PDF File

Noise factor analysis for cDNA microarrays.
Journal of Biomedical Optics, 9, 663678
(with
Yoganand Balagurunathana, Naisyin Wang, Edward R. Dougherty,
Danh V. Nguyen, Yidong Chen, Michael L. Bittner and
Jeffrey. M. Trent).
PDF File

Estimation in Partially Linear Models with Missing Covariates.
Journal of the American Statistical Association, 99, 357367 (with Hua Liang, Suojin Wang and Jamie Robins).
PDF File

Instrumental variables and nonparametric regression. Journal of the American Statistical Association, 99, 736750 (with David Ruppert, Tor Tosteson, Ciprian Crainiceanu and Margaret Karagas).
PDF File

Equivalent kernels of smoothing splines in nonparametric regression for clustered/longitudinal data. Biometrika, 91, 177193 (with Xihong Lin, Naisyin Wang and Alan Welsh)
PDF File

Is crossvalidation better than resubstitution for ranking genes? Bioinformatics.
PDF File

Histospline method in nonparametric regression models with applications.
Statistica Sinica, 14, 633658
(with Peter Hall, Xihong Lin and Tanya Apanasovich).
PDF File

A new method for dealing with measurement error in explanatory variables of regression models. Biometrics, 60, 172181 (with Larry Freedman, Victor Kipnis, Vitali Fainberg and Douglas Midthune).
PDF File

Missing value estimation for cancer microarray gene expression data.
Journal of Data Science, 2, 347370 (with Danh Nguyen and Naisyin Wang).
PDF File

LowOrder Approximations in Deconvolution and Regression With Errors in Variables. Journal of the Royal Statistical Society, Series B (with Peter Hall), 66, 3146.
PDF File
Regression simulation results not included in the paper: PDF File  PostScript

Selected Papers in 2003

A comparison of a food frequency questionnaire with a 24hour recall for use in an epidemiological cohort study: results from the biomarkerbased OPEN study. International Journal of Epidemiology.
PDF File

The structure of dietary measurement error: results from the OPEN biomarker study. American Journal of Epidemiology.
PDF File

Variance are Not Always Nuisance Parameters: The 2002 R. A. Fisher Lecture. Biometrics
PDF File

Semiparametric regression splines in matched casecontrol studies Biometrics (With Inyoung Kim and Noah Cohen).
PDF File

Testing for spatial correlation in nonstationary binary data with application to Aberrant Crypt Foci in colon carcinogenesis. Biometrics, (with T. Apanasovich, S. Sheather, N. Popovic, N. Turner, R. Chapkin and J. Lupton).
PDF File

Color images of aberrant crypt foci in colon carcinogenesis.
PDF File  PostScript

Waveletbased nonparametric modeling of hierarchical functions in colon carcinogenesis Journal of the American Statistical Association, Editor Invited Paper for 2003 (with Jeffrey Morris, Philip Brown and Marina Vannucci).
PDF File

More efficient kernel estimation in nonparametric regression with correlated errors. (With Zhijie Xiao, Oliver Linton and Enno Mammen), Journal of the American Statistical Association.
PDF File

Accounting for correlation in marginal longitudinal nonparametric regression. Seattle Biostatistics Symposium (with Oliver Linton, Enno Mammen and Xihong Lin).
PDF File  PostScript

Semiparametric inference in matched casecontrol studies with missing covariate data. Biometrika (with Paul Rathouz and Glen Satten).
PDF File  PostScript

The relationship between virologic and immunologic responses in AIDS clinical research using mixedeffects varyingcoefficient semiparametric models with measurement error. Biostatistics (with Hua Liang and Hulin Wu).
PDF File

Selected Papers in 2002

DNA microarray experiments: biological and technological aspects Biometrics (with Danh Nguyen, A. Bulak Arpat and Naisyin Wang).
PDF File
Tutorial Figures: PDF File  Color Figures: PDF File

Marginal Longitudinal Nonparametric Regression: Locality and Efficiency of Spline and Kernel Methods. Journal of the American Statistical Association (with Alan Welsh and Xihong Lin).
PDF File  PostScript

Morris, J. S., Wang, N., Lupton, J. R., Chapkin, R. S., Turner, N. D. Hong, M. Y. and Carroll, R. J. (2002). A Bayesian analysis of colonic crypt structure and coordinated response incorporating missing crypts. Biostatistics.
PDF File  PostScript

Semiparametric regression modeling with mixtures of Berkson and classical error, with application to fallout from the Nevada test site. Biometrics (with Bani Mallick, F. Owen Hoffman).
PDF File

Bayesian smoothing and regression splines for measurement error problems. Journal of the American Statistical Association (with Scott Berry and David Ruppert).
PDF File  PostScript

Semiparametric regression for clustered data using generalized estimating equations. Journal of the American Statistical Association (with Xihong Lin).
PDF File  PostScript

Estimation in an additive model when components are linked parametrically. Econometric Theory (with Wolfgang Haerdle and Enno Mammen).
PDF File

Berry, S. A., Carroll, R. J. & Ruppert, D. (2002). Bayesian smoothing for measurement error problems. Total Least Squares and ErrorsinVariables Modeling: Analysis, Algorithms and Applications, editors S. van Huffel and P. Lemmerling}. Kluwer Academic Publishers. (With S. Berry and D. Ruppert).
PDF File  PostScript

Covariate measurement error adjustment for matched casecontrol
studies. Biometrics, 57, 6273 (with Lisa McShane, Doug Midthune, J. Dorgan and Larry Freedman).
PDF File

Selected Papers in 2001

Parametric and nonparametric methods for understanding the relationship between carcinogeninduced DNA adduct levels in distal and proximal regions of the colon. Journal of the American Statistical Association (with Jeffrey Morris, Naisyin Wang, Joanne Lupton, Nancy Turner, MeeYoung Hong and Robert Chapkin).
PDF File  PostScript

A note on the efficiency of sandwich covariance estimation. Journal of the American Statistical Association (with Goeran Kauermann).
PDF File

Thyroid Cancer Following Scalp Irradiation: A Reanalysis Accounting for Uncertainty in Dosimetry. Biometrics (with Dan Schafer, Elaine Ron, Jay Lubin and Marilyn Stovall).
PDF File

Semiparametric regression for clustered data with a nonparametric clusterlevel component. Biometrika (with Xihong Lin).
PDF File  PostScript

Parameterization and inference for nonparametric regression problems. Journal of the Royal Statistical Society Series B (with Wenxin Jiang, Victor Kipnis and Doug Midthune).
PDF File

Review times in Statistics: tilting at windmills? (Biometrics)
PDF File

Bootstrap confidence intervals for local likelihood, local estimating equations and varying coefficient models. Statistica Sinica (with C. Galindo, G. Kauermann and H. Liang)
PDF File  PostScript

Combining datasets to predict the effects of regulation of environmental lead exposure in housing stock. Biometrics (with W. Strauss, S. Bortnick, J. Menkedick and B Schulz)
PDF File

Empirical evidence of correlated biases in dietary instruments and its implication. American Journal of Epidemiology (with V. Kipnis, L. Freedman and D. Midthune).
PDF File

Discussion of the Journal of the American Statistical Association paper by Lin and Ying (with Xihong Lin)
PDF File

Semiparametric Regression for Clustered Data Using Generalized Estimating Equations. Journal of the American Statistical Association, 96, 10451054. (with Xihong Lin)
PDF File

Selected Papers in 2000
Nonparametric Function Estimation for Clustered Data When the Predictor is Measured Without/With Error. Journal of the American Statistical Association (with Xihong Lin)
PDF File

Nonparametric Function Estimation of the Relationship Between Two Repeatedly Measured Variables. Statistica Sinica (with Andreas Ruckstuhl and Alan Welsh)
PDF File

Random effects in intervalcensored ordinal regression: latent structure
and Bayesian approach. Biometrics 56, 376383 (with Minge Xie and Douglas Simpson).
PDF File

On MetaAnalytic Assessment of Surrogate Outcomes. Biostatistics (with Mitchell H. Gail, Ruth Pfeiffer and Hans C. van Houwelingen)
PDF File  PostScript

Score tests for familial correlation in genotyped proband designs. Genetic Epidemiology (with M. H. Gail, D. Pee and J. Benichou)
PDF File

Spatially adaptive penalties for spline fitting. Australian Journal of Statistics (with David Ruppert)
PDF File  Correcttion

Conditional and Unconditional Categorical Regression Models With Missing Covariates. Biometrics (with Glen Satten)
PDF File

Efficient Regression Calibration for Logistic Regression in Main Study/Internal Validation Study Designs with an Imperfect Reference Instrument Statistics in Medicine (with Donna Spiegelman and Victor Kipnis)
PDF File  PostScript

Phase II clinical trial design for noncytotoxic anticancer agents for which time to disease progression is the primary endpoint Controlled Clinical Trials (with Rosie Mick and John Crowley)
PDF File  PostScript

Selected Papers in 1999

Large sample theory in a semiparametric partially linear errors in variables model. Annals of Statistics (with Hua Liang and Wolfgang Haerdle)
PDF File

SIMEX variance component tests in generalized linear mixed measurement error models. Biometrics (with Xihong Lin)
PDF File

The efficiency of biascorrected estimators for nonparametric kernel estimation based local estimating equations.
PDF File

Polynomial regression and estimating functions in the presence of multiplicative measurement error. Journal of the Royal Statistical Society, Series B (with Steve Iturria and David Firth)
PDF File

Nonparametric regression in the presence of measurement error Biometrika (with Jeff Maca and David Ruppert)
PDF File

Flexible parametric measurement error models. Biometrics (with Kathryn Roeder and Larry Wasserman)
PDF File

Highorder asymptotics for retrospective sampling problems. Biometrika (with Suojin Wang)
PDF File

Old Favorite Papers

Deconvolution kernel density estimators. Statistics, 1990 (with Len Stefanski)
PDF File

Generalized partially linear singleindex models. Journal of the American Statistical Association, 1997, pp 477 (with J. Fan, M. Wand and I. Gijbels)
PDF File

Asymptotics for prospective analysis of stratified logistic casecontrol studies. (Journal of the American Statistical Association, 1995, with C. Y. Wang and Suojin Wang)
PDF file

A nonparametric mixture approach to casecontrol studies with errors in covariables. Journal of the American Statistical Association, 1996 (with Kathryn Roeder and Bruce Lindsay)
PDF

The use and misuse of orthogonal regression estimation in linear errorsinvariables models.
PDF

Local Estimating Equations (Journal of the American Statistical Association, 1998, with David Ruppert and Alan Welsh)
PDF File

Papers 1998 or previously

On robustness in the logistic regression model.
PDF File

Robust linear regression in replicated measurement error models.
PDF File

Robust estimation in casecontrol studies with errors in predictors.
PDF File

Estimation of lag in misregistration problems for averaged signals.
PDF File

Metaanalysis, measurement error and corrections for attenuation.
PDF File

A semiparametric correction for attenuation.
PostScript

Adjusting for time trends when estimating the relationship between dietary intake obtained from a food frequency questionnaire and true average intake.
PostScript

Transformation to additivity in measurement error models. Biometrics, 1997, 53, 262272 (with Stephen Eckert and
Naisyin Wang)
PDF file

Binary regressors in dimension reduction models: a new look at treatment comparisons.
PostScript

Dimension reduction in semiparametric measurement error models.
PostScript

Estimating the reliability of an exposure variable in the presence of confounders.

Analysis of tomato root initiation using a mixture normal distribution.
PostScript

Quasilikelihood estimation in measurement error models with correlated replicates.
PostScript

Asymptotics for the SIMEX estimator in structural measurement error models. (Journal of the American Statistical Association, 1996, with Fred Lombard, Helmut Kuechenhoff and Len Stefanski)
PDF File

Estimation in choicebased sampling with measurement error and bootstrap analysis. Journal of Econometrics, 1997, with C.Y. Wang and S. Wang
PDF file

The use of semiquantitative food frequency questionnaires to estimate the distribution of usual intake.
PostScript

Review of Measurement, Regression and Calibration.
PostScript

Interval censoring and marginal analysis in ordinal regression.
PostScript

Segmented regression with errors in predictors.
PostScript

Application of robust methods in combining toxicology data from diverse studies.
PostScript

Analyzing bivariate continuous data that have been grouped into categories defined by sample quantiles of the marginal distribution.
PDF File

Design aspects of calibration studies in nutrition, with analysis of missing data in linear measurement error models.
PDF File

Measurement Error in Epidemiologic Studies
PDF File

Nonparametric Function Estimation of the Relationship Between Two Repeatedly Measured Variables
PDF File

A New Class of Measurement Error Models, With Applications to Dietary Data
PDF File

The Efficiency of BiasCorrected Estimators for Nonparametric Kernel Estimation Based on Local Estimating Equations
PDF File

Large sample theory in a semiparametric partially linear errors in variables model.
PDF File

Bias analysis and SIMEX approach in generalized linear mixed measurement error models.
PDF File

Measurement error, biases, and the validation of complex models for blood lead levels in children.
PDF File

Hunt, J. H., Carroll, R. J., Chinchilli, V. and Frankenberg, D. (1980). Relationship between environmental factors and brown shrimp production in Pamlico Sound, North Carolina. Report to the Division of Marine Fisheries, North Carolina Department of Natural Resources.
PDF File


