Data Sets


  • Programs for the method of secondary linear regression analysis of case-control studies. The paper, Semiparametric Estimation in the Secondary Analysis of Case-Control Studies by Yanyuan Ma and Raymond J. Carroll, will appear in the Journal of the Royal Statistical Society, Series B, in 2015. There is a README file and an example data set to use as a test.
  • CGEN, an R packages based on my work with Nilanjan Chatterjee and Yi-Hau Chen for analyzing genetic data on case-control samples, with particular emphasis on novel methods for detecting Gene-Gene and Gene-Environment interactions.

  • SAS Macro and Demonstrations For Estimating Usual Intake Distributions of Food and Associated Individual-Level Predictors for Diet-Disease Relationships
    This site is based on our work involving the estimation of the usual intake distributyion for nutrients and episodically consumed foods. In addition, we estimate individual-level usual intake for use in regression calibration analysis of diet-disease relationships.
  • SAS Macro for Haplotype Analysis
    The SAS macro HapReg implements the haplotype-based genetic association analysis for case-control studies, using a flexible model for gene-environment association allowing haplotypes to be potentially related with environmental exposures. The novel methodology is proposed by Chen, Chatterjee, and Carroll (2007, Retrospective Analysis of Haplotype-Based Case-Control Studies Under a Flexible Model for Gene-Environment Association, under revision).
  • Wavelet-Based Functional Mixed Model Methodology, Windows
    These are programs based upon the work of Jeffrey Morris and Raymond Carroll, 2006, Journal of the Royal Statistical Society, Series B, 68, 179-199
  • MatLab and R Programs for Mixtures of Berkson and Classical Measurement Errors in the Nevada Test Site Study
    These are programs based upon the work of Yehua Li, Annamaria Guolo, F. Owen Hoffman and Raymond J. Carroll. There are Bayesian and Monte-Carlo EM programs for the analysis of these important radiation data.
  • Programs called in the Measurement Error Short Course at ENAR, 2008
    These are Stata, R2WinBUGS and some SAS Code for our measurement error short course.
  • MatLab
    The following programs are in zipped directories. The various utility directories need to be unzipped since some of the functions call them.
  • R programs
    The R programs are basically translations of the MATLAB Programs. They are all in one directory.
  • CatReg
  • XploRe