By Anonymous - May 7, 2019

Covariates

STATE / REGION

STATE contains the state at which the subject has lived the longest, taking into account:
* LIVLOCL: the number of years a subject has lived within 10 miles of their current address and
* censoring dates appropriate to the outcome (e.g. mortality, incidence of cancer).

Due to the number of sparsely represented states, our imputation function aregImpute() throws an error when imputing STATE. This is circumvented by grouping subjects into REGION of the country.

REGION STATE
New England ME, NH, VT, MA, RI, CT, NY, NJ, PA
East North Central OH, IN, IL, MI, WI
West North Central MN, IA, MO, ND, SD, NE, KS
South Atlantic DE, MD, DC, VA, WV, NC, SC, GA, FL
East South Central KY, TN, AL, MS
West South Central AR, LA, OK, TX
Mountain MT, ID, WY, CO, NM, AZ, UT, NV
Pacific WA, OR, CA, AK, HI
Canada AB, BC, MB, NB, NL, NS, NT, NU, ON, PE, QC, SK, YT

Code