(Note: For every question you either show your work and/or attach SAS output as evidence. Otherwise you will lose all the points, even your answer is right.)

2. A survey question is distributed to a total of 3214 students about what studentsâ€™ opinion on are requiring mask mandate on the campus.

Here are the counts for responses: 1665 students responded as agreed, total of 324 responded as strongly agreed with requiring a mask mandate on campus. A total of 888 students responded as disagreed, 123 strongly disagreed, and 214 students are neural.

1) Construct a summary table and displays count, frequency, and cumulative frequency as appropriate.

2) Interpret it.

3. Serum vitamin E levels for adults are normally distributed mean = 900 g/dL and standard deviation = 350 g/dL.

a. If levels above 1,600 g/dL are considered nearly toxic, the percentage of adults having toxic levels is ___________

b. Ten percent of adults have serum vitamin E levels above _____ g/dL.

5. Summarize the following variables : RIDAGEYR(Age in years at screening), INDFMPIR (Ratio of family income to poverty), BMXWT(Weight (kg)), BMXHT(Standing Height (cm)), BMXWAIST(Waist Circumference (cm)), BMXBMI(Body Mass Index (kg/m**2), BPXSY3(Systolic: Blood pres (3rd rdg) mm Hg), BPXDI3(Diastolic: Blood pres (3rd rdg) mm Hg), LBXHGB(Hemoglobin (g/dL)), LBXGLU(Fasting Glucose (mg/dL)), LBDINSI (Insulin (pmol/L)), and DMDMARTL (Marital status), DMDEDUC2(Education level – Adults 20+), DUQ200(Ever used marijuana or hashish), DUQ240(Ever used cocaine/heroin/methamphetamine), DUQ290(Ever used heroin), DUQ330(Ever used methamphetamine in nh1516 dataset .

(Make sure you add other categorical variables above into the table )

1) Summarize these variables and fill the table below

2) Interpret these summary

Variables | All subjects (N= ) |

Age(years) | |

Mean(SD) | |

Median (min, max) | |

Ratio of family income to poverty | |

Mean(SD) | |

Median (min, max) | |

Waist Circumference (cm) | |

Mean(SD) | |

Median (min, max) | |

Body Mass Index (kg/m**2) | |

Mean(SD) | |

Median (min, max) | |

Systolic: Blood pres (3rd rdg) mm Hg | |

Mean(SD) | |

Median (min, max) | |

Diastolic: Blood pres (3rd rdg) mm Hg | |

Mean(SD) | |

Median (min, max) | |

Marital status, n(%) | |

Married | |

Widowed | |

Divorced | |

Separated | |

Never Married | |

Living with partner | |

4. Make boxplots for BPXSY3(Systolic: Blood pres (3rd rdg) mm Hg), BPXDI3(Diastolic: Blood pres (3rd rdg) mm Hg) by ethnicity, respectively. Compare the trend by and interpret.

5. Make bar graphs for DUQ200(Ever used marijuana or hashish) DUQ200(Ever used marijuana or hashish), DUQ240(Ever used cocaine/heroin/methamphetamine), DUQ290(Ever used heroin), DUQ330(Ever used methamphetamine, respectively. Compare and interpret the trend.

6. You collect information on a random sample of women and obtain estimates on vitamin D exposure. However, the results that you find are different than those conducted at another study site. Should you be concerned? Explain your answer using the concept of sampling variability