Use data from the 2010 Alberta Survey (posted on the course website) to answer the following questions in a separate lab report. All labs must be typed. Labs should be double-spaced, using 12 point font. Please write in essay format for each question, but also indicate which questions you are answering (number them). When including tables, make sure that they are neat and organized. For answers that can be calculated using multiple methods in R, you only have to use one (either calculate by hand or use the R function). Spelling, grammar, style, and mechanics will be graded, so make sure to proofread. Attach your R Output to your lab report as an Appendix.
Describe your dataset. Where do the data come from? How were they collected? What is the population of interest? What is your sample size?
Summarize the following variables using the appropriate measures of central tendency and dispersion. For interval-ratio variables you should include the mean, median, and standard deviation. For nominal and ordinal variables you should use frequencies and percentages. You may use tables to present your summary measures, but you also need to discuss them in paragraph form. Variable names are in parentheses.
(a) Age (age)
(b) Household size (hh.size)
(c) Years of schooling (yrs.educ) (d) Sex (sex)
(e) Marital status (mar.stat2 - condensed version) (f) Employed (employed)
(g) Location (location)
Note: When testing hypotheses, make sure to discuss each step in the process.
According to the 2011 Census of the Canadian population, the average age for adults 18 and over was 47.59 years. Is there a significant difference between the age of adults in Alberta compared to all of Canada? How can you tell? Dataset variable - age
According to the 2011 Census of the Canadian population, 6.05% of adults were divorced at the time of the survey. Is the divorce rate different in Alberta? Is there a significant difference between the proportion of divorced adults in Alberta compared to Canada? How can you tell? Dataset variable - mar.stat1
As noted in the codebook, the 2010 Alberta Survey contains samples of residents in Edmonton, Calgary, and other areas of Alberta. For this question, you are to compare people from Calgary and Edmonton. Do education levels, as measured by years of schooling, differ across these cities? How can you tell? Dataset variables - location and yrs.educ
As noted in the codebook, the 2010 Alberta Survey contains samples of residents in Edmonton, Calgary, and other areas of Alberta. For this question, you are to compare people from Calgary and Edmonton. Do employment rates, as measured by the proportion of people employed, differ across these cities? How can you tell? Dataset variables - location and employed
Does the age of the population in Alberta differ by marital status? Are the average ages for populations of never married, married, and formerly married people the same? Use the 2010 Alberta Survey and an ANOVA test to determine whether age varies by marital status. Make sure to interpret your findings. Dataset variables - mar.stat2 and age
