Assignment – {HASO}

Description

In this third assignment, you’ll have to solve several applied tasks. This means you’ll have to use R in some parts to analyze data sets. In other parts, you’ll need to answer theoretical questions based on the slides provided in class. I will also provide the book chapters on Canvas. When I need you to pay attention to a detail in a book chapter, I’ll mention the chapter and page where you can find the detail I need you to pay attention.

Second Part: Short answer questions

In this section, I expect short answers, where you explain in your own words relevant concepts:

Explain in your own words how to transform a nominal variable into several dummy coded variables. You may provide an example to make your explanation easier to follow. (20 points)
Describe the assumptions of the Classical Regression Model, you may check the lecture Correlation and Regression Models Part 2. (15 points)

Third Part: Playing with `R` and perhaps `JAMOVI`.

In this part you will use R to answer several questions, the questions are a guide into different steps when performing an ANOVA.

You will use the data file named anovaData.csv, you may download the file from here.

You may also open the data file in R after running the following code:

url <- "https://raw.githubusercontent.com/blackhill86/mm2/refs/heads/main/dataSets/anovaData.csv"

camcog <- read.csv(url)

The data for this exercise is real data from an experimental intervention performed in Costa Rica in 2011. In this study, we aimed to answer the following question:

Will an intervention focus on improving autobiographical memory delay dementia symptoms?

Based on this question, we design an experiment where we assign aging adults to 4 different conditions:

Condition A: Participants with mild cognitive impairment received the intervention.
Condition B: Participants with mild cognitive impairment did not receive the intervention.
Condition C: Healthy participants without cognitive impairment received the treatment.
Condition D: Healthy participants without cognitive impairment did not receive the treatment.

We measured the cognitive performance of the participants as dependent variable. The cognitive performance was measured before the intervention started, and it was measured again once the intervention finished. We utilized the Cambridge Cognition Examination (CAMCOG) to determine their cognitive performance. This is a long battery of tests, and at the end you can compute a total score that represents the cognitive status.

Higher scores in the CAMCOG are evidence of better cognitive performance, while lower scores represent a low cognitive performance.

After this explanation I hope the content of the data is starting to make sense. In the data set anovaData.csv you will find the following columns:

ID: The study identification number for each participant.
CAMCOG_pre: the total score before the intervention started.
CAMCOG_post: the total score after the intervention finished.
Group: The intervention group where:
- A: Participants with mild cognitive impairment received the intervention.
- B: Participants with mild cognitive impairment did not receive the intervention.
- C: Healthy participants without cognitive impairment received the treatment.
- D: Healthy participants without cognitive impairment did not receive the treatment.

Report the means and standard deviations of the CAMCOG_post by Group, report the table from R. In addition, create a plot in R showing the mean values of CAMCOG_post by Group. (20 points)
Interpret the estimated mean values. Do you see a large difference in the mean score when comparing healthy aging adults versus aging adults with dementia? Do you observe in the bar plot a remarkable difference between groups A and B? ( 5 points)
ANOVA has two important assumptions, constant variance and normality. Perform the Shapiro-Wilk’s test, to asses the normality assumption, and the Levene’s test to evaluate the assumption of constant variance. (10 points).
Perform an omnibus ANOVA analysis to determine if there is any mean difference not explained by chance alone. Paste the table from R in your answer (10 points).
According to the \(p\)-value. What is your conclusion? (5 point)
At this point we have performed a omnibus test, this test does not tell which group differs beyond chance from the other groups. Perform a post-hoc analysis to determine where are the differences between groups and which differences are not explained by chance alone. Report the table generated in R. After that, interpret the results. (5 points)

Description

Part 1: Review important concepts

Second Part: Short answer questions

Third Part: Playing with R and perhaps JAMOVI.

Third Part: Playing with `R` and perhaps `JAMOVI`.