401077 Introduction to Biostatistics, Autumn 2019
Due Sunday June 9, 2019
Please answer each question in the template document provided and submit via Turnitin on or before the due date. The marks allocated to each question are shown in the assignment. A total of 40 marks are available and this assignment is worth 40% of your overall grade.
Question 1 (18 marks)
Read the paper Weston, G., Zilanawala, A., Webb, E., Carvalho, L. A. &McMunn, A. (2019). Long work hours, weekend working and depressive symptoms in men and women: findings from a UK population-based study. J Epidemiology Community Health, 0, 1-10.
Critically appraise of the statistical material in this paper against items 10, 12-17 of the STROBE checklist. Present your review as a 400-500 word (approx.) report.
Question 2(22 marks)
Using R Commander and the data set from the sample of full-time workers in Sydney assigned to you address the following research questions:
Note: To answer this question, you need to use R Commander and the data set assigned to you for assignments. This data set contains the (fictitious) data from a random sample of full-time workers in Sydney, Australia and is the same data set as that you have previously used in Assignments 1 and 2. See ‘Description of your data set.docx’ for the descriptions of the variables.
Note: This assignment is assessing your skills, not the skills of the computer. You will need to include graphs from R Commander into your assignment but all other R Commander output will attract 0 marks and is discouraged. It is your task to identify the relevant results in the R Commander output and write these up in your assignment.
“When submitting your assignment to Turnitin you are implicitly ticking these statements:
I retain a backup file of this assignment in case the original file is lost or damaged.
I hereby certify that no part of this assignment or product has been copied from any other student’s work or from any other source except where due acknowledgement is made in the assignment.
I hereby certify that no part of this assignment or product has been submitted by me in another (previous or current) assessment.
I hereby certify that no part of the assignment has been written or produced by any person.
I hereby certify that no part of this assignment has been made available to any other student.
I am aware that this work will be reproduced and submitted to plagiarism detection software for the purpose of detecting possible plagiarism. This software may retain a copy of this assignment on its database for future plagiarism detection.
I understand that failure to uphold this declaration may result in academic proceedings in line with the UWS Student Academic Misconduct Policy.”
Provide your appraisal of the strengths and weaknesses of the presentation of the statistical material in Weston et al (2019) against items 10, 12-17 of STROBE (about 400-500 words, 18 marks)
Answer 1: Item 10 – Seperated/divorced/widowed: Marital status impacts a lot on how long a person works. We can see that if they are married then they work for extra hours as compared to when they are separated. We can also see that if Men are separated/divorced/widowed it impacts on how long they work in weekday as well as on weekends. The numbers tell us that they work less than women on weekdays and weekends which results in depression of more men than women.
Item 12: Children None: If a person doesn’t have any children then their working hours are above average and they put in extra working hours on weekdays as well as on weekends and we can relate it with the study and say that in this category women working more than 55 hour/week are more depressed than men.
Item 13: Children 0-4 Years: Children affects the working hours of both men and women but if we only compare them on how long they work given the situation that they have a child at home, we can see that men give extra hours on weekdays and weekends as compared to women wherein if a woman is <35 years then they give extra hours on weekdays as well as weekends but if they are >35 years then they tend to give extra hours on weekends than on weekdays.
Item 14: Children 5-9 years: Children affects the working hours of both men and women but if we only compare them on how long they work given the situation that they have a child at home, we can see that it’s the opposite of what was in 0-4 years. Men give less hours on weekdays as well as weekends whereas if a woman is <35 years then they give extra hours on weekdays but not on weekends and if they are >35 years then they do not give any extra hours on weekdays or weekends.
Item 15: Children 10-15 years: Children affects the working hours of both men and women but if we only compare them on how long they work given the situation that they have a child at home, it is similar to what we understood in 5-9 years, where men don’t give extra hours on weekdays as well as on weekends but women gives long working hours on weekdays as well as weekends. We can say that women work more than men in this category.
We can relate this with the results that the depression symptoms in women is more than men if women are working for more than 55 hours/week but it is opposite in the case of men where the depression rate is lower if they are working more than 55 hours/week.
Item 16: Degree: It is seen that those having degree work more than average hours on weekdays as well as weekends compared to those having A level, GSCE or other qualification. This means that the depression rate in the other categories other than Degree holders is more as they don’t have work to do and they sit idle more often or not.
Question 2: Note: Students will get different answers as the data sets differ.
Present the findings of your descriptive analyses (graphs, tables and about 100-150 words, 8 marks)
I have trained two different linear regression models to predict workhours from age. First model is trained on data of male workers and second is trained on data of female workers.
For linear regression model for male, size of train and test set is:
For linear regression model for female, size of train and test set is:
The average work hours mentioned bellow are noted for test data, as evaluating model in train data is not a good idea.
Graph of Linear regression model for male:
Graph of Linear regression model for female:
Present the findings of relevant regression models and inferential analyses (about 150-200 words, 10 marks)
Coefficients of linear regression model for males:
Intercept = 43.289958, age coefficient = - 0.008368
Coefficients of linear regression model for females:
Intercept = 37.01796, age coefficient = - 0.00235
Provide your answer to the research question (40-80 words, 4 marks)
So, difference between male and female full-time workers on average in Sydney after correcting for age is 6.03952.
References, optional (0 marks)