AFE135 SP 3 – Business Data Analysis Computer Assignment

AFE135 SP 3 – Business Data Analysis

Computer Assignment
(Worth 20% of the overall assessment)

Due date: 11pm (AEST) on Sunday, 28 October 2018 (Week 9)

• All numerical calculations and graphs/plots should be done using EXCEL.
• Your assignment must be submitted in PDF format only!
• When answering questions, wherever required, you should cut and paste the Excel output (eg, plots, regression output etc) to show your working/output.
• Submit your assignment through Safe Assign in the L@G course website, under the Assessment 2, Computer Assignment before the due date.
• You are required to keep an electronic copy of your submitted assignment to re-submit, in case the original submission is failed and/or you are asked to resubmit.
• Please check your email prior to reporting your assignment mark regularly for possible communications due to failure in your submission.

Important Notice:
All assignments submitted undergo plagiarism checking; if found to have cheated, all involving submissions would receive a mark of zero for this assessment item.

Please read below information carefully and respond all questions listed.
Many college instructors believe that students need to spend at least 2 hours studying outside of class for every hour of lecture. They believe that the number of hours students study to prepare for the exam affect students’ marks significantly. As opposed, some believe that the number of preparation hours do not essentially affect students’ marks while some other factors are to be considered. To study the relationship between the preparation time spent by each student (in hours) for the exam and the reported mark, a sample of 100 students were selected randomly from a large statistics class. The data are stored in the file named “ASSIGNMENTDATA.XLS” in the course website. Using EXCEL, answer below 9 questions:
1. What type of survey method could be used? Explain your answer. (1 mark)
2. What sampling method could be used to select the sample? Explain your answer. (1 mark)
3. On the basis of given data, determine the dependent and independent variables we should use. Also, identify the data type(s) for each variable. (1.5 marks)
4. What kind of issues we may face in collecting the data using this type of survey method? List and explain two cases. (1 mark)
5. Using 8 classes and intervals of 20 – 30, 30 – 40, etc for both of the variables selected in question 3, develop a distribution table including class intervals, frequency, relative frequency and cumulative relative frequency for each variable. Then, draw frequency histogram, relative frequency histogram and cumulative relative frequency histogram for each variable. Also, Comment on the shape of frequency histogram for each variable and provide reason(s) for your comment. (5.5 marks)
6. Draw and use an appropriate scatter plot to investigate the relationship between the two variables. Also, briefly explain the selection of each variable on the X and Y axes and the reason? Finally, draw the fitting line for the plotted observations. (2.5 marks)
7. Present the equation of the estimated fitting line (regression) in your answer to Question 6. Then, estimate the effect of an increase in the independent variable by one unit on the dependent variable. Also, interpret your estimation. (2.5 marks)
8. Prepare a numerical summary report about the data on the two variables by including the mean, median, range, variance, standard deviation, smallest and largest values, quartiles, interquartile range and the 30th percentile for each variable. (3.5 marks)
9. Compute a numerical measurement which measures the strength and direction of the linear relationship between the two variables. Also, interpret this value. (1.5 marks)