There are two datasets involved in this assignment: Dataset 1 and Dataset 2, detailed below.

Dataset 1: You will receive an email that contains a dataset that is specifically allocated to you. This dataset is a subset of Service Station and Price History September 2016 individual sample file, provided by the Australian Government Open Data and has been edited to only include a subset of the casesand variables.

Dataset 2: Collect data (e.g. via a survey) that will answer your research question in Section 4. There is no requirement about the number of variables, sampling methods and sample size, but you need to justify your approaches in Section 1 (see below).
Prepare a report in a document file (.doc or .docx) which includes all relevant tables and figures, using the following structure:

1. Section 1: Introduction
a. Give a brief introduction about the assignment and search a related article and write a paragraph of summary which should be a support for your report. You need to give full citation of the article.
b. Dataset 1: Give a short description about this dataset. Is this primary or secondary data? What types of variable(s) is involved? Explain briefly what the possible cases are used in this study.
c. Dataset 2: Explain how you collect the data and discuss its limitation (e.g. whether your sample is biased). Is this primary or secondary data? What is/are type of variable(s) involved? Give a description of cases you consider for this data set.

2. Section 2: Analysis of single variable in Dataset 1.
a. To answer the research question “What is the shape of the distribution of the variable Price?”, provide a suitable numerical summary and graphical display for the variable Price of Dataset 1. Give detail comments to answer research question where you need to use all outputs.
b. Now to answer the research question “Is the average price of petrol is in all service station in September 2016 is more than 115 Australian cents?” setup appropriate hypotheses, perform hypotheses test by following all steps of hypotheses test and answer the research question by writing the conclusion of the test.

3. Section 3: Analysis of two variables in Dataset 1
NRMA always report to media by comparing the price of petrol with major brand of service stations namely Caltex, Caltex Woolworths, Coles Express and 7-Eleven.
a. Give numerical summary and appropriate graphical display for comparing the price of petrol of those four major Brands.
b. Perform a suitable hypothesis test at a 5% level of significance to test whether there a price difference among these four major Brands.
c. Use the conclusion in part b and the outputs in part a to write an accurate information of the petrol price. Your answer should contain that whether there is price differences and if there is, try to find which Brand price is lowest.

4. Section 4: Collect and analysis Dataset 2
Choose at least 30 KOI students and find out which service station they prefer to buy petrol and provide appropriate numerical and graphical summary. Use these outputs to write a comment.

5. Discussion and conclusion
a. Write an executive summary by combining all of your finding in the previous sections which must be a valuable for NRMA to report to media
b. Give a suggestion for further research

