Exhibit 1.1: Frequency Distribution and Graphing with Excel
Figure 1: TechTicket ticket agency sales in the last 240 days
Q1 (b)
- The shape of the histogram columns is normal distribution with a bell-shaped symmetrical density curve. Thus, the mean of the distribution equals its median (Berenson, 2014). The chart also has one peak at the center.
- The shape of the graph can be altered by changing the class width. For instance, increasing the class width to 1,500 gives a new graph as shown below.
Figure 2: New graph after increasing the class width to1,000
Increasing the class width from 500 to 1,000 reduces the number of histogram columns. However, the shape of the distribution does not change; the shape of the histogram columns is normal distribution with a bell-shaped symmetrical density curve, mean of the distribution equals its median and the chart also has one peak at the center.
Q1 (c)
The assumption made here is that the data is normally distributed. With normal distribution, the data lean towards to a central value with no bias left or right (Gupta, 2016).
To obtain the mean and standard deviation of the data in the Exhibit 1;
Step 1: We first calculate the midpoints of the classes using the formula
Midpoint of interval = 0.5 (Lower class limit + Upper class limit)
Step 2: We multiply the frequency of each interval by its mid-point
Step 3: We obtain the sum of all the frequencies (f) and the sum of all the fx.
Table 1: Frequency table for Calculation of mean (See excel calculations)
Sales ($) |
Number of days |
Mid-point (x) |
Frequency (f) |
fx |
|
0 – 1,000 |
5 |
500 |
5 |
2500 |
|
1,000 – 1,500 |
16 |
1250 |
16 |
20000 |
|
1,500 – 2,000 |
27 |
1750 |
27 |
47250 |
|
2,000 – 2,500 |
65 |
2250 |
65 |
146250 |
|
2,500 – 3,000 |
44 |
2750 |
44 |
121000 |
|
3,000 – 4,000 |
48 |
3500 |
48 |
168000 |
|
4,000 – 5,000 |
25 |
4500 |
25 |
112500 |
|
5,000 – 10,000 |
10 |
7500 |
10 |
75000 |
|
|
240 |
692500 |
To obtain the mean, we divide ‘sum of fx’ by ‘sum of f’ which in this case is 240 days. Thus the mea of the data in Exhibit 1 is given by
Mean=692,500/240= 2885.416667 ~ $2,885/day
The Standard Deviation of the same data is calculated using the formula
Table 1: Frequency table for calculation of standard deviation (See excel calculations)
Sales ($) |
Number of days |
Mid-point (x) |
Frequency (f) |
fx |
|
|
|
|||
0 – 1,000 |
5 |
500 |
5 |
2500 |
-385 |
148546 |
742730.0347 |
|||
1,000 – 1,500 |
16 |
1250 |
16 |
20000 |
20000 |
400000000 |
6400000000 |
|||
1,500 – 2,000 |
27 |
1750 |
27 |
47250 |
47250 |
2232562500 |
60279187500 |
|||
2,000 – 2,500 |
65 |
2250 |
65 |
146250 |
146250 |
21389062500 |
1.39029E+12 |
|||
2,500 – 3,000 |
44 |
2750 |
44 |
121000 |
121000 |
14641000000 |
6.44204E+11 |
|||
3,000 – 4,000 |
48 |
3500 |
48 |
168000 |
168000 |
28224000000 |
1.35475E+12 |
|||
4,000 – 5,000 |
25 |
4500 |
25 |
112500 |
112500 |
12656250000 |
3.16406E+11 |
|||
5,000 – 10,000 |
10 |
7500 |
10 |
75000 |
75000 |
5625000000 |
56250000000 |
|||
|
240 |
692500 |
3.82858E+12 |
In our case, from Excel (see the attached worksheet), S if given by the
3.66865E+12/240 =123,636.7573~$123,636/day
Q2
From the report, two measures of central tendency could have been conducted; mode and median. This is because using mean would have skewed the results to the middle point of the 7 point scale and in this case the score would have been 4 and not 3.1 as given. For mode, the doctor could have looked at the most repeated score. With median, the doctor could easily picked the value that is at the middle which would have been 4 and not 3.1 as given.
Q3 (a)
Table 3: Tabulations of statistics (Please see the attached Excel doc.)
Mel |
Syd |
|
Count |
70.0 |
70.0 |
Mean |
65.8 |
67.1 |
Median |
65.8 |
66.5 |
Standard Deviation |
4.0 |
6.1 |
Minimum, |
55.7 |
55.8 |
Maximum, |
77.0 |
81.8 |
Range, |
21.3 |
26.0 |
1st Quartile, |
63.9 |
62.3 |
3rd Quartile |
68.5 |
71.5 |
IQR |
4.6 |
9.1 |
Coefficient of Variation |
6.1 |
9.1 |
Q3 (b) (i)
Table 3: A percentage frequency table (Please see the attached Excel doc.)
Syd |
Percentage Value |
Mel |
Percentage Value |
52.5 |
0% |
52.5 |
0% |
55 |
0% |
55 |
0% |
57.5 |
4% |
57.5 |
3% |
60 |
4% |
60 |
6% |
62.5 |
21% |
62.5 |
7% |
65 |
14% |
65 |
23% |
67.5 |
7% |
67.5 |
29% |
70 |
14% |
70 |
19% |
72.5 |
14% |
72.5 |
10% |
75 |
7% |
75 |
3% |
77.5 |
7% |
77.5 |
1% |
Q3 (b) (ii)
The average completion time for Sydney factory is 62.4 minutes whereas for Melbourne factory is 67.5 minutes.
Q4 (a)
Minimum-10
Maximum-200
Range-190
Median-45
1st Quartile (Q1)-30
3rd Quartile (Q1)-60
IQR-30
Q4 (b)
-The shape of the graph is skewed to the right.
-The highest sales was realized in 1997 i.e. 3500 which is the highest point of the graph.
-Sales declined in year 2001.
References
Berenson, M. L. (2014). Basic Business Statistics (5th ed.). Pearson Education Limited.
Gupta, B. (2016). Introduction to Basic Statistics. Interview Questions in Business Analytics, 6(2), 23-35. doi:10.1007/978-1-4842-0599-0_3