Professional Documents
Culture Documents
19.956% 19.956%
21.175% 19.900%
19.013%
cumulative percentage
100%
number of candies
370
80%
360
60%
350
40%
340
330 20%
320 0%
Green Purple Red Orange Yellow
Color
I observed from the pie chart and pareto chart that green skittle was the highest
colored skittle in the sample size and yellow skittles was the least. Comparing this
data with the data of Elian F, the combined data of the whole class contradicts that
personal data because green was rather the least colored skittles meanwhile purple
was the highest as shown below
Red Orange Yellow green Purple
class data 360 359 343 382 360
class data percent 19.956 19.9 19.013 21.175 0.19.956
my data (EF) 14 12 6 9 23
my data percent 21.875 18.75 9.375 14.062 35.938
Organizing and Displaying Quantitative Data: The Number of Candies per Bag
Mean 60.1
Standard Deviation 3.02
The five-number summary is shown below.
Minimum 51
Quartile 1 58
Median 60.5
Quartile 3 62
Maximum 65
Frequency histogram for total candies per bag
16
14
12
10
frequency
0
40 45 50 55 60 65 70 75
Reflection
Quantitative data are data which measurable and can be quantified such as mass,
number of skittles in a bag etc. while categorical data are data which are grouped
according to common characteristics and properties such as male, female, color
etc.
For categorical data, graphs like bar charts, pie charts because they convey
information with regards to the relative size of groups, they also show percentages
of different categories and allow comparison between categories. Thus, making it
ideal graphs for categorical data.
On the other hand, graphs for quantitative data includes histograms, box plots,
stem and leaf, and scatter plots because you can count the number of values in each
interval which is the class frequency. Also, with these graphs, you can count and
allocate percentages of each individuals in that group making them good
quantitative graphs.
we are 99% confidence that the interval between 0.166 and 0.214 does contain the
true value of the population proportion p. This means that if we were to select
many different samples of size 1804 and construct the corresponding confidence
intervals, 99% of them would actually contain the value of the population
proportion p.
Construct a 95% confidence interval estimate for the true mean number
of candies per bag.
n=30
x=60.13
s=3.07
t/2=2.045
df=29
E=1.15
Interval = 58.98<<61.28
we are 95% confident that the interval between 58.98 and 61.28 does contain the
true value of . This means that if we were to select many different samples of the
same size and construct the corresponding confidence intervals, overall 95% of
them would actually contain the value of .
Construct a 98% confidence interval estimate for the standard
deviation of the number of candies per bag.
/2=0.01
n=30
s=3.07
d.f=29
^2R=38.932
^2L=8.897
=(2.650, 5.543)
we have 98% confidence that the limits of 2.650 and 5.543 contain the true value
of .
Hypothesis test.
Use a 0.05 significance level to test the claim that 20% of all Skittles candies
are red.
Claim: 20% of all Skittles candies are red, with a 0.05 significance level
H: P=0.20
H: P0.20 (two tailed test)
=0.05
P=0.1996
q=0.8004
p=0.2
n=1804
z= -0.04
p value = 0.968
From the p value method, Since p value> it implies do not reject H
Use a 0.01 significance level to test the claim that the mean number of
candies in a bag of Skittles is 55.
Claim: The mean number of Skittles in a 2.17 oz bag is 55. (p=55)
H: p = 55
H: p 55 (two tailed test)
=0.01
/2=0.005
x=60.13
s=3.07
=55
df=29
t/2=2.756
Reject H
There is sufficient evidence to warrant the rejection of the claim that the mean
number of candies in 2.17oz bag of Skittles is 55.
Reflection.
The conditions needed for doing interval estimate and hypothesis thesis was that
the sample should be a simple random sample and should be normally distribute,
the samples should be independent of each other. Our samples met this conditions
because firstly, the values are independent of each other and secondly it was a
normal distribution because n>30, with our sample size being 1804. That
notwithstanding, errors made using this data include counting half skittles and
mistaking orange skittles for red. The sampling methods can in increase by
increasing the sample size to lower the margin of error and increase the percentage
confidence interval I have also drawn the conclusion that each color of Skittle is
evenly proportioned from one bag to the next and the overall mean of the skittle
bag is almost the same as the individual mean of the skittles.
REFLECTION
During our math class we were given a project which was to get data on the colors
of candy from a 2.17oz skittle bag. The project was divided into three parts, the
first part was data collection and analysis: quantitatively and qualitatively. The
second part is the confidence interval estimates and the last part being the
hypothesis testing.
Doing this project has helped me apply what I learnt in class theoretically into a
something practical and realistic. I learnt how to collect skittle data from scratch
from organizing them into different colors (categorical statistics), analyze these
data, categorize it and quantitatively represent it using a histogram. And I further
went to determine how accurate was the data collected by determining the margin
of error and constructing confidence intervals at 99%, 98% and 95% confidence
interval.
The math skills I learned from his project will help me go further in class and has
actually relieve the myth I had about mathematics being super difficult and
statistics in particular. That is because before I took this class, a friend of mind told
me its a miracle to have an A in math statistics but after learning what I did in
class and applying it in real life, I have come to a conclusion that math is lively and
I even plan on changing from biology major to math major and explore more about
math and real life and how much fun math has to offer.
In a nutshell, this project has helped me have a wide horizon on what math is and
not only that, it has helped me to know math is not all about solving for x and y
which we are never going to see x in our day to day lives, but it helps me to
broaden my brain and see life in a different perspective not only in further math
classes but throughout my career ladder.