The following table shows the details for some of the car models.
TABLE BELOW
Car
Horsepower
MPG
Chevrolet Chevelle Malibu
130.0
18.0
Buick Skylark 320
165.0
15.0
Plymouth Satellite
150.0
18.0
AMC Rebel SST
150.0
16.0
Ford Torino
140.0
17.0
Ford Galaxie 500
198.0
15.0
Chevrolet Impala
220.0
14.0
Plymouth Fury
215.0
14.0
Pontiac Catalina
225.0
14.0
AMC Ambassador DPL
190.0
15.0
Citroen DS-21 Pallas
115.0
0
Chevrolet Chevelle Concours
165.0
0
Ford Torino
153.0
0
Plymouth Satellite
175.0
0
AMC Rebel SST
175.0
0
Dodge Challenger SE
170.0
15.0
Plymouth ‘Cuda 340
160.0
14.0
Ford Mustang Boss 302
140.0
0
Chevrolet Monte Carlo
150.0
15.0
Buick Estate Wagon
225.0
14.0
Toyota Corolla Mark
95.00
24.0
Plymouth Duster
95.00
22.0
AMC Hornet
97.00
18.0
Ford Maverick
85.00
21.0
You are required to find out the following for the ‘Horsepower’ and ‘MPG’ column:
a. Mean
b. Median
c. Mode
d. Variance
e. Standard deviation
f. Skewness
g. Kurtosis
Why do we use correlation analysis? What can be the maximum and minimum values for correlation? Identify the correlation between Horsepower and MPG.
Instructions – Use MS-Excel to show your output. Write the formulae in the answer that you use.
Answers
Answer:
This are all the car companies
(i) For Horsepower:
a. Mean = 157.54
b. Median = 160
c. Mode = 150
d. Variance = 1727.65
e. Std Deviation = 41.56
f. Skewness = -0.045
g. Kurtosis = -0.6926
(ii) For MPG:
a. Mean = 12.45
b. Median = 16.5
c. Mode = 0
d. Variance = 60.78
e. Std Deviation = 7.79
f. Skewness = -0.78
g. Kurtosis = -0.69
Correlation analysis for a dataset or data table is used to better understand the patterns in data and analyse how the data set changes based on different factors.
Mean = sum of data value ÷ number of data
Median = The data value which is the midpoint of all given values
Mode = The data value that arises with the most frequency.
Variance = It is the expectation of the squared deviation of a random variable from its population mean or sample mean.
Standard Deviation = The standard deviation of a random variable, sample, statistical population, data set, or probability distribution is the square root of its variance.
Skewness = Skewness refers to a distortion or asymmetry that deviates from the symmetrical bell curve, or normal distribution, in a set of data.
Kurtosis = kurtosis is a measure of the "tailedness" of the probability distribution of a real-valued random variable.