CHAPTER FIFTEEN

# MULTIPLE REGRESSION

## MULTIPLE CHOICE QUESTIONS

In the following multiple choice questions, circle the correct answer.

1. For a multiple regression model, SSR = 600 and SSE = 200. The multiple coefficient of determination is
2. 0.333
3. 0.275
4. 0.300
5. 0.75

1. In a multiple regression analysis involving 15 independent variables and 200 observations, SST = 800 and SSE = 240. The coefficient of determination is
2. 0.300
3. 0.192
4. 0.500
5. 0.700

1. A regression model involved 5 independent variables and 136 observations. The critical value of t for testing the significance of each of the independent variable’s coefficients will have
2. 121 degrees of freedom
3. 135 degrees of freedom
4. 120 degrees of freedom
5. 4 degrees of freedom

1. In order to test for the significance of a regression model involving 3 independent variables and 47 observations, the numerator and denominator degrees of freedom (respectively) for the critical value of F are
2. 47 and 3
3. 3 and 47
4. 2 and 43
5. 3 and 43

1. In regression analysis, an outlier is an observation whose
2. mean is larger than the standard deviation
3. residual is zero
4. mean is zero
5. residual is much larger than the rest of the residual values

1. A variable that cannot be measured in terms of how much or how many but instead is assigned values to represent categories is called
2. an interaction
3. a constant variable
4. a category variable
5. a qualitative variable

1. A variable that takes on the values of 0 or 1 and is used to incorporate the effect of qualitative variables in a regression model is called
2. an interaction
3. a constant variable
4. a dummy variable
5. None of these alternatives is correct.

1. In a multiple regression model, the error term e is assumed to be a random variable with a mean of
2. zero
3. -1
4. 1
5. any value

1. In regression analysis, the response variable is the
2. independent variable
3. dependent variable
4. slope of the regression function
5. intercept

1. The multiple coefficient of determination is
2. MSR/MST
3. MSR/MSE
4. SSR/SST
5. SSE/SSR

1. A multiple regression model has the form

= 7 + 2 x1 + 9 x2

As x1 increases by 1 unit (holding x2 constant), y is expected to

1. increase by 9 units
2. decrease by 9 units
3. increase by 2 units
4. decrease by 2 units

1. A multiple regression model has
2. only one independent variable
3. more than one dependent variable
4. more than one independent variable
5. at least 2 dependent variables

1. A measure of goodness of fit for the estimated regression equation is the
2. multiple coefficient of determination
3. mean square due to error
4. mean square due to regression
5. sample size

1. The numerical value of the coefficient of determination
2. is always larger than the coefficient of correlation
3. is always smaller than the coefficient of correlation
4. is negative if the coefficient of determination is negative
5. can be larger or smaller than the coefficient of correlation

1. The correct relationship between SST, SSR, and SSE is given by
2. SSR = SST + SSE
3. SSR = SST – SSE
4. SSE = SSR – SST
5. None of these alternatives is correct.

Exhibit 15-1

In a regression model involving 44 observations, the following estimated regression equation was obtained.

= 29 + 18X1 +43X2 + 87X3

For this model SSR = 600 and SSE = 400.

1. Refer to Exhibit 15-1. The coefficient of determination for the above model is
2. 0.667
3. 0.600
4. 0.336
5. o.400

1. Refer to Exhibit 15-1. MSR for this model is
2. 200
3. 10
4. 1,000
5. 43

1. Refer to Exhibit 15-1. The computed F statistics for testing the significance of the above model is
2. 1.500
3. 20.00
4. 0.600
5. 0.6667

1. In a multiple regression analysis SSR = 1,000 and SSE = 200. The F statistic for this model is
2. 5.0
3. 1,200
4. 800
5. Not enough information is provided to answer this question.

1. The ratio of MSE/MSR yields
2. SST
3. the F statistic
4. SSR
5. None of these alternatives is correct.

1. In a multiple regression model, the variance of the error term e is assumed to be
2. the same for all values of the dependent variable
3. zero
4. the same for all values of the independent variable
5. -1

2. the number of dependent variables
3. the number of independent variables
4. the number of equations
5. detrimental situations

1. In multiple regression analysis, the correlation among the independent variables is termed
2. homoscedasticity
3. linearity
4. multicollinearity

1. In a multiple regression model, the values of the error term ,e, are assumed to be
2. zero
3. dependent on each other
4. independent of each other
5. always negative

1. In multiple regression analysis,
2. there can be any number of dependent variables but only one independent variable
3. there must be only one independent variable
4. the coefficient of determination must be larger than 1
5. there can be several independent variables, but only one dependent variable

1. In a multiple regression model, the error term e is assumed to
2. have a mean of 1
3. have a variance of zero
4. have a standard deviation of 1
5. be normally distributed

1. In a multiple regression analysis involving 12 independent variables and 166 observations, SSR = 878 and SSE = 122. The coefficient of determination is
2. 0.1389
3. 0.1220
4. 0.878
5. 0.7317

1. A regression analysis involved 8 independent variables and 99 observations. The critical value of t for testing the significance of each of the independent variable’s coefficients will have
2. 98 degrees of freedom
3. 97 degrees of freedom
4. 90 degrees of freedom
5. 7 degrees of freedom

1. In order to test for the significance of a regression model involving 14 independent variables and 255 observations, the numerator and denominator degrees of freedom (respectively) for the critical value of F are
2. 14 and 255
3. 255 and 14
4. 13 and 240
5. 14 and 240

Exhibit 15-2

A regression model between sales (Y in \$1,000), unit price (X1 in dollars) and television advertisement (X2 in dollars) resulted in the following function:

= 7 – 3X1 + 5X2

For this model SSR = 3500, SSE = 1500, and the sample size is 18.

1. Refer to Exhibit 15-2. The coefficient of the unit price indicates that if the unit price is
2. increased by \$1 (holding advertising constant), sales are expected to increase by \$3
3. decreased by \$1 (holding advertising constant), sales are expected to decrease by \$3
4. increased by \$1 (holding advertising constant), sales are expected to increase by \$4,000
5. increased by \$1 (holding advertising constant), sales are expected to decrease by \$3,000

1. Refer to Exhibit 15-2. The coefficient of X2 indicates that if television advertising is increased by \$1 (holding the unit price constant), sales are expected to
2. increase by \$5
3. increase by \$12,000
4. increase by \$5,000
5. decrease by \$2,000

1. Refer to Exhibit 15-2. To test for the significance of the model, the test statistic F is
2. 2.33
3. 0.70
4. 17.5
5. 1.75

1. Refer to Exhibit 15-2. To test for the significance of the model, the p-value is
2. less than 0.01
3. between 0.01 and 0.025
4. between 0.025 and 0.05
5. between 0.05 and 0.10

1. Refer to Exhibit 15-2. The multiple coefficient of correlation for this problem is
2. 0.70
3. 0.8367
4. 0.49
5. 0.2289

Exhibit 15-3

In a regression model involving 30 observations, the following estimated regression equation was obtained:

= 17 + 4X1 – 3X2 + 8X3 + 8X4

For this model SSR = 700 and SSE = 100.

1. Refer to Exhibit 15-3. The coefficient of determination for the above model is approximately
2. -0.875
3. 0.875
4. 0.125
5. 0.144

1. Refer to Exhibit 15-3. The computed F statistic for testing the significance of the above model is
2. 43.75
3. 0.875
4. 50.19
5. 7.00

1. Refer to Exhibit 15-3. The critical F value at 95% confidence is
2. 2.53
3. 2.69
4. 2.76
5. 2.99

1. Refer to Exhibit 15-3. The conclusion is that the
2. model is not significant
3. model is significant
4. slope of X1 is significant
5. slope of X2 is significant

Exhibit 15-4

1. Y = b0 + b1X1 + b2X2 + e

1. E(Y) = b0 + b1X1 + b2X2 + e

1.  = bo + b1 X1 + b2 X2

1. E(Y) = b0 + b1X1 + b2X2

1. Refer to Exhibit 15-4. Which equation describes the multiple regression model?

a

b

c

d

1. Refer to Exhibit 15-4. Which equation gives the estimated regression line?

a

b

c

d

1. Refer to Exhibit 15-4. Which equation describes the multiple regression equation?

a

b

c

d

Exhibit 15-5

Below you are given a partial Minitab output based on a sample of 25 observations.

Coefficient     Standard Error

Constant                145.321              48.682

X1                       25.625                9.150

X2                        ‑5.720                3.575

X3                         0.823                0.183

1. Refer to Exhibit 15-5. The estimated regression equation is
2. Y = b0 + b1X1 + b2X2 + b3X3 + e
3. E(Y) = b0 + b1X1 + b2X2 + b3X3
4.  = 145.321 + 25.625X1 – 5.720X2 + 0.823X3
5.  = 48.682 + 9.15X1 + 3.575X2 + 0.183X3

1. Refer to Exhibit 15-5. The interpretation of the coefficient on X1 is that
2. a one unit change in X1 will lead to a 25.625 unit change in Y
3. a one unit change in X1 will lead to a 25.625 unit increase in Y when all other variables are held constant
4. a one unit change in X1 will lead to a 25.625 unit increase in X2 when all other variables are held constant
5. It is impossible to interpret the coefficient.

1. Refer to Exhibit 15-5. We want to test whether the parameter b1 is significant.  The test statistic equals
2. 0.357
3. 2.8
4. 14
5. 1.96

1. Refer to Exhibit 15-5. The t value obtained from the table to test an individual parameter at the 5% level is
2. 2.06
3. 2.069
4. 2.074
5. 2.080

1. Refer to Exhibit 15-5. Carry out the test of significance for the parameter b1 at the 5% level.  The null hypothesis should be
2. rejected
3. not rejected
4. revised
5. None of these alternatives is correct.

Exhibit 15-6

Below you are given a partial computer output based on a sample of 16 observations.

Coefficient     Standard Error

Constant                12.924                  4.425

X1                      -3.682                  2.630

X2                      45.216                12.560

Analysis of Variance

Source of         Degrees            Sum of         Mean

Variation       of Freedom       Squares       Square          F

Regression                                   4,853          2,426.5

Error                                                                  485.3

1. Refer to Exhibit 15-6. The estimated regression equation is
2. Y = b0 + b1X1 + b2X2 + e
3. E(Y) = b0 + b1X1 + b2X2
4.  = 12.924 ‑ 3.682 X1 + 45.216 X2
5.  = 4.425 + 2.63 X1 + 12.56 X2

1. Refer to Exhibit 15-6. The interpretation of the coefficient of X1 is that
2. a one unit change in X1 will lead to a 3.682 unit decrease in Y
3. a one unit increase in X1 will lead to a 3.682 unit decrease in Y when all other variables are held constant
4. a one unit increase in X1 will lead to a 3.682 unit decrease in X2 when all other variables are held constant
5. It is impossible to interpret the coefficient.

1. Refer to Exhibit 15-6. We want to test whether the parameter b1 is significant.  The test statistic equals
2. -1.4
3. 1.4
4. 3.6
5. 5

1. Refer to Exhibit 15-6. The t value obtained from the table which is used to test an individual parameter at the 1% level is
2. 2.65
3. 2.921
4. 2.977
5. 3.012

1. Refer to Exhibit 15-6. Carry out the test of significance for the parameter b1 at the 1% level.  The null hypothesis should be
2. rejected
3. not rejected
4. revised
5. None of these alternatives is correct.

1. Refer to Exhibit 15-6. The degrees of freedom for the sum of squares explained by the regression (SSR) are
2. 2
3. 3
4. 13
5. 15

1. Refer to Exhibit 15-6. The sum of squares due to error (SSE) equals
2. 37.33
3. 485.3
4. 4,853
5. 6,308.9

1. Refer to Exhibit 15-6. The test statistic used to determine if there is a relationship among the variables equals
2. -1.4
3. 0.2
4. 0.77
5. 5

1. Refer to Exhibit 15-6. The F value obtained from the table used to test if there is a relationship among the variables at the 5% level equals
2. 3.41
3. 3.63
4. 3.81
5. 19.41

1. Refer to Exhibit 15-6. Carry out the test to determine if there is a relationship among the variables at the 5% level.  The null hypothesis should
2. be rejected
3. not be rejected
4. revised
5. None of these alternatives is correct.

1. A regression model in which more than one independent variable is used to predict the dependent variable is called
2. a simple linear regression model
3. a multiple regression model
4. an independent model
5. None of these alternatives is correct.

1. A term used to describe the case when the independent variables in a multiple regression model are correlated is
2. regression
3. correlation
4. multicollinearity
5. None of the above answers is correct.

1. A variable that cannot be measured in numerical terms is called
2. a nonmeasurable random variable
3. a constant variable
4. a dependent variable
5. a qualitative variable

1. A multiple regression model has the form

= 5 + 6X + 7W

As X increases by 1 unit (holding W constant), Y is expected to

1. increase by 11 units
2. decrease by 11 units
3. increase by 6 units
4. decrease by 6 units

Exhibit 15-7

A regression model involving 4 independent variables and a sample of 15 periods resulted in the following sum of squares.

SSR = 165

SSE = 60

1. Refer to Exhibit 15-7. The coefficient of determination is
2. 0.3636
3. 0.7333
4. 0.275
5. 0.5

1. Refer to Exhibit 15-7. If we want to test for the significance of the model at 95% confidence, the critical F value (from the table) is
2. 3.06
3. 3.48
4. 3.34
5. 3.11

1. Refer to Exhibit 15-7. The test statistic from the information provided is
2. 2.110
3. 3.480
4. 4.710
5. 6.875

Exhibit 15-8

The following estimated regression model was developed relating yearly income (Y in \$1,000s) of 30 individuals with their age (X1) and their gender (X2) (0 if male and 1 if female).

= 30 + 0.7X1 + 3X2

Also provided are SST = 1,200 and SSE = 384.

1. Refer to Exhibit 15-8. From the above function, it can be said that the expected yearly income of
2. males is \$3 more than females
3. females is \$3 more than males
4. males is \$3,000 more than females
5. females is \$3,000 more than males

1. Refer to Exhibit 15-8. The yearly income of a 24-year-old female individual is
2. \$19.80
3. \$19,800
4. \$49.80
5. \$49,800

1. Refer to Exhibit 15-8. The yearly income of a 24-year-old male individual is
2. \$13.80
3. \$13,800
4. \$46,800
5. \$49,800

1. Refer to Exhibit 15-8. The multiple coefficient of determination is
2. 0.32
3. 0.42
4. 0.68
5. 0.50

1. Refer to Exhibit 15-8. If we want to test for the significance of the model, the critical value of F at 95% confidence is
2. 3.33
3. 3.35
4. 3.34
5. 2.96

1. Refer to Exhibit 15-8. The test statistic for testing the significance of the model is
2. 0.73
3. 1.47
4. 28.69
5. 5.22

1. Refer to Exhibit 15-8. The model
2. is significant
3. is not significant
4. would be significant is the sample size was larger than 30
5. None of these alternatives is correct.

1. Refer to Exhibit 15-8. The estimated income of a 30-year-old male is
2. \$51,000
3. \$5,100
4. \$510
5. \$51

1. For a multiple regression model, SST = 200 and SSE = 50. The multiple coefficient of determination is
2. 0.25
3. 4.00
4. 250
5. 0.75

1. In a multiple regression analysis involving 10 independent variables and 81 observations, SST = 120 and SSE = 42. The coefficient of determination is
2. 0.81
3. 0.11
4. 0.35
5. 0.65

1. A regression model involved 18 independent variables and 200 observations. The critical value of t for testing the significance of each of the independent variable’s coefficients will have
2. 18 degrees of freedom
3. 200 degrees of freedom
4. 199 degrees of freedom
5. 181 degrees of freedom

1. In order to test for the significance of a regression model involving 8 independent variables and 121 observations, the numerator and denominator degrees of freedom (respectively) for the critical value of F are
2. 8 and 121
3. 7 and 120
4. 8 and 112
5. 7 and 112

1. In a multiple regression analysis involving 5 independent variables and 30 observations, SSR = 360 and SSE = 40. The coefficient of determination is
2. 0.80
3. 0.90
4. 0.25
5. 0.15

1. A regression analysis involved 6 independent variables and 27 observations. The critical value of t for testing the significance of each of the independent variable’s coefficients will have
2. 27 degrees of freedom
3. 26 degrees of freedom
4. 21 degrees of freedom
5. 20 degrees of freedom

78        In order to test for the significance of a regression model involving 4 independent variables and 36 observations, the numerator and denominator degrees of freedom (respectively) for the critical value of F are

1. 4 and 36
2. 3 and 35
3. 4 and 31
4. 4 and 32

1. In logistic regression,
2. there can only be two independent variables
3. there are two dependent variables
4. the dependent variable only assumes two discrete values
5. the dependent variable only assumes two continuous values

1. In a situation where the dependent variable can assume only one of the two possible discrete values,
2. we must use multiple regression
3. there can only be two independent variables
4. logistic regression should be applied
5. all the independent variables must have values of either zero or one

PROBLEMS

1. Multiple regression analysis was used to study how an individual’s income (Y in thousands of dollars) is influenced by age (X1 in years), level of education (X2 ranging from 1 to 5), and the person’s gender (X3 where 0 =female and 1=male). The following is a partial result of a computer program that was used on a sample of 20 individuals.

Coefficient        Standard Error

X1                0.6251                     0.094

X2                0.9210                     0.190

X3               -0.510                       0.920

Analysis of Variance

Source of              Degrees             Sum of            Mean

Variation            of Freedom         Squares          Square             F

Regression                                            84

Error                                                   112

1. Compute the coefficient of determination.
2. Perform a t test and determine whether or not the coefficient of the variable “level of education” (i.e., X2) is significantly different from zero.  Let a = 0.05.
3. At a = 0.05, perform an F test and determine whether or not the regression model is significant.
4. As you note the coefficient of X3 is -0.510.  Fully interpret the meaning of this coefficient.

1. A multiple regression analysis between yearly income (Y in \$1,000s), college grade point average (X1), age of the individuals (X2), and the gender of the individual (X3; zero representing female and one representing male) was performed on a sample of 10 people, and the following results were obtained.

Coefficient           Standard Error

Constant              4.0928                      1.4400

X1                   10.0230                      1.6512

X2                     0.1020                      0.1225

X3                   -4.4811                      1.4400

Analysis of Variance

Source of            Degrees               Sum of               Mean

Variation         of Freedom            Squares             Square                F

Regression                                       360.59

Error                                                  23.91

1. Write the regression equation for the above.
2. Interpret the meaning of the coefficient of X3.
3. Compute the coefficient of determination.
4. Is the coefficient of X1 significant?  Use a = 0.05.
5. Is the coefficient of X2 significant?  Use a = 0.05.
6. Is the coefficient of X3 significant?  Use a = 0.05.
7. Perform an F test and determine whether or not the model is significant.

1. The following results were obtained from a multiple regression analysis.

Source of              Degrees                Sum of            Mean

Variation          of Freedom             Squares          Square            F

Regression                                             384                  48

Error                                                                              20

Total                                                      704

1. How many independent variables were involved in this model?
2. How many observations were involved?
3. Determine the F statistic.

1. Shown below is a partial computer output from a regression analysis.

Coefficient              Standard Error

Constant                  10.00                            2.00

X1                        -2.00                            1.50

X2                         6.00                            2.00

X3                        -4.00                            1.00

Analysis of Variance

Source of           Degrees             Sum of            Mean

Variation       of Freedom          Squares          Square            F

Regression                                         60

Error

Total                       19                     140

1. Use the above results and write the regression equation.
2. Compute the coefficient of determination and fully interpret its meaning.
3. At a = 0.05, test to see if there is a relation between X1 and Y.
4. At a = 0.05, test to see if there is a relation between X3 and Y.
5. Is the regression model significant?  Perform an F test and let a = 0.05.

1. In order to determine whether or not the sales volume of a company (Y in millions of dollars) is related to advertising expenditures (X1 in millions of dollars) and the number of salespeople (X2), data were gathered for 10 years. Part of the regression results is shown below.

Coefficient           Standard Error

Constant              7.0174                      1.8972

X1                   8.6233                      2.3968

X2                   0.0858                      0.1845

Analysis of Variance

Source of               Degrees                Sum of          Mean

Variation            of Freedom             Squares         Square         F

Regression                                            321.11

Error                                                       63.39

1. Use the above results and write the regression equation that can be used to predict sales.
2. Estimate the sales volume for an advertising expenditure of 3.5 million dollars and 45 salespeople.  Give your answer in dollars.
3. At a = 0.05, test to determine if the fitted equation developed in Part a represents a significant relationship between the independent variables and the dependent variable.
4. At a = 0.05, test to see if b1 is significantly different from zero.
5. Determine the multiple coefficient of determination.
6. Compute the adjusted coefficient of determination.

1. In order to determine whether or not the number of automobiles sold per day (Y) is related to price (X1 in \$1,000), and the number of advertising spots (X2), data were gathered for 7 days.  Part of the regression results is shown below.

Coefficient           Standard Error

Constant                 0.8051

X1                      0.4977                      0.4617

X2                      0.4733                      0.0387

Analysis of Variance

Source of              Degrees                Sum of            Mean

Variation          of Freedom             Squares          Square            F

Regression                                           40.700

Error                                                      1.016

1. Determine the least squares regression function relating Y to X1 and X2.
2. If the company charges \$20,000 for each car and uses 10 advertising spots, how many cars would you expect them to sell in a day?
3. At a = 0.05, test to determine if the fitted equation developed in Part a represents a significant relationship between the independent variables and the dependent variable.
4. At 95% confidence, test to see if price is a significant variable.
5. At 95% confidence, test to see if the number of advertising spots is a significant variable.
6. Determine the multiple coefficient of determination.

1. The following is part of the results of a regression analysis involving sales (Y in millions of dollars), advertising expenditures (X1 in thousands of dollars), and number of salespeople (X2) for a corporation. The regression was performed on a sample of 10 observations.

Coefficient        Standard Error

Constant            -11.340                   20.412

X1                     0.798                     0.332

X2                     0.141                     0.278

1. Write the regression equation.
2. Interpret the coefficients of the estimated regression equation found in Part (a).
3. At a =0.05, test for the significance of the coefficient of advertising.
4. At a =0.05, test for the significance of the coefficient of number of salespeople.

1. The following is part of the results of a regression analysis involving sales (Y in millions of dollars), advertising expenditures (X1 in thousands of dollars), and number of sales people (X2) for a corporation:

ANALYSIS OF VARIANCE

Source of              Degrees of            Sum of             Mean

Variation               Freedom             Squares            Square          F

Regression                    2                    822.088

Error                             7                    736.012

1. At a = 0.05 level of significance, test to determine if the model is significant.  That is, determine if there exists a significant relationship between the independent variables and the dependent variable.
2. Determine the multiple coefficient of determination.
3. Determine the adjusted multiple coefficient of determination.
4. What has been the sample size for this regression analysis?

1. Below you are given a partial computer output based on a sample of 12 observations relating the number of personal computers sold by a computer shop per month (Y), unit price (X1 in \$1,000) and the number of advertising spots (X2) used on a local television station.

Coefficient     Standard Error

Constant             17.145                  7.865

X1                  -0.104                  3.282

X2                   1.376                  0.250

1. Use the output shown above and write an equation that can be used to predict the monthly sales of computers.
2. Interpret the coefficients of the estimated regression equation found in Part a.
3. If the company charges \$2,000 for each computer and uses 10 advertising spots, how many computers would you expect them to sell?
4. At a = 0.05, test to determine if the price is a significant variable.
5. At a = 0.05, test to determine if the number of advertising spots is a significant variable.

1. Below you are given a partial computer output based on a sample of 12 observations relating the number of personal computers sold by a computer shop per month (Y), unit price (X1 in \$1,000) and the number of advertising spots (X2) they used on a local television station.

ANALYSIS OF VARIANCE

Source of            Sum of             Degrees of              Mean

Variation          Squares               Freedom               Square             F

Regression                   655.955                     2

Error                                                     9

Total                   838.917

1. At a = 0.05 level of significance, test to determine if the model is significant.  That is, determine if there exists a significant relationship between the independent variables and the dependent variable.
2. Determine the multiple coefficient of determination.
3. Determine the adjusted multiple coefficient of determination.

1. Below you are given a partial computer output based on a sample of 30 days of the price of a company’s stock (Y in dollars), the Dow Jones industrial average (X1), and the stock price of the company’s major competitor (X2 in dollars).

Coefficient     Standard Error

Constant                20.000                  5.455

X1                        0.006                  0.002

X2                      -0.70                    0.200

1. Use the output shown above and write an equation that can be used to predict the price of the stock.
2. If the Dow Jones Industrial Average is 10,000 and the price of the competitor is \$50, what would you expect the price of the stock to be?
3. At a = 0.05, test to determine if the Dow Jones average is a significant variable.
4. At a = 0.05, test to determine if the stock price of the major competitor is a significant variable.

1. Below you are given a partial computer output relating the price of a company’s stock (Y in dollars), the Dow Jones industrial average (X1), and the stock price of the company’s major competitor (X2 in dollars).

ANALYSIS OF VARIANCE

Source of            Degrees of             Sum of               Mean

Variation             Freedom               Squares             Square               F

Regression

Error                          20                          40

Total                                                      800

1. What has been the sample size for this regression analysis?
2. At a = 0.05 level of significance, test to determine if the model is significant.  That is, determine if there exists a significant relationship between the independent variables and the dependent variable.
3. Determine the multiple coefficient of determination.

1. A regression was performed on a sample of 16 observations. The estimated equation is = 23.5 – 14.28X1 + 6.72X2 + 15.68X3.  The standard errors for the coefficients are Sb1 = 4.2, Sb2 = 5.6, and Sb3 = 2.8.  For this model, SST = 3809.6 and SSR = 3285.4.

1. Compute the appropriate t ratios.
2. Test for the significance of b1, b2, and b3 at the 5% level of significance.
3. Do you think that any of the variables should be dropped from the model?  Explain.
4. Compute R2 and Ra2.  Interpret R2.
5. Test the significance of the relationship among the variables at the 5% level of significance.

1. The following results were obtained from a multiple regression analysis of supermarket profitability. The dependent variable, Y, is the profit (in thousands of dollars) and the independent variables, X1 and X2, are the food sales and nonfood sales (also in thousands of dollars).

Coefficient           Standard Error

Constant            -15.062

X1                     0.0972                    0.054

X2                     0.2484                    0.092

Analysis of Variance

Degrees of                  Sum of

Source                  Freedom                   Squares               F

Regression                  2                           562.363           11.23

Error                           9                           225.326

Coefficient of determination = 0.7139

1. Write the estimated regression equation for the relationship between the variables.
2. What can you say about the strength of this relationship?
3. Carry out a test of whether Y is significantly related to the independent variables.  Use a 0.05 level of significance.
4. Carry out a test of whether X1 and Y are significantly related.  Use a .05 level of significance.
5. How many supermarkets are in the sample used here?

1. A regression was performed on a sample of 20 observations. Two independent variables were included in the analysis, X and Z.  The relationship between X and Z is Z = X2.  The following estimated equation was obtained.

= 23.72 + 12.61X + 0.798Z

The standard errors for the coefficients are Sb1 = 4.85 and Sb2 = 0.21

For this model, SSR = 520.2 and SSE = 340.6

1. Estimate the value of Y when X = 5.
2. Compute the appropriate t ratios.
3. Test for the significance of the coefficients at the 5% level.  Which variable(s) is (are) significant?
4. Compute the coefficient of determination and the adjusted coefficient of determination.  Interpret the meaning of the coefficient of determination.
5. Test the significance of the relationship among the variables at the 5% level of significance.

1. A student used multiple regression analysis to study how family spending (Y) is influenced by income (X1), family size (X2), and additions to savings (X3). The variables Y, X1, and X3 are measured in thousands of dollars.  The following results were obtained.

Degrees of                   Sum of

Source                 Freedom                      Squares                     F

Regression                 3                              45.9634                 64.28

Error                        11                                2.6218

Predictor                    Coefficient                    Standard Error

Constant                          0.0136

X1                               0.7992                                  0.074

X2                               0.2280                                  0.190

X3                              -0.5796                                  0.920

Coefficient of determination = 0.946

1. Write out the estimated regression equation for the relationship between the variables.
2. What can you say about the strength of this relationship?
3. Carry out a test of whether Y is significantly related to the independent variables.  Use a .05 level of significance.
4. Carry out a test to see if X3 and Y are significantly related.  Use a .05 level of significance.

1. A regression model involving 3 independent variables for a sample of 20 periods resulted in the following sum of squares.

Sum of Squares

Regression                             90

Residual (Error)                   100

1. Compute the coefficient of determination and fully explain its meaning.
2. At a = 0.05 level of significance, test to determine whether or not there is a significant relationship between the independent variables and the dependent variable.

1. A regression model involving 8 independent variables for a sample of 69 periods resulted in the following sum of squares.

SSE = 306

SST = 1800

1. Compute the coefficient of determination.
2. At a = 0.05, test to determine whether or not the model is significant.

1. In a regression model involving 46 observations, the following estimated regression equation was obtained.

= 17 + 4X1 – 3X2 + 8X3 + 5X4 + 8X5

For this model, SST = 3410 and SSE = 510.

1. Compute the coefficient of determination.
2. Perform an F test and determine whether or not the regression model is significant.

1. A microcomputer manufacturer has developed a regression model relating his sales (Y in \$10,000s) with three independent variables. The three independent variables are price per unit (Price in \$100s), advertising (ADV in \$1,000s) and the number of product lines (Lines).  Part of the regression results is shown below.

Coefficient                    Standard Error

Intercept                          1.0211                               22.8752

Price                              -0.1524                                 0.1411

Lines                              -0.1463                                 1.5340

Analysis of Variance

Source of                    Degrees                      Sum of

Variation                  of Freedom                  Squares

Regression                                                      2708.61

Error (Residuals)              14                          2840.51

1. Use the above results and write the regression equation that can be used to predict sales.
2. If the manufacturer has 10 product lines, advertising of \$40,000, and the price per unit is \$3,000, what is your estimate of their sales?  Give your answer in dollars.
3. Compute the coefficient of determination and fully interpret its meaning.
4. At a = 0.05, test to see if there is a significant relationship between sales and unit price.
5. At a = 0.05, test to see if there is a significant relationship between sales and the number of product lines.
6. Is the regression model significant?  (Perform an F test.)
7. Fully interpret the meaning of the regression (coefficient of price) per unit that is, the slope for the price per unit.
8. What has been the sample size for this analysis?

1. The following is part of the results of a regression analysis involving sales (Y in millions of dollars), advertising expenditures (X1 in thousands of dollars), and number of salespeople (X2) for a corporation. The regression was performed on a sample of 10 observations.

Coefficient            Standard Error

Intercept               40.00                            7.00

X1                      8.00                            2.50

X2                      6.00                            3.00

2. At a = 0.05, test for the significance of the coefficient of advertising.
3. At a = 0.05, test for the significance of the coefficient of the number of salespeople.

1. Sherri Cola Company has developed a regression model relating its sales (Y in \$10,000s) with four independent variables. The four independent variables are price per unit (PRICE, in dollars), competitor’s price (COMPRICE, in dollars), advertising (ADV, in \$1,000s) and type of container used

(CONTAIN; 1 = Cans and 0 = Bottles).  Part of the regression results is shown below.  (Assume n = 25)

Coefficient                 Standard Error

Intercept                       443.143

PRICE                           -57.170                             20.426

COMPRICE                   27.681                             19.991

CONTAIN                    -95.353                             91.027

1. If the manufacturer uses can containers, his price is \$1.25, advertising \$200,000, and his competitor’s price is \$1.50, what is your estimate of his sales?  Give your answer in dollars.
2. Test to see if there is a significant relationship between sales and unit price. Let a = 0.05.
3. Test to see if there is a significant relationship between sales and advertising.  Let a = 0.05.
4. Is the type of container a significant variable?

Let a = 0.05.

1. Test to see if there is a significant relationship between sales and competitor’s price.  Let a = 0.05.

1. The Brock Juice Company has developed a regression model relating sales (Y in \$10,000s) with four independent variables. The four independent variables are price per unit (X1, in dollars), competitor’s price (X2, in dollars), advertising (X3, in \$1,000s) and type of container used (X4) (1 = Cans and 0 = Bottles).  Part of the regression results are shown below:

Analysis of Variance

Source of                  Degrees of                   Sum of

Variation                   Freedom                     Squares

Regression                        4                          283,940.60

Error (Residuals)             18                         621,735.14

1. Compute the coefficient of determination and fully interpret its meaning.
2. Is the regression model significant? Explain what your answer implies.  Let a = 0.05.
3. What has been the sample size for this analysis?

1. The following regression model has been proposed to predict sales at a furniture store.

= 10 – 4X1 + 7X2 + 18X3

where

X1 = competitor’s previous day’s sales (in \$1,000s)

X2 = population within 1 mile (in 1000s)

X3 = 1 if any form of advertising was used, 0 if otherwise

= sales (in \$1,000s)

1. Fully interpret the meaning of the coefficient of X3.
2. Predict sales (in dollars) for a store with competitor’s previous day’s sale of \$3,000, a population of 10,000 within 1 mile, and six radio advertisements.

1. A sample of 30 houses that were sold in the last year was taken. The value of the house (Y) was estimated.  The independent variables included in the analysis were the number of rooms (X1), the size of the lot (X2), the number of bathrooms (X3), and a dummy variable (X4), which equals 1 if the house has a garage and equals 0 if the house does not have a garage.  The following results were obtained:

Coefficient               Standard Error

Constant                          15,232.5                          8,462.5

X1                                 2,178.4                             778.0

X2                                        7.8                                 2.2

X3                                 2,675.2                          2,229.3

X4                                 1,157.8                             463.1

Analysis of Variance

Source of                     Degrees                     Sum of                        Mean

Variation                   of Freedom                Squares                   Squares

Regression                                                    204,242.88               51,060.72

Error (Residuals)                                          205,890.00                 8,235.60

1. Write out the estimated equation.
2. Interpret the coefficient on the number of rooms (X1).
3. Interpret the coefficient on the dummy variable (X4).
4. What are the degrees of freedom for the sum of squares explained by the regression (SSR) and the sum of squares due to error (SSE)?
5. Test whether or not there is a significant relationship between the value of a house and the independent variables.  Use a .05 level of significance.  Be sure to state the null and alternative hypotheses.
6. Test the significance of b1 at the 5% level.  Be sure to state the null and alternative hypotheses.
7. Compute the coefficient of determination and interpret its meaning.
8. Estimate the value of a house that has 9 rooms, a lot with an area of 7,500, 2 bathrooms, and a garage.

1. A sample of 25 families was taken. The objective of the study was to estimate the factors that determine the monthly expenditure on food for families.  The independent variables included in the analysis were the number of members in the family (X1), the number of meals eaten outside the home (X2), and a dummy variable (X3) that equals 1 if a family member is on a diet and equals 0 if there is no family member on a diet.  The following results were obtained.

Coefficient                      Standard Error

Constant                         450.08                                    53.6

X1                                49.92                                      9.6

X2                                10.12                                      2.2

X3                                   -.60                                    12.0

Analysis of Variance

Source of                    Degrees of                  Sum of                        Mean

Variation                     Freedom                   Squares                     Squares

Regression                                                     3,078.39                    1,026.13

Error                                                              2,013.90                         95.90

1. Write out the estimated regression equation.
2. Interpret all coefficients.
3. Test for the significance of b1, b2, and b3 at the 1% level of significance.
4. What are the degrees of freedom for the sum of squares explained by the regression (SSR) and the sum of squares due to error (SSE)?
5. Test whether of not there is a significant relationship between the monthly expenditure on food and the independent variables.  Use a .01 level of significance.  Be sure to state the null and alternative hypotheses.
6. Compute the coefficient of determination and explain its meaning.
7. Estimate the monthly expenditure on food for a family that has 4 members, eats out 3 times, and does not have any member of the family on a diet.

1. The following regression model has been proposed to predict sales at a fast food outlet.

= 18 – 2X1 + 7X2 + 15X3

where

X1 = the number of competitors within 1 mile

X2 = the population within 1 mile (in 1,000s)

X3 = 1 if drive-up windows are present, 0 otherwise

= sales (in \$1,000s)

1. What is the interpretation of 15 (the coefficient of X3) in the regression equation?
2. Predict sales for a store with 2 competitors, a population of 10,000 within one mile, and one drive-up window (give the answer in dollars).
3. Predict sales for the store with 2 competitors, a population of 10,000 within one mile, and no drive-up window (give the answer in dollars).

1. The following regression model has been proposed to predict sales at a computer store.

= 50 – 3X1 + 20X2 + 10X3

where

X1 = competitor’s previous day’s sales (in \$1,000s)

X2 = population within 1 mile (in 1,000s)

= sales (in \$1000s)

Predict sales (in dollars) for a store with the competitor’s previous day’s sale of \$5,000, a population of 20,000 within 1 mile, and nine radio advertisements.

1. The following regression model has been proposed to predict monthly sales at a shoe store.

= 40 – 3X1 + 12X2 + 10X3

where

X1 = competitor’s previous month’s sales (in \$1,000s)

X2 = Stores previous month’s sales (in \$1,000s)

= sales (in \$1000s)

1. Predict sales (in dollars) for the shoe store if the competitor’s previous month’s sales were \$9,000, the store’s previous month’s sales were \$30,000, and no radio advertisements were run.
2. Predict sales (in dollars) for the shoe store if the competitor’s previous month’s sales were \$9,000, the store’s previous month’s sales were \$30,000, and 10 radio advertisements were run.