2. Multiple Regression
• MR is an intermediate prediction method, allowing:
• 2 or more (usually continuous) IVs
• 1 Continuous DV
• Want IVs relatively uncorrelated
• Want IVs correlated with DV
• Focus is on weights for IVs
3. Multiple Regression
• A regression model specifies a relation between a dependent
variable Y and certain independent variables X1, …,XK.
– Here “independence” is not in the sense of random variables; rather, it
means that the value of Y depends on - or is determined by - the Xi
variables.)
• A linear model sets
Y = β1 + β1X1 + … + βkXK + ε,
where ε is the error term.
• To use such a model, we need to have data on values of Y
corresponding to values of the Xi's.
– selling prices for various house features, past growth values for various
economic conditions
4. When to Use MR?
o Standard: Examines how whole set of IVs relates to DV
o Combines all IVs at once to find multiple correlation
o Hierarchical: Examines several sets of IVs based on theory
o Researcher chooses order of variables entered in steps
o Stepwise: Examines IVs most highly correlated with DV
o Computer selects best IVs related to DV
o Conduct any of above in stand-alone MR analysis
o Conduct set of MRs as follow-up to significant Canonical
4
Correlation
5. Example
• Suppose we have data on sales of houses in some area.
– For each house, we have complete information about its size,
the number of bedrooms, bathrooms, total rooms, the size of
the lot, the corresponding property tax, etc., and also the price
at which the house was eventually sold.
– Can we use this data to predict the selling price of a house
currently on the market?
– The first step is to postulate a model of how the various
features of a house determine its selling price.
6. Example
– A linear model would have the following form:
selling price = β0 + β1(sq.ft.) + β2 (no. bedrooms) + β3 (no. bath)
+ β4 (no. acres) + β5 (taxes) + error
• In this expression, β1 represents the increase in selling price for each
additional square foot of area: it is the marginal cost of additional area.
• β2 and β3 are the marginal costs of additional bedrooms and bathrooms,
and so on.
• The intercept β0 could in theory be thought of as the price of a house for
which all the variables specified are zero; of course, no such house could
exist, but including β0 gives us more flexibility in picking a model.
7. Example
– The error reflects the fact that two houses with exactly the same
characteristics need not sell for exactly the same price.
• There is always some variability left over, even after we specify the value of a large
number variables.
• This variability is captured by an error term, which we will treat as a random
variable.
• Regression analysis is a technique for using data to identify
relationships among variables and use these relationships to make
predictions.
8. Levels of advertising
• Determine appropriate levels of advertising and promotion for a
particular market segment.
• Consider the problem of managing sales of beer at large college
campuses.
– Sales over, say, one semester might be influenced by ads in the college
paper, ads on the campus radio station, sponsorship of sports-related
events, sponsorship of contests, etc.
• Use data on advertising and promotional expenditures at many
different campuses to tell us the marginal value of dollars spent in
each category.
• A marketing strategy is designed accordingly.
• Set up a model of the following type:
sales = β0 + β1(print budget) + β2(radio budget)
+ β3(sports promo budget) + β4(other promo) + error
9. General Research Questions:
• How do consumers make decisions about
the foods that they eat?
• How do these decisions vary across
cultures?
9
10. More Specific Research Questions:
• What factors influence consumers’
willingness to purchase genetically modified
food products?
• Does the influence of these factors vary
between U.S. and U.K. consumers?
11. Descriptive Statistics
U.S. and U.K. Students
US UK
N 44 33
Willingness to Purchase 4.86 4.60
General Trust 3.51* 3.22*
Cognitive Trust 5.39* 4.65*
Affective Trust 5.02* 4.41*
Technology 5.17* 4.70*
All student data are on a 7 point scale except general trust which is on a 5 point scale.
†p < .10
*p < .05
12. Multiple Regression Results
U.S. and U.K. Students
Dependent Variable: WTP
U.S. U.K. Combined
β β β
General .465† .645† .520*
Cognitive -.422† .121 -.167
Affective .892* .271 .649*
Technology .179 .000 .000
Country na na -1.09
N 44 31 75
R2 .54 .34 .46
†p < .10
*p < .05