[go: up one dir, main page]

0% found this document useful (0 votes)
253 views14 pages

Multidimensional Scaling

Multidimensional scaling (MDS) is a technique used to represent perceptions and preferences spatially using geometric relationships in a multidimensional space. MDS involves collecting similarity or preference judgments and using statistical procedures to map both stimuli and respondents in a shared perceptual space with interpretable dimensions. The quality of MDS solutions is assessed using goodness-of-fit measures like stress and R-square.
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
253 views14 pages

Multidimensional Scaling

Multidimensional scaling (MDS) is a technique used to represent perceptions and preferences spatially using geometric relationships in a multidimensional space. MDS involves collecting similarity or preference judgments and using statistical procedures to map both stimuli and respondents in a shared perceptual space with interpretable dimensions. The quality of MDS solutions is assessed using goodness-of-fit measures like stress and R-square.
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 14

Multidimensional Scaling (MDS)

Multidimensional scaling (MDS) is a class of procedures for representing perceptions and preferences of respondents spatially by means of a visual display. Perceived or psychological relationships among stimuli are represented as geometric relationships among points in a multidimensional space. These geometric representations are often called spatial maps. The axes of the spatial map are assumed to denote the psychological bases or underlying dimensions respondents use to form perceptions and preferences for stimuli. Statistics and Terms Associated with MDS Similarity judgments. Similarity judgments are ratings on all possible pairs of brands or other stimuli in terms of their similarity using a Likert-type scale. Preference rankings. Preference rankings are rank orderings of the brands or other stimuli from the most preferred to the least preferred. They are normally obtained from the respondents. Stress. This is a lack-of-fit measure; higher values of stress indicate poorer fits. R-square. R-square is a squared correlation index that indicates the proportion of variance of the optimally scaled data that can be accounted for by the MDS procedure. This is a goodness-of-fit measure. Spatial map. Perceived relationships among brands or other stimuli are represented as geometric relationships among points in a multidimensional space called a spatial map. Coordinates. Coordinates indicate the positioning of a brand or a stimulus in a spatial map. Unfolding. The representation of both brands and respondents as points in the same space is referred to as unfolding. Conducting Multidimensional Scaling

Conducting Multidimensional Scaling Formulate the Problem Specify the purpose for which the MDS results would be used. Select the brands or other stimuli to be included in the analysis. The number of brands or stimuli selected normally varies between 8 and 25. The choice of the number and specific brands or stimuli to be included should be based on the statement of the marketing research problem, theory, and the judgment of the researcher.

Input Data for Multidimensional Scaling

Conducting Multidimensional Scaling: Obtain Input Data Perception Data: Direct Approaches. In direct approaches to gathering perception data, the respondents are asked to judge how similar or dissimilar the various brands or stimuli are, using their own criteria. These data are referred to as similarity judgments. Very Very Dissimilar Similar Crest vs. Colgate 1 2 3 4 5 6 7 Aqua-Fresh vs. Crest 1 2 3 4 5 6 7 Crest vs. Aim 1 2 3 4 5 6 7 . . . Colgate vs. Aqua-Fresh 1 2 3 4 5 6 7 The number of pairs to be evaluated is n (n -1)/2, where n is the number of stimuli.

Similarity Rating Of Toothpaste Brands

Aqua-Fresh Aqua-Fresh Crest Colgate Aim Gleem Plus White Ultra Brite Close-Up Pepsodent Sensodyne 5 6 4 2 3 2 2 2 1

Crest

Colgate

Aim

Gleem

Plus White Ultra Brite

Close-Up Pepsodent Sensodyne

7 6 3 3 2 2 2 2

6 4 4 2 2 2 4

5 4 3 2 2 2

5 5 6 6 4

5 5 6 3

6 7 3

6 4

Perception Data: Derived Approaches. Derived approaches to collecting perception data are attribute-based approaches requiring the respondents to rate the brands or stimuli on the identified attributes using semantic differential or Likert scales. Whitens Does not teeth ___ ___ ___ ___ ___ ___ ___ ___ ___ ___ whiten teeth Prevents tooth Does not prevent decay ___ ___ ___ ___ ___ ___ ___ ___ ___ ___ tooth decay . . . Pleasant Unpleasant tasting ___ ___ ___ ___ ___ ___ ___ ___ ___ ___ tasting If attribute ratings are obtained, a similarity measure (such as Euclidean distance) is derived for each pair of brands. Obtain Input Data Direct Vs. Derived Approaches The direct approach has the following advantages and disadvantages: The researcher does not have to identify a set of salient attributes. The disadvantages are that the criteria are influenced by the brands or stimuli being evaluated. Furthermore, it may be difficult to label the dimensions of the spatial map. The attribute-based approach has the following advantages and disadvantages: It is easy to identify respondents with homogeneous perceptions. The respondents can be clustered based on the attribute ratings. It is also easier to label the dimensions. A disadvantage is that the researcher must identify all the salient attributes, a difficult task. The spatial map obtained depends upon the attributes identified. It may be best to use both these approaches in a complementary way. Direct similarity judgments may be used for obtaining the spatial map, and attribute ratings may be used as an aid to interpreting the dimensions of the perceptual map.

Preference Data
Preference data order the brands or stimuli in terms of respondents' preference for some property. A common way in which such data are obtained is through preference rankings. Alternatively, respondents may be required to make paired comparisons and indicate which brand in a pair they prefer. Another method is to obtain preference ratings for the various brands. The configuration derived from preference data may differ greatly from that obtained from similarity data. Two brands may be perceived as different in a similarity map yet similar in a preference map, and vice versa.

Select an MDS Procedure


Selection of a specific MDS procedure depends upon: Whether perception or preference data are being scaled, or whether the analysis requires both kinds of data. The nature of the input data is also a determining factor.

Non-metric MDS procedures assume that the input data are ordinal, but they result in metric output. Metric MDS methods assume that input data are metric. Since the output is also metric, a stronger relationship between the output and input data is maintained, and the metric (interval or ratio) qualities of the input data are preserved. The metric and non-metric methods produce similar results. Another factor influencing the selection of a procedure is whether the MDS analysis will be conducted at the individual respondent level or at an aggregate level.

Decide on the Number of Dimensions


A priori knowledge - Theory or past research may suggest a particular number of dimensions. Interpretability of the spatial map - Generally, it is difficult to interpret configurations or maps derived in more than three dimensions. Elbow criterion - A plot of stress versus dimensionality should be examined. Ease of use - It is generally easier to work with two-dimensional maps or configurations than with those involving more dimensions. Statistical approaches - For the sophisticated user, statistical approaches are also available for determining the dimensionality. Plot of Stress Versus Dimensionality

Label the Dimensions and Interpret the Configuration


Even if direct similarity judgments are obtained, ratings of the brands on researcher-supplied attributes may still be collected. Using statistical methods such as regression, these attribute vectors may be fitted in the spatial map. After providing direct similarity or preference data, the respondents may be asked to indicate the criteria they used in making their evaluations. If possible, the respondents can be shown their spatial maps and asked to label the dimensions by inspecting the configurations. If objective characteristics of the brands are available (e.g., horsepower or miles per gallon for automobiles), these could be used as an aid in interpreting the subjective dimensions of the spatial maps.

A Spatial Map of Toothpaste Brands

Using Attribute Vectors to Label Dimensions

Assess Reliability and Validity The index of fit, or R-square is a squared correlation index that indicates the proportion of variance of the optimally scaled data that can be accounted for by the MDS procedure. Values of 0.60 or better are considered acceptable. Stress values are also indicative of the quality of MDS solutions. While R-square is a measure of goodness-of-fit, stress measures badness-of-fit, or the proportion of variance of the optimally scaled data that is not accounted for by the MDS model. Stress values of less than 10% are considered acceptable. If an aggregate-level analysis has been done, the original data should be split into two or more parts. MDS analysis should be conducted separately on each part and the results compared. Stimuli can be selectively eliminated from the input data and the solutions determined for the remaining stimuli. A random error term could be added to the input data. The resulting data are subjected to MDS analysis and the solutions compared. The input data could be collected at two different points in time and the test-retest reliability determined.

Assessment of Stability by Deleting One Brand

External Analysis of Preference Data

Assumptions and Limitations of MDS It is assumed that the similarity of stimulus A to B is the same as the similarity of stimulus B to A. MDS assumes that the distance (similarity) between two stimuli is some function of their partial similarities on each of several perceptual dimensions. When a spatial map is obtained, it is assumed that inter point distances are ratio scaled and that the axes of the map are multidimensional interval scaled. A limitation of MDS is that dimension interpretation relating physical changes in brands or stimuli to changes in the perceptual map is difficult at best.

Scaling Preference Data


In internal analysis of preferences, a spatial map representing both brands or stimuli and respondent points or vectors is derived solely from the preference data. In external analysis of preferences, the ideal points or vectors based on preference data are fitted in a spatial map derived from perception (e.g., similarities) data. The representation of both brands and respondents as points in the same space, by using internal or external analysis, is referred to as unfolding. External analysis is preferred in most situations.

Correspondence Analysis
Correspondence analysis is an MDS technique for scaling qualitative data in marketing research. The input data are in the form of a contingency table, indicating a qualitative association between the rows and columns. Correspondence analysis scales the rows and columns in corresponding units, so that each can be displayed graphically in the same low-dimensional space. These spatial maps provide insights into (1) similarities and differences within the rows with respect to a given column category; (2) similarities and differences within the column categories with respect to a given row category; and (3) relationship among the rows and columns. The advantage of correspondence analysis, as compared to other multidimensional scaling techniques, is that it reduces the data collection demands imposed on the respondents, since only binary or categorical data are obtained. The disadvantage is that between set (i.e., between column and row) distances cannot be meaningfully interpreted. Correspondence analysis is an exploratory data analysis technique that is not suitable for hypothesis testing.

Relationship Among MDS, Factor Analysis, and Discriminant Analysis If the attribute-based approaches are used to obtain input data, spatial maps can also be obtained by using factor or discriminant analysis. By factor analyzing the data, one could derive for each respondent, factor scores for each brand. By plotting brand scores on the factors, a spatial map could be obtained for each respondent. The dimensions would be labeled by examining the factor loadings, which are estimates of the correlations between attribute ratings and underlying factors. To develop spatial maps by means of discriminant analysis, the dependent variable is the brand rated and the independent or predictor variables are the attribute ratings. A spatial map can be obtained by plotting the discriminant scores for the brands. The dimensions can be labeled by examining the discriminant weights, or the weightings of attributes that make up a discriminant function or dimension.

Conjoint Analysis
Conjoint analysis attempts to determine the relative importance consumers attach to salient attributes and the utilities they attach to the levels of attributes. The respondents are presented with stimuli that consist of combinations of attribute levels and asked to evaluate these stimuli in terms of their desirability. Conjoint procedures attempt to assign values to the levels of each attribute, so that the resulting values or utilities attached to the stimuli match, as closely as possible, the input evaluations provided by the respondents. Statistics and Terms Associated with Conjoint Analysis Part-worth functions. The part-worth functions, or utility functions, describe the utility consumers attach to the levels of each attribute. Relative importance weights. The relative importance weights are estimated and indicate which attributes are important in influencing consumer choice. Attribute levels. The attribute levels denote the values assumed by the attributes. Full profiles. Full profiles, or complete profiles of brands, are constructed in terms of all the attributes by using the attribute levels specified by the design. Pairwise tables. In pairwise tables, the respondents evaluate two attributes at a time until all the required pairs of attributes have been evaluated. Cyclical designs. Cyclical designs are designs employed to reduce the number of paired comparisons. Fractional factorial designs. Fractional factorial designs are designs employed to reduce the number of stimulus profiles to be evaluated in the full profile approach. Orthogonal arrays. Orthogonal arrays are a special class of fractional designs that enable the efficient estimation of all main effects. Internal validity. This involves correlations of the predicted evaluations for the holdout or validation stimuli with those obtained from the respondents.

Conducting Conjoint Analysis

Formulate the Problem Identify the attributes and attribute levels to be used in constructing the stimuli. The attributes selected should be salient in influencing consumer preference and choice and should be actionable. A typical conjoint analysis study involves six or seven attributes. At least three levels should be used, unless the attribute naturally occurs in binary form (two levels). The researcher should take into account the attribute levels prevalent in the marketplace and the objectives of the study. Construct the Stimuli In the pairwise approach, also called two-factor evaluations, the respondents evaluate two attributes at a time until all the possible pairs of attributes have been evaluated. In the full-profile approach, also called multiple-factor evaluations, full or complete profiles of brands are constructed for all the attributes. Typically, each profile is described on a separate index card. In the pairwise approach, it is possible to reduce the number of paired comparisons by using cyclical designs. Likewise, in the full-profile approach, the number of stimulus profiles can be greatly reduced by means of fractional factorial designs Sneaker Attributes and Levels

Pairwise Approach to Conjoint Data

A special class of fractional designs, called orthogonal arrays, allow for the efficient estimation of all main effects. Orthogonal arrays permit the measurement of all main effects of interest on an uncorrelated basis. These designs assume that all interactions are negligible. Generally, two sets of data are obtained. One, the estimation set, is used to calculate the partworth functions for the attribute levels. The other, the holdout set, is used to assess reliability and validity.

Decide on the Form of Input Data


For non-metric data, the respondents are typically required to provide rank-order evaluations. In the metric form, the respondents provide ratings, rather than rankings. In this case, the judgments are typically made independently. In recent years, the use of ratings has become increasingly common. The dependent variable is usually preference or intention to buy. However, the conjoint methodology is flexible and can accommodate a range of other dependent variables, including actual purchase or choice. In evaluating sneaker profiles, respondents were required to provide preference.

Sneaker Profiles and Ratings

Conjoint Analysis Model The basic conjoint analysis model may be represented by the following formula:

Where: U(X) = overall utility of an alternative = the part-worth contribution or utility associated with the j th level (j, j = 1, 2, . . . ki) of the i th attribute (i, i = 1, 2, . . . m) = 1 if the j th level of the i th attribute is present = 0 otherwise = number of levels of attribute i = number of attributes

xij ki m

Relative Importance The importance of an attribute, Ii, is defined in terms of the range of the part-worths, , across the levels of that attribute: The attribute's importance is normalized to ascertain its importance relative to other attributes, Wi: So that

The simplest estimation procedure, and one which is gaining in popularity, is dummy variable regression (see Chapter 17). If an attribute has ki levels, it is coded in terms of ki - 1 dummy variables (see Chapter 14). Other procedures that are appropriate for non-metric data include LINMAP, MONANOVA, and the LOGIT model. Estimating the Conjoint Model The model estimated may be represented as: U = b0 + b1X1 + b2X2 + b3X3 + b4X4 + b5X5 + b6X6 Where: X1, X2 = dummy variables representing Sole X3, X4 = dummy variables representing Upper X5, X6 = dummy variables representing Price For Sole, the attribute levels were coded as follows: X1 1 0 0 X2 0 1 0

Level 1 Level 2 Level 3

Sneaker Data Coded for Dummy Variable Regression

Estimation of Parameters The levels of the other attributes were coded similarly. The parameters were estimated as follows: b0 = 4.222 b1 = 1.000 b2 = -0.333 b3 = 1.000 b4 = 0.667 b5 = 2.333 b6 = 1.333 Given the dummy variable coding, in which level 3 is the base level, the coefficients may be related to the part-worths:

Calculation of Part-Worths To solve for the part-worths, an additional constraint is necessary.

These equations for the first attribute, Sole, are:

Solving these equations, we get: = 0.778 = -0.556 = -0.222 The part-worths for other attributes reported in Table 21.6 can be estimated similarly. For Upper, we have:

For the third attribute, Price, we have:

Calculation of Relative Importance The relative importance weights were calculated based on ranges of part-worths, as follows: Sum of ranges of part-worths = (0.778 - (-0.556)) + (0.445-(-0.556)) + (1.111-(-1.222)) = 4.668 = 1.334/4.668 = 0.286 = 1.001/4.668 = 0.214 = 2.333/4.668 = 0.500

Relative importance of Sole Relative importance of Upper Relative importance of Price

Interpret the Results For interpreting the results, it is helpful to plot the part-worth functions. The utility values have only interval scale properties, and their origin is arbitrary. The relative importance of attributes should be considered. Assessing Reliability and Validity The goodness of fit of the estimated model should be evaluated. For example, if dummy variable regression is used, the value of R2 will indicate the extent to which the model fits the data. Test-retest reliability can be assessed by obtaining a few replicated judgments later in data collection. The evaluations for the holdout or validation stimuli can be predicted by the estimated partworth functions. The predicted evaluations can then be correlated with those obtained from the respondents to determine internal validity. If an aggregate-level analysis has been conducted, the estimation sample can be split in several ways and conjoint analysis conducted on each subsample. The results can be compared across subsamples to assess the stability of conjoint analysis solutions. Part-Worth Functions

Assumptions and Limitations of Conjoint Analysis Conjoint analysis assumes that the important attributes of a product can be identified. It assumes that consumers evaluate the choice alternatives in terms of these attributes and make tradeoffs. The tradeoff model may not be a good representation of the choice process. Another limitation is that data collection may be complex, particularly if a large number of attributes are involved and the model must be estimated at the individual level. The part-worth functions are not unique. Hybrid Conjoint Analysis Hybrid models have been developed to serve two main purposes: 1. Simplify the data collection task by imposing less of a burden on each respondent, and 2. Permit the estimation of selected interactions (at the subgroup level) as well as all main (or simple) effects at the individual level. In the hybrid approach, the respondents evaluate a limited number, generally no more than nine, conjoint stimuli, such as full profiles. These profiles are drawn from a large master design, and different respondents evaluate different sets of profiles, so that over a group of respondents, all the profiles of interest are evaluated. In addition, respondents directly evaluate the relative importance of each attribute and desirability of the levels of each attribute. By combining the direct evaluations with those derived from the evaluations of the conjoint stimuli, it is possible to estimate a model at the aggregate level and still retain some individual differences.

You might also like