How can I convert different point Likert scales for all questionnaires in a survey so that they are the same?

Farimah Dokoushkani @Farimah_Dokoushkani2

01 January 2014 64 4K Report

In my study, I use six questionnaires on different types of Likert scales. Two of them are on 7-point Likert scale, two 5-point and the last two are on 4-point Likert scale. I heard that it is better to modify all of them on the same point Likert scale to simplify data analysis, however I could not find any source for this assumption. Moreover, for some questionnaires, there is no literature that shows they have been used on different Likert scales before. Would you please share your experience with me?

Kamal Karkonasasi Popular answer

Dear Farimah,

I find this link helpful.

http://www-01.ibm.com/support/docview.wss?uid=swg21482329

All the best on your research.

Regards.

http://www-01.ibm.com/support/docview.wss?uid=swg21482329

Antje Cockrill

There are different opinions on this. Some researchers suggest to standardise them all, other people prefer to have a range of different scales in one project. The key is your data analysis. If you use different scales, you need to standardise your data analysis. Most statistics programmes will give you an option to standaridise variables and to save them as new, standardised variables, with adjusted data range and/or data variablity. In SPSS you can use the 'descriptives' menu to save your variables as standardised variables, the new variable will be saved as 'z +variable name' because they are based on Z - score. A Z-Score tells us how many standard deviations a value is away from a mean of 0, so different scores from different scales can be compared in that way.The assumption is that the data is normally distributed.

Farimah Dokoushkani

Thank you very much for your reply.

It should be mentioned that in this study, SPSS will be utilized for descriptive statistics and t-test analysis. Meanwhile, SEM-AMOS will be applied to analyze the relationships and predictions.

Jerzy A. Sobanski

In some cases e.g. incidne of symptom you may omit different levels (Likert points other than 0) an analyse 0 vs non-0. This helps also some logistic regression where 0-1 is welcomed. Additionally maximum level (last point in every scale) may be analysed. All other modes are somehow doubtful because people filled in different scales. Standardization, z-scores are accceptable but are in that case yhmm ...an approximating approach, more coherent when different populations use exactly same scale and we are in doubts whether males vs females see and report things differently. Best regards.

Jerzy A. Sobanski

I mean incidence (or occurence) of symptom

sorry for typing error

Jerzy

Patrick S Malone

Farimah,

Since you're using AMOS for some things, I'd use it for all for this situation. Categorical data analysis -- and specifically factors with categorical indicators -- in AMOS should work just fine without any rescaling.

Pat

Farimah Dokoushkani

Thank you so much for your replies. I am studying immigrant couples. I believe that converting the points in the Likert scale for all questionnaires in a survey so that they are the same not only facilitate the analysis procedure, but also makes it easier to be understood and answered by respondents. Imagine, if for example they have to answer to some questions on 7-point Likert scale and some on 4-point Likert scale, they will be confused. I have 180 quetstions in my survey and as you know it is too much and it cannot be summerized. I should find a way to decrease the time of responding to the questions, data analysis,...

So, the problem is that: I do not have any scientific justification for it!

Indeed, I want to know if I do these changes, I will face any problem in my research procedure and my viva session or not.

Thank you very much for your valuable times.

All the best

Thom S Baguley

If you are using published scales it is not a good idea to deviate from the original presentation (including response options) as this throws some doubt on the psychometric properties. If you have constructed your own scales there is value to harmonizing the response options before assessing its psychometric properties.

In terms of analysis there are numerous options - but one that is often overlooked is percentage of maximum performance (POMP) scaling - proposed by Cohen et al.

Farimah Dokoushkani

Thank you very much for your reply.

David L Morgan

If you want to rely on the claims of reliability and validity for published scales, then you do indeed need to use them as originally written.

As for the 180 questions, I agree with you that this poses a serious issue of "respondent burden" -- especially if these respondents are not familiar with participating in survey interviews. I would recommend two things. First, go over your intended analysis and ask where every set of items fits into your analysis plan. Does each set of items serve a unique and absolutely necessary purpose? If not, you should drop sets of items so that you have usable data rather than perfect measurement of everything that might conceivably matter.

My second recommendation would be to look at survey scales that originate in either sociology or applied research. There is a strong tendency in both those fields to work with shorter scalers that will fit into briefer interviews and still show high levels of reliability and validity. In particular, a scale with as many as 15 items would be considered to be long in that school of thought. (A different set of assumptions comes fro clinicians who often prefer long "batteries" of items, without evidence that the increased length of the scale is truly necessary for adequate reliability and validity.)

I also understand your concern that respondents who are less familiar with surveys could be confused by using two different response scales. Assuming that the respondents are relatively literate and that this is an in-person interview, then a common practice is to write the the different response scales on cards and show those cards to the respondent for each corresponding scale.

In other words, you inform the respondent that you will using a different set options to answer the next set of questions, then hand the respondent the card that shows those categories, and verbally go over the the available responses with the respondent.

Farimah Dokoushkani

Thank you very much for your valuable time dear professor.

Kerri Gayle O'Donnell

Hello Farimah,

I have been facing the same problems with my PhD survey in recent weeks, so I am grateful to you for posting your question, and to the respondents for their valuable advice. It is important to point out that I am a beginner at this, and the following is just to let you know you're not alone: It is not a recommendation - what I did may not be right.

In addition to my own questions, I used three existing sub-tests with different scale systems, which I adapted to fit a 5-point likert scale. With pilot data, I ran Cronbach Alpha tests on the sub-tests before and after the scale change, and as the results based on standardized items were 0.741 before and 0.736 after, I gathered that I had not significantly damaged reliability of the sub-scales.

I don't have any further information yet though, because I'm still gathering data.

A book that's been very helpful to me is de Vaus, 2002, Surveys in Social Research, Allen & Unwin, Sydney - Especially chapter 11: Building Scales.

Good luck!

Farimah Dokoushkani

Dear Kerri,

Thank you so much for your kind words and thank you for sharing your valuable experience with me.

Wish you all the best dear,

Farimah

Dick Sobsey

If these are your own scales and you have not already accumulated some of the data, you can simply switch out the scales so they are all the same. If they are previously published scales, switching may be a problem because you will not be able to compare your results to results that others have obtained using different scales. If you have already collected some data using one kind of scale, it may be a problem to combine it with data using a different scale even if you convert to standard scores such as Z-scores. In addition, there is a conceptual difference between Likert scales with even numbers of choices and odd numbers since there is no neutral middle choice with the even numbers.

Farimah Dokoushkani

Thank you for you reply dear professor. I have not collected data yet. Further, I will use the standard scales. You mean, I will face difficulties if I use different point Likert scales?

Han Ping Fung

Hi Farimah,

Suggesting you to standardize them into higher point Likert Scale e.g. 7-pooint Likert Scale due to some advantages derived by higher point Likert Scale. At the same time needs to take note the disadvantages of higher point Likert scale. For both advantages & disadvantages, you might want to refer to my post in this link:

https://www.researchgate.net/post/When_should_one_switch_to_the_10-item_Likert_scale

Regards,

Fung

Farimah Dokoushkani

Thank you very much for your reply.

David L Morgan

I assume you want to add together all of the items to create a single scale. If so, then they all need to be on the same scoring system, and the most common solution is to convert each item to a Z-score. That way they will all have a mean of 0 and a standard deviation of 1.

Farimah Dokoushkani

Thank you very much for your valuable time dear Professor. I have already collected my data and now I am preparing them to start analysis process via AMOS.

Sorry Prof, so, I should calculate Z-scores for all variables before running SEM. Am I right?

Regards,

David L Morgan

If you are doing SEM, there is no need to standardize the data.

Farimah Dokoushkani

Thank you very much dear professor.

Regards,

Kamal Karkonasasi

Dear Farimah,

I find this link helpful.

http://www-01.ibm.com/support/docview.wss?uid=swg21482329

All the best on your research.

Regards.

http://www-01.ibm.com/support/docview.wss?uid=swg21482329

Farimah Dokoushkani

Thank you sir,

Regards,

Payal Anand

Hi, by any chance could you manage to convert different point scales into one?Can you please suggest me the process please? Regards,

Farimah Dokoushkani

Dear Payal

According to the consultations done with some expert people that some of them are available here, I did no change in regard of converting the scales to a common one. I analyzed my data and I did not face any problem.

Hope it helps,

Good luck!

Najmul Hasan

Thanks a lot Mr. Kamal Asasifor sharing the link

It's very helpful for me.

Suchandra Bose

I have a question . In my study i am using 5 questionnaires 4 of them are in 5 point likert scale and one is in 7 point likert scale. All my questionnaires are pre determined questionnaires. I am interested to do SEM analysis. Kindly guide me whether I need to standardise the 7 point likert scale to 5 point scale ? Or can i use the scales as it ?

Farimah Dokoushkani

Dear Suchandra,

Based on my understanding, you can use them as they are. No change is needed.

Good luck with your study,

Farimah

Suchandra Bose

Thanks a ton Farimah.

Arunima Naithani

Thank you Farimah, for raising this question. I was having the same confusion for my study on response scale. However, I would like to ask if we have neo-literates respondents how can we modify the response scales on cards?? I read somewhere that they used different smilies for responses. Is it possible to use the same??

Thanks

Farimah Dokoushkani

I am sorry Arunima. I am not sure.

Hope the expert people following this post see your question and help you.

Best,

Luca Fumarco

I believe you could use the maximum scale percentage. See COMPREHENSIVE QUALITY OF LIFE SCALE – ADULT, by Robert A. Cummins, School of Psychology, Deakin University. Here is the link to the open access document: http://www.acqol.com.au/instruments/comqol-scale/comqol-a5.pdf

If you want to have a first glance at the formula, go to page 28 of such document.

I see though that this is the method suggested by Kamal already; Cummins' document provides additional details though.

Farimah Dokoushkani

Thank you for your suggestion dear Dr. Fumarco

All the best,

Farimah Dokoushkani

Hi Darren

Yes, they were different but there was no need to standardize them.

Hope it helps,

Nicholas E Rowe

This may be overly simple (maths dullard):

If you have mean values for both 5 & 7 point scales, could you not multiply them up to a common # (e.g. 35), then view them as a % of this number? This would indicate if one mean was greater than another mean, and proportionally by how much.

e.g.

5 point L = barX2, 7 point L = barX3 Common # = 35

2x7 = 14 ÷ 35 x 100 = 40(%)

3x5 = 15 ÷ 35 x 100 = 42.86 (%)

Doesn't the number show that the 7 point scale value was 2.86% greater than the response for the 5 point scale? What this means in context is open to interpretation (and not very mathematical), but if you simply want to say that 'respondents in survey Y were slightly more in agreement than the respondents in survey X' (equivalent Likert mean 3L7:2L5), then perhaps this gives you the supporting evidence?

For producing a formally common scale you can't avoid complex maths, but if you are just looking for a way of seeing proportional difference to support a qualitative observation or idea, then would this not work? The original scales are preserved in the text, so you have not changed or misrepresented any data.

Davit Gogilashvili

Hi,

I would like to conduct factor analysis and then run regression. I have data measured on scale 0=strongly disagree to 10 strongly agree. Respondents evaluated 24 statements for 5 different brands. So, I have in my data set 5*24=120 variables. Now, my goal is to combine these variables into one set of variables so they represent valuations of all brands together. So I want to get 24 items which includes valuations of all brands. I need to combine this statements. Also one item is negatively formulated. Do I have to reverse the codes?

What would you suggest? How should I transform and combine these variables, so that I get one set of variables to conduct factor analysis (which takes into account all scores for all brands)

I would much appreciate your quick response

Best regards,

Davit

Nighat Parveen

hello,

I have selected an instrument for data collection which has been developed using 5 factors of a model but the author has combined these 5 factors into 3 new factors and defined them. I want to use 5 factors of the original model by distributing the items accordingly. will this effect the validation of original tool?

Farimah Dokoushkani

Dear Noora

In my opinion and based on my experience, there is no need for standardizing the data..

BTW, sorry for late response dear..

Best,

Farimah

Farimah Dokoushkani

Dear Davit

If I m correct, you have 120 items that every 24 of them measure (for example) the level of satisfaction about one brand.. So, you measured 5 different things that cannot be combined!

Regarding the negative item, yes, you should reverse its score..

Hope it helps..

Sorry for late response.

Best,

Farimah

Youssef Boudribila

Dear Farimah and all interested in this subject.

There is a quite misunderstanding of what you want to do and how to do it. This subject was treated with experts in psychometric and statistics in a rigorous manner.

I do believe that, depending on how you want to combine your scores, you need to bring them to the same scal either by normalization, or using equating methodologies such as equipercentile equating method(s). The R software has interesting and easy packages for this issue. The most easy and simple way is to normalize to zero-one interval and the formula is easy, since it uses only the min-max values (like z -stat, but in the denominator it uses difference between max and min values). I hope this will solve the problem for all.

Farimah Dokoushkani

Dear Dr. Boudribila

Thank you for your response. However, when we are applying AMOS, it normalizes our data automatically; hence, there is no need to normalize it before running the model.

Moreover, I compared the results of analyzing the normalized data and non-normalized data. They were the same!

Best,

Farimah

Salam Jassim Hmood

Dear Farimah, I agree with David L Morgan, and I would like to add that the questionnaire form should be coordinated, disaggregated, similar and far from complex in terms of design

From the outset in order to shorten time, effort and costs

Sulaiman Umar Musa

I'm happy to come across this issue of standardizing scales because I have a related problem. Pls, I need your suggestions.

In my case, I'm using PLS-SEM for my survey data analysis. I have one variable that is not latent.The variable has a single (proxy) amount in each questionnaire (range in all questionnaires: 30,000 to 400,00). It is in ratio scale. All other variables are on 5-point Likert scale.

Pls refer me to the process and justification for converting the ratio data to interval scale (5-point).

Thank you

Farimah Dokoushkani

Dear Sulaiman

As far as I know, it is okay to run your model with both observed and latent variables. SEM needs at least one latent variable and all other variables can be observed. There is no need to convert your ratio scale to interval one!

Hope it helps,

Farimah

Dela Aghnia Maraya

Hello, my name is Dela. Now, I'm doing data processing for my thesis.

My research is about the quality of marriage by Norton (1983). In this questionnaire, there are 5 items with 7-likert scale and 1 item with 10-likert scale

In this case, Im confused, how to process data with different likert scale in one questionnaire? Can I directly score in total score? Or should be standardized first?

Because, in the manual book that I found, it is not explained further how this statistical processing. However, it is only explained that the total score gained from this questionnaire will illustrate how the quality of the marriage is perceived by the individual.

I ask for an explanation of how to convert data into standardized data, especially in SPSS. Thankyou for your help and information, it means a lot for me..

Paul Hubert Vossen

@ Dela Aghnia Maraya : There are two ways to do it systematically: either you take the time and trouble of learning about statistical Rasch modeling or its generalization called Item Response Theory or you apply the non-statistical but more straightforward approach I have worked out under the assumption that your respondents give roughly reliable and stable responses, i.e. without too much noise.

In the latter case, and if you opt for that approach, I can give you the simple formula to standardize your k-point Likert scales (for any integer k) into standard bipolar Likert scores ranging between -1 and +1. Once you have standardized, there is another simple formula to calculate weighted averages or means, either for a single person (over all scales) or over all persons (per scale or using all scales simultaneously). You don't need SPSS for that, an Excel-sheet is all you need. Depending upon the number of data you have, it will take just a few days to get the answers you need.

Of course, if your supervisors insist on a statistical approach, there is currently no way around some sort of Rasch modelling, but that will take some months to understand and apply.

I am no fan of Rasch and IRT, because they tacitly assume that your user's responses may be thought of as lying on the real line, i.e. neither bounded below nor bounded above, because only then you can add scores in the usual way. But bounded scores behave diffrently, they have to be added in a completely different way. If not, the results will be inflated or biased conclusions, although they won't tell you this when using SPSS or similar statistical packages (unless you read the "small letters", of course).

Noor Atikah Zainal Abidin

Sample-Likert-Scales.pdf

Paul Hubert Vossen

[last update: 2018-03-12]

Thanks Noor Atikah Zainal Abidin for sharing with us these 5-point and 7-point Likert scales. Some remarks for those not so much acquainted with the meaning and use of Likert scales:

A Likert scale is a qualitative (ordinal) scale. The symbols "1", "2", ... attached to the points are not numbers, they are just anchors ("points") chosen in the same way that rulers do: they indicate some points ("anchors") on the underlying scale usually assumed to be spread out more or less evenly.
There is no intrinsic restriction to 5 or 7 points. In fact, you may set up your own n-point Likert scales where n is different from 5 or 7. For obvious pragmatic reasons, however, Likert scale experts favor scales with not too many points, i.e. anchors, and to use *only* the anchors when rating, but obviously this is just a convention.
Mostly, a Likert scale will have an uneven number of points: (3), 5, 7, 9, (...). The argument is, that there should be a "neutral" point in between, so that people get the option of not deciding between negative or positive scores. Some Likert scale in use, however, force people to decide between a positive or a negative score: these scales have an even number of "points" or anchors.
From the foregoing it follows that there are two different types of Likert scales: either the underlying numeric scale is the bounded interval between -1 and +1 (with neutral anchor) or the underlying numeric scale is the bounded interval between 0 and +1 (without neutral anchor).
It is best to think of the leftmost anchor (i.e., mostly "1") as the lowest possible score and the rightmost anchor (e.g., "5" on a 5-point scale) as the highest possible score. Thus it makes no sense at all to assign a rating below the leftmost anchor or a rating above the rightmost anchor. In fact, they should be treated as -infinity and +infinity, respectively. Mathematically there is a simple way to transform Likert scale ratings to real (decimal) numbers such that the above remarks get a "real" sense.
Where the rest of the anchors will be positioned on this numeric Likert scale depends upon your assumptions: if you assume (without verifying) that the anchors are evenly spread out, the anchor positions are fixed once you fix the number of anchors; if you don't assume the principle of even-spread, you have to do some empirical research (see the literature) to find out where the anchor positions are best located. In the last case you will plausibly find that indeed the anchors are not evenly spread out, but cluster more and more to both endpoints of the scale.
Once you have the scale points of the anchors (Likert-points), you can apply the Likert algebra (see my previous contribution) to calculate sums, differences, means or whatever you need to get aggregated scores over test items and/or respondents.

Awa Njie

Interesting, from the previous conversation, gave a slight insight to my current problem. However, it is not clearly for my case. In my study questionnaire, the variables I am interested to run a factor analysis for my new measure 'uncertainty avoidance', are on 6- point likert scale, 9- point scale and a 4 -point scale.

I understand that its advisable for all the variables to be similar in scale length, so they contribute equally in the new scale formation. And I saw in the conversations here, that a z score can be used before performing factor analysis. If so, should all be standardized or just the different one. For eg: most of my variables are in 6 likert, only two are in 4 and 9 likert scale. Any experience or way forward for this?

I would appreciate a timely respond. Thanks in advance!

Ali Farooq

Thanking for asking Awa.

z-scores are calculated for all the variables to make them standardized, that is having values between -1 to 1. If you calculate z-scores for 4-point and 9 point items only, it will be difficult to interpret results of factor analysis.

Awa Njie

Thank you for the timely respond, Ali. In the sense that all have to be standardized. What likert scale should be chosen for all the variables? The 6 likert , since it has the most?? How is the scale determined anyway?

Also, I read a few post that converting the previously used scales ( Switching of scale in general) may create a problem of not been able to compare your results to other results that were obtained with the old scale. How do I go with this? I forget to mention that I will run some regression after my factor analysis ?

Frank M. Schneider

Two citable sources are:

Preston, C. C., & Colman, A. M. (2000). Optimal number of response categories in rating scales: Reliability, validity, discriminating power, and respondent preferences. Acta Psychologica, 104, 1–15. https://doi.org/10.1016/S0001-6918(99)00050-5

They used the following formula to match scales with different numbers of response categories: (rating-1)/(number of response categories-1)*100

Dawes, J. (2008). Do data characteristics change according to the number of scale points used? An experiment using 5-point, 7-point and 10-point scales. International Journal of Market Research, 50, 61–77. https://doi.org/10.1177/147078530805000106

He described also an alternative way by anchoring the scale end points.

Correlation-based analyses won't differ, but if you have to use mean differences (like in a pairwise t test), you will get biased results if variables aren't rescaled. Of course, both variables should be interval-scaled and measure the same construct (e.g., positive affect with a 5-point or a 7-point Likert-type scale).

Paul Hubert Vossen

@ Youssef Boudribila: "I do believe that, depending on how you want to combine your scores, you need to bring them to the same scal[e] either by normalization, or using equating methodologies such as equipercentile equating method(s)."

These two options represent the popular wisdom coming from elementary statistics. Unfortunately, this is not enough and may lead to (gross) errors. Indeed, plain normalization and equating are simple to do, but their simplicity is misleading.
The problem is that those elementary statistical approaches don't respect the inherent structure of (ordinal or interval) n-point scales. Those scales possess a structure of their own (sic!), which can be modelled in a rigorous mathematical way.
Knowing the rules, it is not difficult to work out the optimal way of merging any number of items on n-point scales, where n may be different for different items/scales. I have done this even for the problem of peer assessment, where you have to merge two completely distinct scales/scores, each based on ten or more n-point items.
Statistics may be used to figure out how many items to use, which items to use, and how many points on a scale are reasonable. But if you don't take the underlying scale structure into account, your results and conclusions will be false, anyway.

My offer: if you have an example of a questionnaire or test with different anchor points for different items, and want to know how to correctly "equate" them without any statistical hocus-pocus, please send me the data and I will quickly show you the correct solution. Don't forget to tell me how you would like to report the conclusion, i.e. which format of the merged scales you would like to use, e.g. a 5-point scale or a 7-point scale or even a percentage scale (101 points).

Youssef Boudribila

Paul Hubert Vossen : Thank you for sharing your point of view. You may have a good experience in this field from the mathematical point of view. I do respect it and would like to know it. I can give a simple example not from education system but from labour market when the subject is to score skills or abilities required for a given occupation. There two scales from 1 to five and 0 to 7 for the same items. The first interval for the importance of the skill and the second for the level. If you are interested to convert from to the other I will send you the data. Thanks.@

Paul Hubert Vossen

Youssef Boudribila : Yes, I would like to share my knowledge with you. It will also be a good opportunity for me to see whether I have overlooked an important aspect which didn't occur in the educational context (up to now). From what you told me, I infer the following:

You have two scales each measured by the same number of items N. Let's call them scale A for skills and scale B for skill level. On face value I would say that scale A is the more important one, so I would suggest to take scale A as the primary scale and scale B as the secondary scale. This matches quite well with the inherent structure of Peer Assessment, so we may as well adopt the same approach/method.
Probably you are using some sort of a 5-point Likert scale for A and a 8-point Likert scale for B. I assume (until you tell me otherwise) that both scales are ordered from low to high. If not, we may reverse the order during analysis, although it is known that there may be small numerical differences in scaling when negatively poled items are used.
Unclear is still, whether there is a sense of "negative answers" versus "positive answers" for both scales or for one scale, but clearly scale A has a midpoint (anchor 3) and scale B not (somewhere between anchor points 3 and 4).
Let's assume that the items are just graded from low to high, with an absolute zero at the left end of the scale. Then I suggest to convert all "points" to the standard scale from 0 to 1 (representing percentages, if you like).
Are the scale points ("anchors") spread out more or less evenly along the scales (case "symmetry") or do you feel (and did the respondents feel too) that distances between the lower anchors are larger (clearer) than distances between the higher anchors (case "non-symmetry")? It may be possible (I have to check that on the basis of your data) to infer which case (symmetry or not) applies for scale A and which for scale B. For the moment, let's assume that we have a case of non-symmetry for both scales, which is the more common case too in an educational context.
Now we have uniquely defined which model to apply: it is the model with standard scale [0,1] and where it is not true that distances between the anchor points are evenly spread out. This model comes with some fairly simple scoring formulae, i.e. how to calculate averages and how to calculate a single final score for each respondent from his score on scale A and his rating on scale B.

So, my approach not only "equates" the scales (in the sense of mapping both to the same standard scale [0,1]), but - more importantly - gives you a nice standard way of merging the separate scores A and B into a single final score between 0 and 1.

Of course, if you like, you may "translate" those numerical scores back to 5-point, or 7-point scale, or even to a percentage scale with 101 anchor points. That's up to you.

Naomi Podber

Hi Paul, thanks for all of your input. Can you give me your opinion on whether it is possible to compare 2 sets of items, one with the anchors "strongly disagree," "disagree," "agree," and "strongly agree," and the other with the same anchors, but with a mid-point "neither agree nor disagree" added? It seems that any math I run, from a simple equating, to modeling the spacing of the responses for each item, to using IRT for each item, would ignore that identical anchors exist between the 2. Is this permissible because the anchors themselves are not as important as how they function within the structure of the scale? Or does it make more sense to code the 4-point scale with the 1st, 2nd, 4th, and 5th points of the 5-point scale?

Paul Hubert Vossen

Hi Naomi: This is an interesting situation which I didn't think about before. In fact there are two strongly related issues: (a) how to map the categories (verbal anchor points) onto scale numbers? (b) how to assure that your respondents mentally associated the categories in the same way you did (or intended to do)?

Issue (a)

Can be boiled down to the question: Is the first scale intended to be like the second scale, i.e. with an implicit midpoint in between "disagree" and "agree" such that we get "equally-spaced scale points", BUT with the added restriction not to use that midpoint (i.e., forced choice)?

If so, then, yes, you should equate all 4 common categories and see to it that distances between numerical scores "respect" this invisible/dropped midpoint.
If not, don't equate in a numerical sense and accept that both scales have equally spaced anchors so that e.g. the answer category "agree" gets a different score on the first scale as compared with the second scale.

Issue (b)

Whether you opt for case 1 or case 2 is not enough. Your respondents should be aware of how you intended the categories to be interpreted and used. In particular, they should know that the intended distance between "disagree" and "agree" on the first scale is twice the distance between the answer categories on the second scale. BUT: respondents on the first scale have no way to let you know how strong their opinion on "disagree" versus "agree" was before they made their "forced choice"!

Recommendation

It's a tricky situation. Adopting afterwards one or the other assumption on how your respondents thought about the difference between both scales seems to me to be very risky. In order to find out if you have to worry about the differently framed scales I advise the following. Perform THREE analyses (assuming that it won't take too much time to do and would substantially guard you against objections later on):

(distinct anchors): take both scales as ordinary ordinal scales and discard the fact of "equally phrased categories" (which we would also do when using ordinal numbers instead of verbal categories)

(5 anchors): Put a dummy category in between "disagree" and "agree" on the first scale so that you have two 5-point scales and go ahead as usual/expected.

(4 anchors): Assume that the response category "neither agree nor disagree" on the second scale might be interpreted as "partly agree, partly disagree" or "undecided", and split all those responses evenly between "disagree" and "agrree. In this way you are left with two 4-point scales which you handle as usual.

Then compare the results of all three analyses. Any unexpected or strange differences, which you can't explain away?

Of course I hope that you will find that all three analyses lead by and large to the same conclusions, which may be taken as proof that the respondents behaved more as less in the way you intended.

Hope this helps.

Naomi Podber

Thank you *very* much for the thorough response! I will perform all 3 analyses on the data.

Robin Lynn Nelson

I am using two valid and reliable surveys for my data collection; however, I want to make major changes to one item. One survey is a 5-point Likert scale ranging from 1 "Strongly agree" to 5 "Strongly disagree." The other is a 7-point Likert scale ranging from 1 "Strongly DISAGREE" to 7 "Strongly AGREE". Is it possible to create a single survey that uses a 5-point scale that converts the 7-point scale to 1 "Strongly agree" to 5 "Strongly disagree" scale?

Paul Hubert Vossen

Dear Robin

Did you read my suggestions on a similar question some months ago, just a few posts upward?