Social Research Week Four

Due Before Class on February 18.  MethodsQuizFour will be available in SAKAI., covering Chapter Five in Babbie.  You may take the test twice and the highest score will count.  SAKAI will tell you which ones you got wrong and it is highly recommended that you take advantage of this feature and allow yourself some time between the first and second times you take the test.  There will be no make-up for this quiz, but late quizzes will be accepted until 4:30 on Feb 18.  Once you start a SAKAI quiz you have to finish it, clicking "save and exit" does not stop the clock.

Due by 5 p.m. on Wednesday February 20.  Multiple Bivariate Table (click on the link for instructions).   Late assignments will lose 10% after 5 p.m. Wednesday, 20% after 5 a.m. Thursday, 30% after 5 p.m. Thursday, 40% after 5 a.m. Friday and will not be accepted after 5 p.m. Friday.



Reading:  Babbie, Chapter Five, Conceptualization and Measurement

Class on Friday Feb 15 will be online.  Sign on to SAKAI and click on the Methods class and then click on "Chat Room".  If you are on campus, you can sign on at any campus computer lab.  You can also sign on from home or anyplace on the internet.  Have the Babbie book available and have the SDA software open in another window of your computer.  Go to the SDA Frequencies/Crosstabulation program.   If you miss this class, you can read the transcript of the chat at any time.


Class notes:  The Powerpoint shown in class this week is available in SAKAI.

Here are some key terms.  If you need definitions, check the book:
conceptualization - Thinking through what our conceptions mean, defining them.
dimensions of a concept
indicators of a concept
operationalization
operational definitions
interchangeability of indicators
reliability
  -  test-retest method
  -  split-half method
  -  internal consistency method (best for questionnaires with lots of items, Cronbach's alpha can be used or item-whole correlations
validity
 -  face validity
 -  convergent validity (very similar to internal-consistency tests of reliability)
 -  discriminant validity -  indicatoers of each dimension of the concept should correlate more than indicators of other dimensions
 -  criterion (or predictive) validity is best if you have a good criterion
 -  content validity
 -  construct validity, this is the most subtle and difficult to understand.  An example:  a study of UFO Abduction Status.


  Conceptualization means thinking through what we mean by the words we use (concepts are expressed with words).  Some are fairly obvious, e.g., suicide.  Others are not at all, e.g., race or poverty.  Is there a difference between "sex" and "gender"   Babbie distinguished between real definitions (which he does not believe in, but some philosophers do, e.g, Plato) , nominal definitions, operational definitions.  Babbie follows a nominalist philosophy as opposed to realist, concepts are things we make up, not things that really "exist".  This is a useful way to approach empirical research.

 An example:  the measurement of romantic love.
defined three dimensions affiliative and dependent need, a predisposition to help, and an orientation of exclusiveness and absorption.

A particularly controversial topic has been the concept of "intelligence"  What does this mean?  Is it one thing or does it have multiple dimensions: 
Example:     Multiple Intelligence

Operationalizing the Concepts.  A lot of effort goes into this.  Quantitative  research means you have to measure your variables and a lot depends on having good measurement.  Sometimes this is difficult, e.g., measuring "intelligence" or "liberalism-conservatism" or "mental illness" or "crime rates (various kinds)".  Often we use standard measures created by the government agencies that collect statistics.


The point of conceptualization and operationalization is to measure things.  This means putting things into categories that correspond to attributes of variables, e.g., men and women, upper class, middle class, lower class (or UU, LU, UM, LM, UL, LL),  annual income $37,541.23.   The critical thing in social research is how we do this and what our measures apply.  We can understand this in terms of

Levels of Measurement

The first and most important question is:   is the measure continuous or categorical?   This is important because continuous variables are required for the use of statistics such as the mean, standard deviation, correlation and regression.  With continuous measurement we have precise distances between the items measured, with categorical we just have them sorted into discrete categories.

If a variable is continuous, we can ask whether it is "interval" or "ratio".    Both of these have precise distance measurement between points.  In addition, ratio measures have a logically meaningful zero point.  With ratio measures, we can talk about ratios between variables, e.g., say that $50 is twice as much money as $25.   With interval variables, such as fahrenheit temperatures, we cannot make such statement.

If a variable is categorical, we can ask whether it is "dichotomous,"  "nominal" or "ordinal"

Dichotomous variables have only two categories.  These can be two natural categories such as "male' and "female"  or they can be artificial "dummy" variables, such as:   are you a Catholic or not;.  With dichotomies you can use regression and correlation.

Nominal variables have more than two categories, but not in any order or with a measured distance between them.  We can do percentages and chi-square significance tests with nominal variables.
Nominal Measurement.  Categories that could be put in any order.
      Catholic, Protestant, Jewish, Moslem, LDS, Buddhist, Episcopalian, Baptist
                       variable one, category of religion, variable two denomination.
            Mental illnesses (DSMIV) e.g.,  adjustment disorder, borderline personality disorder, paranoid schizophrenic
               Crimes:   burglary, assault, murder.  What do these terms mean?  Look at the US Criminal Code.

  Each individual should go into one and only one category on a variable, one value on a variable.   For example:  What is your favorite food, we have a long list, but each person is allowed only one.

 Sorting people into categories must be as reliable and accurate or valid as possible.  One of the things we do is evaluate how accurate our measurement is. 

Ordinal variables have the categories in a logical order  (from "lower" to "higher").  We can compute a median and a range.

Ordinal Measurement.   Here we have categories in a logical order.       Very short, short, medium, very tall, tall .  Often we take continuous variables and make them ordinal.    Income:   Under $20,000   $20 to 40,000  $40 to 60,000   $60000 plus.

Interval Measurement:   TEMPERATURE IN FAHRENHEIT OR CENTIGRADE, 0 degrees is not the absence of heat.  How about the day that the "temperature doubled" in New York City?

Ratio Measurement:    Income in dollars:  a continous numerical value PLUS a meaningful zero point.  Height in inches.

In answering questions about measurement, give the highest or best level of measurement that is justified.  Any variable that meets the criteria for a ratio variable also meets the criteria for an interval variable, but the criteria for a ratio variable are more stringent so we would say that it is ratio measurement.  Any ordinal variable also meets the criteria for a nominal variable, but if it meets the criteria for ordinal we say it is ordinal.

It is important to understand that many variables can be measured at different levels.  Thus I could take height and put it into categories such as short, medium, tall in which case I would be using ordinal measurement because they are in order.  I could also measure it in inches or centimeters, which would be ratio measurement.  It is also important to understand that each of the statistics is appropriate for variables measured in some ways but not others.  Doing percentages and cross-tabulations makes sense for nominal or ordinal data. Chisquare is for nominal or ordinal data. Doing correlation or regression or means and standard deviations requires interval or ratio data.  We can make a broad distinction between categorical (nominal or ordinal) or continuous (ratio or interval) data.  The dichotomy is a special case because we can use correlation and regression with dichotomies, but we can also do percentages, cross tabulations and chisquares.

Quality of Measurement   -   Reliability and Validity. 
 
Reliability -  you get the same thing over and over.  Consistency.

         inter-rater - two different raters get the same answer.
         test-retest, if you take it twice the answers are the same.
           internal consistency - are the items on a test consistent?  This can be calculated by looking at the inter-item correlations.  Chronbach's alpha is a statistic that measure inter-item reliability.  Example, correlate the ABORT variables in the GSS data file.  We see that all the correlations are positive and significant.  We can then make an index of them by adding up scores on the six variables.

Validity  is it "really" measuring what it is supposed to measure.
          Face Validity - does it look right?   This is often related to fairness, people will object to the use of measures that do not have face validity even though they may have predictive validity, e.g., using the frequency of moving as a criterion for loaning money.
          Predictive or criterion validity - does it predict what we want to predict, some "true" measure.  SAT test predicts college or law or medical school grades.
          Convergent validity -  do several measures give the same result.
             
          Construct validity - does the measure perform as our theory says it should.  We use this when we have no criterion.
  
This is the most difficult, it is used when things are inherently difficult to measure.  Essentially, it asks whether the results are consistent with what we would expect based on theory and past experience.    Camden schools reportBrim school report, see pdf page 14 for tables.  Story on Brim with graph