ANSWERS WITH COMPLETE SOLUTIONS GRADED A++
Difficulty index
Indicates % of individuals who answered each item correctly. Test creators strive to create items that
have a value of 0.50, because this yields assessments with the greatest variation in test scores.
Item analysis
The process of examining each question on a test to see how it is related to the objectives being tested.
The goal is to assess the test items and the test as a whole.
T/F test has
dichotomous recognition items
Ipsative measure
compares traits within same individual (within-person analysis); does NOT compare person to others,
but rather is measured in response to her own standard. Allows person being tested to compare items,
but does NOT reveal absolute strengths.. (ex: Kuder Career Planning instruments) "I" in Ipsative =
i/individual
client who takes a norm-referenced test
CAN legitimately be compared to others who have taken the test. The individual's score is evaluated by
comparing it to others' scores. (ex: percentile rank--score of 82 on a normed test corresponds to
percentile rank of 60 --> 60% of individuals who took the test scored 82 or less)
,client who takes an ipsative test
canNOT legitimately be compared to others who have taken the test.
test battery
horizontal test, which measures various factors during the same testing procedure; several measures
used to produce results that could be more accurate than those derived from merely using a single
source.
vertical test
has versions for various age brackets or levels of education
parallel tests
two versions or forms of tests that are interchangeable; must have the same mean, standard error, &
other statistical components.
#1 factor in the construction of a test
validity
content validity
Ddoes the test examine or sample the behavior under scrutiny? Ex: IQ test must sample entire range of
intelligence (not just memory & math).
construct validity
test's ability to measure an abstract trait or psychological notion like intelligence, self-esteem, etc.
predictive validity / empirical validity
reflects test's ability to predict future behavior according to established criteria.
,predictive validity
Examines the relationship between an instrument's results collected now and a criterion collected in the
future. For example, using an assessment to measure distress due to depression in the present, and
determining how well that predicts rates of hospitalization in the future due to suicidal ideation.
concurrent validity
The degree to which the measures gathered from one tool agree with the measures gathered from other
assessment techniques at the same time. For example, using an assessment to gather data on
depression and examining that alongside hospital records on suicidal ideation.
What is one difference between concurrent validity and predictive validity?
In assessing concurrent validity, data from outside sources is gathered at the same time data is gathered
in an assessment in order to determine how well the two sets of data correlate. In predictive validity, the
data in the assessment is compared to criterion data that will be collected in the future.
consequential validity
tries to determine the social implications of using tests
ceiling/limit
reliability places a _______ on validity, but validity does NOT set limits on reliability.
physical measurements
are more reliable than psychological ones.
construct
any trait that you cannot "directly" measure or observe
, face validity
refers to extent that a test looks or appears to measure the intended attribute.
correlation coeffecients
expresses evidence for reliability and validity (closer to 1.00 the better)
convergent validity
Relationship or correlation of a test to an independent measure/trait. This is a method used to assess a
test's construct/criterion validity by correlating test scores w/ an outside source. Convergent validity
shows the assessment is related to what it should be.
For example, does a new assessment on depression correlate with the BDI-2? If so, this new assessment
can be said to have convergent validity.
discriminant validity
test will not reflect unrelated variables. Ex: If phobias are unrelated to IQ, then when one correlates
clients' IQ scores to their scores on a phobia test, this should produce a near zero correlation.
upper level
reliability determines the ______ of validity.
test-retest reliability
method which tests for stability; giving the same test to the same group of people 2 times and then
correlates the scores; only valid for traits like IQ which remain stable over time and are not influenced by
mood, memory, or practice effects.
equivalent or alternate forms of reliability