mean_true =100var_true =100var_error =25sd_error =sqrt(var_error)var_obs = var_true + var_errorsd_obs =sqrt(var_obs)mean_obs = mean_true # because mean error is zero#This corresponds to a reliability of .80 rxx = var_true/var_obsprint(rxx)
[1] 0.8
Now, let’s imagine Bob who has a true score of 90. Then, we create 1,000,000 observed scores for Bob.
For each of Bob’s 1,000,000 observed scores we created a confidence interval (centered on the observed score). Then, we determined the percentage of the 1,000,000 confidence intervals that included Bob’s true score. We found that 94.98% of them captured his true score. Thus, a 95% SEM confidence interval, centered on the estimated true scores (\(\hat{y}_{true}\)), DOES capture the test taker’s true score at the specified probability.
7.3 Myth 2: Standard error of measurement captures a test taker’s future observed scores
Be sure you have run all of the code in Myth Simulation Setup prior to the code below.
We started with Bob having an observed score of 80 even though his true score is 90. We used Bob’s observed score (80) as the center for a 95% SEM [70.20, 89.80]. Then we examined the extent to which this interval captured Bob’s observed scores. We found the interval captured only 48.36% of Bob’s observed scores. Thus, a 95% SEM interval, centered on an observed score, does NOT capture the specified percentage of future observed scores.
Furthermore, recall that Bob’s observed scores ranged from 66 to 113. We can re-run the simulation was using cur_obs_for_bob = 66 or cur_obs_for_bob = 113. If re-run the above simulation using these values we find capture rates of 0.22% and 0.40%, respectively. Thus, the further an observed score is from the true score the worse the interval performs at capturing future observed scores.
7.4 Myth 3: Using an Estimated True Score Provides an Interval that Capture Future Observed Scores
Be sure you have run all of the code in Myth Simulation Setup prior to the code below.
We started with Bob having an observed score of 80 even though his true score is 90. We calculated an estimated true score (i.e., \(\hat{y}_{true} = 84.00\)). This value is the estimated mean true score for all test takers with an observed score of 80.00. We use \(\hat{y}_{true} = 84.00\)) as the center for a 95% SEM [74.20, 93.80]. Then we examined the extent to which this interval captured Bob’s observed scores. We found the interval captured only 77.55% of Bob’s observed scores. Thus, a 95% SEM interval, centered on the estimated true score, does NOT capture the specified percentage of future observed scores.
Once again, recall that Bob’s observed scores ranged from 66 to 113. We can re-run the simulation was using cur_obs_for_bob = 66 or cur_obs_for_bob = 113. If re-run the above simulation using these values we find capture rates of 6.97% and 1.71%, respectively. Thus, the further an observed score is from the true score the worse the interval performs at capturing future observed scores.
7.5 Myth 4: Using a Estimated True Score Provides an Interval that Captures True Scores
Be sure you have run all of the code in Myth Simulation Setup prior to the code below.
For each of Bob’s 1,000,000 observed scores we calculated a predicted true score (\(\hat{y}_{true}\)). Recall, this predicted true score value is the mean true score for all test takers that share Bob’s observed score. We used this predicted mean true score as the center for a 95% SEM confidence interval. Then, we determined the percentage of the 1,000,000 confidence intervals that included Bob’s true score. We found that 97.26% of them captured his true score. Thus, a 95% SEM confidence interval, centered on the predicted mean true scores (\(\hat{y}_{true}\)), does NOT capture the test taker’s true score at the specified probability. Consequently, a Standard Error of Measurement interval for a single test taker should be centered on the test taker’s observed score not a predicted true score.