Replicating the Monte Carlo split-half #13
-
I've just read your paper on split-half reliability, and so I wanted to try out the Monte Carlo approach you mentioned in it. It's not clear to me how to do it correctly, and I could not find the right way in the splithalfr package source code. I tried interpreting the descriptions given by you and Williams & Kaufmann, but the two interpretations I've implemented both give incorrect reliabilities. What do you suggest? the interpretations I've tried are as follows:
Kind regards.
|
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 1 reply
-
Hi @Spiritspeak, Your first interpretation of the splitting method is correct, i.e. "randomly sample (with replacement) two sets from the same dataset, the two sets being of equal size to the original dataset, and it is thus permitted that a single row in the data can end up in both sets.". In this vignette (included in the package) I provide a concrete example of this approach. With regard to your concerns about Monte Carlo estimates being too high, I've got two suggestions:
Cheers T |
Beta Was this translation helpful? Give feedback.
-
Given that there hasn't been any activity in this thread for about a week, I'll close this thread as "answered". However, @Spiritspeak, feel free to reopen it if you've got further questions or comments. |
Beta Was this translation helpful? Give feedback.
-
Hey @Spiritspeak, I spotted a preprint making a strong case against the using the Monte Carlo method because it overestimates reliability. They based their argument on simulated data. This nicely lines up with our preprint (now published here), which makes the same argument, but based on empirical data. |
Beta Was this translation helpful? Give feedback.
Hi @Spiritspeak,
Your first interpretation of the splitting method is correct, i.e. "randomly sample (with replacement) two sets from the same dataset, the two sets being of equal size to the original dataset, and it is thus permitted that a single row in the data can end up in both sets.". In this vignette (included in the package) I provide a concrete example of this approach.
With regard to your concerns about Monte Carlo estimates being too high, I've got two suggestions: