Machine learning to detect invalid text responses: Validation and comparison to existing detection methods

Published in Behavior Research Methods, 2022

Recommended citation: Yeung, R. C., & Fernandes, M. A. (2022). Machine learning to detect invalid text responses: Validation and comparison to existing detection methods. Behavior Research Methods, 1–16. https://doi.org/10.3758/s13428-022-01801-y

We propose and implement a supervised machine learning approach that can mimic the accuracy of human coding, but without the need to hand-code entire text datasets. Using autobiographical memory texts, we accurately detected invalid texts with performance near human coding, significantly outperforming existing data quality indicators.

Share on

Twitter Facebook LinkedIn

Ryan Yeung

Share on