Automatically Detecting Failures in Natural Language Processing Tools for Online Community Text.
Albert ParkAndrea Lisabeth HartzlerJina Huh-YooDavid W McDonaldWanda PrattPublished in: Journal of medical Internet research (2015)
We illustrate the challenges of processing patient-generated online health community text and characterize failures of NLP tools on this patient-generated health text, demonstrating the feasibility of our low-cost approach to automatically detect those failures. Our approach shows the potential for scalable and effective solutions to automatically assess the constantly evolving NLP tools and source vocabularies to process patient-generated text.