Login / Signup

Mining Health-Related Issues in Consumer Product Reviews by Using Scalable Text Analytics.

Manabu ToriiSameer S TilakSon DoanDaniel S ZisookJung-Wei Fan
Published in: Biomedical informatics insights (2016)
In an era when most of our life activities are digitized and recorded, opportunities abound to gain insights about population health. Online product reviews present a unique data source that is currently underexplored. Health-related information, although scarce, can be systematically mined in online product reviews. Leveraging natural language processing and machine learning tools, we were able to mine 1.3 million grocery product reviews for health-related information. The objectives of the study were as follows: (1) conduct quantitative and qualitative analysis on the types of health issues found in consumer product reviews; (2) develop a machine learning classifier to detect reviews that contain health-related issues; and (3) gain insights about the task characteristics and challenges for text analytics to guide future research.
Keyphrases