Evaluating large language models for health-related text classification tasks with public social media data.
Yuting GuoAnthony OvadjeMohammed Ali Al-GaradiAbeed SarkerPublished in: Journal of the American Medical Informatics Association : JAMIA (2024)
By leveraging the data augmentation strategy, we can harness the power of LLMs to develop smaller, more effective domain-specific NLP models. Using LLM-annotated data without human guidance for training lightweight supervised classification models is an ineffective strategy. However, LLM, as a zero-shot classifier, shows promise in excluding false negatives and potentially reducing the human effort required for data annotation.