Clinical Text Data in Machine Learning: Systematic Review.
Irena SpasićGoran NenadicPublished in: JMIR medical informatics (2020)
We identified the data annotation bottleneck as one of the key obstacles to machine learning approaches in clinical NLP. Active learning and distant supervision were explored as a way of saving the annotation efforts. Future research in this field would benefit from alternatives such as data augmentation and transfer learning, or unsupervised learning, which do not require data annotation.