Comparative study of ChatGPT and human evaluators on the assessment of medical literature according to recognised reporting standards.
Richard H R RobertsStephen R AliHayley A HutchingsThomas D DobbsIain S WhitakerPublished in: BMJ health & care informatics (2023)
LLMs like ChatGPT can help automate appraisal of medical literature, aiding in the identification of accurately reported research. Possible applications of ChatGPT include integration within medical databases for abstract evaluation. Current limitations include the token limit, restricting its usage to abstracts. As AI technology advances, future versions like GPT4 could offer more reliable, comprehensive evaluations, enhancing the identification of high-quality research and potentially improving patient outcomes.