Login / Signup

The Use of Generative AI for Scientific Literature Searches for Systematic Reviews: ChatGPT and Microsoft Bing AI Performance Evaluation.

Yong-Nam GwonJae Heon KimHyun Soo ChungEun Jee JungJoey ChunSerin LeeSung-Ryul Shim
Published in: JMIR medical informatics (2024)
This is the first study to compare AI and conventional human systematic review methods as a real-time literature collection tool for evidence-based medicine. The results suggest that the use of ChatGPT as a tool for real-time evidence generation is not yet accurate and feasible. Therefore, researchers should be cautious about using such AI. The limitations of this study using the generative pre-trained transformer model are that the search for research topics was not diverse and that it did not prevent the hallucination of generative AI. However, this study will serve as a standard for future studies by providing an index to verify the reliability and consistency of generative AI from a user's point of view. If the reliability and consistency of AI literature search services are verified, then the use of these technologies will help medical research greatly.
Keyphrases
  • systematic review
  • artificial intelligence
  • meta analyses
  • healthcare
  • machine learning
  • deep learning
  • randomized controlled trial
  • mental health
  • current status
  • mass spectrometry
  • clinical evaluation