Login / Signup

Reply to Hu et al.: Applying different evaluation standards to humans vs. Large Language Models overestimates AI performance.

Evelina LeivadaFritz GüntherVittoria Dentella
Published in: Proceedings of the National Academy of Sciences of the United States of America (2024)
Keyphrases
  • artificial intelligence
  • autism spectrum disorder
  • machine learning
  • clinical evaluation