A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations
- Md Tahmid Rahman Laskar*
- , Sawsan Alqahtani
- , M. Saiful Bari*
- , Mizanur Rahman
- , Mohammad Abdullah Matin Khan
- , Haidar Khan
- , Israt Jahan
- , Md Amran Hossen Bhuiyan
- , Chee Wei Tan
- , Md Rizwan Parvez
- , Enamul Hoque
- , Shafiq Joty*
- , Jimmy Xiangji Huang*
*Corresponding author for this work
- York University Toronto
- Dialpad Canada Inc.
- Princess Nourah Bint Abdulrahman University
- National Center for AI
- Royal Bank of Canada
- Nanyang Technological University
- Salesforce.com, Inc.
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review
27
Link opens in a new tab
Citations
(Scopus)