Three papers were accepted at The First Workshop on Large Language Models (LLMs) for Evaluation in Information Retrieval, also known as LLM4Eval:
- Chuan Meng, Negar Arabzadeh, Arian Askari, Mohammad Aliannejadi and Maarten de Rijke. Query Performance Prediction using Relevance Judgments Generated by Large Language Models
- Zahra Abbasiantaeb, Chuan Meng, Leif Azzopardi and Mohammad Aliannejadi. Can We Use Large Language Models to Fill Relevance Judgment Holes?
- Weijia Zhang, Mohammad Aliannejadi, Jiahuan Pei, Yifei Yuan, Jia-Hong Huang and Evangelos Kanoulas. Towards Fine-Grained Citation Evaluation in Generated Text: A Comparative Analysis of Faithfulness Metrics