LLM-as-a-Judge Boosts Legal Document Recommendation Evaluation
LLM-as-a-Judge scoring aligns with experts, reaching Gwet's AC2 0.80, suggest Wilcoxon Signed-Rank testing with Benjamini-Hochberg correction for RAG comparison. Paper submitted September 2025. getnews.me/llm-as-a-judge-boosts-le... #llmasajudge #legalai
0
0
0
0