2024-12-14Towards Robust Evaluation: A Comprehensive Taxonomy of Datasets and Metrics for Open Domain本文是LLM系列文章,针对《TowardsRobustEvaluation:AComprehensiveTaxonomyofDatasetsandMetricsforOpenDomainQuestionAnsweringintheEraofLargeLanguageModels》的翻译。迈向稳健评估:大语言模型时代开放领域问答的数据集和指标的综合分类摘要1