Rethinking with Retrieval Faithful Large Language Model Inference

时间：2023-07-20 20:46:05浏览次数：44

标签：Rethinking Inference Faithful laptop LLM Aristotle invented was first

概
Rethinking with retrieval (RR)
代码

He H., Zhang H. and Roth D. Rethinking with retrieval: faithful large language model inference. arXiv preprint arXiv:2301.00303, 2023.

概

LLM (Large Language Model) + 检索.

Rethinking with retrieval (RR)

CoT (Chain of thought) 已经被证明是一种挖掘 LLM 能力的有效方式, 但是, 这种方式还是容易发生 LLM 胡编乱造的情况. RR 就是希望 LLM 在思考的时候能够有一个理性的参照以获得更好的更准确的结果 (上图 (c)).
让我们以 "Did Aristotle use a laptop?" 来分析这个流程.
首先, CoT 的做法是希望 LLM 将这个问题分解然后一步步的解决, 理想的步骤是:
1. "Aristotle died in 322 BC.";
2. "The first laptop was invented in 1980.";
3. "So the answer is no.".
但是, 其中涉及到很多的 '常识', 倘若 LLM 本身并没有很好学习这些知识, 即使整体的步骤是对的, 也有可能回答错误. 在实际中可能会产生如下的几种回答 (加粗的是错误的):
1. [R1] "Aristotle died in 2000. The first laptop was invented in 1980. Thus, Aristotle used a laptop. So the answer is yes."
2. [R2] "Aristotle died in 322BC. The first laptop was invented in 2000. Thus, Aristotle did not user a laptop. So the answer is no."
3. [R2] "Aristotle died in 322BC. The first laptop was invented in 1980. Thus, Aristotle did not user a laptop. So the answer is no."
甚至, 有些时候, 回答是对的, 但是所依据的常识是错误的 (R2).
所以, 我们需要借助外部的数据库, 比如 Wikipedia. 对于上面的问题, 涉及到两个知识点:
1. [K1] Aristotle (384–322 BC) was a Greek philosopher and polymath during the Classical period in Ancient Greece. ...
2. [K2] The Epson HX-20, the first laptop computer, was invented in 1980. ...
RR 所依赖的 Faithful inference 可以这样定义. 对于某个问题 \(Q\), 产生了一组思维链 \(R_1, R_2, \ldots, R_N\) 以及它们所对应的答案 \(P_1, P_2, \ldots, P_N\), faithful inference 定义为:

\[R^* := \mathop{\arg\max} \limits_{R_i, i=1,2,\ldots, N} \mathbb{I}[P_i = P] f_{\mathcal{KB}}(R_i). \]
其中 \(f_{\mathcal{KB}}: R \rightarrow \mathbb{R}^+\) 评估思维链 \(R\) 和外部知识库的吻合程度.

代码

[official]

标签：Rethinking,Inference,Faithful,laptop,LLM,Aristotle,invented,was,first
From： https://www.cnblogs.com/MTandHJ/p/17569614.html

deepspeed ZeRO-Inference 可在1-GPU上推理～100B的大模型
原理：......
Part2: DDPM as Example of Variational Inference
很多次翻看DDPM，始终不太能理解论文中提到的\(\text{VariationalInference}\)到底是如何在这个工作中起到作用。五一假期在家，无意间又刷到徐亦达老师早些年录制的理论视频，没想到其中也有介绍这部分的内容。老师的上课方式总是娓娓道来，把每一步都讲解得很仔细。本文记录一下个人对......
百度飞桨(PaddlePaddle) - PP-OCRv3 文字检测识别系统 Paddle Inference 模型推理
PaddleInference模型推理流程分别介绍文字检测、方向分类器和文字识别3个模型，基于PaddleInference的推理过程。PaddleInference的Python离线推理离线推理，即在特定机器上部署的代码只能在这台机器上使用，无法通过其他机器进行访问使用whl包预测推理“WHL”是“WHeeL”的英文......
百度飞桨(PaddlePaddle) - PP-OCRv3 文字检测识别系统 Paddle Inference 模型推理
PaddleInference模型推理流程分别介绍文字检测、方向分类器和文字识别3个模型，基于PaddleInference的推理过程。PaddleInference的Python离线推理离线推理，即在特定机器上部署的代码只能在这台机器上使用，无法通过其他机器进行访问使用whl包预测推理“WHL”是“WHeeL”......
Exploiting Cloze Questions for Few Shot Text Classification and Natural Language
ExploitingClozeQuestionsforFewShotTextClassificationandNaturalLanguageInference 论文全程及链接：《ExploitingClozeQuestionsforFewShotTextClassificationandNaturalLanguageInferenceTimo》项目地址：https://github.com/timoschick/pet ......
【提示学习】Exploiting Cloze Questions for Few Shot Text Classification and Natu
论文信息名称内容论文标题ExploitingClozeQuestionsforFewShotTextClassificationandNaturalLanguageInference论文地址https://arxiv.org/abs/2001.07676研究领域NLP,文本分类,提示学习,PET提出模型PET(Pattern-ExploitingTraining)来源EACL2021阅读摘要目前......
Jetson Nano初体验之实现人脸检测(初学者在跑jetson-inference之前最好先看看这篇文章
另外,在看这篇文章之前,你应该已经装好了镜像:如果还没有装好,请查看这篇文章:JetsonNano初体验之写入官方Ubuntu镜像回到刚刚的话题,我在尝试运行jetson-inference......
Rethinking the Heatmap Regression for Bottom-up Human Pose Estimation
本文的主要思想就是对heatmap图进行一个权重缩放。weight是作者提出的一个思路让模型将低输出值的位置加大权重，高输出值给予小权重，低输出值给与大全中。scaled_gt就是scale......
Rethinking CNN Models for Audio Classification
WhatenablestheImageNetpretrainedmodelstolearnusefulaudiorepresentations,wesystematicallystudyhowmuchofpretrainedweightsisusefulforlearnin......
【五期邹昱夫】CCF-A（SIGSAC'22）Membership Inference Attacks by Exploiting Loss Traj
"Liu,Yiyong,etal."Membershipinferenceattacksbyexploitinglosstrajectory."Proceedingsofthe2022ACMSIGSACConferenceonComputerandCommunicatio......

Rethinking with Retrieval Faithful Large Language Model Inference

概

Rethinking with retrieval (RR)

代码

相关文章

赞助商

阅读排行