Semantic Kernel 学习笔记：通过 Kernel Memory 初步体验 Retrieval Augmented Generation

时间：2024-03-03 10:34:09浏览次数：33

标签：info Kernel Semantic Handlers Generation 博客园 Handler KernelMemory Microsoft

学习材料：Quick intro to Kernel Memory: install, upload a doc, ask a question

创建控制台项目

dotnet new console
dotnet add package Microsoft.KernelMemory.Core

创建 IKernelMemory 实例

var memory = new KernelMemoryBuilder()
    .WithOpenAIDefaults(OPENAI_API_KEY)
    .Build<MemoryServerless>();

注：默认大模型用的是 gpt-3.5-turbo-16k

运行控制台程序，通过日志可以看到加载了哪些 handler

info: Microsoft.KernelMemory.Handlers.TextExtractionHandler[0]
      Handler 'extract' ready
info: Microsoft.KernelMemory.Handlers.TextPartitioningHandler[0]
      Handler 'partition' ready
info: Microsoft.KernelMemory.Handlers.SummarizationHandler[0]
      Handler 'summarize' ready
info: Microsoft.KernelMemory.Handlers.GenerateEmbeddingsHandler[0]
      Handler 'gen_embeddings' ready, 1 embedding generators
info: Microsoft.KernelMemory.Handlers.SaveRecordsHandler[0]
      Handler save_records ready, 1 vector storages
info: Microsoft.KernelMemory.Handlers.DeleteDocumentHandler[0]
      Handler 'private_delete_document' ready
info: Microsoft.KernelMemory.Handlers.DeleteIndexHandler[0]
      Handler 'private_delete_index' ready
info: Microsoft.KernelMemory.Handlers.DeleteGeneratedFilesHandler[0]
      Handler 'delete_generated_files' ready

导入 PDF 文件

PDF 文件是博客园鼠标垫.pdf，内容来自这篇博文

导入 PDF 文件的代码

await memory.ImportDocumentAsync("博客园鼠标垫.pdf", documentId: "doc001");

对应上面这行代码的控制台日志输出

info: Microsoft.KernelMemory.Pipeline.BaseOrchestrator[0]
      Queueing upload of 1 files for further processing [request doc001]
info: Microsoft.KernelMemory.Pipeline.BaseOrchestrator[0]
      File uploaded: 博客园鼠标垫.pdf, 174013 bytes
info: Microsoft.KernelMemory.Pipeline.BaseOrchestrator[0]
      Handler 'extract' processed pipeline 'default/doc001' successfully
info: Microsoft.KernelMemory.Pipeline.BaseOrchestrator[0]
      Handler 'partition' processed pipeline 'default/doc001' successfully
info: Microsoft.KernelMemory.Pipeline.BaseOrchestrator[0]
      Handler 'gen_embeddings' processed pipeline 'default/doc001' successfully
info: Microsoft.KernelMemory.Pipeline.BaseOrchestrator[0]
      Handler 'save_records' processed pipeline 'default/doc001' successfully
info: Microsoft.KernelMemory.Pipeline.BaseOrchestrator[0]
      Pipeline 'default/doc001' complete

从日志看，在 import document 的过程中就完成了 embeddings 的生成并保存至向量数据库。

接下来，基于内存向量数据库中的 embeddings 数据，向 gpt-3.5-turbo-16k 模型提问，Kernel Memory 会自动根据提示词检索对应的 embeddings 然后一起发给大模型，这就是 RAG(Retrieval Augmented Generation)

var question = "博客园鼠标垫在哪买";
var answer = await memory.AskAsync(question);
Console.WriteLine($"Question: {question}\n\nAnswer: {answer.Result}");

运行程序，看看 AI 的回答：

Question: 博客园鼠标垫在哪买

Answer: 博客园鼠标垫可以在淘宝上购买。购买链接为https://item.taobao.com/item.htm?id=761724714914。另外，如果不想在淘宝上购买，也可以通过博客园的企业微信购买。

如果不使用 embedding，ChatGPT 的回答一看就是编出来的

博客园鼠标垫可以在博客园的官方网站上购买，也可以在其他在线购物平台或者实体店中找到。你可以在博客园网站上搜索他们的商店或者联系客服询问购买渠道

RAG 的效果果然明显。

标签：info,Kernel,Semantic,Handlers,Generation,博客园,Handler,KernelMemory,Microsoft
From： https://www.cnblogs.com/dudu/p/18037412

万字长文学会对接 AI 模型：Semantic Kernel 和 Kernel Memory，工良出品，超简单的教程
万字长文学会对接AI模型：SemanticKernel和KernelMemory，工良出品，超简单的教程目录万字长文学会对接AI模型：SemanticKernel和KernelMemory，工良出品，超简单的教程配置环境部署one-api配置项目环境模型划分和应用场景聊天提示词引导AI回复指定AI回复特定格式模板化提示......
记一次WPF集成SemanticKernel+OneAPI+讯飞星火认知大模型实践
开启OneAPI服务OneAPI介绍OpenAI接口管理&分发系统，支持Azure、AnthropicClaude、GooglePaLM2&Gemini、智谱ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360智脑以及腾讯混元，可用于二次分发管理key，仅单可执行文件，已打包好Docker镜像，一键部署，开箱即用.Ope......
Semantic Kernel 学习笔记：初步体验用 Semantic Memory 生成 Embedding 并进行语义搜索
SemanticKernel的Memory有两种实现，一个是SemanticKernel内置的SemanticMemory，一个是独立的KernelMemory，KernelMemory是从SemanticKernel进化而来。关于SemanticMemory的介绍（来源）：SemanticMemory(SM)isalibraryforC#,Python,andJavathatwrapsdir......
Semantic Kernel 学习笔记：体验基于 prompt function 实现的 Plugin
在一个SemanticKernelplugin中可以创建两种类型的function，分别是nativefunction与promptfunction（之前叫semanticfunction）。下面这款plugin中给C#method添加了[KernelFunction]attribute，就是nativefunctionpublicclassLightPlugin{publicboolIsOn......
旁门左道：借助 HttpClientHandler 拦截请求，体验 Semantic Kernel 插件
前天尝试通过one-api+dashscope(阿里云灵积)+qwen(通义千问)运行SemanticKernel插件（Plugin），结果尝试失败，详见前天的博文。今天换一种方式尝试，选择了一个旁门左道走走看，看能不能在不使用大模型的情况下让SemanticKernel插件运行起来，这个旁门左道就是从StephenToub那......
基于Microsoft SemanticKernel和GPT4实现一个智能翻译服务
今年.NETConfChina2023技术大会，我给大家分享了.NET应用国际化-AIGC智能翻译+代码生成的议题.NETConfChina2023分享-.NET应用国际化-AIGC智能翻译+代码生成今天将详细的代码实现和大家分享一下。一、前提准备1.新建一个Console类的Project2.引用SK的Nuget包，SK的最新N......
Semantic Kernel + 通义千问：借助 one-api 调用阿里云灵积 DashScope api
one-api相当于是一个兼容OpenAIapi的api网关（针对api的反向代理），借助one-api可以通过已有的OpenAI客户端调用非OpenAI大模型的api，比如通义千问。DashScope是阿里云提供的模型服务灵积的英文名称，这里通过调用DashScopeapi使用通义千问qwen-max大模型。以容器......
体光伏效应和二次谐波产生的微观理论（Photogalvanic effect 、bulk photovoltaic effec
此领域较易入门，经典文献为：1.综述：https://www.nature.com/articles/s41563-021-00992-72.Sipe大佬的论文：开创领域的两篇最经典论文，值得全部重复：https://journals.aps.org/prb/abstract/10.1103/PhysRevB.61.5337https://journals.aps.org/prb/abstract/10.1103/PhysRevB.52.146......
实现阿里云模型服务灵积 DashScope 的 Semantic Kernel Connector
SemanticKernel内置的IChatCompletionService实现只支持OpenAI与AzureOpenAI，而我却打算结合DashScope(阿里云模型服务灵积)学习SemanticKernel。于是决定自己动手实现一个支持DashScope的SemanticKernelConnector——DashScopeChatCompletionService，实现......
初步体验通过 Semantic Kernel 与自己部署的通义千问开源大模型进行对话
春节之前被SemanticKernel所吸引，开始了解它，学习它。在写这篇博文之前读了一些英文博文，顺便在这里分享一下：IntrotoSemanticKernel–PartOneIntrotoSemanticKernel–PartTwoBuildacustomCopilotexperiencewithyourprivatedatausingandKernelMemory......

Semantic Kernel 学习笔记：通过 Kernel Memory 初步体验 Retrieval Augmented Generation

创建控制台项目

创建 IKernelMemory 实例

导入 PDF 文件

相关文章

赞助商

阅读排行