以bert为例,了解Lora是如何添加到模型中的

时间：2024-06-13 21:33:24浏览次数：20

标签：bert 为例 torch 添加可视化 tensorboard Lora

以bert为例,了解Lora是如何添加到模型中的

一.效果图
二.复现步骤
- 1.生成配置文件(num_hidden_layers=1)
- 2.运行测试脚本

本文以bert为例,对比了添加Lora模块前后的网络结构图
说明:

1.为了加快速度,将bert修改为一层
2.lora只加到intermediate.dense,方便对比
3.使用了几种不同的可视化方式(onnx可视化,torchviz图,torch.fx可视化,tensorboard可视化)

可参考的点:

1.peft使用
2.几种不同的pytorch模型可视化方法

一.效果图

1.torch.fx可视化

A.添加前

在这里插入图片描述

B.添加后

在这里插入图片描述

2.onnx可视化

A.添加前

在这里插入图片描述

B.添加后

在这里插入图片描述

3.tensorboard可视化

A.添加前

在这里插入图片描述

B.添加后

在这里插入图片描述

二.复现步骤

1.生成配置文件(num_hidden_layers=1)

tee ./config.json <<-'EOF'
{
  "architectures": [
    "BertForMaskedLM"
  ],
  "attention_probs_dropout_prob": 0.1,
  "directionality": "bidi",
  "hidden_act": "gelu",
  "hidden_dropout_prob": 0.1,
  "hidden_size": 768,
  "initializer_range": 0.02,
  "intermediate_size": 3072,
  "layer_norm_eps": 1e-12,
  "max_position_embeddings": 512,
  "model_type": "bert",
  "num_attention_heads": 12,
  "num_hidden_layers": 1,
  "pad_token_id": 0,
  "pooler_fc_size": 768,
  "pooler_num_attention_heads": 12,
  "pooler_num_fc_layers": 3,
  "pooler_size_per_head": 128,
  "pooler_type": "first_token_transform",
  "type_vocab_size": 2,
  "vocab_size": 21128
}
EOF

2.运行测试脚本

tee bert_lora.py <<-'EOF'
import time
import os
import torch
import torchvision.models as models
import torch.nn as nn
import torch.nn.init as init
import time
import numpy as np
from peft import get_peft_config, get_peft_model, get_peft_model_state_dict, LoraConfig, TaskType
from torchviz import make_dot
from torch.utils.tensorboard import SummaryWriter
from torch._functorch.partitioners import draw_graph

def onnx_infer_shape(onnx_path):
    import onnx
    onnx_model  = onnx.load_model(onnx_path)
    new_onnx= onnx.shape_inference.infer_shapes(onnx_model)
    onnx.save_model(new_onnx, onnx_path)

def get_model():
    torch.manual_seed(1)
    from transformers import AutoModelForMaskedLM,BertConfig
    config=BertConfig.from_pretrained("./config.json")
    model = AutoModelForMaskedLM.from_config(config)
    return model,config

def my_compiler(fx_module: torch.fx.GraphModule, _):
    draw_graph(fx_module, f"bert.{time.time()}.svg")
    return fx_module.forward

if __name__ == "__main__":

    model,config=get_model()
    model.eval()
    input_tokens=torch.randint(0,config.vocab_size,(1,128))
    
    # 一.原始模型
    # 1.onnx可视化
    torch.onnx.export(model,input_tokens,
                  "bert_base.onnx",
                  export_params=False,
                  opset_version=11,
                  do_constant_folding=True)
    onnx_infer_shape("bert_base.onnx")
    
    # 2.torchviz图
    output = model(input_tokens)
    logits = output.logits
    viz = make_dot(logits, params=dict(model.named_parameters()))
    viz.render("bert_base", view=False)
    
    # 3.torch.fx可视化
    compiled_model = torch.compile(model, backend=my_compiler)
    output = compiled_model(input_tokens)

    # 4.tensorboard可视化
    writer = SummaryWriter('./runs')
    writer.add_graph(model, input_to_model = input_tokens,use_strict_trace=False)
    writer.close()
    
    # 二.Lora模型
    peft_config = LoraConfig(
        task_type=TaskType.CAUSAL_LM,
        inference_mode=True,
        r=8,
        lora_alpha=32,
        target_modules=['intermediate.dense'],
        lora_dropout=0.1,
    )
    lora_model = get_peft_model(model, peft_config)
    lora_model.eval()
    torch.onnx.export(lora_model,input_tokens,
                      "bert_base_lora_inference_mode.onnx",
                      export_params=False,
                      opset_version=11,
                      do_constant_folding=True)
    onnx_infer_shape("bert_base_lora_inference_mode.onnx")

    compiled_model = torch.compile(lora_model, backend=my_compiler)
    output = compiled_model(input_tokens)

    writer = SummaryWriter('./runs_lora')
    writer.add_graph(lora_model, input_to_model = input_tokens,use_strict_trace=False)
    writer.close()
EOF

# 安装依赖
apt install graphviz -y
pip install torchviz
pip install pydot

# 运行测试程序
python bert_lora.py

标签：bert,为例,torch,添加,可视化,tensorboard,Lora
From： https://blog.csdn.net/m0_61864577/article/details/139662717

【esp32 学习笔记】入门使用u8g2库（以OLED驱动芯片SSD1306为例）
一、常用APIU8g2库提供了丰富的API，用于控制各种显示器并在屏幕上绘制文本、图形等元素。以下是U8g2库中一些常用的API：1.初始化-------U8G2U8G2(display,rotation,[,reset[,clock,data,cs,dc,reset,cs1,cs2,cs3]]) 初始化U8g2对象，其中display表示所使用的显示器......
以 ZGC 为例，谈一谈 JVM 是如何实现 Reference 语义的
本文基于OpenJDK17进行讨论1.Reference相关概念及其应用场景总览Reference（引用）是JVM中非常核心且重要的一个概念，垃圾回收器判断一个对象存活与否都是围绕着这个Reference来的，JVM将Reference又细分为几种具体的引用类型，它们分别是：StrongReference，SoftReference，Weak......
ALoRA: Allocating Low-Rank Adaptation for Fine-tuning Large Language Models
本文是LLM系列文章，针对《ALoRA:AllocatingLow-RankAdaptationforFine-tuningLargeLanguageModels》的翻译。ALoRA：为微调大型语言模型分配低秩自适应摘要1引言2相关工作3方法4实验5结论摘要参数有效微调（PEFT）在大语言模型时代因其有效性和效率而......
NLP实战入门——文本分类任务（TextRNN，TextCNN，TextRNN_Att，TextRCNN，FastText，DPCNN，BERT，ERN
本文参考自https://github.com/649453932/Chinese-Text-Classification-Pytorch?tab=readme-ov-file，https://github.com/leerumor/nlp_tutorial?tab=readme-ov-file，https://zhuanlan.zhihu.com/p/73176084，是为了进行NLP的一些典型模型的总结和尝试。中文数据集从THUCNews......
大模型高效微调-LoRA原理详解和训练过程深入分析
博客首发于我的知乎，详见：https://zhuanlan.zhihu.com/p/702629428一、LoRA原理LoRA(Low-RankAdaptationofLLMs)，即LLMs的低秩适应，是参数高效微调最常用的方法。LoRA的本质就是用更少的训练参数来近似LLM全参数微调所得的增量参数，从而达到使用更少显存占用的高效微调。1.1问......
以sqlilabs靶场为例，讲解SQL注入攻击原理【54-65关】
【Less-54】与前面的题目不同是，这里只能提交10次，一旦提交超过十次，数据会重新刷新，所有的步骤需要重来一次。解题步骤：根据测试，使用的是单引号闭合。#判断字段的数量?id=1'orderby3--aaa#获取数据库的名字?id=-1'unionselect1,2,database()--aa#获取数据......
【YOLOv5进阶】——修改网络结构（以C2f模块为例）
一、站在巨人的肩膀上这里我们借鉴YOLOv8源码：上期说到，对于网络模块定义详情在common.py这个文件，如Conv、CrossConv、C3f等。本期要修改的需要参考YOLOv8里的C2f模块，它定义在YOLOv8的module文件夹的block.py文件里（与common.py一样），源码链接如下：YOLOv8源码https://github.com/u......
【简单讲解下Fine-tuning BERT，什么是Fine-tuning BERT？】
......
基于ESP32+arduino+platformIO驱动小米模组接入米家app（以温湿度传感器为例）
1.选择开发板以及开发环境1.ESP32-C3-DevKitC-02作为主控（以下称为ESP32模块）相关文档：ESP32-C3-DevKitC-02-ESP32-C3-—ESP-IDF编程指南latest文档https://docs.espressif.com/projects/esp-idf/zh_CN/latest/esp32c3/hw-reference/esp32c3/user-guide-devkitc-02.ht......
RainBond 制作应用并上架【以ElasticSearch为例】
文章目录安装ElasticSearch集群第1步：添加组件第2步：查看组件第3步：访问组件制作ElasticSearch组件准备工作ElasticSearch集群原理尝试Helm安装ES集群RainBond制作ES思路源代码Dockerfiledocker-entrypoint.shelasticsearch.yml......

以bert为例,了解Lora是如何添加到模型中的

以bert为例,了解Lora是如何添加到模型中的

一.效果图

1.torch.fx可视化

A.添加前

B.添加后

2.onnx可视化

A.添加前

B.添加后

3.tensorboard可视化

A.添加前

B.添加后

二.复现步骤

1.生成配置文件(num_hidden_layers=1)

2.运行测试脚本

相关文章

赞助商

阅读排行