首页 > 其他分享 >8 Innovative BERT Knowledge Distillation Papers That Have Changed The Landscape of NLP

8 Innovative BERT Knowledge Distillation Papers That Have Changed The Landscape of NLP

时间:2023-10-31 13:01:39浏览次数:29  
标签:BERT Knowledge NLP text distillation model pruning

8 Innovative BERT Knowledge Distillation Papers That Have Changed The Landscape of NLP

Contemporary state-of-the-art NLP models are difficult to be utilized in production. Knowledge distillation offers tools for tackling such issues along with several others, but it has its quirks.


BERT’s inefficient nature has not gone unnoticed. Many researchers have pursued ways to reduce its cost and size. Some of the most active research is in model compression techniques such as smaller architectures (structured pruning), distillation, quantization, and unstructured pruning. A few of the more impactful papers include:




This post is about text classification on problems with a limited sample count.


From: https://blog.51cto.com/emanlee/8103775


  • R语言自然语言处理NLP:情感分析上市公司文本信息知识发现可视化|附代码数据
  • CVer从0入门NLP(一)———词向量与RNN模型
  • 论文阅读:DeepKE:A Deep Learning Based Knowledge Extraction Toolkit for Knowledge B
  • MLP代码模型--NLP方向
     训练 对于二分类任务,通常使用一个包含两个输出单元的输出层,而不是一个单一的输出单元。这是因为在二分类任务中,每个类别通常对应一个输出单元,一个用于表示类别1(例如正类别),另一个用于表示类别2(例如负类别) 预测是......
  • 栩栩如生,音色克隆,Bert-vits2文字转语音打造鬼畜视频实践(Python3.10)
  • UniKGQA Unified Retrieval and Reasoning for Solving Multi-hop Question Answering
  • Open Domain Question Answering Using Early Fusion of Knowledge Bases and Text
  • 论文阅读:Unifying Large Language Model and Knowledge Graph:A RoadMap
  • Makefile knowledge summarization
  • 论文阅读:Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point M
    Point-BERT:Pre-training3DPointCloudTransformerswith MaskedPointModeling摘要我们提出了Point-BERT,一个学习注意力的新范式,将BERT[8]的概念推广到三维点云。受BERT的启发,我们设计了一个掩蔽点建模(MPM)任务来预先训练点云注意力。具体来说,我们首先将点云划分为几个局部的......