Deep Variational Information Bottleneck

时间：2022-11-19 14:34:59浏览次数：79

标签：Information Bottleneck frac log int bm dz theta Variational

概
本文内容

Alemi A. A., Fischer I., Dillon J. V. and Murphy K. Deep variational information bottleneck. In International Conference on Learning Representations (ICLR), 2017.

概

本文介绍了 Information Bottleneck 理论如何用在一般的特征建模上.

本文内容

假设我们拥有数据 \(X\) 和目标 \(Y\), 我们希望通过 \(p(\bm{z}| \bm{x}; \theta)\) 来建模隐变量 \(Z\). 自然地, 我们会希望 \(Z\) 和我们的目标 \(Y\) 之间有一个紧密的联系, 换言之它们之间的互信息足够大

\[\tag{1} I(Z, Y;\theta) = \int dz dy \: p(\bm{z}, \bm{y}|\theta) \log \frac{p(\bm{z, y}|\theta)}{p(\bm{z}|\theta)p(\bm{y}|\theta)}. \]
但是, 仅仅应用 (1) 通常会导致一个平凡解, 即 \(Z = X\). 而我们通常所希望的 \(Z\) 能够将 \(X\) 中与 \(Y\) 无关的部分的杂质去掉, 换言之我们还需要添加约束

\[\tag{2} I(X, Z) \le I_c \]
以保证 \(Z\) 不会直接复制 \(X\).
(1), (2) 可以转换为一个共同的优化问题:

\[\max_{\theta} \: I(Z, Y; \theta) - \beta I(Z, X; \theta). \]
我们首先假设

\[\tag{3} p(X, Y, Z) = p(Z|X, Y)p(Y|X) p(X) = p(Z|X)p(Y|X)p(Z), \]
即 \(Y \leftrightarrow X \leftrightarrow Z\).
让我们来计算 \(I(Z, Y), I(X, Z)\), 注意到

\[I(Z, Y) =\int dz dy \: p(\bm{z}, \bm{y}) \log \frac{p(\bm{z, y})}{p(\bm{z})p(\bm{y})} =\int dz dy \: p(\bm{z}, \bm{y}) \log \frac{p(\bm{y}|\bm{z})}{p(\bm{y})}, \]
由于无法知晓 \(p(\bm{y}|\bm{z})\), 我们可以采用 [here] 中的方式, 用 \(q(\bm{y}|\bm{z}; \theta)\) 来变分近似, 得到

\[I(Z, Y) \ge \int dy dz p(\bm{z}, \bm{y}) \log q(\bm{y}|\bm{z}) + H(Y) \ge \int dy dz p(\bm{z}, \bm{y}) \log q(\bm{y}|\bm{z}). \]
对于 \(I(X, Z)\), 通过 \(r(\bm{z})\) 来近似 \(p(\bm{z})\) 可以得到如下的一个上界 (参考 [here]):

\[I(X, Z) \le \int dxdz \: p(\bm{x}, \bm{z}) \log \frac{p(\bm{z}|\bm{x})}{r(\bm{z})}. \]
凭借假设 (3), 可得

\[\begin{array}{ll} I(Z, Y) - \beta I(X, Z) &\ge \int dx dy dz \: p(\bm{z}) p(\bm{y|x}) p(\bm{z|x}) \log q(\bm{y}|\bm{z}; \theta) \\ &\quad - \beta \int dx dz \: p(\bm{x}) p(\bm{z}|\bm{x}) \log \frac{p(\bm{z|x})}{r(\bm{z})}\\ &=: L. \end{array} \]
用经验分布 \(\frac{1}{N}\sum_{n=1}^N \delta_{x_n}(\bm{x}) \delta_{y_n}(\bm{y})\) 来近似 \(p(x, y)\), \(p(\bm{z}|\bm{x})\) 用 encoder 近似, 记为 \(p(\bm{z|x}; \phi)\), 可得

\[L \approx \frac{1}{N} \sum_{n=1}^N [\int dz \: p(z|x_n; \phi) \log q(\bm{y}_n |\bm{z}; \theta) - \beta p(\bm{z}|\bm{x}_n; \phi)\log \frac{p(\bm{z}|\bm{x}_n; \phi)}{r(\bm{z})}]. \]

标签：Information,Bottleneck,frac,log,int,bm,dz,theta,Variational
From： https://www.cnblogs.com/MTandHJ/p/16906051.html

论文笔记 - PRISM: A Rich Class of Parameterized Submodular Information Measures
Motivation与ActiveLearning类似，TargetLearning致力于挑选外卖更“感兴趣”的数据，即人为为更重要的数据添加bias。例如我们当前的任务目标是增强自动驾驶算法的夜......
论文笔记 - SIMILAR: Submodular Information Measures Based Active Learning In Rea
motivationActiveLearning存在的重要问题：现实数据极度不平衡，有许多类别很少见（rare），又有很多类别是冗余的（redundancy），又有些数据是OOD的（out-of-distribution）。1.不同的......
谣言检测(RDCL)——《Towards Robust False Information Detection on Social Network
论文信息论文标题：TowardsRobustFalseInformationDetectiononSocialNetworkswithContrastiveLearning论文作者：ChunyuanYuan,QianwenMa,WeiZhou,Jizhong......
git pull提示当前branch没有跟踪信息 There is no tracking information for the cur
gitpull提示当前branch没有跟踪信息Thereisnotrackinginformationforthecurrentbranch使用第二种方法，设置本地repository和远程repository关联在执行git......
git pull报错:There is no tracking information for the current branch
gitpull报错:Thereisnotrackinginformationforthecurrentbranch报错：Thereisnotrackinginformationforthecurrentbranch.Pleasespecifywhichb......
L10U4-3 Presenting information
VocabularyMorebusinesspresentationsDialogue[JOAN]Asyouknow,I'vebeenspendingalotoftimeatSunset'sheadquarters.AndI'vebeenveryimpressed.It's......
Improving Item Cold-start Recommendation via Model-agnostic Conditional Variatio
动机本文是2022年SIGIR上的一篇论文。解决推荐系统中冷启动问题通常有两种方法：1.挖掘历史数据中的分布模式，例如学习一个辅助信息到id的映射。2.在交互物品有限的情况下提......
Learning from the Best: Rationalizing Prediction by Adversarial Information Cali
最近看了一些关于Rationale的方法，选取其中一篇写个笔记Motivation之前的rationale的方法中，选择器和预测器的结果来自于预测对真实答案的比较，这样的探索空间非常大。通......
Multi-view Denoising Graph Auto-Encoders on Heterogeneous Information Networks f
动机本文是2021年KDD上的一篇文章。最近有不少工作利用异构图去解决推荐系统冷启动问题，但是这些方法都忽略了在冷启动场景下训练和推理的差异。针对以上问题，本文提出了MvD......
hive 建表报错Execution failed with exit status: 137 Obtaining error information
如图所示，大小表关联，默认mapjoin,申请本地内存巨大，导致报错退出关闭mapjoin即可sethive.auto.convert.join=false; ......

Deep Variational Information Bottleneck

概

本文内容

相关文章

赞助商

阅读排行