首页 > 其他分享 >英特尔深度学习框架BigDL——a distributed deep learning library for Apache Spark

英特尔深度学习框架BigDL——a distributed deep learning library for Apache Spark

时间:2023-06-01 14:34:29浏览次数:44  
标签:BigDL distributed deep learning Spark data out

BigDL: Distributed Deep Learning on Apache Spark

What is BigDL?

BigDL is a distributed deep learning library for Apache Spark; with BigDL, users can write their deep learning applications as standard Spark programs, which can directly run on top of existing Spark or Hadoop clusters.

  • Rich deep learning support. Modeled after Torch, BigDL provides comprehensive support for deep learning, including numeric computing (via Tensor) and high level neural networks; in addition, users can load pre-trained Caffe or Torchmodels into Spark programs using BigDL.
  • Extremely high performance. To achieve high performance, BigDL uses Intel MKL and multi-threaded programming in each Spark task. Consequently, it is orders of magnitude faster than out-of-box open source CaffeTorch or TensorFlowon a single-node Xeon (i.e., comparable with mainstream GPU).
  • Efficiently scale-out. BigDL can efficiently scale out to perform data analytics at "Big Data scale", by leveraging Apache Spark (a lightning fast distributed data processing framework), as well as efficient implementations of synchronous SGD and all-reduce communications on Spark.

Why BigDL?

You may want to write your deep learning programs using BigDL if:

  • You want to analyze a large amount of data on the same Big Data (Hadoop/Spark) cluster where the data are stored (in, say, HDFS, HBase, Hive, etc.).
  • You want to add deep learning functionalities (either training or prediction) to your Big Data (Spark) programs and/or workflow.
  • You want to leverage existing Hadoop/Spark clusters to run your deep learning applications, which can be then dynamically shared with other workloads (e.g., ETL, data warehouse, feature engineering, classical machine learning, graph analytics, etc.)

How to use BigDL?

标签:BigDL,distributed,deep,learning,Spark,data,out
From: https://blog.51cto.com/u_11908275/6393796

相关文章

  • Deep Isolation Forest for Anomaly Detection
    DeepIsolationForestforAnomalyDetection1INTRODUCTIONIForest的缺点它的与坐标轴平行的隔离方法会导致它在高维/非线性空间中难以检测到异常。如图1所示。红色为异常节点,蓝色为正常节点。红色被蓝色所包围,这种情况无法被直接用平行于x或者平行于y的分割方法隔离......
  • 事件抽取论文综述-A Survey on Deep Learning Event Extraction: Approaches and Appl
    ASurveyonDeepLearningEventExtraction:ApproachesandApplications1)发表信息:https://arxiv.org/abs/2107.02126QianLi,JianxinLi,Member,IEEE,JiaweiSheng,ShiyaoCui,JiaWu,SeniorMember,IEEE,YimingHei,HaoPeng,ShuGuo,LihongWang,Amin......
  • deepin-for-arm64支持
    arm64架构支持v23仓库已经支持arm64和amd64架构软件包,arm64架构的基础环境已经具备,现在就差镜像制作工具的支持了,镜像构建工具的目标是构建出标准pc镜像。为此我借来一台紫光飞腾D2000机器进行arm64的适配工作,这台机器有相对标准的UEFI固件,目前已经支持UEFI安装,镜像地址: https......
  • iOS MachineLearning 系列(18)—— PoseNet,DeeplabV3与FCRN-DepthPrediction模型
    iOSMachineLearning系列(18)——PoseNet,DeeplabV3与FCRN-DepthPrediction模型本篇文章将再介绍三个官方的CoreML模型:PoseNet,DeeplabV3和FCRN-DepthPrediction。PoseNet是人体姿势分析模型,可以识别图片中的人体部分,然后以17个基准点来描述人体的姿势。关于人体姿势的识别,其实Vision......
  • Distributed System and Application
    Assignment2:DistributedSystemandApplicationCloudComputingandDistributedSystems(CLOUDS)LaboratorySchoolofComputingandInformationSystemsTheUniversityofMelbourne,AustraliaOthercontributors:AllTutors2Project:DistributedSharedWhiteBoa......
  • The Difficulty of Passive Learning in Deep Reinforcement Learning
    发表时间:2021(NeurIPS2021)文章要点:这篇文章提出一个tandemlearning的实验范式来研究为什么offlineRL很难学。对于offlineRL来说,一个很严重的问题就是extrapolationerror,也就是没见过的stateactionpair的估计是不准确的。再加上bootstrapping的更新方式,就会加剧误差导致o......
  • Paper Reading: forgeNet a graph deep neural network model using tree-based ensem
    目录研究动机文章贡献本文方法图嵌入深度前馈网络forgeNet特征重要性评估具体实现模拟实验合成数据生成实验评估实验结果真实数据应用BRCA数据集microRNA数据Healthyhumanmetabolomics数据集优点和创新点PaperReading是从个人角度进行的一些总结分享,受到个人关注点的侧重......
  • 音容笑貌,两臻佳妙,人工智能AI换脸(deepfake)技术复刻《卡萨布兰卡》名场面(Python3.1
    影史经典《卡萨布兰卡》是大家耳熟能详的传世名作,那一首壮怀激烈,激奋昂扬的马赛曲,应当是通片最为激动人心的经典桥段了,本次我们基于faceswap和so-vits库让AI川普复刻美国演员保罗·亨雷德高唱《马赛曲》的名场面。配置人脸替换DeepFakes项目关于人脸替换,业内鼎鼎有名的deepfakes......
  • Ad Hoc Distributed Queries 的启用与关闭2
    上一篇的解决办法,主要是通过语句操作来完成,SQLServer也提供了图形操作界面来解决类似问题我们可以利用SQLServer(2005)配置工具中提供的“SQLServer外围应用配置”来完成此操作,具体的操作方法如下:1.打开“SQLServer外围应用配置”,界面如下图 2.选择“SQLServer外围应用配置......
  • Off-Policy Deep Reinforcement Learning without Exploration
    发表时间:2019(ICML2019)文章要点:这篇文章想说在offlineRL的setting下,由于外推误差(extrapolationerrors)的原因,标准的off-policy算法比如DQN,DDPG之类的,如果数据的分布和当前policy的分布差距很大的话,那就很难从data里学到好的policy。然后文章提出了batch-constrainedreinforceme......