首页 > 其他分享 >为了发布而发布,占位符2023

为了发布而发布,占位符2023

时间:2023-12-02 21:33:27浏览次数:43  
标签:up generation memory AWS Amazon 占位 instances 发布 2023

为了发布而发布,占位符2023

Amazon Elastic Compute Cloud (Amazon EC2) R8g instances, powered by the latest generation AWS Graviton4 processors, provide the best price performance in Amazon EC2 for memory-optimized workloads. R8g instances are ideal for memory-intensive workloads, such as databases, in-memory caches, and real-time big data analytics. R8g instances offer up to 30% better performance, and larger instance sizes with up to 3x more vCPUs and memory than the seventh-generation AWS Graviton3-based R7g instances.

 

https://aws.amazon.com/ec2/instance-types/r8g/

 

Amazon Web Services Inc. today unveiled two next-generation chips from its AWS-designed silicon families for generalized cloud computing and high-efficiency artificial intelligence training with the release of the Graviton4 and Trainium2 during an AWS re:Invent conference keynote.

The Graviton family of Arm-based processors is used by AWS to deliver high-performance and reduced costs for customers for a broad range of cloud compute workloads in the Amazon Elastic Compute Cloud, or EC2. According to Amazon, Graviton4 provides up to 30% better computing power with 50% more cores and 75% more memory bandwidth than the current-generation Graviton3 processors.

“Graviton4 marks the fourth generation we’ve delivered in just five years, and is the most powerful and energy-efficient chip we have ever built for a broad range of workloads,” said David Brown, vice president of compute and networking at AWS. “Silicon underpins every customer workload, making it a critical area of innovation for AWS.”

Amazon has been building its own custom silicon since 2018 with the Graviton1, which powered the A1 EC2 instance. Each successive generation of Graviton has brought with it significant increases in performance, lower costs and efficiency. In 2021, Brown told theCube, SiliconANGLE Media’s livestreaming studio, that Graviton’s availability brought about major ecosystem growth for AWS as customers saw immediate improvements in their workloads.

As of today, AWS offers more than 150 different Graviton-powered Amazon EC2 instance types globally and has rolled out more than 2 million Graviton processors.

Graviton4 processors will be available in a new memory-optimized Amazon EC2 R8g instance, which will allow customers to run improved execution for high-performance databases, in-memory caches and big data analytics workloads at scale. R8g instances will provide large sizes for up to three times more virtual central processing units and three times more memory than the current R7g instances. Amazon said that the new R8g instances are currently available in preview today, with general availability planned in the coming months.

Trainium2: next-gen chip designed for AI training in the cloud

As AI foundation models and large language models behind today’s generative AI applications get larger, they require the processing of massive datasets, which means ever-increasing time and costs to train them. The largest and most advanced models can scale from hundreds of billions to trillions of data points and can generate text, images, audio, video and software code.

Today, AWS announced Trainium2, a purpose-built high-performance chip for training FM and LLMs with up to trillions of parameters that can deliver up to four times the training performance and three times more memory capacity of the first-generation chip. The company also said that it improved the energy efficiency of the chip by two times the first generation.

“With the surge of interest in generative AI, Trainium2 will help customers train their ML models faster, at a lower cost, and with better energy efficiency,” said Brown.

Trainium chips act as AI accelerators for deep-learning algorithms for high-performance AI and ML workloads. They are also optimized for training natural language processing, computer vision and recommender models used in AI applications, such as text summarization, code generation, question answering, image and video generation.

The Trainium2 will be available in new Amazon EC2 Trn2 instances, which include 16 Trainium2 chips in a single instance. Customers will be able to scale these instances up to 10,000 Trianum2 chips in the next generation of EC2 UltraClusters, interconnected with AWS Elastic Fabric Adapter petabit-scale networking, capable of delivering up to 65 exaflops of compute. At that scale, Amazon said that customers will be able to train up to 300 billion-parameter LLMs in a week versus months.

标签:up,generation,memory,AWS,Amazon,占位,instances,发布,2023
From: https://www.cnblogs.com/cloudrivers/p/17872260.html

相关文章

  • swpuctf 2023
    web0x01If_else注释掉然后构造payload就可以了。payload:url?check=1){eval(system("cat/flag"));}/*0x02RCE-PLUSrce没有回显,那就重定向到文件访问就可以了。payload:url?cmd=cat/f*>1.txt0x03Pingpingpingphp特性,“]”后面的“.”不会被转换。payload:url?Ping[ip.......
  • CTT2023游记
    NOI2023之后状态一直不咋地,原本要写的CSP/NOIP阿克游记也都咕掉了。看看老年退役选手能不能苟进CTS!12.1启程。9:28启动的火车,原本我们是\(6\)个人的:徐老师,zak,sjy,我,cmll,dx。结果徐老师记错日期了没来。然后就打车去了酒店签到,一到就是先发参赛证书(落款日期还是12.5)、胸......
  • 2023-2024-1 20231402《计算机基础与程序设计》第十周学习总结
    2023-2024-120231402《计算机基础与程序设计》第十周学习总结作业信息这个作业属于哪个课程2023-2024-1-计算机基础与程序设计这个作业要求在哪里2023-2024-1计算机基础与程序设计第10周作业这个作业的目标自学计算机科学概论第12,13,14章,《C语言程序设计》第9章......
  • The 2023 ICPC Asia Hefei Regional Contest Test I. Linguistics Puzzle
    Preface这题yysy真不难,但比赛的时候想出做法后没时间写了,只能遗憾地看着倒计时结束Solution直接上爆搜复杂度肯定会爆,考虑有哪些数是可以不用搜直接推出来的首先样例启发我们\(0,1\)这两个数很好确定,因为\(0\)对应的字母单独出现的次数肯定最多,而\(1\)作为两位的开头出现的次......
  • 表达式-C语言-2023/12/2
    首先介绍表达式:......
  • 语句-C1-2023/12/2
    ......
  • 2023.12.2——每日总结
    学习所花时间(包括上课):9h代码量(行):0行博客量(篇):1篇今天,上午学习,下午学习;我了解到的知识点:1.jfinal明日计划:学习......
  • 2023-12-02:用go语言,如何求模立方根? x^3=a mod p, p是大于等于3的大质数, a是1到p-1范围
    2023-12-02:用go语言,如何求模立方根?x^3=amodp,p是大于等于3的大质数,a是1到p-1范围的整数常数,x也是1到p-1范围的整数,求x。p过大,x不能从1到p-1遍历。答案2023-12-02:灵捷3.5大体步骤如下:1.判断是否存在模立方根。有0,1,3个根这三种情况。1.1.求p-1和3的最大公约数gcd(p-1,3)......
  • 88th 2023/12/2 模拟赛总结57
    本次排名稍微考前,但依然丢脸,赛时先冷静地把题目翻了一遍,T1手推了一下,没有任何思路,却也不觉得这题难赛后事实说明是赛时考虑的太少了,光可行路径就至少可以卡到\(21^{20}\)条正解是预处理出两两之间路径最少需要的点数,然后DP,用优先队列优化然后就翻到T2,觉得这题非常有意思,就画......
  • 2023/12/2软件工程日报
    使用deepspeech做音频特征提取时报错AttributeError:module'numpy'hasnoattribute'float'.`np.float`wasadeprecatedaliasforthebuiltin`float`.Toavoidthiserrorinexistingcode,use`float`byitself.Doingthiswillnotmodifyanybehav......