< Back to 68k.news US front page

The Next Platform - The Next Platform

Original source (on modern site) | Article images: [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16]

Control

It has been quite a week for Hashi Corp, the company behind the open source Hashi Stack of systems software for creating and running modern, distributed applications. First, on Monday the company was in the middle of transforming is business model with the Hashi Stack, and had a big event …

Not many devices in the datacenter have been etched with the Intel 4 process, which is the chip maker's spin on 7 nanometer extreme ultraviolet immersion lithography. But Intel's Loihi 2 neuromorphic processor is one of them, and Sandia National Laboratories is firing up a supercomputer with 1,152 of them …

Compute

Here is a paradox for you: Spending on infrastructure to support generative AI is apparently booming, as clearly evidenced by the skyrocketing revenues and profits of Nvidia. But spending on datacenter hardware is not changing all that much, and where spending is now forecast to be higher, it is in …

Edge

The Internet of Things (IoT) has shown significant growth and promise, with data generated by IoT devices alone expected to reach 73.1 zettabytes by 2025. Moving this data away from its point of creation to a centralized data center or cloud would contradict the application's purpose. Thus, edge computing was …

With large language models, bigger is better (and faster) but better is also better. And one of the key insights that the Meta AI research team had with the Llama family of models is that you want to optimize for the lowest cost, highest performance AI inference with any model …

Compute

Everyone is in a big hurry to get the latest and greatest GPU accelerators to build generative AI platforms. Those who can't get GPUs, or have custom devices that are better suited to their workloads than GPUs, deploy other kinds of accelerators. The companies designing these AI compute engines have …

Compute

We have a long-standing joke that dates from the early 2000s, when the hyperscalers - there were not yet cloud builders as we now know them - started having hundreds of millions of users and millions of servers and storage arrays to run applications for them at the same time …

More Analysis

Power Efficiency, Customization Will Drive Arm's Role In AI

More than a decade ago, executives at Arm Ltd saw the energy costs in datacenters soaring and sensed an opportunity to extend the low-power architecture of its eponymous systems-on-a-chip that has dominated the mobile phone markets from the get-go and took over the embedded device market from PowerPC into enterprise …

Compute

Ampere Readies 256-Core CPU Beast, Awaits The AI Inference Wave

How many cores is enough for server CPUs? All that we can get, and then some. For the past two decades, the game in compute engines has been to try to pack as many cores and additional functionality as possible into a socket and make the overall system price/performance come …

HPC

Los Alamos Pushes The Memory Wall With "Venado" Supercomputer

Today is the ribbon-cutting ceremony for the "Venado" supercomputer, which was hinted at back in April 2021 when Nvidia announced its plans for its first datacenter-class Arm server CPU and which was talked about in some detail - but not really enough to suit our taste for speeds and feeds …

Compute

AWS Hedges Its Bets With Nvidia GPUs And Homegrown AI Chips

There was a time - and it doesn't seem like that long ago - that the datacenter chip market was a big-money but relatively simple landscape, with CPUs from Intel and AMD and Arm looking to muscle its way in and GPUs mostly from Nvidia with some from AMD and …

Looking To Adopt Generative AI Within Your Organization?

April 11, 2024

SPONSORED POST: Generative AI is the subject of significant interest from enterprises busy looking to use it to help them improve business processes and build innovative new applications and services which can attract new customers and grow revenue. Research firm Gartner expects that spending on GenAI software specifically will expand …

With MTIA v2 Chip, Meta Can Do AI Inference, But Not Training

If you control your code base and you have only a handful of applications that run at massive scale - what some have called hyperscale - then you, too, can win the Chip Jackpot like Meta Platforms and a few dozen companies and governments in the world have. If you …

Gelsinger: With Gaudi 3 and Xeon 6, AI Workloads Will Come Our Way

The steady rise of AI over the past several years - and the accelerated growth with the introduction generative AI since OpenAI's launch of ChatGPT in November 2022 - has shifted Intel's status as a challenger in a chip market that it long had dominated. For sure, Intel still commands …

Compute

Google Joins The Homegrown Arm Server CPU Club

If you are wondering why Intel chief executive officer Pat Gelsinger has been working so hard to get the company's foundry business not only back on track but utterly transformed into a merchant foundry that, by 2030 or so can take away some business from archrival Taiwan Semiconductor Manufacturing Co, …

With Gaudi 3, Intel Can Sell AI Accelerators To The PyTorch Masses

We have said it before, and we will say it again right here: If you can make a matrix math engine that runs the PyTorch framework and the Llama large language model, both of which are open source and both of which come out of Meta Platforms and both of …

< Back to 68k.news US front page