Groq LPU (language processor)

Product

Developers:	Groq
Date of the premiere of the system:	February 2024
Branches:	Electrical and Microelectronics
Technology:	Processors

2024: Product Announcement

At the end of February 2024 startup Groq , he presented a specialized processor LPU (Language Processing Unit), designed to speed up the work of large language models (LLM). The product is expected to revolutionize the market. artificial intelligence

Groq LPU is based on the tensor stream processor (TSP) architecture. The solution is endowed with 230 MB local SRAM with 80 TB/s bandwidth. It is claimed that the performance on INT8 operations reaches 750 TOPS, on FP16 operations - 188 Tflops. When working with the Mixtral 8x7B model, the Groq LPU accelerator provides an interference rate of up to 480 tokens per second, which is one of the best indicators in the industry as of the end of February 2024. In models such as Llama 2 70B with a context length of 4096 tokens, the new chip demonstrates performance at 300 tokens per second, while in the smaller Llama 2 7B model with 2048 context tokens, the rate of interference reaches 750 tokens per second.

Startup Groq unveils dedicated processor designed to speed up large language models

In general, as noted, the Groq LPU accelerator outperforms competing products from NVIDIA, AMD and Intel. In fact, we are talking about rethinking the efficiency of AI computing. The Groq LPU product isn't just a chip: it's a harbinger of a new era where AI can easily integrate into everyday life, overcoming existing delay barriers that make it difficult for systems to interact with the user in real time.

Unlike GPUs, LPUs use a simplified approach that eliminates the need for complex scheduling hardware and provides constant latency and high throughput. In addition, the new product has high energy efficiency, which reduces the total cost of maintaining AI systems.^[1]

Notes

↑ 'Feels like magic!': Groq's ultrafast LPU could well be the first LLM-native processor — and its latest demo may well convince Nvidia and AMD to get out their checkbooks

Источник — «https://tadviser.com/index.php/Product:Groq_LPU_(language_processor)»

The site content is translated by machine translation software powered by PROMT. The machine-translated articles are not always perfect and may contain errors in vocabulary, syntax or grammar. Read original article
If you find inaccuracies or errors in the results of machine translation, please write to editor@tadviser.ru. We will make every effort to correct them as soon as possible.

Simple Link

How to create a "smart plant": Key characteristics of a modern digital enterprise 10500

Model Studio CS: How to use BIM to give new impetus to the development of the fuel and energy complex 11000