Intel Says Its Gaudi 2 Accelerator is Nvidia H100's Only Benchmarked Alternative for Generative AI

Intel Gaudi 2 continues to shine as the sole benchmarked alternative to Nvidia's H100 in the realm of generative AI (GenAI) performance. Intel's ambitious goal to deliver competitive AI solutions across its portfolio is evident through the latest MLPerf v4.0 benchmark results. "The Intel Gaudi 2 AI accelerator remains the only benchmarked alternative to Nvidia H100 for generative AI (GenAI) performance and provides strong performance-per-dollar," said Intel, in an official news release. 

In a recent MLPerf GPT-J inference benchmark, Intel's Gaudi 2 achieved near-parity performance with Nvidia's H100, reinforcing its position as a formidable alternative.

The Intel® Gaudi® 2 accelerator is a heavyweight contender in the AI accelerator arena, designed specifically for deep learning training and inference. It delivers two times the performance of A100 on Computer Vision, NLP, and large scale models.

Gaudi 2 shrinks the process from 16nm to 7nm, increases the number of Al-customized Tensor Processor Cores from 8 to 24, adds FP8 support, and integrates a media compression engine.

Additionally, Intel remains the exclusive server CPU vendor to submit MLPerf results. The 5th Gen Intel Xeon processors have shown an impressive 1.42x improvement compared to their 4th Gen counterparts in MLPerf Inference v3.1.

Intel Says Its Gaudi 2 Accelerator is Nvidia H100's Only Benchmarked Alternative for Generative AI
Intel Gaudi 2 

Developed by Habana Labs, now part of Intel, the Gaudi 2 is equipped with a whopping 96GB of HBM2E memory offering ample space to store and process massive datasets. Additionally, the high bandwidth of 2.45 TB/s ensures smooth data flow during training and inference processes, minimizing bottlenecks.

Habana Labs is considered the center of excellence for AI solutions at Intel, and Intel acquired the company in 2019 for approximately $2 billion.

It is said that Habana Labs is now working on Gaudi 3, which is expected to offer a significant performance boost.

Intel Gaudi 2 Vs NVIDIA H100 / A100

Gaudi 2 vs. NVIDIA H100

Performance: Gaudi 2 remains the only benchmarked alternative to Nvidia H100 for generative AI (GenAI) performance.

Price-to-Performance: It provides strong performance-per-dollar.

MLPerf Results: Intel is the exclusive server CPU vendor to submit MLPerf results, showcasing Gaudi 2's capabilities.

Scalability: Gaudi 2 integrates 24 100-gigabit RDMA over Converged Ethernet (RoCE2) ports, making it cost-effective and easy to scale out training capacity.

Networking: These ports enable efficient communication within the server, enhancing throughput.

Intel Gaudi 2 vs. NVIDIA A100

Inference: Gaudi 2 matches the latency of Nvidia H100 systems on decoding and outperforms the Nvidia A100 in large language model (LLM) inference.

Performance: Gaudi 2 is about twice faster than the NVIDIA A100 80GB for both training and inference. 

Available on the Intel Developer Cloud for easy access.

Memory Bandwidth: Gaudi 2 achieves higher memory bandwidth utilization than H100 and A100.

In summary, Intel Gaudi 2 offers compelling performance-per-dollar and can be a respected alternative to Nvidia's offerings. While Nvidia A100 remains a powerhouse, Gaudi 2 provides better value in many scenarios.

Advertisements

Post a Comment

Comment

Previous Post Next Post