Google's latest Trillium AI chip offers 4 times the speed and supports Gemini 2.0

December 15, 2024

102

Google has just been revealed Trilliumits sixth-generation artificial intelligence accelerator chip, guarantees performance improvements that might fundamentally change the economics of AI development while pushing the boundaries of what is feasible in machine learning.

The custom processor that powers Google's training has been newly announced Gemini 2.0 The AI model offers 4 times higher training performance than its predecessor and uses significantly less energy. This breakthrough comes at an important time as technology corporations race to develop increasingly sophisticated AI systems that require massive computing resources.

“TPUs support 100% of Gemini 2.0 training and inference,” Google CEO Sundar Pichai said in an announcement Announcement post This underlines the central role of the chip in the corporate's AI strategy. The scale of the deployment is unprecedented: Google connected greater than 100,000 Trillium chips in a single network structure, creating one of the crucial powerful AI supercomputers on the planet.

How Trillium's 4x performance improvement is transforming AI development

Trillium's specifications represent significant advances in several dimensions. The chip delivers a 4.7x increase in peak processing power per chip in comparison with its predecessor, while doubling each high-bandwidth storage capability and inter-chip interconnect bandwidth. Perhaps most significantly, energy efficiency is increased by 67% – an important metric as data centers struggle with the large energy demands of AI training.

“When training the Llama-2-70B model, our testing shows that Trillium achieves near-linear scaling from a 4-slice Trillium 256 chip pod to a 36-slice Trillium 256 chip pod with scaling efficiency of 99%,” said Mark Lohmeyer, vp of computing and AI infrastructure at Google Cloud. This level of scaling efficiency is especially notable given the challenges typically related to distributed computing at this scale.

The Economics of Innovation: Why Trillium is changing the sport for AI startups

Trillium's business impact goes beyond just performance metrics. Google claims the chip offers as much as 2.5x improvement in training performance per dollar in comparison with the previous generation, potentially reshaping the economics of AI development.

This cost-effectiveness could prove particularly vital for corporations and startups developing large language models. AI21 Labs, an early Trillium customer, has already reported significant improvements. “The advances in scale, speed and cost-effectiveness are significant,” he noted Barak LenzCTO of AI21 Labs, within the announcement.

Reaching latest heights: Google's 100,000-chip AI supernetwork

Google's use of Trillium inside its AI hypercomputer architecture demonstrates the corporate's integrated approach to AI infrastructure. The system combines over 100,000 Trillium chips with a Jupiter network structure that permits a bisection bandwidth of 13 petabits per second – allowing a single distributed training job to scale across lots of of hundreds of accelerators.

“The growth in Flash usage has exceeded 900%, which is incredible to see,” noted Logan Kilpatrick, product manager on Google’s AI Studio team, throughout the developer conference, citing the rapidly increasing demand for AI computing Resources.

Beyond Nvidia: Google's daring move within the AI chip war

Trillium's release intensifies competition in AI hardware, where Nvidia has dominated with its GPU-based solutions. While Nvidia's chips remain the industry standard for a lot of AI applications, Google's custom silicon approach could offer benefits for certain workloads, particularly when training very large models.

Industry analysts suspect that Google's massive investments in developing customized chips reflect a strategic bet on the growing importance of AI infrastructure. The company's decision to make Trillium available to cloud customers demonstrates a desire to compete more aggressively within the cloud AI market, where the corporate faces strong competition Microsoft Azure And Amazon Web Services.

Driving the longer term: What Trillium means for the AI of tomorrow

The impact of Trillium's abilities goes beyond immediate performance increases. The chip's ability to efficiently handle mixed workloads – from training large models to running inference for production applications – suggests a future during which AI computing becomes more accessible and cost-effective.

For the broader tech industry, Trillium's release signals that the race for dominance in AI hardware is entering a brand new phase. As corporations push the boundaries of what is feasible with artificial intelligence, the power to design and deploy specialized hardware at scale could change into an increasingly crucial competitive advantage.

“We are still within the early stages of what is feasible with AI,” Demis Hassabis, CEO of Google DeepMind, wrote at the corporate Blog post. “The right infrastructure – each hardware and software – might be critical as we proceed to push the boundaries of what AI can do.”

As the industry moves toward more sophisticated AI models that may act autonomously and reason across multiple information modes, the demands on the underlying hardware will only increase. With Trillium, Google has shown that it intends to stay on the forefront of this evolution and put money into the infrastructure that can power the subsequent generation of AI advancements.

Google's latest Trillium AI chip offers 4 times the speed and supports Gemini 2.0

How Trillium's 4x performance improvement is transforming AI development

The Economics of Innovation: Why Trillium is changing the sport for AI startups

Reaching latest heights: Google's 100,000-chip AI supernetwork

Beyond Nvidia: Google's daring move within the AI chip war

Driving the longer term: What Trillium means for the AI of tomorrow

LEAVE A REPLY Cancel reply

Must Read

Instagram introduces AI tools for editing appearances and backgrounds in videos using command prompts

The recent tool from the AI startup Odyssey can generate photorealistic 3D worlds

Big language overkill: How SLMs can beat their larger, resource-intensive cousins

Decart acquires $32 million at a valuation of over $500 million to develop AI technology and “open world” apps

Backed by a16z and NEA, Backflip raises $30 million in Series A to rework text into AI-generated designs

Arm Lawsuit Against Qualcomm Ends in Mistrial and Favorable Verdict for Qualcomm

Write for Us: Open invitation to industry experts and passionate writers

Latest articles

Instagram introduces AI tools for editing appearances and backgrounds in videos using command prompts

The recent tool from the AI startup Odyssey can generate photorealistic 3D worlds

Big language overkill: How SLMs can beat their larger, resource-intensive cousins

Our Newsletter

Google's latest Trillium AI chip offers 4 times the speed and supports Gemini 2.0

How Trillium's 4x performance improvement is transforming AI development

The Economics of Innovation: Why Trillium is changing the sport for AI startups

Reaching latest heights: Google's 100,000-chip AI supernetwork

Beyond Nvidia: Google's daring move within the AI ​​chip war

Driving the longer term: What Trillium means for the AI ​​of tomorrow

RELATED ARTICLES

LEAVE A REPLY Cancel reply

Must Read

Latest articles

Our Newsletter

Beyond Nvidia: Google's daring move within the AI chip war

Driving the longer term: What Trillium means for the AI of tomorrow