AI

AI chip startup Groq lands $640M to challenge Nvidia

Comment

concept of quantum computing or big data, graphic of microchip with futuristic cube and technology elements presented in isometric
Image Credits: Jackie Niam (opens in a new window) / Getty Images

Groq, a startup developing chips to run generative AI models faster than conventional processors, said on Monday that it has raised $640 million in a new funding round led by Blackrock. Neuberger Berman, Type One Ventures, Cisco, KDDI and Samsung Catalyst Fund also participated.

The tranche, which brings Groq’s total raised to over $1 billion and values the company at $2.8 billion, is a major win for Groq, which reportedly was originally looking to raise $300 million at a slightly lower ($2.5 billion) valuation. It more than doubles Groq’s previous valuation (~$1 billion) in April 2021, when the company raised $300 million in a round led by Tiger Global Management and D1 Capital Partners.

Meta chief AI scientist Yann LeCun will serve as a technical advisor to Groq and Stuart Pann, the former head of Intel’s foundry business and ex-CIO at HP, will join the startup as chief operating officer, Groq also announced today. LeCun’s appointment is a bit unexpected, given Meta’s investments in its own AI chips — but it undoubtedly gives Groq a powerful ally in a cutthroat space.

Groq, which emerged from stealth in 2016, is creating what it calls an LPU (language processing unit) inference engine. The company claims its LPUs can run existing generative AI models similar in architecture to OpenAI’s ChatGPT and GPT-4o at 10x the speed and one-tenth the energy.

Groq CEO Jonathan Ross’ claim to fame is helping to invent the tensor processing unit (TPU), Google’s custom AI accelerator chip used to train and run models. Ross teamed up with Douglas Wightman, an entrepreneur and former engineer at Google parent company Alphabet’s X moonshot lab, to co-found Groq close to a decade ago.

Groq provides an LPU-powered developer platform called GroqCloud that offers “open” models like Meta’s Llama 3.1 family, Google’s Gemma, OpenAI’s Whisper and Mistral’s Mixtral, as well as an API that allows customers to use its chips in cloud instances. (Groq also hosts a playground for AI-powered chatbots, GroqChat, that it launched late last year.) As of July, GroqCloud had more than 356,000 developers; Groq says that a portion of the proceeds from the round will be used to scale capacity and add new models and features.

“Many of these developers are at large enterprises,” Stuart Pann, Groq’s COO, told TechCrunch. “By our estimates, over 75% of the Fortune 100 are represented.”

Groq
A close-up look at Groq’s LPU, which is designed to accelerate certain AI workloads.
Image Credits: Groq

As the generative AI boom continues, Groq faces increasing competition from both rival AI chip upstarts and Nvidia, the formidable incumbent in the AI hardware sector.

Nvidia controls an estimated 70% to 95% of the market for AI chips used to train and deploy generative AI models, and the firm’s taking aggressive steps to maintain its dominance.

Nvidia has committed to releasing a new AI chip architecture every year, rather than every other year as was the case historically. And it’s reportedly establishing a new business unit focused on designing bespoke chips for cloud computing firms and others, including AI hardware.

Beyond Nvidia, Groq competes with Amazon, Google and Microsoft, all of which offer — or will soon offer — custom chips for AI workloads in the cloud. Amazon has its Trainium, Inferentia and Graviton processors, available through AWS; Google Cloud customers can use the aforementioned TPUs and, in time, Google’s Axion chip; and Microsoft recently launched Azure instances in preview for its Cobalt 100 CPU, with Maia 100 AI Accelerator instances to come in the next several months.

Groq could consider Arm, Intel, AMD and a growing number of startups as rivals, too, in an AI chip market that could reach $400 billion in annual sales in the next five years, according to some analysts. Arm and AMD in particular have blossoming AI chip businesses, thanks to soaring capital spending by cloud vendors to meet the capacity demand for generative AI.

D-Matrix late last year raised $110 million to commercialize what it’s characterizing as a first-of-its-kind inference compute platform. In June, Etched emerged from stealth with $120 million for a processor custom-built to speed up the dominant generative AI model architecture today, the transformer. SoftBank’s Masayoshi Son is reportedly looking to raise $100 billion for a chip venture to compete Nvidia. And OpenAI is said to be in talks with investment firms to launch an AI chip-making initiative.

To carve out its niche, Groq is investing heavily in enterprise and government outreach.

In March, Groq acquired Definitive Intelligence, a Palo Alto-based firm offering a range of business-oriented AI solutions, to form a new business unit called Groq Systems. Within Groq Systems’ purview is serving organizations, including U.S. government agencies and sovereign nations, that wish to add Groq’s chips to existing data centers or build new data centers using Groq processors.

More recently, Groq partnered with Carahsoft, a government IT contractor, to sell its solutions to public sector clients through Carahsoft’s reseller partners, and the startup has a letter of intent to install tens of thousands of its LPUs at European firm Earth Wind & Power’s Norway data center.

Groq is also collaborating with Saudi Arabian consulting firm Aramco Digital to install LPUs in future data centers in the Middle East.

At the same time it’s establishing customer relationships, Mountain View, California-based Groq is marching toward the next generation of its chip. Last August, the company announced that it would contract with Samsung’s foundry business to manufacture 4nm LPUs, which are expected to deliver performance and efficiency gains over Groq’s first-gen 13nm chips.

Groq says it plans to deploy more than 108,000 LPUs by the end of Q1 2025.

More TechCrunch

Ola Electric, India’s largest electric two-wheeler maker, saw its shares rise as much as 20% on its public debut on Friday, making it the biggest listing among Indian firms in…

Ola Electric surges in India’s biggest listing in two years

Rocket Lab surpassed $100 million in quarterly revenue for the first time, a 71% increase from the same quarter of last year. This is just one of several shiny accomplishments…

Rocket Lab’s sunny outlook bodes well for future constellation plans 

In 1996, two companies, Patersons HR and Payroll Solutions, formed a venture called CloudPay to provide payroll and payments services to enterprise clients. CloudPay grew quietly over the next several…

CloudPay, a payroll services provider, lands $120M in new funding

The vulnerabilities allowed one security researcher to peek inside the leak sites without having to log in.

Security bugs in ransomware leak sites helped save six companies from paying hefty ransoms

Featured Article

A comprehensive list of 2024 tech layoffs

The tech layoff wave is still going strong in 2024. Following significant workforce reductions in 2022 and 2023, this year has already seen 60,000 job cuts across 254 companies, according to independent layoffs tracker Layoffs.fyi. Companies like Tesla, Amazon, Google, TikTok, Snap and Microsoft have conducted sizable layoffs in the…

A comprehensive list of 2024 tech layoffs

A new “beta rabbit” mode adds some conversational AI chops to the Rabbit r1, particularly in more complex or multi-step instructions.

Rabbit’s r1 refines chats and timers, but its app-using ‘action model’ is still MIA

Los Angeles is notorious for its back-to-back traffic. Three events that promise to bring in millions of spectators from around the world — the 2026 World Cup, the Super Bowl…

Archer to set up air taxi network in LA by 2026 ahead of World Cup

Featured Article

Amazon is fumbling in India

Amazon’s decision to overlook quick-commerce in India is now looking like a significant misstep.

Amazon is fumbling in India

OpenAI’s GPT-4o, the generative AI model that powers the recently launched alpha of Advanced Voice Mode in ChatGPT, is the company’s first trained on voice as well as text and…

OpenAI finds that GPT-4o does some truly bizarre stuff sometimes

On Thursday, Box filled in a missing piece on its AI platform when it bought automated metadata extracting startup, Alphamoon.

Box adds crucial piece to its AI platform with Alphamoon acquisition

OpenAI has announced a new appointment to its board of directors: Zico Kolter. Kolter, a professor and director of the machine learning department at Carnegie Mellon, predominantly focuses his research…

OpenAI adds a Carnegie Mellon professor to its board of directors

Count Spotify and Epic Games among the Apple critics who are not happy with the iPhone maker’s newly revised compliance plan for the European Union’s Digital Markets Act (DMA). Shortly…

Spotify and Epic Games call Apple’s revised DMA compliance plan ‘confusing,’ ‘illegal’ and ‘unacceptable’

Thursday seeks to shake up conventional online dating in a crowded market. The app, which recently expanded to San Francisco, fosters intentional dating by restricting user access to Thursdays. At…

Thursday, the dating app that you can use only on Thursdays, expands to San Francisco

AI companies are gobbling up investor money and securing sky-high valuations early in their life cycle. This dynamic has many calling the AI industry a bubble. Nick Frosst, a co-founder…

Cohere co-founder Nick Frosst thinks everyone needs to be more realistic about what AI can and cannot do

Instagram is rolling out the ability for users to add up to 20 photos or videos to their feed carousels, as the platform embraces the trend of “photo dumps.” Back…

Instagram is embracing the ‘photo dump’

Welcome back to TechCrunch Mobility — your central hub for news and insights on the future of transportation. Sign up here for free — just click TechCrunch Mobility! Anyone paying…

Lyft ‘opens a can of whoop ass’ on surge pricing, Tesla’s Dojo explained and Saudi Arabia pumps $1.5B into Lucid

Flint Capital just closed its third fund at $160 million. Its has a unique strategy for finding its limited partner investors. 

Flint Capital raises a $160M through an unusual fund-raising strategy

Earlier this week it emerged that the DPC had instigated court proceedings seeking an injunction against X over the data processing without consent.

Elon Musk’s X agrees to pause EU data processing for training Grok

During testing, Google DeepMind’s table tennis bot was able to beat all of the beginner-level players it faced.

Google DeepMind develops a ‘solidly amateur’ table tennis robot

The X account announced that its Premium+ subscription would now be “fully” ad-free, leading some to question how this change would affect creator earnings.

As X sues advertisers over boycott, the app ditches all ads from its top subscription tier

Apple has further revised its compliance plan for the European Union’s Digital Markets Act (DMA) rulebook, which, since March, has forced it to give iOS developers more freedom over how…

Apple revises DMA compliance for App Store link-outs, applying fewer restrictions and a new fee structure

The rise of neobanks has been fascinating to witness, as a number of companies in recent years have grown from merely challenging traditional banks to being massive players in and…

Chime and Dave execs are coming to TechCrunch Disrupt 2024

If you visited the Wikipedia website on mobile this week, you might have seen a pop-up indicating that dark mode is ready for prime time.

How to enable Wikipedia’s dark mode

The home security company says attackers accessed databases containing customer home addresses, email addresses, and phone numbers.

Home security giant ADT says it was hacked

The Looking Glass Pro has a 6-inch display and a foldable base. It shows spatial images like those created with the Apple Vision Pro and iPhone 15 Pro.

Looking Glass’ new lineup includes a $300 phone-sized holographic display

TikTok’s latest offering is capitalizing on the app’s ability to serve as a discovery engine for other media — something its users already take advantage of by sharing short clips…

TikTok partners with Warner Bros. to become a discovery engine for TV and movies

Cocoon is a new startup built on the belief that greener steel production and the creation of concrete slag doesn’t have to be an either/or proposition.

Cocoon is transforming steel production runoff into a greener cement alternative

SoundHound, an AI company that makes voice interface tech used by car companies, restaurants and tech firms, is doubling down on enterprise services by playing consolidator in a crowded market.…

SoundHound acquires Amelia AI for $80M after it raised $189M+

Seeking mental health support is a complex process, but some founders believe that using AI to formalize techniques like cognitive behavioral therapy (CBT) can help folks who might not have…

Feeling Great’s new therapy app translates its psychiatrist co-founder’s experience into AI

The U.K.’s antitrust regulator has confirmed that it’s carrying out a formal antitrust investigation into Amazon’s ties with Anthropic, after Amazon recently completed a $4 billion investment into the AI startup.…

UK launches formal probe into Amazon’s ties with AI startup Anthropic