Sponsored Content by Microsoft Azure

Confidential GPUs for AI are the future of secure computing

Efficiency and innovation are often touted as hallmark attributes of generative AI. But as more enterprise businesses look to integrate the technology into their workflows, confidentiality — in data processing and sharing — is of utmost importance. 

The recent introduction of AI-specific policies, such as the U.S. Executive Order on the Safe, Secure and Trustworthy AI and the European Union’s AI Act, is a regulatory step forward for developers and users alike. These policies set compliance standards for AI developers to ensure that sensitive, proprietary, or confidential data is protected. They also nod to the inherent value of AI models as intellectual property, wherein training data, algorithms, model architecture, and weights should be secured against unauthorized access.

How confidential computing protects data at scale

Cloud services providers (CSPs) have been helping their customers keep their sensitive code and data secure in transit on the network using TLS and HTTPS encryption, and secure at rest on disk using encryption with customer managed keys. However, one area of data protection that has not been addressed until more recently is the protection of data in use in server memory. This changed in 2019 when Microsoft and other industry leaders founded the Confidential Computing Consortium (CCC), a project community at the Linux Foundation, to accelerate the development and adoption of confidential computing. The CCC defines confidential computing as the protection of data in use by performing computations in a hardware-based and attested Trusted Execution Environment (TEE). 

As a pioneer in this space, Microsoft Azure became one of the first CSPs to introduce confidential virtual machines, which are virtual machines running on confidential computing enabled CPUs. With confidential VMs, only the CPU hardware and the contents of the confidential VM are trusted—all other components of the software stack, including the hypervisor and host OS, are considered outside of this trust boundary and can be breached without exposing sensitive data in memory. And, in compliance with the CCC definition of confidential computing, Microsoft provides attestation tools to allow the user to verify the good state of the CPU and their VM before disk encryption keys are released and sensitive data is loaded into the VM.

The need for confidential GPUs

“We’ve worked very closely with customers to get their feedback on what types of AI models they hope to run, what security posture they are looking for, what use cases they want to enable,” said Vikas Bhatia, Head of Product for Azure Confidential Computing. “With answers including AI models such as Stable Diffusion, Zephyr, Llama2, and GPT2, it became very clear that GPU-enhanced confidential computing would be needed. Our introduction of Azure confidential VMs with NVIDIA H100 Tensor Core GPUs is our first step at addressing this market.”

“Our collaboration with NVIDIA has been a multi-year effort,” said Bhatia, “but this has been necessary to ensure that the TEE of the confidential VM can be securely extended to include the GPU and the communications channel that connects the two. Any AI applications uploaded, built, and deployed on this stack will remain protected from end to end.”

With these new GPU-enhanced confidential VMs, existing Azure customers can redeploy their CUDA models and the code that they’ve written already in an AI ML space in a confidential GPU environment to achieve what Bhatia calls a “unified confidentiality.” This establishes a secure channel with the GPU, wherein all subsequent data transfers between the VM and GPU are protected. Furthermore, the attestation process will verify that the VMs and GPUs are running a correctly configured TEE before any sensitive applications are launched.

The diverse applications of confidential GPUs

The effectiveness of generative AI models hinges on two factors: quality and quantity in training data. Despite training progress made with publicly available datasets, access to proprietary data is essential to leveraging the full potential of enterprise models. Through confidential GPUs computing, businesses can securely authorize the use of specialized data to perform more complex and targeted tasks, such as private data analysis, joint modeling, secure voting, or multi-party computation.

Bhatia identified three major use-cases for confidential GPUs: 

  • Confidential multi-party computation: Organizations can collaborate to train and run inferences on models without sharing proprietary data. Only the final result of a computation would be revealed to the participants.
  • Confidential inferencing: Inferencing occurs when a query or input is sent to a machine learning model to obtain a prediction or response. Confidential GPUs protect data in all stages of the inferencing process from clients, the model developer, service operations, and cloud providers.
  • Confidential training: Model algorithms and weights won’t be visible outside of TEEs set up by AI developers. Models can be securely trained on encrypted, distributed datasets that remain confidential to each party within a hardware-enforced boundary.

Azure’s healthcare customers, for example, are interested in employing confidential inferencing to analyze medical images, like X-rays, CT scans, and MRIs, without disclosing sensitive patient data or proprietary algorithms. Advanced image processing can improve the likelihood of diagnosis and treatment in identifying tumors, fractures, or anomalies in scans — without placing patient data at risk.

As an example, confidential GPUs are valuable in scenarios where data privacy is crucial but collaborative computation is still necessary. Researchers can run simulations of sensitive data (e.g. government data, scientific data) without sharing datasets or code to unauthorized parties. In the finance sector, confidential multi-party computation can be useful in fraud prevention work. Finance institutions can perform analyses or computations in a protected data clean room without disclosing individual financial details.

“Before confidential computing, companies struggled to securely implement this kind of data-sharing technology,” Bhatia said. “While in preview, clients have tested the VMs and found that the security enhancements help to address some of the challenges they’re facing with respect to compliance, governance and security.”

A new security standard for the AI era

As a leader in confidential computing, Azure’s robust security platform caters to the privacy needs of businesses worldwide. Innovative hardware is essential to maintaining a confidential GPU ecosystem of applications and AI models, which Azure is building towards. Bhatia’s hope is that this level of confidentiality will one day be standard across all industries. Data privacy and AI confidentiality should be a convention of everyday computing. 

“Our initial offering is best suited for use with smaller language models,” Bhatia said. “And while work is underway to scale this technology to support LLMs, we know customers will benefit from the current version by discovering the possibilities this technology will bring.”

Similar to how the early internet was once run on unsecure HTTP sites, security standards are always evolving. With more organizations processing sensitive data for AI models, there’s a great need for confidential NVIDIA GPU-powered AI. Azure’s latest VMs are a necessary, innovative introduction to secure GPU computing, which Azure is working to scale up to multiple GPUs.

“We want to set a new security standard with our confidential VMs,” Bhatia said. “We build from the mindset that a rising tide lifts all boats.”

Curious about Azure confidential VMs with NVIDIA H100 Tensor Core GPUs? Sign up to preview Azure’s hardware-based security enhancements and protect your GPU data-in-use.


This article is presented by TC Brand Studio. This is paid content, TechCrunch editorial was not involved in the development of this article. Reach out to learn more about partnering with TC Brand Studio.

More TechCrunch

Ola Electric, India’s largest electric two-wheeler maker, saw its shares rise as much as 20% on its public debut on Friday, making it the biggest listing among Indian firms in…

Ola Electric surges in India’s biggest listing in two years

Rocket Lab surpassed $100 million in quarterly revenue for the first time, a 71% increase from the same quarter of last year. This is just one of several shiny accomplishments…

Rocket Lab’s sunny outlook bodes well for future constellation plans 

In 1996, two companies, Patersons HR and Payroll Solutions, formed a venture called CloudPay to provide payroll and payments services to enterprise clients. CloudPay grew quietly over the next several…

CloudPay, a payroll services provider, lands $120M in new funding

The vulnerabilities allowed one security researcher to peek inside the leak sites without having to log in.

Security bugs in ransomware leak sites helped save six companies from paying hefty ransoms

The tech layoff wave is still going strong in 2024. Following significant workforce reductions in 2022 and 2023, this year has already seen 60,000 job cuts across 254 companies, according…

A comprehensive list of 2024 tech layoffs

A new “beta rabbit” mode adds some conversational AI chops to the Rabbit r1, particularly in more complex or multi-step instructions.

Rabbit’s r1 refines chats and timers, but its app-using ‘action model’ is still MIA

Los Angeles is notorious for its back-to-back traffic. Three events that promise to bring in millions of spectators from around the world — the 2026 World Cup, the Super Bowl…

Archer to set up air taxi network in LA by 2026 ahead of World Cup

Featured Article

Amazon is fumbling in India

Amazon’s decision to overlook quick-commerce in India is now looking like a significant misstep.

Amazon is fumbling in India

OpenAI’s GPT-4o, the generative AI model that powers the recently launched alpha of Advanced Voice Mode in ChatGPT, is the company’s first trained on voice as well as text and…

OpenAI finds that GPT-4o does some truly bizarre stuff sometimes

On Thursday, Box filled in a missing piece on its AI platform when it bought automated metadata extracting startup, Alphamoon.

Box adds crucial piece to its AI platform with Alphamoon acquisition

OpenAI has announced a new appointment to its board of directors: Zico Kolter. Kolter, a professor and director of the machine learning department at Carnegie Mellon, predominantly focuses his research…

OpenAI adds a Carnegie Mellon professor to its board of directors

Count Spotify and Epic Games among the Apple critics who are not happy with the iPhone maker’s newly revised compliance plan for the European Union’s Digital Markets Act (DMA). Shortly…

Spotify and Epic Games call Apple’s revised DMA compliance plan ‘confusing,’ ‘illegal’ and ‘unacceptable’

Thursday seeks to shake up conventional online dating in a crowded market. The app, which recently expanded to San Francisco, fosters intentional dating by restricting user access to Thursdays. At…

Thursday, the dating app that you can use only on Thursdays, expands to San Francisco

AI companies are gobbling up investor money and securing sky-high valuations early in their life cycle. This dynamic has many calling the AI industry a bubble. Nick Frosst, a co-founder…

Cohere co-founder Nick Frosst thinks everyone needs to be more realistic about what AI can and cannot do

Instagram is rolling out the ability for users to add up to 20 photos or videos to their feed carousels, as the platform embraces the trend of “photo dumps.” Back…

Instagram is embracing the ‘photo dump’

Welcome back to TechCrunch Mobility — your central hub for news and insights on the future of transportation. Sign up here for free — just click TechCrunch Mobility! Anyone paying…

Lyft ‘opens a can of whoop ass’ on surge pricing, Tesla’s Dojo explained and Saudi Arabia pumps $1.5B into Lucid

Flint Capital just closed its third fund at $160 million. Its has a unique strategy for finding its limited partner investors. 

Flint Capital raises a $160M through an unusual fund-raising strategy

Earlier this week it emerged that the DPC had instigated court proceedings seeking an injunction against X over the data processing without consent.

Elon Musk’s X agrees to pause EU data processing for training Grok

During testing, Google DeepMind’s table tennis bot was able to beat all of the beginner-level players it faced.

Google DeepMind develops a ‘solidly amateur’ table tennis robot

The X account announced that its Premium+ subscription would now be “fully” ad-free, leading some to question how this change would affect creator earnings.

As X sues advertisers over boycott, the app ditches all ads from its top subscription tier

Apple has further revised its compliance plan for the European Union’s Digital Markets Act (DMA) rulebook, which, since March, has forced it to give iOS developers more freedom over how…

Apple revises DMA compliance for App Store link-outs, applying fewer restrictions and a new fee structure

The rise of neobanks has been fascinating to witness, as a number of companies in recent years have grown from merely challenging traditional banks to being massive players in and…

Chime and Dave execs are coming to TechCrunch Disrupt 2024

If you visited the Wikipedia website on mobile this week, you might have seen a pop-up indicating that dark mode is ready for prime time.

How to enable Wikipedia’s dark mode

The home security company says attackers accessed databases containing customer home addresses, email addresses, and phone numbers.

Home security giant ADT says it was hacked

The Looking Glass Pro has a 6-inch display and a foldable base. It shows spatial images like those created with the Apple Vision Pro and iPhone 15 Pro.

Looking Glass’ new lineup includes a $300 phone-sized holographic display

TikTok’s latest offering is capitalizing on the app’s ability to serve as a discovery engine for other media — something its users already take advantage of by sharing short clips…

TikTok partners with Warner Bros. to become a discovery engine for TV and movies

Cocoon is a new startup built on the belief that greener steel production and the creation of concrete slag doesn’t have to be an either/or proposition.

Cocoon is transforming steel production runoff into a greener cement alternative

SoundHound, an AI company that makes voice interface tech used by car companies, restaurants and tech firms, is doubling down on enterprise services by playing consolidator in a crowded market.…

SoundHound acquires Amelia AI for $80M after it raised $189M+

Seeking mental health support is a complex process, but some founders believe that using AI to formalize techniques like cognitive behavioral therapy (CBT) can help folks who might not have…

Feeling Great’s new therapy app translates its psychiatrist co-founder’s experience into AI

The U.K.’s antitrust regulator has confirmed that it’s carrying out a formal antitrust investigation into Amazon’s ties with Anthropic, after Amazon recently completed a $4 billion investment into the AI startup.…

UK launches formal probe into Amazon’s ties with AI startup Anthropic