Safety

OpenAI finds that GPT-4o does some truly bizarre stuff sometimes

15 hours ago

OpenAI’s GPT-4o, the generative AI model that powers the recently launched alpha of Advanced Voice Mode in ChatGPT, is the company’s first trained on voice as well as text and…

OpenAI finds that GPT-4o does some truly bizarre stuff sometimes

OpenAI pledges to give U.S. AI Safety Institute early access to its next model

Kyle Wiggers

10:48 pm PDT • July 31, 2024

OpenAI CEO Sam Altman says that OpenAI is working with the U.S. AI Safety Institute, a federal government body that aims to assess and address risks in AI platforms, on…

OpenAI pledges to give U.S. AI Safety Institute early access to its next model

Google releases new ‘open’ AI models with a focus on safety

Kyle Wiggers

3:25 pm PDT • July 31, 2024

Google has released a trio of new, “open” generative AI models that it’s calling “safer,” “smaller” and “more transparent” than most — a bold claim, to be sure. They’re additions…

Google releases new ‘open’ AI models with a focus on safety

WitnessAI is building guardrails for generative AI models

Kyle Wiggers

9:19 am PDT • May 21, 2024

Generative AI makes stuff up. It can be biased. Sometimes it spits out toxic text. So can it be “safe”? Rick Caccia, the CEO of WitnessAI, believes it can. “Securing…

WitnessAI is building guardrails for generative AI models

Apps

Hinge adds a way to mute requests containing words you specify

Ivan Mehta

5:00 am PDT • April 24, 2024

Hinge is adding a “Hidden Words” feature to its app, which will filter out likes with comments containing those phrases or words. It pretty much works like a mute filter…

Hinge adds a way to mute requests containing words you specify

Apps

Life360 launches flight landing notifications to alert friends and family

Ivan Mehta

4:00 am PDT • April 9, 2024

Algorithms can detect takeoff and landing times, and alert family members when you connect to the network post-landing.

Life360 launches flight landing notifications to alert friends and family

Gaming

k-ID launches a solution that helps game developers comply with ever-changing child safety regulations

Sarah Perez

6:47 am PST • March 6, 2024

Making a video game successful is already hard. Doing so while complying with the growing number of child safety laws and regulations around the world is an almost insurmountable task.…

k-ID launches a solution that helps game developers comply with ever-changing child safety regulations

Google DeepMind forms a new org focused on AI safety

Kyle Wiggers

7:00 am PST • February 21, 2024

If you ask Gemini, Google’s flagship GenAI model, to write deceptive content about the upcoming U.S. presidential election, it will, given the right prompt. Ask about a future Super Bowl…

Google DeepMind forms a new org focused on AI safety

Anthropic researchers find that AI models can be trained to deceive

Kyle Wiggers

8:30 am PST • January 13, 2024

Most humans learn the skill of deceiving other humans. So can AI models learn the same? Yes, the answer seems — and terrifyingly, they’re exceptionally good at it. A recent…

Anthropic researchers find that AI models can be trained to deceive

Distributional wants to develop software to reduce AI risk

Kyle Wiggers

6:00 am PST • December 14, 2023

Companies are increasingly curious about AI and the ways in which it can be used to (potentially) boost productivity. But they’re also wary of the risks. In a recent Workday…

Distributional wants to develop software to reduce AI risk

Apps

Google Play tightens up rules for Android app developers to require testing, increased app review

Sarah Perez

9:00 am PST • November 9, 2023

Google today is announcing strengthened protections for Android developers publishing apps to its Google Play store. The changes are a part of Google’s broader efforts at keeping low-quality and unsafe…

Google Play tightens up rules for Android app developers to require testing, increased app review

Apps

Snapchat adds new teen safety features, cracks down on age-inappropriate content

Sarah Perez

7:27 am PDT • September 7, 2023

Snapchat today is announcing a series of new safeguards for its app, aimed at better protecting teen users, similar to other efforts introduced earlier by other social apps, like Facebook…

Snapchat adds new teen safety features, cracks down on age-inappropriate content

Apps

Pinterest rolls out new teen safety features, including wiping followers from users 15 and under

Sarah Perez

8:13 am PDT • August 21, 2023

Pinterest today introduced a series of new safety features aimed at better protecting teens using its service. The features — which include things like private profiles, more control over followers…

Apps

Match Group’s background check provider Garbo ends its partnership

Sarah Perez

2:00 pm PDT • August 17, 2023

Tech nonprofit Garbo announced today it’s ending its formal partnership with Match Group, the dating app giant behind Tinder, Plenty of Fish, Match and other apps. The two companies first…

Match Group’s background check provider Garbo ends its partnership

Featured Article

The other DWI: Driving while immersed

I believe that putting virtual reality headsets in cars will kill people. VR is the most distracting medium ever invented.

Jeremy Bailenson

10:30 am PDT • May 23, 2023

Apps

Increased oversight: Discord tests new parental controls for teens

Sarah Perez

11:11 am PDT • May 22, 2023

New usernames aren’t the only change coming to the popular chat app Discord, now used by 150 million people every month. The company is also testing a suite of parental…

Increased oversight: Discord tests new parental controls for teens

Transportation

Qualcomm acquires Autotalks to boost Snapdragon’s automotive safety technology, reportedly for $350-400M

Ingrid Lunden

2:36 am PDT • May 8, 2023

Qualcomm’s longer term bet on the automotive sector as a lucrative customer base for its chips and related communications technology is getting a significant push today: The company announced that…

Qualcomm acquires Autotalks to boost Snapdragon’s automotive safety technology, reportedly for $350-400M

Hardware

Apple and Google team up on industry spec to make Bluetooth tracking devices, like AirTag, safer

Sarah Perez

7:50 am PDT • May 2, 2023

After numerous cases of Bluetooth trackers like Apple’s AirTag being used for stalking or other criminal apps, Apple and Google today released a joint announcement saying they will work together…

Apple and Google team up on industry spec to make Bluetooth tracking devices, like AirTag, safer

Social

After an investigation exposes its dangers, Pinterest announces new safety tools and parental controls

Sarah Perez

7:58 am PDT • April 12, 2023

Following last month’s NBC News investigation into Pinterest that exposed how pedophiles had been using the service to curate image boards of young girls, the company on Tuesday announced further…

Apps

TikTok introduces a strike system for violations, tests a feature to ‘refresh’ the For You feed

Sarah Perez

8:54 am PST • February 2, 2023

TikTok today is announcing several changes to its service, including what it claims will be increased enforcement against bad actors as well as tests of new user-facing tools that will…

TikTok introduces a strike system for violations, tests a feature to ‘refresh’ the For You feed

Apps

Twitter disperses the Trust & Safety Council after key members resigned

Ivan Mehta

8:40 pm PST • December 12, 2022

Twitter today dispersed the Trust & Safety Council, which was an advisory group consisting of roughly 100 independent researchers and human rights activists. The group, formed in 2016, gave the…

Twitter disperses the Trust & Safety Council after key members resigned

Startups

Ring launches pilot program to let local agencies share updates and ‘safety information’

Kyle Wiggers

12:46 pm PST • November 15, 2022

Ring today announced that local government agencies will be able to have an official presence on the company’s Neighbors app. Beginning with the City of North Port and Pinellas County…

Ring launches pilot program to let local agencies share updates and ‘safety information’

Apps

Meta, TikTok, YouTube and Twitter dodge questions on social media and national security

Taylor Hatmaker

4:01 pm PDT • September 14, 2022

Executives from four of the biggest social media companies testified before the Senate Homeland Security Committee Wednesday, defending their platforms and their respective safety, privacy and moderation failures in recent…

Meta, TikTok, YouTube and Twitter dodge questions on social media and national security

Featured Article

A huge Chinese database of faces and vehicle license plates spilled online

A massive Chinese database storing millions of faces and vehicle license plates was left exposed on the internet for months before it quietly disappeared in August. While its contents might seem unremarkable for China, where facial recognition is routine and state surveillance is ubiquitous, the sheer size of the exposed…

Zack Whittaker

10:00 am PDT • August 30, 2022

A huge Chinese database of faces and vehicle license plates spilled online

Transportation

Uber partners with ADT to let riders get in touch with a live safety agent

Lauren Forristal

8:09 am PDT • August 30, 2022

Uber is introducing a new option to its safety toolkit, a section of Uber’s app where users can contact emergency services, report a safety issue to the company, verify rides…

Uber partners with ADT to let riders get in touch with a live safety agent

Security

US unmasks alleged Conti ransomware operative, offers $10M for intel

Carly Page

5:22 am PDT • August 12, 2022

The U.S. government said it will offer up to $10 million for information related to five people believed to be high-ranking members of the notorious Russia-backed Conti ransomware gang. The…

US unmasks alleged Conti ransomware operative, offers $10M for intel

Startups

Fort is working to keep humans safe from industrial robots

Brian Heater

6:00 am PDT • July 12, 2022

Industrial robots are big, hulking things. They are, at once, designed to operate alongside humans, while also posing potential bodily risk to our soft, fleshy exterior. It’s precisely for this…

Fort is working to keep humans safe from industrial robots

Microsoft and Meta join Google in using AI to help run their data centers

Kyle Wiggers

6:00 am PDT • June 18, 2022

Data centers, which drive the apps, websites and services that billions of people use every day, can be hazardous places for the workers that build and maintain them. Workers sometimes…

Microsoft and Meta join Google in using AI to help run their data centers

Startups

Behavioral cybersecurity platform CybSafe raises $28M Series B led by Evolution Equity Partners

Mike Butcher

12:01 am PDT • June 9, 2022

Last year, U.K. cybersecurity startup CybSafe, a “behavioral security” platform, raised a $7.9 million Series A. This SaaS product with a per-user-based, subscription licensing model has a “behavior-led” platform that…

Behavioral cybersecurity platform CybSafe raises $28M Series B led by Evolution Equity Partners

Startups

To better manage cybersecurity risk, extend zero-trust principles to third parties

Saket Modi

2:43 pm PDT • June 3, 2022

Today’s cybersecurity landscape requires an agile and data-driven risk management strategy to deal with the ever-expanding third-party attack surface.

To better manage cybersecurity risk, extend zero-trust principles to third parties