How DeepMind thinks it can make chatbots safer

To receive The Algorithm in your inbox every Monday, sign up here.

Welcome to the Algorithm!

Some technologists hope that one day we will develop a superintelligent AI system that people will be able to have conversations with. Ask it a question, and it will offer an answer that sounds like something composed by a human expert. You could use it to ask for medical advice, or to help plan a holiday. Well, that’s the idea, at least.

In reality, we’re still a long way away from that. Even the most sophisticated systems of today are pretty dumb. I once got Meta’s AI chatbot BlenderBot to tell me that a prominent Dutch politician was a terrorist. In experiments where AI-powered chatbots were used to offer medical advice, they told pretend patients to kill themselves. Doesn’t fill you with a lot of optimism, does it?

That’s why AI labs are working hard to make their conversational AIs safer and more helpful before turning them loose in the real world. I just published a story about Alphabet-owned AI lab DeepMind’s latest effort: a new chatbot called Sparrow.

DeepMind’s new trick to making a good AI-powered chatbot was to have humans tell it how to behave—and force it to back up its claims using Google search. Human participants were then asked to evaluate how plausible the AI system’s answers were. The idea is to keep training the AI using dialogue between humans and machines.

In reporting the story, I spoke to Sara Hooker, who leads Cohere for AI, a nonprofit AI research lab.

She told me that one of the biggest hurdles in safely deploying conversational AI systems is their brittleness, meaning they perform brilliantly until they are taken to unfamiliar territory, which makes them behave unpredictably.

“It is also a difficult problem to solve because any two people might disagree on whether a conversation is inappropriate. And even if we agree that something is appropriate right now, this may change over time, or rely on shared context that can be subjective,” Hooker says.

Despite that, DeepMind’s findings underline that AI safety is not just a technical fix. You need humans in the loop.

In the long term, DeepMind hopes, having people steer the chatbot through dialogue could be a helpful tool for supervising machines.

“We might have a discussion about what a machine is doing in a way that allows us to communicate what we actually want and not miss subtle things,” says Geoffrey Irving, a safety researcher at DeepMind.

DeepMind’s model combines a lot of different strands of safety research into one model, with impressive results. You can read about it here.

But let’s be real. Nobody is building these systems purely because they want customer service bots to have better tools to help you rebook your canceled flight.

AI chatbots are powered by large language models, which produce human-sounding text by scraping vast amounts of writing from the internet. They could be a powerful tool for an entire new form of online search.

There’s a lot of money to be made in improving search, which has really lost its mojo. Google search has become overpersonalized and overcommercialized. It’s also riddled with hidden scams and malware.

Deeper Learning

This startup’s AI is smart enough to drive different types of vehicles

Wayve, a driverless car startup based in London, has made a single machine-learning model that can drive two different types of vehicles, a passenger car and a delivery truck—a first for the industry.

Watch out, Tesla: The breakthrough suggests that Wayve’s approach to autonomous vehicles might just scale up faster than the technology of mainstream companies like Cruise, Waymo, and Tesla. My colleague Will Heaven visited Wayve’s offices in London to check out the company’s new vehicle. Read more here.

Blog

Deeper Learning

Bits and Bytes

Empowering Your News Experience: Unveiling the Repithwin News Platform

Can you elaborate on the journey toward DevOps maturity and the role of Sirius360 in that process?

The Benefits of Receiving a Gay Massage

Best Beard Balm For You Beard

Repithwin: The Future of Innovation and Possibility

Revolutionizing Shareholder Voting with New Market Solutions