Put Humans at the Center of AI
As the director of Stanford’s AI Lab and now as a chief scientist of Google Cloud, Fei-Fei Li is helping to spur the AI revolution. But it’s a revolution that needs to include more people. She spoke with MIT Technology Review senior editor Will Knight about why everyone benefits if we emphasize the human side of the technology.
Why did you join Google?
Researching cutting-edge AI is very satisfying and rewarding, but we’re seeing this great awakening, a great moment in history. For me it’s very important to think about AI’s impact in the world, and one of the most important missions is to democratize this technology. The cloud is this gigantic computing vehicle that delivers computing services to every single industry.
What have you learned so far?
We need to be much more human-centered. If you look at where we are in AI, I would say it’s the great triumph of pattern recognition. It is very task-focused, it lacks contextual awareness, and it lacks the kind of flexible learning that humans have. We also want to make technology that makes humans’ lives better, our world safer, our lives more productive and better. All this requires a layer of human-level communication and collaboration.
How can we make AI more human-centered?
There’s a great phrase, written in the ’70s: “the definition of today’s AI is a machine that can make a perfect chess move while the room is on fire.” It really speaks to the limitations of AI. In the next wave of AI research, if we want to make more helpful and useful machines, we’ve got to bring back the contextual understanding. We’ve got to bring knowledge abstraction and reasoning. These are all the most important steps.
At Stanford you created Visual Genome, a database of images that are extensively labeled so they can be used for AI systems. Is this interplay of vision and language necessary for the next leap forward?
Absolutely. Vision is a cornerstone of intelligence, and language understanding is a cornerstone of intelligence. What makes humans unique is that evolution gave us the most incredible and sophisticated vision system, motor system, and language system, and they all work together. Visual Genome is exactly the kind of project that’s pushing the boundaries of language understanding and visual understanding. And eventually we’re going to connect with the world of robotics as well.
You’ve talked about the need to have more women involved in AI. Why?
More jobs will be related to artificial intelligence, so we need a huge workforce, and we need a more inclusive base. That’s an economic argument. There are also tons of studies that have shown that when a diverse group of workers come together, the solutions they find in their work are more innovative and more creative. That drives innovation. But it’s also moral and ethical.
When you are making a technology this pervasive and this important for humanity, you want it to carry the values of the entire humanity, and serve the needs of the entire humanity. If the developers of this technology do not represent all walks of life, it is very likely that this will be a biased technology. I say this as a technologist, a researcher, and a mother. And we need to be speaking about this clearly and loudly.
Deep Dive
Artificial intelligence
Google DeepMind used a large language model to solve an unsolved math problem
They had to throw away most of what it produced but there was gold among the garbage.
Unpacking the hype around OpenAI’s rumored new Q* model
If OpenAI's new model can solve grade-school math, it could pave the way for more powerful systems.
Finding value in generative AI for financial services
Financial services firms have started to adopt generative AI, but hurdles lie in their path toward generating income from the new technology.
Google DeepMind’s new Gemini model looks amazing—but could signal peak AI hype
It outmatches GPT-4 in almost all ways—but only by a little. Was the buzz worth it?
Stay connected
Get the latest updates from
MIT Technology Review
Discover special offers, top stories, upcoming events, and more.