She's an important figure behind today's artificial intelligence boom, but not all computer scientists thought Fei-Fei Li was on the right track when she came up with the idea for a giant visual database called ImageNet that took years to build. Li, now a founding director of Stanford University's Institute for Human-Centered Artificial Intelligence, is out with a new memoir that recounts her pioneering work in curating the dataset that accelerated the computer vision branch of AI.
The book, “The World I See,” also portrays her formative years that abruptly shifted from China to New Jersey and follows her through academia, Silicon Valley and the halls of Congress as growing commercialization of AI technology brought public attention and a backlash. She spoke with The Associated Press about the book and the current AI moment. The interview has been edited for length and clarity.
A: ImageNet really is the quintessential story of identifying the North Star of an AI problem and then finding a way to get there. The North Star for me was to really rethink how we can solve the problem of visual intelligence. One of the most fundamental problems in visual intelligence is understanding, or seeing, objects because the world is made of objects. Human vision is grounded in our understanding of objects. And there are many, many, many of them. ImageNet is really an attempt to define the problem of object recognition and also to provide a path to solve it, which is the big data path.
A: What does not surprise me is that everything you mention — DALL-E, ChatGPT, Gemini — is large-data based. They are pretrained on a large amount of data. That's exactly what I was hoping for. What surprised me is we got to generative AI faster than most of
Read more on tech.hindustantimes.com