AmazonLast week, Amazon unveiled a slew of products revamped with generative AI models running everything from the Echo to the Fire Stick. Amazon head scientist Rohit Prasad is leading the effort to bring generative AI into people’s homes. Q: You stepped Alexa up to this LLM-style, more conversational experience. Does it hallucinate like some of the other foundation models? A: LLMs are a waypoint towards generalized intelligence. They aren’t the endpoint. What you are seeing is still early days. All LLMs still hallucinate a bit. What we have done is taken extreme steps on how to ground them. If you said, ‘Alexa, it’s hot in here,’ a browser to an LLM-based system will say, ‘go to the beach.’ But if you asked Alexa on your home device, it should be grounded by knowing you have a thermostat connected to it. It should ask, ‘Do you want me to lower the temperature by five degrees?’ That’s the kind of important grounding that we have done. In addition, hallucinations are common because these are built on a token predictor. So you have to do a lot of fine tuning, as well as aligning so that it doesn’t hallucinate. Over the years of working on Alexa in the consumer domain, we have learned a lot on how to align these models. Q: Is the model a continuation of the Alexa model or is it an entirely new model? A: This is a new model we have built. But we’ve had years of experience with encoder-decoder models, which we were using for Alexa to learn new languages, new domains. And then we also had a visual language model, so that you can ask about product features. And some of the visual language models are also useful for things like visual processing. So we have had an immense amount of experience with that. But this is a new model. Very large. And then it’s fusing in real-time devices and services, your personal context of what you watch, what you listen to, what’s your favorite teams, are you a vegetarian. And of course, how to align these models the right way, especially when you connect it to the real world. Q: One of the big ideas in generative AI is the agent concept — an AI that can shop for you or buy your airline tickets. But there’s also some disillusionment that these agents are not going to work as well as we thought. You demonstrated one way it can work. Do you see this as a service within Amazon’s ecosystem? A: Alexa is that super intelligence that works on your behalf. Today, it already controls your home. It assists you on your tasks. It’s being used by two-year-olds to 100-year-olds. It doesn’t take huge imagination to say, ‘if I’m running out of milk, you can order it.’ Then there will be many vertical agents. They’re specialized at a certain task. So if you have a law firm, you have an agent that you can build for your work. If you are in the travel industry, you may have a travel agent. But there will also be these super agents or the super AI like Alexa. Both are possible now. For the rest of the conversation, read here. → |
|