Google has finally taken the covers off its project Gemini, after almost a year-long secrecy, and the world now gets to take a look at its capabilities. Google Gemini is the company's largest AI model and is a multimodal AI system capable of producing outputs in images, video, and audio formats in its most powerful version. The AI model will be competing with OpenAI's GPT-4 directly, and the first shots have already been fired by Google. At its launch, Google, without really looking to do a comparison, claimed that its Gemini AI model beats any other models out there in most of the benchmarks. So, how different is Google Gemini compared to GPT-4, and can it surpass the ChatGPT maker? Let us take a look.
The Gemini model's problem-solving skills are being touted by Google as being especially adept in math and physics, fueling hopes among AI optimists that it may lead to scientific breakthroughs that improve life for humans.
“This is a significant milestone in the development of AI, and the start of a new era for us at Google,” said Demis Hassabis, CEO of Google DeepMind, the AI division behind Gemini.
Google claimed that Gemini is its most flexible model yet and able to efficiently run on everything from data centers to mobile devices. Its state-of-the-art capabilities will significantly enhance the way developers and enterprise customers build and scale with AI. It is available in three variants — Gemini Nano, the basic model, Gemini Pro, and its most advanced model Gemini Ultra which can generate results in images, video, and audio.
Google has also tested its benchmarks against those of GPT-4, and the company claims that its AI modal has defeated OpenAI's LLM in 30 out of 32 benchmarks. The blog post said, “We've been
Read more on tech.hindustantimes.com