Google introduced Gemini 1.0 back at Google I/O 2023, and today, the company has finally launched it in three sizes. It is going to start rolling out to Bard. The company shared a lot of details about how the new model is going to work, and there are some really impressive details that they have shared.
Google is claiming that Gemini 1.0 happens to be the most capable and general model that is available. The model can actually go ahead and train separate components for various modalities and then stitch them together, as well. It is worth noting that Google has mentioned that the model will struggle with anything that is more conceptual and requires more complex reasoning.
Speaking of Gemini, Google “pre-trained from the start on different modalities” using TPU 4 and TPU v5e. Google has also announced TPU v5p today and claims that it is “most powerful, efficient, and scalable."
To show just how powerful Gemini actually is, GOogle actually demoed Gemini working through 200,000 scientific research papers, filtering out the ones that are relevant, and then summarizing the data in an hour. Coding is another thing that Google has paid attention to, and it mentioned how the model will be able to understand, explain, as well as generate high-quality code in multiple languages like Python, Java, C++, and Go.
At the time of writing, Gemini 1.0 is available in three different sizes that will span from data centers to phones.
To showcase just how powerful Gemini is, Google even shared the benchmark that you can look at below.
The Gemini 1.0 is actually very powerful in terms of multimodality, as well. We will be seeing Gemini Ultra fighting the ChatGPT-4V across the image, video, as well as audio tests. Google DeepMind has even shared a
Read more on wccftech.com