Google, on December 6, introduced its ‘largest’ and ‘most capable’ large language model (LLM), which goes by the name ‘Gemini’, to the world.
The AI model is designed to be multimodal, meaning it can learn from data beyond just text, processing insights from audio, video, and images.
Gemini is built to reason seamlessly across various types of input and output, such as generating code based on different inputs, generating text and images combined, and reasoning visually across languages.
Key features of Gemini include:
Multimodality: Gemini is designed to work with various types of data, including text, images, video, audio, and code.
Reasoning: The AI model can reason across different types of input and output, such as generating code based on given inputs or generating text and images combined.
Multilingual: Gemini can reason visually across languages. The AI, at first, will only work in English throughout the world, although Google executives assured reporters during a briefing that the technology will have no problem eventually diversifying into other languages.
Gemini is not a single language model but three models – Gemini Nano, Gemini Pro, and Gemini Ultra.Gemini Ultra is the largest and most capable one, designed for highly complex tasks, and is ideal for use in data centers and enterprise applications.
Meanwhile, Gemini Pro is not as large as the Ultra but is a beefier one that can scale across a wide range of tasks and will be used to power many Google AI services.
Then, there is Gemini Nano, which is a rather lighter model that can be run natively and offline on Android devices. It is also the most efficient model and is perfect for specific tasks and mobile devices.
Google's search engine, Bard Advanced, will also be infused with Gemini, although the timing of that transition hasn't been specified yet. Gemini is expected to provide significant improvements in AI multitasking and will eventually be used in Google's dominant search engine.
Gemini is expected to be the most powerful AI ever built. It will have sophisticated multimodal capabilities, master human-style conversations, language, and content, understand and interpret images, code prolifically and effectively, drive data and analytics, and be used by developers to create new AI apps and APIs. Soon, Gemini can be expected to exist - or even power - most of Google's products and services.
Comments
Write Comment