HomeNewsGoogle's Gemini presents more powerful technology, but we're still nowhere near superhuman...

Google's Gemini presents more powerful technology, but we're still nowhere near superhuman AI

In December 2023, Google has announced the launch its latest Large Language Model (LLM). Twins. Gemini now provides the foundations of artificial intelligence (AI) for Google products; it’s also a direct rival to GPT-4 by OpenAI.

But why does Google consider Gemini such a very important milestone and what does it mean for users of Google services? And what does that mean usually within the context of the present hyper-fast developments in AI?

AI all over the place

Google is betting that Gemini will transform most of its products by improving current functionality and creating latest ones for services like search, Gmail, YouTube and its Office productivity suite. This would also enable improvements to its internet advertising business – its principal income – in addition to its software for Android phones, with stripped-down versions of Gemini running on hardware with limited capability.

A video from Google highlights Gemini's capabilities.

For users, Gemini means latest features and improved capabilities that will make it harder to avoid Google services and strengthen an already dominant position in areas resembling search engines like google. The potential and opportunity for Google is important as most of their software consists of easily upgradable cloud services.

But the massive one and unexpected success by ChatGPT attracted quite a lot of attention and increased the credibility of OpenAI. Gemini will allow Google to re-establish itself in the general public eye as a significant player in AI. Google is a powerhouse in AI, with large and powerful research teams which have been at the foundation of lots of the foremost advances of the last decade.

There is a public debate about these latest technologies, each in regards to the advantages they provide and the disruption they cause in areas resembling education, design and healthcare.

Strengthen AI

At its core, Gemini relies on Transformer networks. The same technology was originally developed by a research team at Google and can also be used for other LLMs resembling GPT-4.

A special feature of Gemini is its ability to handle different data modalities: text, audio, image and video. This gives the AI ​​model the power to perform tasks across multiple modalities, resembling answering questions on the content of a picture or performing keyword searches for specific sorts of content discussed in podcasts.

But more importantly, the incontrovertible fact that the models can handle different modalities enables the training of worldwide superior AI models in comparison with different models trained independently for every modality. In fact, such multimodal models are considered stronger because they’re exposed to different perspectives of the identical concepts.

For example, the concept of birds could be higher understood by learning from a combination of text descriptions, vocalizations, images, and videos of birds. This idea of ​​multimodal transformer models was explored in previous research that GoogleGemini is the primary full-fledged industrial implementation of the approach.

An AI would higher understand the concept of birds using a combination of textual descriptions, vocalizations, images and videos of the birds.

Such a model is seen as a step towards stronger generalist AI models, also often called Artificial general intelligence (AGI).

Risks of AGI

Given the speed at which AI is advancing, the expectation that AGI with superhuman capabilities shall be developed within the near future is sparking debate within the research community and society at large.

On the one hand, some anticipate and demand the chance of catastrophic events if a strong AGI falls into the hands of malicious groups Developments are slowed down.

Others claim that we’re still very removed from such an actionable AGI, that current approaches allow for superficial modeling of intelligence, Mimicking the information they’re trained onthey usually lack effective—an in depth understanding of actual reality—required to attain human-level intelligence.

a digital representation of a brain
More technological breakthroughs are required to create artificial general intelligence.

On the opposite hand, one could argue that focusing the conversation on existential risks distracts attention from more immediate implications caused by recent advances in AI perpetuating prejudicesproduce false and misleading content – causing Google to pause its Gemini image generator, increasing environmental impact And Asserting Big Tech’s dominance.

The line to follow lies somewhere between all these considerations. We are still a good distance from adopting actionable AGI – further breakthroughs are needed, including the introduction of stronger symbolic modeling and reasoning capabilities.

In the meantime, we should always not be distracted from the essential ethical and societal implications of recent AI. These considerations are essential and ought to be addressed by individuals with diverse expertise in technological and social science fields.

While not a short-term threat, the belief of AI with superhuman abilities is a priority. It is significant that we’re prepared together to responsibly manage the emergence of AGI when this essential milestone is reached.


Please enter your comment!
Please enter your name here

Must Read