Unlock Editor's Digest free of charge
FT editor Roula Khalaf selects her favourite stories on this weekly newsletter.
Elon Musk's AI startup xAI has released a brand new chatbot that it says rivals OpenAI, Google and Anthropic, catapulting the 18-month-old company into the highest 5 AI developers.
xAI on Wednesday unveiled a preview of the model, which is ranked among the many top five chatbots on the planet in accordance with independent AI benchmark sites, closely followed by Google Gemini and OpenAI ChatGPT.
Grok-2, the corporate's latest large-scale language model, will probably be available to paying subscribers of Musk's social media platform X. xAI also plans to release the model to developers this month in order that they can construct enterprise applications.
Ethan Mollick, professor at Wharton Business School and AI creator, published to X: “There at the moment are five models of the GPT-4 class: GPT-4o, Claude 3.5, Gemini 1.5, Llama 3.1 and now Grok-2.”
He added: “All the labs say there continues to be room for further tremendous improvement, but we haven't seen a model that basically goes beyond GPT-4… yet.”
Musk is attempting to meet up with OpenAI, the AI ​​research lab he co-founded in 2015 but left three years later after disagreements over the direction of research. The Tesla and SpaceX boss filed a brand new lawsuit this month against OpenAI and its CEO Sam Altman, claiming he was “manipulated” into investing in a “fake humanitarian mission.” Microsoft-backed OpenAI has previously dismissed Musk's claims as “incoherent and frivolous.”
Founded in March last yr, xAI has rapidly expanded the capabilities of its technology because of significant investment.
This yr, xAI closed a $6 billion funding round at an $18 billion valuation, while Musk recently said he was looking for approval from the board of Tesla, where he’s CEO, to take a position $5 billion in the corporate, bringing the startup's investment nearly to the extent of OpenAI's $13 billion and surpassing Anthropic's nearly $9 billion.
However, xAI's use of knowledge from Musk's X platform has proved controversial. Earlier this month, the corporate agreed to a partial suspension of knowledge processing in Europe after Ireland's data protection regulator challenged a move to make use of X posts to coach its AI systems without first obtaining users' explicit consent – a possible breach of EU data protection rules.
xAI said Grok-2 is a “significant step forward” and is “more intuitive, controllable, and versatile for a wide selection of tasks, whether you're looking for answers, collaborating on writing, or solving coding problems.”
According to a rating on LMSYS, a number one website for comparing or benchmarking the capabilities of AI models, Grok-2's performance is taken into account to be higher than one of the best models from Meta and Anthropic. However, a recent update to OpenAI's latest model, GPT-4o, brought it back to the highest of the rankings, ahead of Google's Gemini Pro.
The company says its internal evaluation of the model's performance focused on ensuring that the system followed instructions and provided “accurate, factual information.”
Its predecessor was criticized by experts for “hallucinations,” through which the AI ​​presented false information as fact. Hallucinations were seen as an obstacle to the introduction of AI systems in firms.