GPT-5 Has Had a Rocky Start but Remains an Extraordinary Achievement

B. Valle

Summary Bullets:

  • OpenAI released GPT-5 on August 7, 2025, a multimodal large language model (LLM) with agentic capabilities.
  • This is the latest iteration of the famous chatbot, and the most important upgrade since the release of the previous generation, GPT-4, in 2023.

As it happens sometimes when a product is thrust with such force into the realm of popular culture, the release of GPT-5 sparked a veritable PR crisis, leading CEO Sam Altman to make a public apology and backtrack on the decision to remove access to all previous AI models in ChatGPT. Unlike enterprise customers, which received advanced warnings of such movements, consumer ChatGPT users did not know their preferred models would disappear so suddenly. The ensuing kerfuffle highlighted the strange co-dependency relationship that some people have developed with the technology, creating no end of background noise surrounding this momentous release.

In truth, OpenAI handled this launch rather clumsily. But GPT-5 remains an extraordinary achievement, in terms of writing, research, analysis, coding, and problem-solving capabilities. The bête noire of generative AI (GenAI), hallucination, has been addressed (to a limited degree, of course), and GPT-5 is significantly less likely to hallucinate than previous generations, according to OpenAI. With web search enabled on anonymized prompts representative of ChatGPT production traffic, GPT-5’s responses are around 45% less likely to contain a factual error than GPT-4o. The startup claims that across several benchmarks, GPT-5 shows a sharp drop in hallucinations, about six times fewer than o3.

However, safety remains a concern. OpenAI has a patchy record in this area: Altman famously lobbied against the US California Senate Bill SB 1047 (SB 1047), which aimed to hold AI developers liable for catastrophic harm caused by their models if appropriate safety measures weren’t taken. In 2024, members of OpenAI’s safety team quit after voicing concerns about the company’s record in this area.

Meanwhile, there has been talk in industry circles and trade media outlets of artificial general intelligence (AGI) and GPT-5’s position in this regard. However, the AI landscape remains so dynamic that this is missing the point. Google’s announcement on August 5, 2025 (in limited research preview) of Google DeepMind’s Genie 3 frontier world models, which help users train AI agents in simulation environments, positions the company against AI behemoth Nvidia in the realm of world AI. World AI in this context means technologies that integrate so-called “world models,” i.e., simulations of how the world works from a physics, causality, or behavior perspective. It could be argued that this is where true AGI resides: in real-world representations and in the trenches of the simulation realm.

On the other hand, Google’s latest salvo in the enterprise space has involved a fierce onslaught of partnerships, with several deals announced in the last 48 hours. Oracle will sell Google Gemini models via Oracle’s cloud computing services and business applications through Google’s developer platform Vertex AI, an important step to boost its capillarity in corporate accounts. With Wipro, Google Cloud is going to launch 200 agentic AI solutions in different verticals that are production-ready and accessible via Google Cloud Marketplace. And with NTT Data, Google is launching industry-specific cloud and AI solutions, with joint go-to-market investments to support this important launch.

The AI market is advancing at rapid speed, including applications of agentic AI in enterprise environments. This includes a variety of AI-driven applications and platforms that are transforming business processes and interactions. The release of GPT-5 is simply another tool in this direction.

Leave a Reply