OpenAI Launches GPT 5.5 With Advanced Coding And Computer Use Capabilities

Published:

OpenAI has introduced GPT 5.5, the latest version of its flagship artificial intelligence model, designed to deliver stronger reasoning capabilities, improved coding performance, and enhanced ability to execute multi step tasks across applications and digital tools. The release represents an evolution in how AI systems interact with users, with a stronger focus on understanding intent and independently completing complex workflows. According to OpenAI, GPT 5.5 is engineered to reduce dependency on step by step instructions and instead allow the system to plan, execute, verify, and continue tasks with minimal user intervention. The model is being positioned for use across software engineering, office productivity, scientific research, and computer automation environments.

The rollout of GPT 5.5 has begun for Plus, Pro, Business, and Enterprise users within ChatGPT and Codex, with a more advanced GPT 5.5 Pro version also being made available for Pro, Business, and Enterprise tiers. API access is expected to be released in the near future, expanding integration possibilities for developers and enterprise systems. One of the most notable improvements in this version is its performance in coding related tasks. OpenAI reports that GPT 5.5 outperforms GPT 5.4 while using fewer computational tokens, which can help reduce operational costs and improve efficiency in large scale deployments. On the Terminal Bench 2.0 benchmark, which evaluates complex command line workflows, GPT 5.5 achieved 82.7 percent compared to 75.1 percent for GPT 5.4. In internal Expert SWE evaluations focused on long engineering tasks, the model scored 73.1 percent against GPT 5.4’s 68.5 percent, demonstrating improved capability in handling large codebases, debugging issues, and applying changes across multiple files.

Beyond software development, GPT 5.5 is also designed to support broader workplace productivity tasks such as research, spreadsheet creation, document drafting, and navigation of digital tools. On the GDPval benchmark, which measures performance across professional tasks in multiple occupations, GPT 5.5 scored 84.9 percent, slightly ahead of GPT 5.4 at 83.0 percent. In OSWorld Verified testing, which evaluates a model’s ability to operate computer environments autonomously, the new model achieved 78.7 percent compared to 75.0 percent previously. OpenAI also noted that more than 85 percent of its internal employees are already using Codex on a weekly basis across departments including engineering, finance, communications, marketing, and product management, reflecting growing internal reliance on AI driven workflows.

In research focused benchmarks, GPT 5.5 showed further improvements in scientific reasoning and multi step analysis tasks. The model recorded 25.0 percent on GeneBench, up from 19.0 percent for GPT 5.4, and 80.5 percent on BixBench compared to 74.0 percent. On FrontierMath Tier 4, it reached 35.4 percent, improving over GPT 5.4’s 27.1 percent. OpenAI stated that GPT 5.5 maintains similar response speed to its predecessor while delivering higher output quality, supported by infrastructure based on NVIDIA GB200 and GB300 NVL72 systems. Internal upgrades have also increased token generation speeds by more than 20 percent. The model includes enhanced safety measures covering cybersecurity and biology related risks, along with expanded context handling of 400K tokens in Codex and an upcoming API version supporting up to 1 million tokens, priced at $5 per 1 million input tokens and $30 per 1 million output tokens.

Follow the SPIN IDG WhatsApp Channel for updates across the Smart Pakistan Insights Network covering all of Pakistan’s technology ecosystem. 

Related articles

spot_img