Brief IA

OpenAI Dramatically Reduces Access Costs for ChatGPT

💡 Use Cases·Tom Levy·

OpenAI Dramatically Reduces Access Costs for ChatGPT

OpenAI Dramatically Reduces Access Costs for ChatGPT
Key Takeaways
1OpenAI has reduced inference costs for ChatGPT by over 50%.
2This reduction is due to optimizations that have decreased the usage of Nvidia GPUs.
3The number of GPUs required has sometimes dropped to just a few hundred.
💡Why it mattersThis decrease in costs could make ChatGPT more accessible and competitive, influencing the generative AI market.
Le brief IA que lisent les pros

Le brief IA que les pros lisent chaque soir

Les 7 actus IA du jour, décryptées en 5 min. Gratuit.

Inclus dès l'inscription : notre sélection des meilleurs guides & comparatifs IA.

Choisis ton rythme

Gratuit · Pas de spam · Désabonnement en 1 clic

📄
Full Analysis

OpenAI Drastically Reduces Access Costs for ChatGPT

OpenAI has reportedly cut response costs for guest users of ChatGPT by more than half. Engineers at OpenAI informed their colleagues earlier this month that they had managed to reduce inference costs—the expense associated with running existing AI models—by over 50%. This was reported by a person familiar with the discussions, according to The Information.

OpenAI has applied these new optimizations to ChatGPT, specifically for visitors who do not have an account. The number of Nvidia GPUs required to serve these users has dropped to just a few hundred. It is unclear how many were needed previously or what techniques OpenAI used to achieve this. Guest users only have access to a very limited set of ChatGPT features, so it remains to be seen whether these gains will translate to the full product.

Deepseek has also recently launched a new open-source method that can accelerate inference requests by 60 to 85%. The freed-up resources could be used for service expansion, better models, faster responses, or larger profit margins. However, given that data center builds are progressing slowly, such gains will likely give labs more leeway rather than reduce the demand for chips.

Brief IA — L'actualité IA en français

L'essentiel de l'actualité de l'intelligence artificielle, décrypté et expliqué chaque jour.