GPT-4o mini: ChatGPT's New Default Model and Why It Matters for Your Team
At $0.15 per million tokens, GPT-4o mini is 15x cheaper than GPT-4o and more capable than GPT-3.5 Turbo. Here is the task routing framework that maximizes both models.
What matters today
At $0.15 per million tokens, GPT-4o mini is 15x cheaper than GPT-4o and more capable than GPT-3.5 Turbo. Here is the task routing framework that maximizes both models.
Key points
- The Cost Reset
- The Task Routing Framework
- The Batch Processing Pattern
- Access Points
- The Bottom Line
What You'll Learn
- Why GPT-4o mini replacing GPT-3.5 Turbo changes the economics of team AI usage
- Which executive tasks belong on mini versus GPT-4o
- The batch processing pattern that cuts per-task AI cost by 80%
The Director of Customer Success at a regional bank ran AI-assisted ticket summarization for three months. With GPT-4o, the cost was manageable for 50 tickets per day. At 500 tickets per day, the math stopped working. The team went back to manual processing.
GPT-4o mini arrived July 18 and changed that calculation entirely. At $0.15 per million input tokens, the same 500-ticket workflow now costs $0.15 per day instead of $2.25. For organizations processing high volumes of text, the practical constraint just disappeared.
GPT-4o mini is now the default model for all ChatGPT users (free and Plus). It scores 82% on MMLU, outperforms GPT-3.5 Turbo on every published benchmark, and runs significantly faster. For most everyday AI tasks, it is a strict upgrade at no additional cost.
SUBSCRIBER BREAK -- Premium Content Below
The Cost Reset
GPT-4o mini outperforms GPT-3.5 Turbo on every published benchmark while costing less per token. On MMLU (general knowledge): 82.0% vs. 70.0%. On HumanEval (coding): 87.2% vs. 48.1%. On MATH: 70.2% vs. 34.1%. GPT-4o mini is cheaper than GPT-3.5 Turbo while being substantially more capable. For teams building on the API, this is a free upgrade.
The Task Routing Framework
Tasks for GPT-4o mini (volume, consistency): email summarization, customer feedback categorization, document classification, invoice data extraction, FAQ response drafting, meeting transcript key-point extraction, sentiment tagging, standard contract clause identification.
Tasks that still warrant GPT-4o or Claude 3.5 Sonnet (complexity, nuance): strategic memo drafting, complex contract analysis with risk assessment, competitive analysis synthesis, board presentation narrative, sensitive communications requiring judgment.
The Batch Processing Pattern
Stop running one item per prompt. Batch 10 to 20 items in a single call to GPT-4o mini. This reduces API overhead, cuts cost further, and is faster than sequential calls. For ChatGPT interface users (no API): paste a batch of 5 to 10 items in a single message. 4 to 8 times faster than individual queries.
You are a [role] processing a batch of [document type]. For each item below, return a table row with these columns: [Number] | [Classification] | [Key Extract] | [Action]. Use these classifications: [LIST YOUR CATEGORIES]. Items: Item 1: [text] Item 2: [text] ... Return the full table only. No explanations.
Access Points
- ChatGPT Free: GPT-4o mini is now the default. No action required. Rate limits are significantly higher than GPT-3.5 Turbo offered.
- ChatGPT Plus: Available alongside GPT-4o. Switch using the model selector in the new conversation screen.
- API: Model ID: gpt-4o-mini. $0.15/1M input tokens, $0.60/1M output tokens. Available immediately at platform.openai.com.
The Bottom Line
GPT-4o mini is the most significant cost reduction in commercial AI since the category emerged. Tasks that previously could not be automated because per-query economics did not pencil out now have a different calculation. The single most valuable action this week: identify one repetitive text task that absorbs manual time and build a batch processing prompt for it.
Three deep dives. Four useful moves. One email worth opening.
PromptHacker turns the AI firehose into practical next steps for work, health, family, and everything time keeps trying to steal.