PH PROMPTHACKER.AI

GPT-4o mini: ChatGPT's New Default Model and Why It Matters for Your Team

At $0.15 per million tokens, GPT-4o mini is 15x cheaper than GPT-4o and more capable than GPT-3.5 Turbo. Here is the task routing framework that maximizes both models.

July 31, 2024 3 min read
gpt4o mini chatgpt default model
Quick Scan

What matters today

At $0.15 per million tokens, GPT-4o mini is 15x cheaper than GPT-4o and more capable than GPT-3.5 Turbo. Here is the task routing framework that maximizes both models.

Format TOP UPDATE
Audience Executives using AI at work
Time 3 min read
Topic Chatgpt

Key points

  • The Cost Reset
  • The Task Routing Framework
  • The Batch Processing Pattern
  • Access Points
  • The Bottom Line

What You'll Learn

  • Why GPT-4o mini replacing GPT-3.5 Turbo changes the economics of team AI usage
  • Which executive tasks belong on mini versus GPT-4o
  • The batch processing pattern that cuts per-task AI cost by 80%

The Director of Customer Success at a regional bank ran AI-assisted ticket summarization for three months. With GPT-4o, the cost was manageable for 50 tickets per day. At 500 tickets per day, the math stopped working. The team went back to manual processing.

GPT-4o mini arrived July 18 and changed that calculation entirely. At $0.15 per million input tokens, the same 500-ticket workflow now costs $0.15 per day instead of $2.25. For organizations processing high volumes of text, the practical constraint just disappeared.

GPT-4o mini is now the default model for all ChatGPT users (free and Plus). It scores 82% on MMLU, outperforms GPT-3.5 Turbo on every published benchmark, and runs significantly faster. For most everyday AI tasks, it is a strict upgrade at no additional cost.

SUBSCRIBER BREAK -- Premium Content Below

The Cost Reset

GPT-4o mini outperforms GPT-3.5 Turbo on every published benchmark while costing less per token. On MMLU (general knowledge): 82.0% vs. 70.0%. On HumanEval (coding): 87.2% vs. 48.1%. On MATH: 70.2% vs. 34.1%. GPT-4o mini is cheaper than GPT-3.5 Turbo while being substantially more capable. For teams building on the API, this is a free upgrade.

The Task Routing Framework

Tasks for GPT-4o mini (volume, consistency): email summarization, customer feedback categorization, document classification, invoice data extraction, FAQ response drafting, meeting transcript key-point extraction, sentiment tagging, standard contract clause identification.

Tasks that still warrant GPT-4o or Claude 3.5 Sonnet (complexity, nuance): strategic memo drafting, complex contract analysis with risk assessment, competitive analysis synthesis, board presentation narrative, sensitive communications requiring judgment.

The Batch Processing Pattern

Stop running one item per prompt. Batch 10 to 20 items in a single call to GPT-4o mini. This reduces API overhead, cuts cost further, and is faster than sequential calls. For ChatGPT interface users (no API): paste a batch of 5 to 10 items in a single message. 4 to 8 times faster than individual queries.

You are a [role] processing a batch of [document type]. For each item below, return a table row with these columns: [Number] | [Classification] | [Key Extract] | [Action]. Use these classifications: [LIST YOUR CATEGORIES]. Items: Item 1: [text] Item 2: [text] ... Return the full table only. No explanations.

Access Points

  • ChatGPT Free: GPT-4o mini is now the default. No action required. Rate limits are significantly higher than GPT-3.5 Turbo offered.
  • ChatGPT Plus: Available alongside GPT-4o. Switch using the model selector in the new conversation screen.
  • API: Model ID: gpt-4o-mini. $0.15/1M input tokens, $0.60/1M output tokens. Available immediately at platform.openai.com.

The Bottom Line

GPT-4o mini is the most significant cost reduction in commercial AI since the category emerged. Tasks that previously could not be automated because per-query economics did not pencil out now have a different calculation. The single most valuable action this week: identify one repetitive text task that absorbs manual time and build a batch processing prompt for it.

Bottom line

The useful move with GPT-4o mini: ChatGPT's New Default Model and Why It Matters for Your Team is to run one narrow test this week, then keep only the workflow that saves time, improves a decision, or gives your team clearer output. Treat the announcement as raw material, not the win itself.

About the author

Pierre Bradshaw Founder, PromptHacker.ai

Pierre has spent 25+ years building growth systems across fintech, real estate, lending, campaigns, and AI workflows, with machine-learning work dating back to 2012.

If you have any questions or comments about GPT-4o mini: ChatGPT's New Default Model and Why It Matters for Your Team feel free to reach out. I'd love to hear from you.

Contact Pierre
Free weekly briefing

Three deep dives. Four useful moves. One email worth opening.

PromptHacker turns the AI firehose into practical next steps for work, health, family, and everything time keeps trying to steal.