Model Efficiency: Cost Per Overall Token by Model and Max Mode (Usage-based)
This chart displays the cost per overall token for 'Usage-based' events, broken down by model and whether 'max_mode' was enabled. The models are sorted by efficiency, with the lowest cost per token at the top. The 'gpt-5' model without max mode is highlighted as the most efficient.