Skip to content

Latest commit

 

History

History
42 lines (33 loc) · 1.53 KB

global-batch-limits.md

File metadata and controls

42 lines (33 loc) · 1.53 KB
title titleSuffix description manager ms.service ms.topic ms.date
Azure OpenAI Global Batch Limits
Azure OpenAI Service
Azure OpenAI model global batch limits
nitinme
azure-ai-openai
include
02/12/2025

Batch limits

Limit Name Limit Value
Max files per resource 500
Max input file size 200 MB
Max requests per file 100,000

Batch quota

The table shows the batch quota limit. Quota values for global batch are represented in terms of enqueued tokens. When you submit a file for batch processing the number of tokens present in the file are counted. Until the batch job reaches a terminal state, those tokens will count against your total enqueued token limit.

Global batch

Model Enterprise agreement Default Monthly credit card based subscriptions MSDN subscriptions Azure for Students, Free Trials
gpt-4o 5 B 200 M 50 M 90 K N/A
gpt-4o-mini 15 B 1 B 50 M 90 K N/A
gpt-4-turbo 300 M 80 M 40 M 90 K N/A
gpt-4 150 M 30 M 5 M 100 K N/A
gpt-35-turbo 10 B 1 B 100 M 2 M 50 K
o3-mini 15 B 1 B 50 M 90 K N/A

B = billion | M = million | K = thousand

Data zone batch

Model Enterprise agreement Default Monthly credit card based subscriptions MSDN subscriptions Azure for Students, Free Trials
gpt-4o 500 M 30 M 30 M 90 K N/A
gpt-4o-mini 1.5 B 100 M 50 M 90 K N/A
o3-mini 1.5 B 100 M 50 M 90 K N/A