Gemini CLI: Quotas and Pricing
Your Gemini CLI quotas and pricing depends on the type of account you use to authenticate with Google. Additionally, both quotas and pricing may may be calculated differently based on the model version, requests, and tokens used. A summary of model usage is available through the /stats
command and presented on exit at the end of a session. See privacy and terms for details on Privacy policy and Terms of Service. Note: published prices are list price; additional negotiated commercial discounting may apply.
This article outlines the specific quotas and pricing applicable to the Gemini CLI when using different authentication methods.
1. Log in with Google (Gemini Code Assist Free Tier)
For users who authenticate by using their Google account to access Gemini Code Assist for individuals:
- Quota:
- 60 requests per minute
- 1000 requests per day
- Token usage is not applicable
- Cost: Free
- Details: Gemini Code Assist Quotas
- Notes: A specific quota for different models is not specified; model fallback may occur to preserve shared experience quality.
2. Gemini API Key (Unpaid)
If you are using a Gemini API key for the free tier:
- Quota:
- Flash model only
- 10 requests per minute
- 250 requests per day
- Cost: Free
- Details: Gemini API Rate Limits
3. Gemini API Key (Paid)
If you are using a Gemini API key with a paid plan:
- Quota: Varies by pricing tier.
- Cost: Varies by pricing tier and model/token usage.
- Details: Gemini API Rate Limits, Gemini API Pricing
4. Login with Google (for Workspace or Licensed Code Assist users)
For users of Standard or Enterprise editions of Gemini Code Assist, quotas and pricing are based on a fixed price subscription with assigned license seats:
- Standard Tier:
- Quota: 120 requests per minute, 1500 per day
- Enterprise Tier:
- Quota: 120 requests per minute, 2000 per day
- Cost: Fixed price included with your Gemini for Google Workspace or Gemini Code Assist subscription.
- Details: Gemini Code Assist Quotas, Gemini Code Assist Pricing
- Notes:
- Specific quota for different models is not specified; model fallback may occur to preserve shared experience quality.
- Members of the Google Developer Program may have Gemini Code Assist licenses through their membership.
5. Vertex AI (Express Mode)
If you are using Vertex AI in Express Mode:
- Quota: Quotas are variable and specific to your account. See the source for more details.
- Cost: After your Express Mode usage is consumed and you enable billing for your project, cost is based on standard Vertex AI Pricing.
- Details: Vertex AI Express Mode Quotas
6. Vertex AI (Regular Mode)
If you are using the standard Vertex AI service:
- Quota: Governed by a dynamic shared quota system or pre-purchased provisioned throughput.
- Cost: Based on model and token usage. See Vertex AI Pricing.
- Details: Vertex AI Dynamic Shared Quota
7. Google One and Ultra plans, Gemini for Workspace plans
These plans currently apply only to the use of Gemini web-based products provided by Google-based experiences (for example, the Gemini web app or the Flow video editor). These plans do not apply to the API usage which powers the Gemini CLI. Supporting these plans is under active consideration for future support.