Login
Back to Blog
EnglishTutorial

Gemini 2.5 Pro and Gemini 3 Pro API Integration Guide

Complete guide to integrating Google's Gemini 2.5 Pro, Gemini 2.5 Flash, and Gemini 3 Pro models via API. Includes native format and OpenAI-compatible examples.

C
Crazyrouter Team
January 22, 2026 / 366 views
Share:
Gemini 2.5 Pro and Gemini 3 Pro API Integration Guide

Google's Gemini models offer exceptional multimodal capabilities, advanced reasoning, and competitive pricing. This guide covers how to integrate Gemini 2.5 Pro, Gemini 2.5 Flash, and the latest Gemini 3 Pro through Crazyrouter's API.

Supported Gemini Models#

ModelInput ($/1M)Output ($/1M)Best For
gemini-3-pro-preview$1.25$7.50Latest generation
gemini-3-pro-preview-thinking$1.25$7.50Extended reasoning
gemini-3-pro-image-preview$1.25$7.50Image generation
gemini-3-flash-preview$0.15$0.60Fast responses
gemini-2.5-pro$1.25$10.00Long documents
gemini-2.5-pro-thinking$1.25$10.00Complex reasoning
gemini-2.5-flash$0.30$2.50Production workloads
gemini-2.5-flash-lite$0.10$0.40Cost-effective
gemini-2.5-flash-image$0.30$2.50Image generation

Quick Start with OpenAI-Compatible API#

The easiest way to use Gemini models is through the OpenAI-compatible endpoint:

python
from openai import OpenAI

client = OpenAI(
    api_key="sk-your-api-key",
    base_url="https://crazyrouter.com/v1",
    default_headers={
        "User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36"
    }
)

response = client.chat.completions.create(
    model="gemini-2.5-pro",
    messages=[
        {"role": "user", "content": "Explain the theory of relativity in simple terms."}
    ]
)

print(response.choices[0].message.content)

Native Gemini API Format#

For advanced features, use the native Gemini API format:

Text Generation#

bash
curl -X POST "https://crazyrouter.com/v1beta/models/gemini-2.5-pro:generateContent" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-your-api-key" \
  -H "User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36" \
  -d '{
    "contents": [{
      "parts": [{"text": "Write a poem about artificial intelligence."}]
    }]
  }'

Streaming Text Generation#

bash
curl -X POST "https://crazyrouter.com/v1beta/models/gemini-3-pro-preview:streamGenerateContent" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-your-api-key" \
  -H "User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36" \
  -d '{
    "contents": [{
      "parts": [{"text": "Explain quantum computing step by step."}]
    }]
  }'

Extended Thinking with Gemini#

Gemini models support extended thinking for complex reasoning:

python
# Gemini 2.5 Pro with thinking
response = client.chat.completions.create(
    model="gemini-2.5-pro-thinking",
    messages=[
        {"role": "user", "content": "Solve this complex math problem step by step: Find all prime numbers p such that p^2 + 2 is also prime."}
    ]
)

# Gemini 3 Pro with thinking
response = client.chat.completions.create(
    model="gemini-3-pro-preview-thinking",
    messages=[
        {"role": "user", "content": "Analyze this algorithm's time complexity."}
    ]
)

Image Generation with Gemini#

Gemini 2.5 Flash Image#

bash
curl -X POST "https://crazyrouter.com/v1beta/models/gemini-2.5-flash-image:generateContent" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-your-api-key" \
  -H "User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36" \
  -d '{
    "contents": [{
      "parts": [{"text": "Generate an image of a futuristic city at sunset"}]
    }],
    "generationConfig": {
      "responseModalities": ["image", "text"]
    }
  }'

Gemini 3 Pro Image (with aspect ratio control)#

bash
curl -X POST "https://crazyrouter.com/v1beta/models/gemini-3-pro-image-preview:generateContent" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-your-api-key" \
  -H "User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36" \
  -d '{
    "contents": [{
      "parts": [{"text": "A serene Japanese garden with cherry blossoms"}]
    }],
    "generationConfig": {
      "responseModalities": ["image"],
      "aspectRatio": "16:9"
    }
  }'

Google Search Integration#

Gemini models can access real-time information via Google Search:

bash
curl -X POST "https://crazyrouter.com/v1beta/models/gemini-2.5-flash:generateContent" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-your-api-key" \
  -H "User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36" \
  -d '{
    "contents": [{
      "parts": [{"text": "What are the latest developments in AI as of today?"}]
    }],
    "tools": [{
      "googleSearch": {}
    }]
  }'

Flash Lite Models (Cost-Effective)#

For high-volume, cost-sensitive applications:

python
# Standard Flash Lite
response = client.chat.completions.create(
    model="gemini-2.5-flash-lite",
    messages=[{"role": "user", "content": "Summarize this text briefly."}]
)

# Flash Lite with thinking
response = client.chat.completions.create(
    model="gemini-2.5-flash-lite-thinking",
    messages=[{"role": "user", "content": "Solve this logic puzzle."}]
)

Python SDK Example#

python
from openai import OpenAI

client = OpenAI(
    api_key="sk-your-api-key",
    base_url="https://crazyrouter.com/v1",
    default_headers={
        "User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36"
    }
)

# Gemini 3 Pro for complex reasoning
response = client.chat.completions.create(
    model="gemini-3-pro-preview",
    messages=[
        {"role": "system", "content": "You are a helpful AI assistant with expertise in science."},
        {"role": "user", "content": "Explain CRISPR gene editing and its potential applications."}
    ],
    temperature=0.7,
    max_tokens=2000
)

print(response.choices[0].message.content)

Pricing Comparison#

ModelInput ($/1M)Output ($/1M)
gemini-3-pro-preview$1.25$7.50
gemini-2.5-pro$1.25$10.00
gemini-2.5-flash$0.30$2.50
gemini-2.5-flash-lite$0.10$0.40
gemini-flash-latest$0.30$2.50

Pricing Disclaimer: The prices shown in this article are for demonstration purposes only and may change at any time. Actual billing will be based on the real-time prices displayed when you make your request.

Best Practices#

  1. Use Flash for production: Gemini 2.5 Flash offers excellent performance at lower cost
  2. Enable thinking for complex tasks: Use thinking-enabled models for math and reasoning
  3. Leverage multimodal capabilities: Gemini excels at image and video understanding
  4. Use streaming for long responses: Better user experience with real-time output

Model Selection Guide#

Use CaseRecommended Model
General chatgemini-2.5-flash
Complex reasoninggemini-2.5-pro-thinking or gemini-3-pro-preview-thinking
Image generationgemini-2.5-flash-image or gemini-3-pro-image-preview
Cost-sensitivegemini-2.5-flash-lite
Latest featuresgemini-3-pro-preview

Getting Started#

  1. Sign up at Crazyrouter
  2. Create an API key in the console
  3. Choose between OpenAI-compatible or native Gemini format
  4. Start building with Gemini models

For questions, contact support@crazyrouter.com

Implementation Guides

Topics

Related Posts

Qwen2.5-Omni Complete Guide: Alibaba's Multimodal AI Model for DevelopersTutorial

Qwen2.5-Omni Complete Guide: Alibaba's Multimodal AI Model for Developers

"Complete developer guide to Qwen2.5-Omni — Alibaba's multimodal AI model that processes text, images, audio, and video. Includes API setup, code examples, and pricing."

Feb 19
How to Reduce AI API Costs by 80% - Complete Developer Guide 2026Tutorial

How to Reduce AI API Costs by 80% - Complete Developer Guide 2026

Learn proven strategies to reduce AI API costs by up to 80%. Includes model selection, caching, prompt optimization, and batch processing techniques.

Jan 22
Sora API: The Complete Guide to Building with OpenAI Video GenerationTutorial

Sora API: The Complete Guide to Building with OpenAI Video Generation

OpenAI's current Sora API is asynchronous and tier-based, not a fire-and-forget video button. The official guide recommends polling every 10 to 20 seconds, and Sora access is not available on the F...

Mar 26
MCP (Model Context Protocol) Complete Guide: The New Standard for AI Tool IntegrationTutorial

MCP (Model Context Protocol) Complete Guide: The New Standard for AI Tool Integration

Everything developers need to know about MCP (Model Context Protocol). Covers what it is, how it works, how to build MCP servers, and why it matters for AI application development.

Feb 23
Kimi K2 Thinking Model: Complete Developer Guide for Reasoning WorkflowsTutorial

Kimi K2 Thinking Model: Complete Developer Guide for Reasoning Workflows

"Complete guide to Moonshot's Kimi K2 Thinking model. Learn chain-of-thought reasoning, benchmark comparisons, API integration, and cost optimization for production."

May 5
Pika 2.2 API Integration Guide: Build Video Generation Pipelines in 2026Tutorial

Pika 2.2 API Integration Guide: Build Video Generation Pipelines in 2026

"Step-by-step guide to integrating Pika 2.2's API into production video pipelines. Covers text-to-video, image-to-video, effects, pricing, and multi-model fallback strategies."

Apr 13