Models that support context caching (min 2048 tokens required)
Context caching reduces costs for repeated API calls with similar contexts
Reference: https://ai.google.dev/gemini-api/docs/caching
Models that support code execution (Python)
Code execution allows models to write and execute Python code
Reference: https://ai.google.dev/gemini-api/docs/code-execution
Models that support thinking/reasoning capability with configurable thinking_level
Based on: https://ai.google.dev/gemini-api/docs/gemini-3
Gemini 3 Pro/Flash: supports low, high (default)
Gemini 3 Flash only: also supports minimal, medium