Thinking with Gemini 2.5

POST

/v1beta/models/{model_name}:generateContent

Request

Path Params

Header Params

Body Params application/json

Example

{
    "contents": [
      {
        "parts": [
          {
            "text": "How does AI work?"
          }
        ]
      }
    ],
    "generationConfig": {
      "thinkingConfig": {
        "thinkingBudget": 0
      }
    }
  }

Request Code Samples

Shell

JavaScript

Java

Swift

PHP

Python

HTTP

Objective-C

Ruby

OCaml

Dart

curl --location --request POST '/v1beta/models/:generateContent' \
--header 'x-goog-api-key;' \
--header 'Content-Type: application/json' \
--data-raw '{
    "contents": [
      {
        "parts": [
          {
            "text": "How does AI work?"
          }
        ]
      }
    ],
    "generationConfig": {
      "thinkingConfig": {
        "thinkingBudget": 0
      }
    }
  }'

Responses

🟢200OK

application/json

Body

Example

{
    "candidates": [
        {
            "content": {
                "parts": [
                    {
                        "text": "That's a fantastic question, and one that many people are curious about! The truth is, \"AI\" is a very broad term, and there are many different types and approaches. However, I can give you a good overview of the core concepts and how they generally work.\n\nAt its heart, **AI works by training computer programs to recognize patterns and make decisions, often by learning from data, much like humans do (but in a very different way).**\n\nHere's a breakdown of the key components and processes:\n\n---\n\n## The Core Components of AI\n\n1.  **Data:** This is the fuel for AI. It can be anything from images, text, numbers, audio, videos, or sensor readings. The more relevant and diverse the data, the better an AI can learn.\n\n2.  **Algorithms:** These are the step-by-step instructions or mathematical formulas that the AI uses to process data, identify patterns, and make decisions. Think of them as the \"brains\" of the AI.\n\n3.  **Computational Power:** AI, especially modern AI, requires a lot of processing power (CPUs and especially GPUs) to handle large datasets and complex algorithms.\n\n---\n\n## The General Process: Learning and Inference\n\nMost AI systems follow a two-phase process:\n\n### Phase 1: Training (Learning)\n\nThis is where the AI \"learns\" from the data.\n\n1.  **Input Data:** You feed the algorithm a vast amount of data. For example:\n    *   **Image Recognition:** Thousands of images labeled \"cat\" or \"dog.\"\n    *   **Language Translation:** Millions of pairs of sentences in two languages.\n    *   **Recommendation System:** User purchase history and ratings.\n\n2.  **Feature Extraction:** The algorithm identifies relevant characteristics (features) within the data. For images, these might be edges, shapes, colors. For text, it could be word frequencies or grammatical structures.\n\n3.  **Pattern Recognition:** The algorithm looks for correlations and patterns between the input features and the desired output. It tries to build a mathematical model that explains these relationships.\n\n4.  **Model Adjustment (Learning):** The AI makes predictions based on its current model. It then compares its predictions to the actual \"correct\" answers (if available, this is called \"supervised learning\"). If there's a discrepancy, the algorithm adjusts its internal parameters (weights and biases) to reduce the error. This process is repeated thousands or millions of times.\n\n    *   **Analogy:** Imagine a child learning to distinguish between apples and oranges. You show them many examples, tell them \"this is an apple,\" \"this is an orange.\" When they make a mistake, you correct them. Over time, they learn the distinguishing features. The AI does something similar, but with complex mathematical adjustments.\n\n### Phase 2: Inference (Prediction/Action)\n\nOnce the AI is trained, it can be put to work.\n\n1.  **New Input:** You provide the trained AI with new, unseen data.\n2.  **Feature Extraction (again):** The AI extracts features from this new data using the same methods it learned during training.\n3.  **Prediction/Decision:** Using its learned model, the AI makes a prediction or takes an action based on the new input.\n    *   **Image Recognition:** Given a new image, it predicts \"cat.\"\n    *   **Language Translation:** Given a new sentence, it translates it.\n    *   **Recommendation System:** Suggests a new product to a user.\n\n---\n\n## Key Paradigms and Techniques\n\nWhile the above is a general framework, different AI fields employ specific techniques:\n\n1.  **Machine Learning (ML):** This is the broadest subset of AI. It focuses on building systems that can learn from data without being explicitly programmed for every possible scenario.\n    *   **Supervised Learning:** Learning from labeled data (e.g., \"this is a picture of a dog\").\n    *   **Unsupervised Learning:** Finding patterns in unlabeled data (e.g., grouping similar news articles).\n    *   **Reinforcement Learning:** Learning through trial and error, receiving rewards or penalties for actions (e.g., an AI learning to play a game).\n\n2.  **Deep Learning (DL):** A subfield of ML that uses **Artificial Neural Networks (ANNs)** with many layers (hence \"deep\"). These networks are inspired by the structure of the human brain.\n    *   **How it works:** Each \"neuron\" in a layer receives input, processes it, and passes it to the next layer. Deeper layers learn more complex and abstract features.\n    *   **Why it's powerful:** Deep learning excels at tasks like image recognition, natural language processing, and speech recognition because it can automatically learn incredibly complex and hierarchical features from raw data, reducing the need for human-engineered feature extraction.\n\n3.  **Natural Language Processing (NLP):** Enables computers to understand, interpret, and generate human language (e.g., chatbots, translation, sentiment analysis). Many modern NLP models use deep learning (like Transformers, which power models like ChatGPT).\n\n4.  **Computer Vision (CV):** Enables computers to \"see\" and interpret visual information from images and videos (e.g., facial recognition, self-driving cars, medical image analysis). Also heavily reliant on deep learning.\n\n5.  **Robotics:** Integrates AI with physical machines to perform tasks in the real world.\n\n---\n\n## Analogy with the Human Brain (and why it's a loose one)\n\nPeople often compare AI to the human brain. While inspiration is drawn, it's crucial to understand the differences:\n\n*   **Human Brain:** Billions of neurons, incredible plasticity, energy-efficient, learns from limited data, understands context, common sense, emotions.\n*   **AI (Neural Networks):** Millions/billions of *parameters* (not actual neurons), requires vast amounts of data, computational power, lacks true understanding or common sense. It's a highly sophisticated pattern-matching machine.\n\n---\n\n## In Summary\n\nAI works by **training algorithms on large datasets to identify patterns and make predictions or decisions.** The more sophisticated the AI (especially deep learning), the more complex the patterns it can learn. It's not magic, but rather highly advanced statistics, mathematics, and computer science, building systems that can perform tasks that traditionally required human intelligence."
                    }
                ],
                "role": "model"
            },
            "finishReason": "STOP",
            "index": 0
        }
    ],
    "usageMetadata": {
        "promptTokenCount": 6,
        "candidatesTokenCount": 1353,
        "totalTokenCount": 1359,
        "promptTokensDetails": [
            {
                "modality": "TEXT",
                "tokenCount": 6
            }
        ]
    },
    "modelVersion": "gemini-2.5-flash",
    "responseId": "GOG6aO3iCMuumtkPl8L8wAQ"
}

Modified at 2025-09-05 13:12:00

Text generation

Structured output