Text generation

POST

/v1beta/models/{model_name}:generateContent

Request

Path Params

Header Params

Body Params application/json

Example

{
    "contents": [
      {
        "parts": [
          {
            "text": "How does AI work?"
          }
        ]
      }
    ]
  }

Request Code Samples

Shell

JavaScript

Java

Swift

PHP

Python

HTTP

Objective-C

Ruby

OCaml

Dart

curl --location --request POST '/v1beta/models/:generateContent' \
--header 'x-goog-api-key;' \
--header 'Content-Type: application/json' \
--data-raw '{
    "contents": [
      {
        "parts": [
          {
            "text": "How does AI work?"
          }
        ]
      }
    ]
  }'

Responses

🟢200OK

application/json

Body

Example

{
    "candidates": [
        {
            "content": {
                "parts": [
                    {
                        "text": "AI, or Artificial Intelligence, fundamentally works by **enabling machines to learn from data, identify patterns, make decisions, and solve problems in ways that simulate human intelligence.** It's not magic, but a combination of complex algorithms, vast amounts of data, and powerful computational resources.\n\nHere's a breakdown of the core concepts and how it generally works:\n\n### The Core Idea: Learning from Data\n\nAt its heart, most modern AI (especially Machine Learning) operates on the principle of **learning from examples rather than being explicitly programmed for every possible scenario.**\n\nImagine teaching a child:\n*   You show them many pictures of cats and dogs, telling them \"This is a cat,\" \"This is a dog.\"\n*   Eventually, they learn to distinguish between the two on their own, even with new pictures they haven't seen before.\n\nAI works similarly.\n\n### Key Components of AI\n\n1.  **Data:** This is the fuel for AI. AI models learn by processing massive amounts of data. This data can be text, images, audio, video, numbers, etc., and often needs to be cleaned, labeled, and prepared.\n2.  **Algorithms (Models):** These are the sets of rules and statistical techniques that AI uses to learn from the data. Different tasks require different types of algorithms.\n3.  **Computational Power:** Training complex AI models requires immense processing power (CPUs and especially GPUs) to handle the vast amounts of data and calculations.\n\n### The General Process of How AI Works\n\n1.  **Data Collection & Preparation:**\n    *   Gathering relevant data (e.g., images for object recognition, text for language translation).\n    *   Cleaning the data (removing errors, inconsistencies).\n    *   Labeling the data (e.g., manually identifying objects in images, or categorizing text). This is crucial for *supervised learning*.\n\n2.  **Choosing a Model/Algorithm:**\n    *   Depending on the problem (e.g., prediction, classification, generation), an appropriate AI model (e.g., neural network, decision tree, support vector machine) is selected.\n\n3.  **Training the Model:**\n    *   The prepared data is fed into the algorithm.\n    *   The algorithm processes this data, looking for patterns and relationships.\n    *   During training, the model makes predictions and compares them to the actual correct answers (if available).\n    *   It then adjusts its internal parameters (often called \"weights\" and \"biases\" in neural networks) to minimize the difference between its predictions and the correct answers. This is an iterative process that can happen millions or billions of times.\n    *   **Analogy:** It's like a student practicing for an exam, getting feedback on wrong answers, and adjusting their understanding.\n\n4.  **Evaluation:**\n    *   After training, the model is tested with *new, unseen data* to assess how well it generalizes and performs on real-world inputs.\n    *   If it performs well, it's ready for deployment. If not, the process might involve further training, more data, or a different algorithm.\n\n5.  **Inference (Deployment/Prediction):**\n    *   Once trained and evaluated, the AI model can be used to make predictions or decisions on new, real-world data.\n    *   For example, a trained image recognition AI can now identify objects in new photos instantly.\n\n### Key Branches of AI\n\nWhile \"AI\" is an umbrella term, modern AI is largely driven by these subfields:\n\n1.  **Machine Learning (ML):** This is the most common approach today. It focuses on building systems that learn from data.\n    *   **Supervised Learning:** Learns from labeled data (input-output pairs).\n        *   **Classification:** Categorizing data (e.g., spam/not-spam, cat/dog).\n        *   **Regression:** Predicting a continuous value (e.g., house prices, stock values).\n    *   **Unsupervised Learning:** Learns from unlabeled data, finding hidden patterns or structures.\n        *   **Clustering:** Grouping similar data points together (e.g., customer segmentation).\n        *   **Dimensionality Reduction:** Simplifying data while retaining important information.\n    *   **Reinforcement Learning (RL):** An agent learns by performing actions in an environment, receiving rewards for good actions and penalties for bad ones, gradually optimizing its behavior (e.g., AI playing games like Go or chess, robotics).\n\n2.  **Deep Learning (DL):** A *subset* of Machine Learning that uses artificial **neural networks** with many layers (hence \"deep\").\n    *   Inspired by the structure of the human brain (though a very simplified model).\n    *   Each layer in a deep neural network processes the input in a hierarchical way, extracting increasingly complex features.\n    *   Deep learning has revolutionized fields like:\n        *   **Image Recognition (CNNs - Convolutional Neural Networks):** Identifying faces, objects, medical conditions in scans.\n        *   **Natural Language Processing (RNNs, Transformers):** Understanding and generating human language (chatbots, translation, summarization).\n        *   **Speech Recognition:** Converting spoken words to text.\n\n### Examples of AI in Action\n\n*   **Image Recognition:** Your phone unlocking with your face, self-driving cars identifying pedestrians and traffic signs.\n*   **Natural Language Processing:** ChatGPT, Google Translate, spam filters, virtual assistants like Siri or Alexa.\n*   **Recommendation Systems:** Netflix suggesting movies, Amazon recommending products.\n*   **Medical Diagnosis:** AI helping doctors analyze medical images for diseases.\n*   **Financial Fraud Detection:** Identifying suspicious transactions.\n*   **Robotics:** Robots performing complex tasks in manufacturing or exploration.\n\nIn essence, AI works by using sophisticated computational methods to sift through vast amounts of information, learn patterns, and then apply that learned knowledge to new situations to make intelligent decisions or perform specific tasks."
                    }
                ],
                "role": "model"
            },
            "finishReason": "STOP",
            "index": 0
        }
    ],
    "usageMetadata": {
        "promptTokenCount": 6,
        "candidatesTokenCount": 1265,
        "totalTokenCount": 2472,
        "promptTokensDetails": [
            {
                "modality": "TEXT",
                "tokenCount": 6
            }
        ],
        "thoughtsTokenCount": 1201
    },
    "modelVersion": "gemini-2.5-flash",
    "responseId": "kuC6aPShNKWAqtsPgfKR6A8"
}

Modified at 2025-09-05 13:11:49

Create chat image recognition (non-streaming)

Thinking with Gemini 2.5