Gemini API Reference

Generate Content

Endpoint

POST https://api.rockapi.ru/google-ai-studio/v1beta/{model=models/*}:generateContent

Path Parameters

Name	Type	Description
model	string	Required. The name of the Model to use for generating the completion. Format: `name=models/{model}`.

Request Body

The request body contains data with the following structure:

Fields

Name	Type	Description
contents[]	object	Required. The content of the current conversation with the model. For single-turn queries, this is a single instance. For multi-turn queries, this is a repeated field that contains conversation history + latest request.
tools[]	object	Optional. A list of Tools the model may use to generate the next response. A Tool is a piece of code that enables the system to interact with external systems to perform an action, or set of actions, outside of knowledge and scope of the model. The only supported tool is currently Function.
toolConfig	object	Optional. Tool configuration for any Tool specified in the request.
safetySettings[]	object	Optional. A list of unique SafetySetting instances for blocking unsafe content. This will be enforced on the `GenerateContentRequest.contents` and `GenerateContentResponse.candidates`. There should not be more than one setting for each SafetyCategory type. The API will block any contents and responses that fail to meet the thresholds set by these settings. This list overrides the default settings for each SafetyCategory specified in the safetySettings. If there is no SafetySetting for a given SafetyCategory provided in the list, the API will use the default safety setting for that category. Supported harm categories: HARM_CATEGORY_HATE_SPEECH, HARM_CATEGORY_SEXUALLY_EXPLICIT, HARM_CATEGORY_DANGEROUS_CONTENT, HARM_CATEGORY_HARASSMENT.
systemInstruction	object	Optional. Developer set system instruction. Currently, text only.
generationConfig	object	Optional. Configuration options for model generation and outputs.
cachedContent	string	Optional. The name of the cached content used as context to serve the prediction. Note: only used in explicit caching, where users can have control over caching (e.g., what content to cache) and enjoy guaranteed cost savings. Format: `cachedContents/{cachedContent}`.

Example Request

import google.generativeai as genai
from google.api_core.client_options import ClientOptions

genai.configure(
    api_key='$ROCKAPI_API_KEY',
    transport='rest',
    client_options=ClientOptions(api_endpoint='https://api.rockapi.ru/google-ai-studio')
)

model = genai.GenerativeModel("gemini-1.5-flash")
response = model.generate_content("Write a story about a magic backpack.")
print(response.text)

Response Body

If successful, the response body contains an instance of GenerateContentResponse.

Stream Generate Content

Endpoint

POST https://api.rockapi.ru/google-ai-studio/v1beta/{model=models/*}:streamGenerateContent

Path Parameters

Name	Type	Description
model	string	Required. The name of the Model to use for generating the completion. Format: `name=models/{model}`.

Request Body

The request body contains data with the following structure:

Fields

Name	Type	Description
contents[]	object	Required. The content of the current conversation with the model. For single-turn queries, this is a single instance. For multi-turn queries, this is a repeated field that contains conversation history + latest request.
tools[]	object	Optional. A list of Tools the model may use to generate the next response. A Tool is a piece of code that enables the system to interact with external systems to perform an action, or set of actions, outside of knowledge and scope of the model. The only supported tool is currently Function.
toolConfig	object	Optional. Tool configuration for any Tool specified in the request.
safetySettings[]	object	Optional. A list of unique SafetySetting instances for blocking unsafe content. This will be enforced on the `GenerateContentRequest.contents` and `GenerateContentResponse.candidates`. There should not be more than one setting for each SafetyCategory type. The API will block any contents and responses that fail to meet the thresholds set by these settings. This list overrides the default settings for each SafetyCategory specified in the safetySettings. If there is no SafetySetting for a given SafetyCategory provided in the list, the API will use the default safety setting for that category. Supported harm categories: HARM_CATEGORY_HATE_SPEECH, HARM_CATEGORY_SEXUALLY_EXPLICIT, HARM_CATEGORY_DANGEROUS_CONTENT, HARM_CATEGORY_HARASSMENT.
systemInstruction	object	Optional. Developer set system instruction. Currently, text only.
generationConfig	object	Optional. Configuration options for model generation and outputs.
cachedContent	string	Optional. The name of the cached content used as context to serve the prediction. Note: only used in explicit caching, where users can have control over caching (e.g., what content to cache) and enjoy guaranteed cost savings. Format: `cachedContents/{cachedContent}`.

Example Request

import google.generativeai as genai
from google.api_core.client_options import ClientOptions

genai.configure(
    api_key='$ROCKAPI_API_KEY',
    transport='rest',
    client_options=ClientOptions(api_endpoint='https://api.rockapi.ru/google-ai-studio')
)

model = genai.GenerativeModel("gemini-1.5-flash")
response = model.generate_content("Write a story about a magic backpack.", stream=True)
for chunk in response:
    print(chunk.text)
    print("_" * 80)

Response Body

If successful, the response body contains a stream of GenerateContentResponse instances.

GenerateContentResponse

Response from the model supporting multiple candidates.

JSON Representation

Note on safety ratings and content filtering. They are reported for both prompt in GenerateContentResponse.prompt_feedback and for each candidate in finishReason and in safetyRatings. The API contract is that: - either all requested candidates are returned or no candidates at all - no candidates are returned only if there was something wrong with the prompt (see promptFeedback) - feedback on each candidate is reported on finishReason and safetyRatings.

{
  "candidates": [
    {
      object (Candidate)
    }
  ],
  "promptFeedback": {
    object (PromptFeedback)
  },
  "usageMetadata": {
    object (UsageMetadata)
  }
}

Fields

Name	Type	Description
candidates[]	object	Candidate responses from the model.
promptFeedback	object	Returns the prompt's feedback related to the content filters.
usageMetadata	object	Metadata on the generation requests' token usage.

PromptFeedback

A set of the feedback metadata the prompt specified in GenerateContentRequest.content.

JSON Representation

{
  "blockReason": "enum (BlockReason)",
  "safetyRatings": [
    {
      "object": "SafetyRating"
    }
  ]
}

Fields

Name	Type	Description
blockReason	enum	Optional. If set, the prompt was blocked and no candidates are returned. Rephrase your prompt.
safetyRatings[]	object	Ratings for safety of the prompt. There is at most one rating per category.

BlockReason

Specifies what was the reason why the prompt was blocked.

Enums

Name	Description
BLOCK_REASON_UNSPECIFIED	Default value. This value is unused.
SAFETY	Prompt was blocked due to safety reasons. You can inspect safetyRatings to understand which safety category blocked it.
OTHER	Prompt was blocked due to unknown reasons.

UsageMetadata

Metadata on the generation request's token usage.

JSON Representation

{
  "promptTokenCount": "integer",
  "cachedContentTokenCount": "integer",
  "candidatesTokenCount": "integer",
  "totalTokenCount": "integer"
}

Fields

Name	Type	Description
promptTokenCount	integer	Number of tokens in the prompt. When cachedContent is set, this is still the total effective prompt size. I.e., this includes the number of tokens in the cached content.
cachedContentTokenCount	integer	Number of tokens in the cached part of the prompt, i.e., in the cached content.
candidatesTokenCount	integer	Total number of tokens across the generated candidates.
totalTokenCount	integer	Total token count for the generation request (prompt + candidates).

Candidate

A response candidate generated from the model.

JSON Representation

{
  "content": {
    "object": "Content"
  },
  "finishReason": "enum (FinishReason)",
  "safetyRatings": [
    {
      "object": "SafetyRating"
    }
  ],
  "citationMetadata": {
    "object": "CitationMetadata"
  },
  "tokenCount": "integer",
  "groundingAttributions": [
    {
      "object": "GroundingAttribution"
    }
  ],
  "index": "integer"
}

Fields

Name	Type	Description
content	object	Generated content returned from the model.
finishReason	enum	Optional. The reason why the model stopped generating tokens.
safetyRatings[]	object	List of ratings for the safety of a response candidate. There is at most one rating per category.
citationMetadata	object	Citation information for model-generated candidate.
tokenCount	integer	Token count for this candidate.
groundingAttributions[]	object	Attribution information for sources that contributed to a grounded answer.
index	integer	Index of the candidate in the list of candidates.

FinishReason

Defines the reason why the model stopped generating tokens.

Enums

Name	Description
FINISH_REASON_UNSPECIFIED	Default value. This value is unused.
STOP	Natural stop point of the model or provided stop sequence.
MAX_TOKENS	The maximum number of tokens as specified in the request was reached.
SAFETY	The candidate content was flagged for safety reasons.
RECITATION	The candidate content was flagged for recitation reasons.
LANGUAGE	The candidate content was flagged for using an unsupported language.
OTHER	Unknown reason.

GroundingAttribution

Attribution for a source that contributed to an answer.

JSON Representation

{
  "sourceId": {
    "object": "AttributionSourceId"
  },
  "content": {
    "object": "Content"
  }
}

Fields

Name	Type	Description
sourceId	object	Identifier for the source contributing to this attribution.
content	object	Grounding source content that makes up this attribution.

AttributionSourceId

Identifier for the source contributing to this attribution.

JSON Representation

{
  "groundingPassage": {
    "object": "GroundingPassageId"
  },
  "semanticRetrieverChunk": {
    "object": "SemanticRetrieverChunk"
  }
}

Fields

Name	Type	Description
groundingPassage	object	Identifier for an inline passage.
semanticRetrieverChunk	object	Identifier for a Chunk fetched via Semantic Retriever.

GroundingPassageId

Identifier for a part within a GroundingPassage.

JSON Representation

{
  "passageId": "string",
  "partIndex": "integer"
}

Fields

Name	Type	Description
passageId	string	ID of the passage matching the `GenerateAnswerRequest`'s GroundingPassage.id.
partIndex	integer	Index of the part within the `GenerateAnswerRequest`'s GroundingPassage.content.

SemanticRetrieverChunk

Identifier for a Chunk retrieved via Semantic Retriever specified in the GenerateAnswerRequest using SemanticRetrieverConfig.

JSON Representation

{
  "source": "string",
  "chunk": "string"
}

Fields

Name	Type	Description
source	string	Name of the source matching the request's `SemanticRetrieverConfig.source`. Example: `corpora/123` or `corpora/123/documents/abc`.
chunk	string	Name of the Chunk containing the attributed text. Example: `corpora/123/documents/abc/chunks/xyz`.

CitationMetadata

A collection of source attributions for a piece of content.

JSON Representation

{
  "citationSources": [
    {
      "object": "CitationSource"
    }
  ]
}

Fields

Name	Type	Description
citationSources[]	object	Citations to sources for a specific response.

CitationSource

A citation to a source for a portion of a specific response.

JSON Representation

{
  "startIndex": "integer",
  "endIndex": "integer",
  "uri": "string",
  "license": "string"
}

Fields

Name	Type	Description
startIndex	integer	Optional. Start of the segment of the response that is attributed to this source. Index indicates the start of the segment, measured in bytes.
endIndex	integer	Optional. End of the attributed segment, exclusive.
uri	string	Optional. URI that is attributed as a source for a portion of the text.
license	string	Optional. License for the GitHub project that is attributed as a source for the segment. License info is required for code citations.

GenerationConfig

Configuration options for model generation and outputs. Not all parameters may be configurable for every model.

JSON Representation

{
  "stopSequences": [
    "string"
  ],
  "responseMimeType": "string",
  "responseSchema": {
    "object": "Schema"
  },
  "candidateCount": "integer",
  "maxOutputTokens": "integer",
  "temperature": "number",
  "topP": "number",
  "topK": "integer"
}

Fields

Name	Type	Description
stopSequences[]	string	Optional. The set of character sequences (up to 5) that will stop output generation. If specified, the API will stop at the first appearance of a stop sequence. The stop sequence will not be included as part of the response.
responseMimeType	string	Optional. Output response MIME type of the generated candidate text. Supported MIME types: `text/plain` (default), `application/json` (JSON response in the candidates).
responseSchema	object	Optional. Output response schema of the generated candidate text when response MIME type can have schema. Schema can be objects, primitives, or arrays and is a subset of OpenAPI schema. If set, a compatible responseMimeType must also be set. Compatible MIME types: `application/json` (Schema for JSON response).
candidateCount	integer	Optional. Number of generated responses to return. Currently, this value can only be set to 1. If unset, this will default to 1.
maxOutputTokens	integer	Optional. The maximum number of tokens to include in a candidate. Note: The default value varies by model, see the `Model.output_token_limit` attribute of the Model returned from the `getModel` function.
temperature	number	Optional. Controls the randomness of the output. Note: The default value varies by model, see the `Model.temperature` attribute of the Model returned from the `getModel` function. Values can range from [0.0, 2.0].
topP	number	Optional. The maximum cumulative probability of tokens to consider when sampling. The model uses combined Top-k and nucleus sampling. Tokens are sorted based on their assigned probabilities so that only the most likely tokens are considered. Top-k sampling directly limits the maximum number of tokens to consider, while Nucleus sampling limits the number of tokens based on the cumulative probability. Note: The default value varies by model, see the `Model.top_p` attribute of the Model returned from the `getModel` function.
topK	integer	Optional. The maximum number of tokens to consider when sampling. Models use nucleus sampling or combined Top-k and nucleus sampling. Top-k sampling considers the set of topK most probable tokens. Models running with nucleus sampling don't allow topK setting. Note: The default value varies by model, see the `Model.top_k` attribute of the Model returned from the `getModel` function. An empty topK field in Model indicates the model doesn't apply top-k sampling and doesn't allow setting topK on requests.

HarmCategory

The category of a rating. These categories cover various kinds of harms that developers may wish to adjust.

Enums

Name	Description
HARM_CATEGORY_UNSPECIFIED	Category is unspecified.
HARM_CATEGORY_DEROGATORY	Negative or harmful comments targeting identity and/or protected attribute.
HARM_CATEGORY_TOXICITY	Content that is rude, disrespectful, or profane.
HARM_CATEGORY_VIOLENCE	Describes scenarios depicting violence against an individual or group, or general descriptions of gore.
HARM_CATEGORY_SEXUAL	Contains references to sexual acts or other lewd content.
HARM_CATEGORY_MEDICAL	Promotes unchecked medical advice.
HARM_CATEGORY_DANGEROUS	Dangerous content that promotes, facilitates, or encourages harmful acts.
HARM_CATEGORY_HARASSMENT	Harassment content.
HARM_CATEGORY_HATE_SPEECH	Hate speech and content.
HARM_CATEGORY_SEXUALLY_EXPLICIT	Sexually explicit content.
HARM_CATEGORY_DANGEROUS_CONTENT	Dangerous content.

SafetyRating

Safety rating for a piece of content. The safety rating contains the category of harm and the harm probability level in that category for a piece of content. Content is classified

for safety across a number of harm categories and the probability of the harm classification is included here.

JSON Representation

{
  "category": "enum (HarmCategory)",
  "probability": "enum (HarmProbability)",
  "blocked": "boolean"
}

Fields

Name	Type	Description
category	enum	Required. The category for this rating.
probability	enum	Required. The probability of harm for this content.
blocked	boolean	Was this content blocked because of this rating?

HarmProbability

The probability that a piece of content is harmful. The classification system gives the probability of the content being unsafe. This does not indicate the severity of harm for a piece of content.

Enums

Name	Description
HARM_PROBABILITY_UNSPECIFIED	Probability is unspecified.
NEGLIGIBLE	Content has a negligible chance of being unsafe.
LOW	Content has a low chance of being unsafe.
MEDIUM	Content has a medium chance of being unsafe.
HIGH	Content has a high chance of being unsafe.

SafetySetting

Safety setting, affecting the safety-blocking behavior. Passing a safety setting for a category changes the allowed probability that content is blocked.

JSON Representation

{
  "category": "enum (HarmCategory)",
  "threshold": "enum (HarmBlockThreshold)"
}

Fields

Name	Type	Description
category	enum	Required. The category for this setting.
threshold	enum	Required. Controls the probability threshold at which harm is blocked.

HarmBlockThreshold

Block at and beyond a specified harm probability.

Enums

Name	Description
HARM_BLOCK_THRESHOLD_UNSPECIFIED	Threshold is unspecified.
BLOCK_LOW_AND_ABOVE	Content with NEGLIGIBLE will be allowed.
BLOCK_MEDIUM_AND_ABOVE	Content with NEGLIGIBLE and LOW will be allowed.
BLOCK_ONLY_HIGH	Content with NEGLIGIBLE, LOW, and MEDIUM will be allowed.
BLOCK_NONE	All content will be allowed.

Gemini API Reference

Generate Content​

Endpoint​

Path Parameters​

Request Body​

Fields​

Example Request​

Response Body​

Stream Generate Content​

Endpoint​

Path Parameters​

Request Body​

Fields​

Example Request​

Response Body​

GenerateContentResponse​

JSON Representation​

Fields​

PromptFeedback​

JSON Representation​

Fields​

BlockReason​

Enums​

UsageMetadata​

JSON Representation​

Fields​

Candidate​

JSON Representation​

Fields​

FinishReason​

Enums​

GroundingAttribution​

JSON Representation​

Fields​

AttributionSourceId​

JSON Representation​

Fields​

GroundingPassageId​

JSON Representation​

Fields​

SemanticRetrieverChunk​

JSON Representation​

Fields​

CitationMetadata​

JSON Representation​

Fields​

CitationSource​

JSON Representation​

Fields​

GenerationConfig​

JSON Representation​

Fields​

HarmCategory​

Enums​

SafetyRating​

JSON Representation​

Fields​

HarmProbability​

Enums​

SafetySetting​

JSON Representation​

Fields​

HarmBlockThreshold​

Enums​

Generate Content

Endpoint

Path Parameters

Request Body

Fields

Example Request

Response Body

Stream Generate Content

Endpoint

Path Parameters

Request Body

Fields

Example Request

Response Body

GenerateContentResponse

JSON Representation

Fields

PromptFeedback

JSON Representation

Fields

BlockReason

Enums

UsageMetadata

JSON Representation

Fields

Candidate

JSON Representation

Fields

FinishReason

Enums

GroundingAttribution

JSON Representation

Fields

AttributionSourceId

JSON Representation

Fields

GroundingPassageId

JSON Representation

Fields

SemanticRetrieverChunk

JSON Representation

Fields

CitationMetadata

JSON Representation

Fields

CitationSource

JSON Representation

Fields

GenerationConfig

JSON Representation

Fields

HarmCategory

Enums

SafetyRating

JSON Representation

Fields

HarmProbability

Enums

SafetySetting

JSON Representation

Fields

HarmBlockThreshold

Enums