Image Generation

Learn how to generate or manipulate images with DALL·E in the API.

Introduction

The Images API provides three methods for interacting with images:

Creating images from scratch based on a text prompt (DALL·E 3 and DALL·E 2)
Creating edited versions of images by having the model replace some areas of a pre-existing image, based on a new text prompt (DALL·E 2 only)
Creating variations of an existing image (DALL·E 2 only)

This guide covers the basics of using these three API endpoints with useful code samples. To try DALL·E 3, head to RockChat.

Usage

The image generations endpoint allows you to create an original image given a text prompt. When using DALL·E 3, images can have a size of 1024x1024, 1024x1792, or 1792x1024 pixels.

By default, images are generated at standard quality, but when using DALL·E 3, you can set quality: "hd" for enhanced detail. Square, standard quality images are the fastest to generate.

You can request one image at a time with DALL·E 3 (request more by making parallel requests) or up to 10 images at a time using DALL·E 2 with the n parameter.

Example: Generate an image

from openai import OpenAI

client = OpenAI(
    api_key = '$ROCKAPI_API_KEY',
    base_url = 'https://api.rockapi.ru/openai/v1'
)

response = client.images.generate(
  model="dall-e-3",
  prompt="a white Siamese cat",
  size="1024x1024",
  quality="standard",
  n=1,
)

image_url = response.data[0].url

Prompting

With the release of DALL·E 3, the model now takes in the default prompt provided and automatically rewrites it for safety reasons, and to add more detail (more detailed prompts generally result in higher quality images).

While it is not currently possible to disable this feature, you can use prompting to get outputs closer to your requested image by adding the following to your prompt: I NEED to test how the tool works with extremely simple prompts. DO NOT add any detail, just use it AS-IS:.

The updated prompt is visible in the revised_prompt field of the data response object.

Example DALL·E 3 generations

Example prompt: "A photograph of a white Siamese cat."

Generated image:

Example Image

Each image can be returned as either a URL or Base64 data, using the response_format parameter. URLs will expire after an hour.

Edits (DALL·E 2 only)

Coming Soon

Also known as "inpainting", the image edits endpoint allows you to edit or extend an image by uploading an image and mask indicating which areas should be replaced. The transparent areas of the mask indicate where the image should be edited, and the prompt should describe the full new image, not just the erased area. This endpoint can enable experiences like DALL·E image editing in ChatGPT Plus.

Example: Edit an image

from openai import OpenAI

client = OpenAI(
    api_key = '$ROCKAPI_API_KEY',
    base_url = 'https://api.rockapi.ru/openai/v1'
)

response = client.images.edit(
  model="dall-e-2",
  image=open("sunlit_lounge.png", "rb"),
  mask=open("mask.png", "rb"),
  prompt="A sunlit indoor lounge area with a pool containing a flamingo",
  n=1,
  size="1024x1024"
)

image_url = response.data[0].url

The uploaded image and mask must both be square PNG images less than 4MB in size and also must have the same dimensions as each other. The non-transparent areas of the mask are not used when generating the output, so they don’t necessarily need to match the original image like the example above.

Variations (DALL·E 2 only)

Coming Soon

The image variations endpoint allows you to generate a variation of a given image.

Example: Generate an image variation

from openai import OpenAI

client = OpenAI(
    api_key = '$ROCKAPI_API_KEY',
    base_url = 'https://api.rockapi.ru/openai/v1'
)

response = client.images.create_variation(
  model="dall-e-2",
  image=open("corgi_and_cat_paw.png", "rb"),
  n=1,
  size="1024x1024"
)

image_url = response.data[0].url

Similar to the edits endpoint, the input image must be a square PNG image less than 4MB in size.

Content moderation

Prompts and images are filtered based on our content policy, returning an error when a prompt or image is flagged.

Language-specific tips

Node.js Example

import OpenAI from "openai";

const openai = new OpenAI();

const buffer = [your image data];
buffer.name = "image.png";

async function main() {
  const image = await openai.images.createVariation({
    model: "dall-e-2",
    image: buffer,
    n: 1,
    size: "1024x1024"
  });
  console.log(image.data);
}
main();

TypeScript Example

import fs from "fs";
import OpenAI from "openai";

const openai = new OpenAI();

async function main() {
  try {
    const image = await openai.images.createVariation({
      image: fs.createReadStream("image.png") as any,
      n: 1,
      size: "1024x1024",
    });
    console.log(image.data);
  } catch (error) {
    if (error.response) {
      console.log(error.response.status);
      console.log(error.response.data);
    } else {
      console.log(error.message);
    }
  }
}
main();

For more details, refer to the OpenAI Cookbook.

Image Generation

Introduction​

Usage​

Prompting​

Example DALL·E 3 generations​

Edits (DALL·E 2 only)​

Variations (DALL·E 2 only)​

Content moderation​

Language-specific tips​

Node.js Example​

TypeScript Example​