Docs
  1. ChatGPT (Audio)
Docs
  • Introduction
  • Quick Start Guide
  • Make a request
  • Chat Models
    • ChatGpt
      • ChatGPT (Audio)
        • Create transcription by gpt-4o-mini-transcribe & gpt-4o-transcribe
          POST
        • Create a voice with gpt-4o-mini-tts
          POST
        • Create a voice
          POST
        • Create a transcript
          POST
        • Create translation
          POST
      • ChatGPT (Chat)
        • Chat completion object
        • Create chat completion (streaming)
        • Create chat completion (non-streaming)
        • Create chat image recognition (streaming)
        • Create chat image recognition (streaming) base64
        • Create chat image recognition (non-streaming)
        • Function calling
        • N choices
        • Create chat function call (only non-streaming)
        • Create structured output
      • ChatGPT (Completions)
        • Completion object
        • Creation completed
      • ChatGPT(Embeddings)
        • Embedded Object
        • Create embed
    • Anthropic Claude
      • Offical Format
        • Messages (official Anthropic format)
        • Messages(Image Recognition)
        • Messages(function call)
        • Messages(Web search)
      • Create chat completion (streaming)
      • Create chat completion (non-streaming)
      • Create chat image recognition (streaming)
      • Create chat image recognition (non-streaming)
    • Gemini
      • Gemini Image creation interface (gemini-2.0-flash-exp-image-generation)
      • Chat interface
      • Image recognition interface
      • Function calling - Google Search
      • Function calling - codeExecution
  • Image Models
    • GPT-IMAGE-1
      • Generate Image by gpt-image-1
      • Edit Image by gpt-image-1
    • MJ
      • Submit Imagine task (mj_imagine)
      • Submit Blend task (mj_blend)
      • Submit Describe task (mj_describe)
      • Submit Change task (mj_variation, mj_upscale,mj_reroll)
      • Query task status based on task ID
    • Ideogram
      • Generate with Ideogram 3.0
      • Edit with Ideogram 3.0
      • Remix with Ideogram 3.0
      • Ideogram Upscale
    • Kling Image
      • Submit Image Generation
      • Get Image by Task ID
      • Submit Kolors Virtual Try On
      • Get Kolors Virtual Try On by Task ID
    • Flux
      • Flux on Replicate
        • Submit Image by flux-kontext-pro
        • Submit Image by flux-kontext-max
        • Submit Image by flux-pro
        • Get Image by ID
    • Recraft API
      • Recraft Image
      • Generate Image
      • Generate Vector Image
      • Remove Background
      • Clarity Upscale
      • Generative Upscale
    • Models use Dall-e Format
      • Google Imagen
      • Bytedance - seedream-3.0
      • Recraftv3 use Dall-e endpoint
      • Flux use Dall-e endpoint
    • DALL·E 3
      POST
  • Video Models
    • Kling Video
      • Create Video by Text
      • Get Video by Task ID(text2video)
      • Create Video by Image
      • Get Video by Task ID(image2video)
    • Runway ML Video
      • Create Video by Runway
      • Get Video by Task ID
    • Luma Video
      • Create Video by Luma
      • Get Video by Task ID
    • Pika Video
      • Create Video by Pika
      • Get Video by Task ID
    • Google Veo
      • Submit Video Request
      • Submit Video Request with Frames
      • Get Video by ID
    • Minimax - Hailuo
      • Submit Video Request
      • Get Video
  • Music Model - Suno
    • Illustrate
    • Parameter
    • Task submission
      • Generate songs (inspiration, customization, continuation)
      • Generate lyrics
    • Query interface
      • Query a single task
  • Python Samples
    • python openai official library (using AutoGPT, langchain, etc.)
    • Python uses speech to text
    • Python uses text to speech
    • Python uses Embeddings
    • python calls DALL·E
    • python simple call openai function-calling demo
    • python langchain
    • python llama_index
    • Python uses gpt-4o to identify pictures-local pictures
    • python library streaming output
    • Python uses gpt-4o to identify images
  • Plug-in/software usage tutorials
    • Setting HTTP for Make.com with Yescale
    • Sample Code for gpt-4o-audio/gpt-4o-mini-audio
  • Help Center
    • HTTP status codes
  • Tutorials
    • GPT-Image-1 API: A Step-by-Step Guide With Examples
  1. ChatGPT (Audio)

Create transcription by gpt-4o-mini-transcribe & gpt-4o-transcribe

POST
/v1/audio/transcriptions
GPT-4o (mini) Transcribe is a speech-to-text model that uses GPT-4o to transcribe audio. It offers improvements to word error rate and better language recognition and accuracy compared to original Whisper models. Use it for more accurate transcripts.

Request

Body Params multipart/form-data
file
file 
required
The audio file object (not file name) to transcribe, in one of these formats: flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, or webm.
model
string 
required
gpt-4o-mini-transcribe or gpt-4o-transcribe
prompt
string 
optional
An optional text to guide the model's style or continue a previous audio segment. The prompt should match the audio language.
https://platform.openai.com/docs/guides/speech-to-text#prompting

Request samples

Shell
JavaScript
Java
Swift
Go
PHP
Python
HTTP
C
C#
Objective-C
Ruby
OCaml
Dart
R
Request Request Example
Shell
JavaScript
Java
Swift
curl --location --request POST '/v1/audio/transcriptions' \
--form 'file=@""' \
--form 'model=""' \
--form 'prompt=""'

Responses

🟢200OK
application/json
Body
text
string 
required
Example
{
    "text": "ngàn đêm buông xuống phố xa vắng thành chỉ còn mình em với nỗi nhớ anh mây mong ngày xưa đôi ta tay nắm tay trông lối về giờ chỉ còn em ôm ký ức não nề tình mình như áng mây trôi cuốn theo gió bay xa vời lời yêu nắm mấy phơi phôi để lại trong em chơi vơi giọt lệ tuôn rơi sẽ tan tim em rồi anh ơi tại sao tình mình ràng dở vậy thôi nhìn lại những ước anh xưa nụ cười anh vẫn còn đây mà sao giờ đây chỉ thấy bóng hình anh nhạt vai em cố gắng quên đi những kỷ niệm mình có những càng cố gắng em càng nhớ anh thêm bao giờ tình mình như áng mây trôi cuốn theo gió bay xa vời lời yêu nắm mấy phơi phôi để lại trong em chơi vơi giọt lệ tuôn rơi sẽ tan tim em rồi anh ơi tại sao tình mình ràng dở vậy thôi có lẽ em sai khi quá yêu anh đầm sâu để rồi nhận lấy nỗi đau này đến muôn sâu em ước gì thời gian quay trở lại để em có thể giữ anh mới mơ tình mình như áng mây trôi cuốn theo gió bay xa vời lời yêu nắm mấy phơi phôi để lại trong em chơi vơi giọt lệ tuôn rơi sẽ tan tim em rồi anh ơi tại sao tình mình ràng dở vậy thôi dù biết rằng anh sẽ không quay về em vẫn sẽ giữ tình yêu này trong tim em mãi mãi"
}
Modified at 2025-06-29 07:54:20
Previous
Make a request
Next
Create a voice with gpt-4o-mini-tts
Built with