Skip to content

Audio Understanding

POST
/v1beta/models/gemini-2.5-pro:generateContent
  • Upload audio via inline_data (base64, e.g. audio/mp3)
  • Use text to specify tasks such as transcription, summarization, or Q&A
  • Supports native multimodal generateContent format
  • Official docs: Audio understanding

Authorizations

bearer
Type
HTTP (bearer)

Request Body

application/json
object
object[]
Required

Responses

Success

application/json

Playground

Authorization
Body

Samples