Saltar al contenido principal
POST
/
v1
/
audio
/
speech
Python (OpenAI SDK)
from openai import OpenAI

client = OpenAI(
    api_key="<COMETAPI_KEY>",
    base_url="https://api.cometapi.com/v1"
)

response = client.audio.speech.create(
    model="tts-1",
    voice="alloy",
    input="The quick brown fox jumped over the lazy dog."
)

response.stream_to_file("output.mp3")
"<string>"

Autorizaciones

Authorization
string
header
requerido

Bearer token authentication. Use your CometAPI key.

Cuerpo

application/json
model
string
predeterminado:tts-1
requerido

The TTS model to use. Choose a current speech model from the Models page.

input
string
requerido

The text to generate audio for. Maximum length is 4096 characters.

Maximum string length: 4096
voice
enum<string>
predeterminado:alloy
requerido

The voice to use for speech synthesis.

Opciones disponibles:
alloy,
ash,
ballad,
coral,
echo,
fable,
onyx,
nova,
sage,
shimmer
response_format
enum<string>
predeterminado:mp3

The audio output format.

Opciones disponibles:
mp3,
opus,
aac,
flac,
wav,
pcm
speed
number
predeterminado:1

The speed of the generated audio. Select a value between 0.25 and 4.0.

Rango requerido: 0.25 <= x <= 4

Respuesta

200 - audio/mpeg

The audio file content.

The response is of type file.