Save a model, system prompt, and parameters as a reusable preset — then call it like a model ID.
Presets let you bundle a model, system prompt, and generation parameters into a single reusable configuration. Once created, you reference a preset exactly like a model ID — no extra code, no repeated config.Presets can be private (only your API key can use them) or public (shareable with anyone).
Give it a short, descriptive name. This is for your reference only.
2
Choose a model
Select any supported model, or choose Any Model to leave the model open at request time. With Any Model, callers must specify the model using the @ syntax — see Overriding the model below.
3
Write a system prompt
Enter the system prompt that should apply to every request using this preset. This is the main reason to use presets — ship a locked, versioned persona or instruction set without exposing the prompt in client code.
4
Set parameters (optional)
Configure defaults like temperature, max_tokens, top_p, and other generation parameters. These apply automatically to every request unless overridden.
5
Set visibility
Choose Private (default) or Public. Public presets can be used by anyone with the preset ID.
This is useful when you want a shared system prompt and parameter set but need the caller to decide which underlying model to use — for example, letting users pick between gpt-4o and gpt-4o-mini while keeping the same persona.
Presets work with all standard Chat Completions parameters, including stream:
stream = client.chat.completions.create( model="presets/0ede00c3-b202-4102-bda7-ec9a4b398767", messages=[ {"role": "user", "content": "Tell me a short story."} ], stream=True,)for chunk in stream: if chunk.choices[0].delta.content: print(chunk.choices[0].delta.content, end="", flush=True)
Public presets expose the system prompt to anyone who has the preset ID. Don’t include secrets, internal instructions, or sensitive information in the system prompt of a public preset.