Chat Completion

Authorizations

Authorization

string

header

required

Use the following format for authentication: Bearer <your api key>

Headers

Accept-Language

enum<string>

default:en-US,en

Config desired response language for HTTP requests.

Available options:

en-US,en

Example:

"en-US,en"

Body

application/json

Text Model
Vision Model

model

enum<string>

default:glm-4.6

required

The model code to be called. GLM-4.6 are the latest flagship model series, foundational models specifically designed for agent applications.

Available options:

glm-4.6,

glm-4.5,

glm-4.5-air,

glm-4.5-x,

glm-4.5-airx,

glm-4.5-flash,

glm-4-32b-0414-128k

Example:

"glm-4.6"

messages

(User Message · object | System Message · object | Assistant Message · object | Tool Message · object)[]

required

The current conversation message list as the model’s prompt input, provided in JSON array format, e.g.,{“role”: “user”, “content”: “Hello”}. Possible message types include system messages, user messages, assistant messages, and tool messages. Note: The input must not consist of system messages or assistant messages only.

Minimum length: 1

User Message
System Message
Assistant Message
Tool Message

Hide child attributes

role

enum<string>

default:user

required

Role of the message author

Available options:

user

content

string

required

Text message content

Example:

"What opportunities and challenges will the Chinese large model industry face in 2025?"

request_id

string

Passed by the user side, needs to be unique; used to distinguish each request. If not provided by the user side, the platform will generate one by default.

do_sample

boolean

default:true

When do_sample is true, sampling strategy is enabled; when do_sample is false, sampling strategy parameters such as temperature and top_p will not take effect. Default value is true.

Example:

true

stream

boolean

default:false

This parameter should be set to false or omitted when using synchronous call. It indicates that the model returns all content at once after generating all content. Default value is false. If set to true, the model will return the generated content in chunks via standard Event Stream. When the Event Stream ends, a data: [DONE] message will be returned.

Example:

false

thinking

object

Only supported by GLM-4.5 series and higher models. This parameter is used to control whether the model enable the chain of thought.

Hide child attributes

thinking.type

enum<string>

default:enabled

Whether to enable the chain of thought(When enabled, GLM-4.6, GLM-4.5 and others will automatically determine whether to think, while GLM-4.5V will think compulsorily), default: enabled

Available options:

enabled,

disabled

temperature

number

default:1

Sampling temperature, controls the randomness of the output, must be a positive number within the range: [0.0, 1.0]. The GLM-4.6 series default value is 1.0, GLM-4.5 series default value is 0.6, GLM-4-32B-0414-128K default value is 0.75.

Required range: 0 <= x <= 1

Example:

1

top_p

number

default:0.95

Another method of temperature sampling, value range is: (0.0, 1.0]. The GLM-4.6, GLM-4.5 series default value is 0.95, GLM-4-32B-0414-128K default value is 0.9.

Required range: 0 <= x <= 1

Example:

0.95

max_tokens

integer

The maximum number of tokens for model output, the GLM-4.6 series supports 128K maximum output, the GLM-4.5 series supports 96K maximum output, the GLM-4.5v series supports 16K maximum output, GLM-4-32B-0414-128K supports 16K maximum output.

Required range: 1 <= x <= 98304

Example:

1024

tool_stream

boolean

default:false

Whether to enable streaming response for Function Calls. Default value is false. Only supported by GLM-4.6. Refer the Stream Tool Call

Example:

false

tools

(Function Call · object | Retrieval · object | Web Search · object)[]

A list of tools the model may call. Currently, only functions are supported as a tool. Use this to provide a list of functions the model may generate JSON inputs for. A max of 128 functions are supported.

Function Call
Retrieval
Web Search

Hide child attributes

type

enum<string>

default:function

required

Available options:

function

function

object

required

Hide child attributes

function.name

string

required

The name of the function to be called. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.

Required string length: 1 - 64

function.description

string

required

A description of what the function does, used by the model to choose when and how to call the function.

function.parameters

object

required

Parameters defined using JSON Schema. Must pass a JSON Schema object to accurately define accepted parameters. Omit if no parameters are needed when calling the function.

tool_choice

enum<string>

Controls how the model selects a tool. Used to control how the model selects which function to call. This is only applicable when the tool type is function. The default value is auto, and only auto is supported.

Available options:

auto

stop

string[]

Stop word list. Generation stops when the model encounters any specified string. Currently, only one stop word is supported, in the format ["stop_word1"].

Maximum length: 1

response_format

object

Specifies the response format of the model. Defaults to text. Supports two formats:{ "type": "text" } plain text mode, returns natural language text, { "type": "json_object" } JSON mode, returns valid JSON data. When using JSON mode, it’s recommended to clearly request JSON output in the prompt.

Hide child attributes

response_format.type

enum<string>

default:text

required

Output format type: text for plain text, json_object for JSON-formatted output.

Available options:

text,

json_object

user_id

string

Unique ID for the end user, 6–128 characters. Avoid using sensitive information.

Required string length: 6 - 128

Response

Processing successful

string

Task ID

request_id

string

Request ID

created

integer

Request creation time, Unix timestamp in seconds

model

string

Model name

choices

object[]

List of model responses

Hide child attributes

index

integer

Result index.

message

object

Hide child attributes

message.role

string

Current conversation role, default is ‘assistant’ (model)

Example:

"assistant"

message.content

string

Current conversation content. Hits function is null, otherwise returns model inference result. For the GLM-4.5V series models, the output may contain the reasoning process tags <think> </think> or the text boundary tags <|begin_of_box|> <|end_of_box|>.

message.reasoning_content

string

Reasoning content, supports by GLM-4.5 series.

message.tool_calls

object[]

Function names and parameters generated by the model that should be called.

Hide child attributes

function

object

Contains the function name and JSON format parameters generated by the model.

Hide child attributes

function.name

string

required

Model-generated function name.

function.arguments

object

required

JSON format of the function call parameters generated by the model. Validate the parameters before calling the function.

string

Unique identifier for the hit function.

type

string

Tool type called by the model, currently only supports ‘function’.

finish_reason

string

Reason for model inference termination. Can be ‘stop’, ‘tool_calls’, ‘length’, ‘sensitive’, or ‘network_error’.

usage

object

Token usage statistics returned when the model call ends.

Hide child attributes

usage.prompt_tokens

number

Number of tokens in user input

usage.completion_tokens

number

Number of output tokens

usage.prompt_tokens_details

object

Hide child attributes

usage.prompt_tokens_details.cached_tokens

number

Number of tokens served from cache

usage.total_tokens

integer

Total number of tokens

web_search

object[]

Search results.

Hide child attributes

title

string

Title.

content

string

Content summary.

link

string

Result URL.

media

string

Website name.

icon

string

Website icon.

refer

string

Index number.

publish_date

string

Website publication date.

Using the APIs

Model API

Image API

Video API

Tool API

Agent API

Authorizations

Headers

Body

Response