GLM-4-32B-0414-128K

Overview

GLM-4-32B-0414-128K is a highly cost-effective foundation language model. It can efficiently perform complex tasks and has significantly enhanced capabilities in tool use, online search, and code-related intelligent tasks.

Price

$0.1 per million tokens

Input Modality

Text

Output Modality

Text

Context Length

128K

Maximum Output Tokens

16K

Usage

Intelligent Q&A Assistant

Supports real-time online search to retrieve the latest information, accurately parses complex queries on e-commerce product inquiries, financial service terms, education course Q&A, and generates precise, professional answers based on enterprise knowledge bases.

Intelligent Quality Inspection

Accurately identifies and extracts key information and business fields from complex texts such as customer service tickets, automating analyses like sales pitch inspection and risk identification, strictly adhering to SOP processes and greatly reducing data processing time.

Financial Data Analysis

Real-time cleansing of financial data, automated extraction of key insights, and detection of potential trends and correlations. Supports scenarios such as bid document analysis, financial report interpretation, and market trend monitoring.

Code Generation

Based on intent decomposition and logical reasoning, accurately generates initial code frameworks or key functions in mainstream languages such as Python, Java, and JavaScript. Supports multi-turn contextual iterative development, intelligent comments, and rewriting functions to add clear annotations to code.

Job Market Analysis

Deep analysis of job descriptions and resumes using real-time job information, industry salary trends, and talent demand, providing precise talent matching recommendations for enterprises and analyzing employment trends and career development paths for job seekers.

Resources

API Documentation: Learn how to call the API.

Introducting GLM-4-32B-0414-128K

GLM-4-32B-0414-128K was pre-trained on 15T of high-quality data, including abundant synthetic reasoning data to lay a solid foundation for subsequent reinforcement learning. In the post-training phase, besides aligning with human preferences in dialogue scenarios, we also applied techniques like rejection sampling and reinforcement learning to enhance instruction following, engineering code generation, and function calling, strengthening the model’s fundamental capabilities for intelligent tasks.The model performs comparably to much larger domestic and international mainstream models, with some benchmark indicators approaching or even exceeding models like GPT-4o and DeepSeek-V3-0324 (671B).

Capability

Streaming Output

Structured Output

Function Calling

Knowledge Base Retrieval

Web search

the search engine supports Jina AI, with a price of $0.01 per use.

Quick Start

cURL
Official Python SDK
Official Java SDK
OpenAI Python SDK

Basic Call

curl -X POST "https://api.z.ai/api/paas/v4/chat/completions" \
     -H "Authorization: Bearer your-api-key" \
     -H "Content-Type: application/json" \
     -d '{
       "model": "glm-4-32b-0414-128k",
       "messages": [
         {
           "role": "user",
           "content": "As a marketing expert, please create an attractive slogan for my product."
         }
       ]
     }'

Streaming Call

curl -X POST "https://api.z.ai/api/paas/v4/chat/completions" \
     -H "Authorization: Bearer your-api-key" \
     -H "Content-Type: application/json" \
     -d '{
       "model": "glm-4-32b-0414-128k",
       "messages": [
         {
           "role": "user",
           "content": "As a marketing expert, please create an attractive slogan for my product."
         }
       ],
       "stream": true
     }'

Get Started

Language Models

Visual Language Models

Image Generation Models

Video Generation Models

Image Generation Models

Capabilities

Tools

Agents

Overview

Price

Input Modality

Output Modality

Context Length

Maximum Output Tokens

Usage

Resources

Introducting GLM-4-32B-0414-128K

Capability

Streaming Output

Structured Output

Function Calling

Knowledge Base Retrieval

Web search

Quick Start

Get Started

Language Models

Visual Language Models

Image Generation Models

Video Generation Models

Image Generation Models

Capabilities

Tools

Agents

​ Overview

Price

Input Modality

Output Modality

Context Length

Maximum Output Tokens

​ Usage

​ Resources

​ Introducting GLM-4-32B-0414-128K

​ Capability

Streaming Output

Structured Output

Function Calling

Knowledge Base Retrieval

Web search

​ Quick Start

Overview

Usage

Resources

Introducting GLM-4-32B-0414-128K

Capability

Quick Start