Gpt Load Download

Enterprise-grade Go-based AI API transparent proxy with intelligent key rotation, load balancing, distributed cluster support, and management UI for OpenAI, Gemini, and Claude.

⭐ 5,640 stars on GitHub
Latest Release: v1.4.1

About Software

GPT-Load is a high-performance Go-based AI API transparent proxy service that preserves native API formats (OpenAI, Google Gemini, Anthropic Claude). It features intelligent key management with group-based pools, automatic rotation, weighted load balancing, and smart failure handling with blacklist mechanisms.

Built for production environments, it supports dynamic hot-reload configuration, distributed leader-follower deployment, dual authentication (management and proxy with global/group-level keys), comprehensive monitoring, and graceful shutdown. Available via Docker, Docker Compose, or source build with support for MySQL, PostgreSQL, SQLite, and optional Redis caching.

Use Cases:

  • Enterprise-grade AI API transparent proxy for OpenAI, Gemini, and Claude services
  • Intelligent key pool management with automatic rotation, failure recovery, and blacklist handling
  • Load balancing across multiple upstream endpoints with weighted distribution
  • Distributed leader-follower cluster deployment with horizontal scaling support
  • Vue 3 management interface with real-time monitoring, health checks, and request logging

Downloads

v1.4.1 November 23, 2025
gpt-load-windows-amd64.exeexe
v1.4.0 November 09, 2025
gpt-load-windows-amd64.exeexe
v1.4.0-beta.2 November 08, 2025
gpt-load-windows-amd64.exeexe
v1.4.0-beta.1 November 08, 2025
gpt-load-windows-amd64.exeexe
v1.3.2 October 19, 2025
gpt-load-windows-amd64.exeexe
v1.3.1 October 13, 2025
gpt-load-windows-amd64.exeexe
v1.3.0 October 08, 2025
gpt-load-windows-amd64.exeexe
v1.3.0-beta.6 October 06, 2025
gpt-load-windows-amd64.exeexe
v1.3.0-beta.5 October 04, 2025
gpt-load-windows-amd64.exeexe
v1.3.0-beta.4 October 01, 2025
gpt-load-windows-amd64.exeexe
v1.3.0-beta.3 October 01, 2025
gpt-load-windows-amd64.exeexe
v1.3.0-beta.2 September 30, 2025
gpt-load-windows-amd64.exeexe
v1.3.0-beta.1 September 30, 2025
gpt-load-windows-amd64.exeexe
v1.2.1 September 21, 2025
gpt-load-windows-amd64.exeexe
v1.2.0 September 14, 2025
gpt-load-windows-amd64.exeexe
v1.1.0 September 07, 2025
gpt-load-windows-amd64.exeexe
v1.0.22.1 August 24, 2025
gpt-load-windows-amd64.exeexe
v1.0.22 August 24, 2025
gpt-load-windows-amd64.exeexe
v1.0.21 August 17, 2025
gpt-load-windows-amd64.exeexe
v1.0.20 August 10, 2025
gpt-load-windows-amd64.exeexe
v1.0.19 August 04, 2025
gpt-load-windows-amd64.exeexe
v1.0.18 August 01, 2025
gpt-load.exeexe
v1.0.17 July 26, 2025
gpt-load.exeexe
v1.0.16 July 25, 2025
gpt-load.exeexe
v1.0.15 July 23, 2025
gpt-load.exeexe
v1.0.14 July 19, 2025
gpt-load.exeexe

Package Info

Last Updated
Nov 23, 2025
Latest Version
v1.4.1
License
MIT
Total Versions
26

README

GPT-Load

English | 中文 | 日本語

Release (https://img.shields.io/github/v/release/tbphp/gpt-load) !Go Version (https://img.shields.io/badge/Go-1.23+-blue.svg) License (https://img.shields.io/badge/license-MIT-green.svg)

A high-performance, enterprise-grade AI API transparent proxy service designed specifically for enterprises and developers who need to integrate multiple AI services. Built with Go, featuring intelligent key management, load balancing, and comprehensive monitoring capabilities, designed for high-concurrency production environments.

For detailed documentation, please visit Official Documentation (https://www.gpt-load.com/docs?lang=en)

Features

  • Transparent Proxy: Complete preservation of native API formats, supporting OpenAI, Google Gemini, and Anthropic Claude among other formats
  • Intelligent Key Management: High-performance key pool with group-based management, automatic rotation, and failure recovery
  • Load Balancing: Weighted load balancing across multiple upstream endpoints to enhance service availability
  • Smart Failure Handling: Automatic key blacklist management and recovery mechanisms to ensure service continuity
  • Dynamic Configuration: System settings and group configurations support hot-reload without requiring restarts
  • Enterprise Architecture: Distributed leader-follower deployment supporting horizontal scaling and high availability
  • Modern Management: Vue 3-based web management interface that is intuitive and user-friendly
  • Comprehensive Monitoring: Real-time statistics, health checks, and detailed request logging
  • High-Performance Design: Zero-copy streaming, connection pool reuse, and atomic operations
  • Production Ready: Graceful shutdown, error recovery, and comprehensive security mechanisms
  • Dual Authentication: Separate authentication for management and proxy, with proxy authentication supporting global and group-level keys

Supported AI Services

GPT-Load serves as a transparent proxy service, completely preserving the native API formats of various AI service providers:

  • OpenAI Format: Official OpenAI API, Azure OpenAI, and other OpenAI-compatible services
  • Google Gemini Format: Native APIs for Gemini Pro, Gemini Pro Vision, and other models
  • Anthropic Claude Format: Claude series models, supporting high-quality conversations and text generation

Quick Start

System Requirements

  • Go 1.23+ (for source builds)
  • Docker (for containerized deployment)
  • MySQL, PostgreSQL, or SQLite (for database storage)
  • Redis (for caching and distributed coordination, optional)

Method 1: Docker Quick Start

docker run -d --name gpt-load \
    -p 3001:3001 \
    -e AUTH_KEY=your-secure-key-here \
    -v "$(pwd)/data":/app/data \
    ghcr.io/tbphp/gpt-load:latest

Please change your-secure-key-here to a strong password (never use the default value), then you can log in to the management interface:

Method 2: Using Docker Compose (Recommended)

Installation Commands:

See full README on repository.