atopos31/llmio 有哪些主要特性？

多提供商支持：OpenAI、Claude、Gemini、DeepSeek 等; 智能负载均衡与自动故障转移; 速率限制与请求排队; 轻量级，易于部署; 基于 TypeScript，类型安全

atopos31/llmio 有哪些使用场景？

跨多个 LLM 提供商分发 API 调用; 构建具有自动故障转移能力的弹性 AI 应用; 在生产环境中管理 API 速率限制和成本; 为团队集中管理 AI 网关配置

atopos31/llmio 使用什么编程语言？

atopos31/llmio 主要使用 TypeScript 编写。

如何安装 atopos31/llmio？

运行命令：openclaw install atopos31/llmio

Claude Skill

atopos31/llmio

llmio 是一个基于 TypeScript 的轻量级 LLM API 负载均衡网关，支持 OpenAI、Claude、Gemini、DeepSeek 等。具备智能路由、故障转移和速率限制功能。

语言

概览

Stars311

Forks35

语言TypeScript

最后更新2026-05-29

最近同步2026-06-17

前往 GitHub

仓库信息

拥有者atopos31

仓库llmio

完整名称atopos31/llmio

Repo ID1,032,927,082

GitHub 地址https://github.com/atopos31/llmio

安装这个 Skill

docker run -d \

GitHub

Registry 信息

类型openclaw_skill

质量分80/100

验证状态readme_parsed

最近验证2026-06-15

平台

ClaudeOpenClawCodex

能力

searchimageterminalaiai-gatewayclaudeclaude-aiclaude-codecodexdeepseek

识别文件

README.mddocker-compose.ymldocs

配置键

TOKENYOUR_TOKENOPENAI_API_KEYANTHROPIC_API_KEYGEMINI_API_KEY

安装方式

docker run -d \
git clone https://github.com/atopos31/llmio.git

项目简介

llmio 是一个轻量级、高性能的 LLM API 负载均衡网关，使用 TypeScript 编写。它支持 OpenAI、Claude、Gemini、DeepSeek 等多个 AI 提供商，提供智能请求路由、故障转移和速率限制功能，适用于生产级 AI 应用。

英文描述

Unified LLM gateway with weighted load balancing, observability & cost tracking. 统一的 LLM 网关，提供权重负载均衡、可观测性与费用追踪。

要点

多提供商支持：OpenAI、Claude、Gemini、DeepSeek 等
智能负载均衡与自动故障转移
速率限制与请求排队
轻量级，易于部署
基于 TypeScript，类型安全

使用场景

跨多个 LLM 提供商分发 API 调用
构建具有自动故障转移能力的弹性 AI 应用
在生产环境中管理 API 速率限制和成本
为团队集中管理 AI 网关配置

README 摘要

# LLMIO English | [中文](README_cn.md) LLMIO is a Go-based LLM load‑balancing gateway that provides a unified REST API, weighted scheduling, observability, and a modern admin UI for LLM clients (openclaw / claude code / codex / gemini cli / cherry studio / open webui). It helps you integrate OpenAI, Anthropic, Gemini, and other model capabilities in a single service. **QQ group: 1083599685** ## Architecture ![LLMIO Architecture](./docs/llmio.svg) ## Features - **Unified API**: Compatible with OpenAI Chat Completions, OpenAI Responses, Gemini Native, and Anthropic Messages. Supports both streaming and non‑streaming passthrough. - **Weighted scheduling**: `balancers/` provides two strategies (random by weight / priority by weight). You can route based on tool calling, structured output, and multimodal capability. - **Admin Web UI**: React + TypeScript + Tailwind + Vite console for providers, models, associations, logs, and metrics. - **Rate limiting & failure handling**: Built‑in rate‑limit fallback and provider connectivity checks for fault isolation. - **Local persistence**: Pure Go SQLite (`db/llmio.db`) for config and request logs, ready to use out of the box. - **Session tracking**: Pass `session_id` in any request body (works with `extra_body` in OpenAI SDK) to tag logs with a session identifier. Filter and search by `session_id` in the admin UI or via `GET /api/logs?session_id=`. - **Observability**: Every request is recorded with TraceID, latency breakdown (proxy / first-chunk / completion time), TPS, token usage (input / cached / output), and optional full IO logging. Per-request cost is calculated from configurable per-million-token prices (CNY / USD) and shown in the log detail view alongside provider and model metadata. ## Deployment ### Docker Compose (

话题

ai ai-gateway claude claude-ai claude-code codex deepseek doubao gemini gemini-cli go golang kimi newapi openai openclaw qwen transformer

atopos31/llmio

概览

仓库信息

安装这个 Skill

Registry 信息

项目简介

要点

使用场景

README 摘要

话题

探索更多

相关技能

zhayujie/CowAgent

iOfficeAI/AionUi

NousResearch/hermes-agent

HKUDS/nanobot

Sylinko/Everywhere

BlockRunAI/ClawRouter