new-api

mirror of https://github.com/QuantumNous/new-api.git synced 2026-03-30 10:12:30 +00:00

Author	SHA1	Message	Date
Seefs	540cf6c991	fix: channel affinity (#2799 ) * fix: channel affinity log styles * fix: Issue with incorrect data storage when switching key sources * feat: support not retrying after a single rule configuration fails * fix: render channel affinity tooltip as multiline content * feat: channel affinity cache hit * fix: prevent ChannelAffinityUsageCacheModal infinite loading and hide data before fetch * chore: format backend with gofmt and frontend with prettier/eslint autofix	2026-02-02 14:37:31 +08:00
Seefs	478f1871d6	feat: grok Usage Guidelines Violation Fee (#2753 ) * feat: grok Usage Guidelines Violation Fee ui setting * feat: grok Usage Guidelines Violation Fee consume log * fix: grok Usage Guidelines Violation Fee log detail	2026-01-26 20:20:30 +08:00
Seefs	cc1da72d10	feat: openai response /v1/response/compact (#2644 ) * feat: openai response /v1/response/compact * feat: /v1/response/compact bill * feat: /v1/response/compact * feat: /v1/responses/compact -> codex channel * feat: /v1/responses/compact -> codex channel * feat: /v1/responses/compact -> codex channel * feat: codex channel default models * feat: compact model price * feat: /v1/responses/comapct test	2026-01-26 20:20:16 +08:00
Seefs	d7d3a2f763	feat: channel affinity (#2669 ) * feat: channel affinity * feat: channel affinity -> model setting * fix: channel affinity * feat: channel affinity op * feat: channel_type setting * feat: clean * feat: cache supports both memory and Redis. * feat: Optimise ui/ux * feat: Optimise ui/ux * feat: Optimise codex usage ui/ux * feat: Optimise ui/ux * feat: Optimise ui/ux * feat: Optimise ui/ux * feat: If the affinitized channel fails and a retry succeeds on another channel, update the affinity to the successful channel	2026-01-26 19:57:41 +08:00
Seefs	4bffc249d6	feat: customizable automatic retry status codes	2026-01-14 14:34:12 +08:00
Seefs	41da848c56	Merge pull request #2647 from seefs001/feature/status-code-auto-disable feat: status code auto-disable configuration	2026-01-12 18:47:45 +08:00
t0ng7u	8cb56fc319	🧹 fix: harden request-body size handling and error unwrapping Tighten oversized request handling across relay paths and make error matching reliable. - Align `MAX_REQUEST_BODY_MB` fallback to `32` in request body reader and decompression middleware - Stop ignoring `GetRequestBody` errors in relay retry paths; return consistent 413 on oversized bodies (400 for other read errors) - Add `Unwrap()` to `types.NewAPIError` so `errors.Is/As` can match wrapped underlying errors - `go test ./...` passes	2025-12-16 18:10:00 +08:00
t0ng7u	8e3f9b1faa	🛡️ fix: prevent OOM on large/decompressed requests; skip heavy prompt meta when token count is disabled Clamp request body size (including post-decompression) to avoid memory exhaustion caused by huge payloads/zip bombs, especially with large-context Claude requests. Add a configurable `MAX_REQUEST_BODY_MB` (default `32`) and document it. - Enforce max request body size after gzip/br decompression via `http.MaxBytesReader` - Add a secondary size guard in `common.GetRequestBody` and cache-safe handling - Return 413 Request Entity Too Large on oversized bodies in relay entry - Avoid building large `TokenCountMeta.CombineText` when both token counting and sensitive check are disabled (use lightweight meta for pricing) - Update READMEs (CN/EN/FR/JA) with `MAX_REQUEST_BODY_MB` - Fix a handful of vet/formatting issues encountered during the change - `go test ./...` passes	2025-12-16 17:00:19 +08:00
CaIon	c51936e068	refactor(channel_select): enhance retry logic and context key usage for channel selection	2025-12-13 16:43:38 +08:00
CaIon	ce6fb95f96	refactor(relay): update channel retrieval to use RelayInfo structure	2025-12-12 22:04:38 +08:00
Seefs	ca29fc5702	Merge pull request #2194 from NoahCodeGG/fix/process_channel_error	2025-12-11 18:12:06 +08:00
CaIon	f5b409d74f	feat: refactor token estimation logic - Introduced new OpenAI text models in `common/model.go`. - Added `IsOpenAITextModel` function to check for OpenAI text models. - Refactored token estimation methods across various channels to use estimated prompt tokens instead of direct prompt token counts. - Updated related functions and structures to accommodate the new token estimation approach, enhancing overall token management.	2025-12-02 21:34:39 +08:00
NoahCode	138810f19c	fix(channel): update channel identification logic in error processing	2025-11-08 20:33:14 +08:00
CaIon	c4ca9d7c3b	refactor(relay): enhance error logging and improve multipart form handling in audio requests #2127	2025-10-29 23:33:55 +08:00
CaIon	1ded19795a	feat: add special user usable group setting	2025-10-28 23:25:43 +08:00
CaIon	b3f50e9dd0	fix: remove redundant error message details for channel retrieval failures	2025-10-14 13:53:33 +08:00
Seefs	86c63ea4a7	feat: endpoint type log	2025-10-13 22:44:54 +08:00
Seefs	2624c48113	feat: endpoint type log	2025-10-13 22:25:39 +08:00
CaIon	ede47ef014	feat: support free model setting	2025-10-12 13:31:03 +08:00
Seefs	e1c7a4f41f	format: package name -> github.com/QuantumNous/new-api (#2017 )	2025-10-11 15:30:09 +08:00
Xyfacai	b25ac0bfb6	fix: 预扣额度使用 relay info 传递	2025-09-11 16:04:32 +08:00
Xyfacai	cda73a2ec5	fix: dalle log 显示张数 N	2025-09-10 19:53:32 +08:00
iszcz	289ed24899	task_relay_info	2025-08-25 18:01:10 +08:00
funnycups	e3473e3c39	fix: prompt calculation User will correctly get estimated prompt usage when upstream returns either zero or nothing.	2025-08-16 22:54:00 +08:00
CaIon	f8ca8d7cea	fix: refactor processChannelError to use goroutine for asynchronous handling	2025-08-16 15:15:19 +08:00
CaIon	f6a48434c1	feat: initialize channel metadata in mjproxy and relay processing	2025-08-15 19:14:29 +08:00
CaIon	28bd31a30b	feat: initialize channel metadata in relay processing	2025-08-15 19:00:16 +08:00
CaIon	491013e27a	refactor: comment out SetContextKey to prevent token count meta setting	2025-08-15 18:43:08 +08:00
CaIon	0bb43aa464	refactor: update function signatures to include context and improve file handling #1599	2025-08-15 18:40:54 +08:00
CaIon	5fe1ce89ec	refactor: improve request type validation and enhance sensitive information masking	2025-08-15 13:20:36 +08:00
CaIon	7f1f368065	refactor: improve channel base URL handling and enhance RelayInfo logging	2025-08-14 22:15:18 +08:00
CaIon	6748b006b7	refactor: centralize logging and update resource initialization This commit refactors the logging mechanism across the application by replacing direct logger calls with a centralized logging approach using the `common` package. Key changes include: - Replaced instances of `logger.SysLog` and `logger.FatalLog` with `common.SysLog` and `common.FatalLog` for consistent logging practices. - Updated resource initialization error handling to utilize the new logging structure, enhancing maintainability and readability. - Minor adjustments to improve code clarity and organization throughout various modules. This change aims to streamline logging and improve the overall architecture of the codebase.	2025-08-14 21:10:04 +08:00
CaIon	e2037ad756	refactor: Introduce pre-consume quota and unify relay handlers This commit introduces a major architectural refactoring to improve quota management, centralize logging, and streamline the relay handling logic. Key changes: - Pre-consume Quota: Implements a new mechanism to check and reserve user quota before making the request to the upstream provider. This ensures more accurate quota deduction and prevents users from exceeding their limits due to concurrent requests. - Unified Relay Handlers: Refactors the relay logic to use generic handlers (e.g., `ChatHandler`, `ImageHandler`) instead of provider-specific implementations. This significantly reduces code duplication and simplifies adding new channels. - Centralized Logger: A new dedicated `logger` package is introduced, and all system logging calls are migrated to use it, moving this responsibility out of the `common` package. - Code Reorganization: DTOs are generalized (e.g., `dalle.go` -> `openai_image.go`) and utility code is moved to more appropriate packages (e.g., `common/http.go` -> `service/http.go`) for better code structure.	2025-08-14 20:05:06 +08:00
CaIon	fa2edd9d3f	fix(relay): remove unnecessary channel type check for BadRequest	2025-08-12 16:12:47 +08:00
RedwindA	b70d2655ed	feat: support native Gemini Embedding	2025-08-09 00:27:33 +08:00
CaIon	6abbd036f8	feat: add recordErrorLog option to NewAPIError for conditional error logging	2025-08-02 11:07:50 +08:00
CaIon	953f1bdc3c	feat: add admin info to error logging with multi-key support	2025-08-01 18:19:28 +08:00
Xyfacai	1f5ef24ecd	feat: 显式指定 error 跳过重试	2025-07-30 22:35:31 +08:00
CaIon	f7b284ad73	feat: 错误内容脱敏	2025-07-30 19:08:35 +08:00
CaIon	ae0461692c	feat: support ollama claude format	2025-07-23 20:01:03 +08:00
CaIon	756a8c50d6	fix: improve error messages for channel retrieval failures in distributor and relay	2025-07-23 16:32:52 +08:00
feitianbubu	7bc9192f3f	chore: opt video channel and platform	2025-07-22 20:14:24 +08:00
CaIon	cd8c23c0ab	✨ feat(channel): enhance channel status management	2025-07-10 17:49:53 +08:00
CaIon	98952198bb	refactor: Introduce standardized API error This commit refactors the application's error handling mechanism by introducing a new standardized error type, `types.NewAPIError`. It also renames common JSON utility functions for better clarity. Previously, internal error handling was tightly coupled to the `dto.OpenAIError` format. This change decouples the internal logic from the external API representation. Key changes: - A new `types.NewAPIError` struct is introduced to serve as a canonical internal representation for all API errors. - All relay adapters (OpenAI, Claude, Gemini, etc.) are updated to return `*types.NewAPIError`. - Controllers now convert the internal `NewAPIError` to the client-facing `OpenAIError` format at the API boundary, ensuring backward compatibility. - Channel auto-disable/enable logic is updated to use the new standardized error type. - JSON utility functions are renamed to align with Go's standard library conventions (e.g., `UnmarshalJson` -> `Unmarshal`, `EncodeJson` -> `Marshal`).	2025-07-10 15:02:40 +08:00
CaIon	f0f277dc2a	🔧 refactor(auth, channel, context): improve context setup and validation for multi-key channels	2025-07-06 12:37:56 +08:00
CaIon	7e298f8ad1	✨ feat: refactor environment variable initialization and introduce new constant types for API and context keys	2025-07-03 13:10:25 +08:00
skynono	8a79de333a	feat: add video channel kling	2025-06-19 11:53:42 +08:00
creamlike1024	7fa21ce95f	feat: auto分组	2025-06-16 22:15:12 +08:00
creamlike1024	738a9a4558	gemini text generation	2025-05-26 13:34:41 +08:00
creamlike1024	425feb88d8	feat: support /v1/responses API	2025-05-02 13:59:46 +08:00

1 2 3 4 5

220 Commits