new-api

mirror of https://github.com/QuantumNous/new-api.git synced 2026-04-19 13:38:39 +00:00

Author	SHA1	Message	Date
RedwindA	c2464fc877	fix(gemini): fetch model list via native v1beta/models endpoint Use the native Gemini Models API (/v1beta/models) instead of the OpenAI-compatible path when listing models for Gemini channels, improving compatibility with third-party Gemini-format providers that don't implement OpenAI routes. - Add paginated model listing with timeout and optional proxy support - Select an enabled key for multi-key Gemini channels	2026-01-09 18:00:40 +08:00
Seefs	5f37a1e97c	fix: fix the proxyURL is empty, not using the default HTTP client configuration && the AWS calling side did not apply the relay timeout.	2026-01-05 17:56:24 +08:00
Calcium-Ion	177553af37	Merge pull request #2578 from xyfacai/fix/gemini-mimetype fix: 修复 gemini 文件类型不支持 image/jpg	2026-01-04 22:19:16 +08:00
Xyfacai	5ed4583c0c	fix: 修复 gemini 文件类型不支持 image/jpg	2026-01-04 22:09:03 +08:00
Seefs	1519e97bc6	Merge pull request #2550 from shikaiwei1/patch-2	2026-01-04 18:11:46 +08:00
Seefs	67ba913b44	feat: add support for Doubao /v1/responses (#2567 ) * feat: add support for Doubao /v1/responses	2026-01-03 12:35:35 +08:00
Seefs	43c671b8b3	Merge pull request #2393 from prnake/fix-claude-haiku	2026-01-03 09:36:42 +08:00
Seefs	3510b3e6fc	Merge pull request #2532 from feitianbubu/pr/620211e02bd55545f0fa4568f3d55c3b4d7f3305	2026-01-03 09:36:17 +08:00
CaIon	23a68137ad	feat(adaptor): update resolution handling for wan2.6 model	2025-12-31 00:44:06 +08:00
CaIon	2a5b2add9a	refactor(image): remove unnecessary logging in oaiImage2Ali function	2025-12-31 00:23:19 +08:00
John Chen	ab81d6e444	fix: 修复智普、Moonshot渠道在stream=true时无法拿到cachePrompt的统计数据。根本原因： 1. 在OaiStreamHandler流式处理函数中，调用applyUsagePostProcessing(info, usage, nil)时传入的responseBody为nil，导致无法从响应体中提取缓存tokens。 2. 两个渠道的cached_tokens位置不同： - 智普：标准位置 usage.prompt_tokens_details.cached_tokens - Moonshot：非标准位置 choices[].usage.cached_tokens 处理方案： 1. 传递body信息到applyUsagePostProcessing中 2. 拆分智普和Moonshot的解析，并为Moonshot单独写一个解析方法。	2025-12-30 17:38:32 +08:00
CaIon	48d358faec	feat(adaptor): 新适配百炼多种图片生成模型 - wan2.6系列生图与编辑，适配多图生成计费 - wan2.5系列生图与编辑 - z-image-turbo生图，适配prompt_extend计费	2025-12-29 23:00:17 +08:00
Seefs	8063897998	fix: glm 4.7 finish reason (#2545 )	2025-12-29 19:41:15 +08:00
Seefs	24d359cf40	feat: Add "wan2.6-i2v" video ratio configuration to Ali adaptor.	2025-12-29 14:13:33 +08:00
Seefs	725d61c5d3	feat: ionet integrate (#2105 ) * wip ionet integrate * wip ionet integrate * wip ionet integrate * ollama wip * wip * feat: ionet integration & ollama manage * fix merge conflict * wip * fix: test conn cors * wip * fix ionet * fix ionet * wip * fix model select * refactor: Remove `pkg/ionet` test files and update related Go source and web UI model deployment components. * feat: Enhance model deployment UI with styling improvements, updated text, and a new description component. * Revert "feat: Enhance model deployment UI with styling improvements, updated text, and a new description component." This reverts commit 8b75cb5bf0d1a534b339df8c033be9a6c7df7964.	2025-12-28 15:55:35 +08:00
Your Name	b6a25d9f0f	feat(gemini): 支持 tool_choice 参数转换，优化错误处理	2025-12-27 18:33:09 +08:00
RedwindA	1de78f8749	feat: map OpenAI developer role to Gemini system instructions	2025-12-27 02:52:33 +08:00
feitianbubu	37a1882798	fix: kling correct fail reason	2025-12-26 16:35:46 +08:00
papersnake	2c2dfea60f	Merge branch 'QuantumNous:main' into fix-claude-haiku	2025-12-26 16:23:34 +08:00
Calcium-Ion	654bb10b45	Merge pull request #2460 from seefs001/feature/gemini-flash-minial fix(gemini): handle minimal reasoning effort budget	2025-12-26 13:57:56 +08:00
Seefs	a0c3d37d66	Merge pull request #2493 from shikaiwei1/patch-1	2025-12-24 16:52:24 +08:00
feitianbubu	3652dfdbd5	fix: check claudeResponse delta StopReason nil point	2025-12-24 11:54:23 +08:00
John Chen	dbaba87c39	为Moonshot添加缓存tokens读取逻辑为Moonshot添加缓存tokens读取逻辑。其与智普V4的逻辑相同，所以共用逻辑	2025-12-22 17:05:16 +08:00
Seefs	28f7a4feef	fix: 在Vertex Adapter过滤content[].part[].functionResponse.id	2025-12-21 17:22:04 +08:00
Seefs	da24a165d0	fix(gemini): handle minimal reasoning effort budget - Add minimal case to clampThinkingBudgetByEffort to avoid defaulting to full thinking budget	2025-12-18 08:10:46 +08:00
t0ng7u	8e3f9b1faa	🛡️ fix: prevent OOM on large/decompressed requests; skip heavy prompt meta when token count is disabled Clamp request body size (including post-decompression) to avoid memory exhaustion caused by huge payloads/zip bombs, especially with large-context Claude requests. Add a configurable `MAX_REQUEST_BODY_MB` (default `32`) and document it. - Enforce max request body size after gzip/br decompression via `http.MaxBytesReader` - Add a secondary size guard in `common.GetRequestBody` and cache-safe handling - Return 413 Request Entity Too Large on oversized bodies in relay entry - Avoid building large `TokenCountMeta.CombineText` when both token counting and sensitive check are disabled (use lightweight meta for pricing) - Update READMEs (CN/EN/FR/JA) with `MAX_REQUEST_BODY_MB` - Fix a handful of vet/formatting issues encountered during the change - `go test ./...` passes	2025-12-16 17:00:19 +08:00
CaIon	7cae4a640b	fix(audio): correct TotalTokens calculation for accurate usage reporting	2025-12-13 17:49:57 +08:00
CaIon	e36e2e1b69	feat(audio): enhance audio request handling with token type detection and streaming support	2025-12-13 17:24:23 +08:00
CaIon	21fca238bf	refactor(error): replace dto.OpenAIError with types.OpenAIError for consistency	2025-12-13 16:43:57 +08:00
CaIon	b58fa3debc	fix(helper): improve error handling in FlushWriter and related functions	2025-12-13 13:29:21 +08:00
Calcium-Ion	30cb224793	Merge pull request #2429 from QuantumNous/feat/xhigh feat(adaptor): add '-xhigh' suffix to reasoning effort options	2025-12-12 22:06:19 +08:00
CaIon	50854c17bb	feat(adaptor): add '-xhigh' suffix to reasoning effort options for model parsing	2025-12-12 20:53:48 +08:00
Calcium-Ion	147659fb6e	Merge pull request #2426 from QuantumNous/feat/auto-cross-group-retry feat(token): add cross-group retry option for token processing	2025-12-12 20:45:54 +08:00
CaIon	01b4039e96	feat(token): add cross-group retry option for token processing	2025-12-12 17:59:21 +08:00
zdwy5	e1bee48152	fix: 支持aws 通过全局参数透传或者渠道参数透传来调用 (#2423 ) * fix: 支持aws 通过全局参数透传或者渠道参数透传来调用 * fix(aws): replace json.Unmarshal with common.Unmarshal for request body processing --------- Co-authored-by: r0 <liangchunlei@01.ai> Co-authored-by: CaIon <i@caion.me>	2025-12-12 17:09:27 +08:00
Seefs	4e69c98b42	Merge pull request #2412 from seefs001/pr-2372 feat: add openai video remix endpoint	2025-12-11 23:35:23 +08:00
Calcium-Ion	e346f0bf16	Merge pull request #2398 from seefs001/fix/video-proxy fix: Use channel proxy settings for task query scenarios	2025-12-09 14:05:30 +08:00
Calcium-Ion	9561c7b50f	Merge pull request #2356 from seefs001/feature/zhipiu_4v_image feat: zhipu 4v image generations	2025-12-09 14:00:20 +08:00
Seefs	5889571108	fix: Use channel proxy settings for task query scenarios	2025-12-09 11:15:27 +08:00
Seefs	72d2a94b0d	Merge pull request #2229 from HynoR/chore/v1 fix: Set default to unsupported value for gpt-5 model series requests	2025-12-08 20:59:30 +08:00
Seefs	5eae6a3874	Merge pull request #2375 from FlowerRealm/feat/add-claude-haiku-4-5 feat: add claude-haiku-4-5-20251001 model support	2025-12-08 20:46:02 +08:00
Papersnake	681b37d104	feat: support claude-haiku-4-5-20251001 on vertex	2025-12-08 17:28:36 +08:00
firstmelody	121746a79e	fix(adaptor): fix reasoning suffix not processing in vertex adapter	2025-12-08 01:12:29 +08:00
FlowerRealm	c3c119a9b4	feat: add claude-haiku-4-5-20251001 model support - Add model to Claude ModelList - Add model ratio (0.5, $1/1M input tokens) - Add completion ratio support (5x, $5/1M output tokens) - Add cache read ratio (0.1, $0.10/1M tokens) - Add cache write ratio (1.25, $1.25/1M tokens) Model specs: - Context window: 200K tokens - Max output: 64K tokens - Release date: October 1, 2025	2025-12-05 18:54:20 +08:00
Seefs	2e37347851	feat: zhipu v4 image generations	2025-12-02 22:56:58 +08:00
Calcium-Ion	ffc45a756e	Merge pull request #2344 from seefs001/feature/gemini-thinking-level feat: gemini 3 thinking level gemini-3-pro-preview-high	2025-12-02 21:55:43 +08:00
CaIon	f5b409d74f	feat: refactor token estimation logic - Introduced new OpenAI text models in `common/model.go`. - Added `IsOpenAITextModel` function to check for OpenAI text models. - Refactored token estimation methods across various channels to use estimated prompt tokens instead of direct prompt token counts. - Updated related functions and structures to accommodate the new token estimation approach, enhancing overall token management.	2025-12-02 21:34:39 +08:00
CaIon	4dbdbdec1d	feat(gemini): implement markdown image handling in text processing	2025-12-01 17:54:41 +08:00
Seefs	b6a02d8303	feat: gemini 3 thinking level gemini-3-pro-preview-high	2025-12-01 16:40:46 +08:00
CaIon	98f92f990a	feat(gemini): add validation and conversion for imageConfig parameters in extra_body	2025-11-30 19:31:08 +08:00

1 2 3 4 5 ...

1062 Commits