new-api

mirror of https://github.com/QuantumNous/new-api.git synced 2026-03-30 04:03:18 +00:00

Author	SHA1	Message	Date
CaIon	ec5c6b28ea	feat(task): add model redirection, per-call billing, and multipart retry fix for async tasks 1. Async task model redirection (aligned with sync tasks): - Integrate ModelMappedHelper in RelayTaskSubmit after model name determination, populating OriginModelName / UpstreamModelName on RelayInfo. - All task adaptors now send UpstreamModelName to upstream providers: - Gemini & Vertex: BuildRequestURL uses UpstreamModelName. - Doubao & Ali: BuildRequestBody conditionally overwrites body.Model. - Vidu, Kling, Hailuo, Jimeng: convertToRequestPayload accepts RelayInfo and unconditionally uses info.UpstreamModelName. - Sora: BuildRequestBody parses JSON and multipart bodies to replace the "model" field with UpstreamModelName. - Frontend log visibility: LogTaskConsumption and taskBillingOther now emit is_model_mapped / upstream_model_name in the "other" JSON field. - Billing safety: RecalculateTaskQuotaByTokens reads model name from BillingContext.OriginModelName (via taskModelName) instead of task.Data["model"], preventing billing leaks from upstream model names. 2. Per-call billing (TaskPricePatches lifecycle): - Rename TaskBillingContext.ModelName → OriginModelName; add PerCallBilling bool field, populated from TaskPricePatches at submission time. - settleTaskBillingOnComplete short-circuits when PerCallBilling is true, skipping both adaptor adjustments and token-based recalculation. - Remove ModelName from TaskSubmitResult; use relayInfo.OriginModelName consistently in controller/relay.go for billing context and logging. 3. Multipart retry boundary mismatch fix: - Root cause: after Sora (or OpenAI audio) rebuilds a multipart body with a new boundary and overwrites c.Request.Header["Content-Type"], subsequent calls to ParseMultipartFormReusable on retry would parse the cached original body with the wrong boundary, causing "NextPart: EOF". - Fix: ParseMultipartFormReusable now caches the original Content-Type in gin context key "_original_multipart_ct" on first call and reuses it for all subsequent parses, making multipart parsing retry-safe globally. - Sora adaptor reverted to the standard pattern (direct header set/get), which is now safe thanks to the root fix. 4. Tests: - task_billing_test.go: update makeTask to use OriginModelName; add PerCallBilling settlement tests (skip adaptor adjust, skip token recalc); add non-per-call adaptor adjustment test with refund verification.	2026-02-22 16:33:00 +08:00
CaIon	5ec4633cb8	refactor(task): add CAS-guarded updates to prevent concurrent billing conflicts Replace all bare task.Update() (DB.Save) calls with UpdateWithStatus(), which adds a WHERE status = ? guard to prevent concurrent processes from overwriting each other's state transitions. Key changes: model/task.go: - Add taskSnapshot struct with Equal() method for change detection - Add Snapshot() method to capture pre-update state - Add UpdateWithStatus(fromStatus) using DB.Where().Save() for CAS semantics with full-struct save (no explicit field listing needed) model/midjourney.go: - Add UpdateWithStatus(fromStatus string) with same CAS pattern service/task_polling.go (updateVideoSingleTask): - Snapshot before processing upstream response; skip DB write if unchanged - Terminal transitions (SUCCESS/FAILURE) use UpdateWithStatus CAS: billing/refund only executes if this process wins the transition - Non-terminal updates also use UpdateWithStatus to prevent overwriting a concurrent terminal transition back to IN_PROGRESS - Defer settleTaskBillingOnComplete to after CAS check (shouldSettle flag) relay/relay_task.go (tryRealtimeFetch): - Add snapshot + change detection; use UpdateWithStatus for CAS safety controller/midjourney.go (UpdateMidjourneyTaskBulk): - Capture preStatus before mutations; use UpdateWithStatus CAS - Gate refund (IncreaseUserQuota) on CAS success (won && shouldReturnQuota) This prevents the multi-instance race condition where: 1. Instance A reads task (IN_PROGRESS), fetches upstream (still IN_PROGRESS) 2. Instance B reads same task, fetches upstream (now SUCCESS), writes SUCCESS 3. Instance A's bare Save() overwrites SUCCESS back to IN_PROGRESS	2026-02-22 16:01:19 +08:00
CaIon	cda540180b	refactor(relay): improve channel locking and retry logic in RelayTask - Enhanced the RelayTask function to utilize a locked channel when available, allowing for better reuse during retries. - Updated error handling to ensure proper context setup for the selected channel. - Clarified comments in ResolveOriginTask regarding channel locking and retry behavior. - Introduced a new field in TaskRelayInfo to store the locked channel object, improving type safety and reducing import cycles.	2026-02-22 16:01:19 +08:00
CaIon	76892e8376	refactor(relay): enhance remix logic for billing context extraction - Updated the remix handling in ResolveOriginTask to prioritize extracting OtherRatios from the BillingContext of the original task if available. - Retained the previous logic for extracting seconds and size from task data as a fallback. - Improved clarity and maintainability of the remix logic by separating the new and old approaches.	2026-02-22 16:01:19 +08:00
CaIon	d6e11fd2e1	feat(task): add adaptor billing interface and async settlement framework Add three billing lifecycle methods to the TaskAdaptor interface: - EstimateBilling: compute OtherRatios from user request before pricing - AdjustBillingOnSubmit: adjust ratios from upstream submit response - AdjustBillingOnComplete: determine final quota at task terminal state Introduce BaseBilling as embeddable no-op default for adaptors without custom billing. Move Sora/Ali OtherRatios logic from shared validation into per-adaptor EstimateBilling implementations. Add TaskBillingContext to persist pricing params (model_price, group_ratio, other_ratios) in task private data for async polling settlement. Extract RecalculateTaskQuota as a general-purpose delta settlement function and unify polling billing via settleTaskBillingOnComplete (adaptor-first, then token-based fallback).	2026-02-22 16:00:27 +08:00
CaIon	9e3954428d	refactor(task): extract billing and polling logic from controller to service layer Restructure the task relay system for better separation of concerns: - Extract task billing into service/task_billing.go with unified settlement flow - Move task polling loop from controller to service/task_polling.go (supports Suno + video platforms) - Split RelayTask into fetch/submit paths with dedicated retry logic (taskSubmitWithRetry) - Add TaskDto, TaskResponse generics, and FetchReq to dto/task.go - Add taskcommon/helpers.go for shared task adaptor utilities - Remove controller/task_video.go (logic consolidated into service layer) - Update all task adaptors (ali, doubao, gemini, hailuo, jimeng, kling, sora, suno, vertex, vidu) - Simplify frontend task logs to use new TaskDto response format	2026-02-22 16:00:27 +08:00
feitianbubu	9c91b8fb18	feat: task pre consume modelPrice default use setting value	2026-01-24 15:32:06 +08:00
郑伯涛	aed1900364	fix(task): 修复使用 auto 分组时 Task Relay 不记录日志和不扣费的问题问题描述： - 使用 auto 分组的令牌调用 /v1/videos 等 Task 接口时，虽然任务能成功创建，但使用日志不显示记录，且不会扣费根本原因： - Distribute 中间件在选择渠道后，会将实际选中的分组存储在 ContextKeyAutoGroup 中 - 但 RelayTaskSubmit 函数没有从 context 中读取这个值来更新 info.UsingGroup - 导致 info.UsingGroup 始终是 "auto" 而不是实际选中的分组（如 "sora2逆"） - 当 auto 分组的倍率配置为 0 时，quota 计算结果为 0 - 日志记录条件 "if quota != 0" 不满足，导致日志不记录、不扣费修复方案： - 在 RelayTaskSubmit 函数中计算分组倍率之前，添加从 ContextKeyAutoGroup 获取实际分组的逻辑 - 使用安全的类型断言，避免潜在的 panic 风险影响范围： - 仅影响 Task Relay 流程（/v1/videos, /suno, /kling 等接口） - 不影响使用具体分组令牌的调用 - 不影响其他 Relay 类型（chat/completions 等已有类似处理逻辑）	2026-01-06 00:16:50 +08:00
t0ng7u	8e3f9b1faa	🛡️ fix: prevent OOM on large/decompressed requests; skip heavy prompt meta when token count is disabled Clamp request body size (including post-decompression) to avoid memory exhaustion caused by huge payloads/zip bombs, especially with large-context Claude requests. Add a configurable `MAX_REQUEST_BODY_MB` (default `32`) and document it. - Enforce max request body size after gzip/br decompression via `http.MaxBytesReader` - Add a secondary size guard in `common.GetRequestBody` and cache-safe handling - Return 413 Request Entity Too Large on oversized bodies in relay entry - Avoid building large `TokenCountMeta.CombineText` when both token counting and sensitive check are disabled (use lightweight meta for pricing) - Update READMEs (CN/EN/FR/JA) with `MAX_REQUEST_BODY_MB` - Fix a handful of vet/formatting issues encountered during the change - `go test ./...` passes	2025-12-16 17:00:19 +08:00
Seefs	4e69c98b42	Merge pull request #2412 from seefs001/pr-2372 feat: add openai video remix endpoint	2025-12-11 23:35:23 +08:00
Seefs	5889571108	fix: Use channel proxy settings for task query scenarios	2025-12-09 11:15:27 +08:00
Seefs	9629c8a771	fix veo3 (#2140 )	2025-10-31 15:29:17 +08:00
Sh1n3zZ	810641a264	feat: vertex veo sora-compatible video output	2025-10-25 02:00:35 +08:00
CaIon	43f2a8ac06	feat: add temporary TASK_PRICE_PATCH configuration to environment variables	2025-10-16 21:59:21 +08:00
CaIon	aa35d8db69	refactor: update ConvertToOpenAIVideo method to return byte array and improve error handling	2025-10-14 23:03:17 +08:00
Seefs	86c63ea4a7	feat: endpoint type log	2025-10-13 22:44:54 +08:00
Seefs	2624c48113	feat: endpoint type log	2025-10-13 22:25:39 +08:00
Seefs	e1c7a4f41f	format: package name -> github.com/QuantumNous/new-api (#2017 )	2025-10-11 15:30:09 +08:00
CaIon	07b099006c	feat: add logging for model details and enhance action assignment in relay tasks	2025-10-11 11:56:44 +08:00
Calcium-Ion	eab768b4a0	Merge pull request #2006 from xyfacai/feat/sora-price feat: sora 增加参数校验与计费	2025-10-11 11:22:08 +08:00
feitianbubu	35422b316d	refactor: openAI video use OpenAIVideoConverter	2025-10-11 02:43:43 +08:00
Xyfacai	a54baf4998	feat: sora 增加参数校验与计费	2025-10-10 23:56:36 +08:00
feitianbubu	ff9f9fbbc9	feat: support openAI sdk retrieve videos	2025-10-10 18:59:52 +08:00
Seefs	6a34d365ec	Merge branch 'alpha' into feat-vertex-veo	2025-09-13 13:10:39 +08:00
creamlike1024	4b968d03a1	fix(relay): initialize TaskRelayInfo	2025-08-27 23:26:51 +08:00
Sh1n3zZ	81e29aaa3d	feat: vertex veo (#1450 )	2025-08-27 18:06:47 +08:00
iszcz	289ed24899	task_relay_info	2025-08-25 18:01:10 +08:00
CaIon	6748b006b7	refactor: centralize logging and update resource initialization This commit refactors the logging mechanism across the application by replacing direct logger calls with a centralized logging approach using the `common` package. Key changes include: - Replaced instances of `logger.SysLog` and `logger.FatalLog` with `common.SysLog` and `common.FatalLog` for consistent logging practices. - Updated resource initialization error handling to utilize the new logging structure, enhancing maintainability and readability. - Minor adjustments to improve code clarity and organization throughout various modules. This change aims to streamline logging and improve the overall architecture of the codebase.	2025-08-14 21:10:04 +08:00
CaIon	e2037ad756	refactor: Introduce pre-consume quota and unify relay handlers This commit introduces a major architectural refactoring to improve quota management, centralize logging, and streamline the relay handling logic. Key changes: - Pre-consume Quota: Implements a new mechanism to check and reserve user quota before making the request to the upstream provider. This ensures more accurate quota deduction and prevents users from exceeding their limits due to concurrent requests. - Unified Relay Handlers: Refactors the relay logic to use generic handlers (e.g., `ChatHandler`, `ImageHandler`) instead of provider-specific implementations. This significantly reduces code duplication and simplifies adding new channels. - Centralized Logger: A new dedicated `logger` package is introduced, and all system logging calls are migrated to use it, moving this responsibility out of the `common` package. - Code Reorganization: DTOs are generalized (e.g., `dalle.go` -> `openai_image.go`) and utility code is moved to more appropriate packages (e.g., `common/http.go` -> `service/http.go`) for better code structure.	2025-08-14 20:05:06 +08:00
feitianbubu	28fdb8af37	feat: add jimeng video official api	2025-08-10 16:54:44 +08:00
feitianbubu	7bc9192f3f	chore: opt video channel and platform	2025-07-22 20:14:24 +08:00
feitianbubu	4369b18fbf	feat: priority use origin model name	2025-07-16 15:38:51 +08:00
Xiangyuan-liu	7b29f429ee	refactor: log params and channel params refactor: log params and channel params	2025-07-07 14:26:37 +08:00
skynono	05ea0dd54f	feat: add video channel jimeng	2025-06-27 17:08:20 +08:00
CaIon	fd040988a3	refactor: streamline price calculation in RelaySwapFace and RelayMidjourneySubmit functions	2025-06-22 17:52:48 +08:00
creamlike1024	39617bc8c6	task userGroupRatio	2025-06-22 15:52:25 +08:00
skynono	8a79de333a	feat: add video channel kling	2025-06-19 11:53:42 +08:00
$Apple\Apple$ Apple\Apple	a180d13182	🚚 Refactor(ratio_setting): refactor ratio management into standalone `ratio_setting` package Summary • Migrated all ratio-related sources into `setting/ratio_setting/` – `model_ratio.go` (renamed from model-ratio.go) – `cache_ratio.go` – `group_ratio.go` • Changed package name to `ratio_setting` and relocated initialization (`ratio_setting.InitRatioSettings()` in main). • Updated every import & call site: – Model / cache / completion / image ratio helpers – Group ratio helpers (`GetGroupRatio`, `ContainsGroupRatio`, `CheckGroupRatio`, etc.) – JSON-serialization & update helpers (`Ratio2JSONString`, `Update*RatioByJSONString`) • Adjusted controllers, middleware, relay helpers, services and models to reference the new package. • Removed obsolete `setting` / `operation_setting` imports; added missing `ratio_setting` imports. • Adopted idiomatic map iteration (`for key := range m`) where value is unused. • Ran static checks to ensure clean build. This commit centralises all ratio configuration (model, cache and group) in one cohesive module, simplifying future maintenance and improving code clarity.	2025-06-18 18:00:49 +08:00
1808837298@qq.com	4f194f4e6a	feat: Implement cache token ratio for more precise token pricing	2025-03-08 01:30:50 +08:00
1808837298@qq.com	7dbb6b017c	feat: Add self-use mode for model ratio and price configuration - Introduce `SelfUseModeEnabled` setting to allow flexible model ratio configuration - Update error handling to provide more informative messages when model ratios are not set - Modify pricing and relay logic to support self-use mode - Add UI toggle for enabling self-use mode in operation settings - Implement fallback mechanism for model ratios when self-use mode is enabled	2025-03-01 21:13:48 +08:00
1808837298@qq.com	069f2672c1	refactor: Enhance user context and quota management - Add new context keys for user-related information - Modify user cache and authentication middleware to populate context - Refactor quota and notification services to use context-based user data - Remove redundant database queries by leveraging context information - Update various components to use new context-based user retrieval methods	2025-02-25 20:56:16 +08:00
1808837298@qq.com	3da1344897	feat: Add user notification settings with quota warning and multiple notification methods - Implement user notification settings with email and webhook options - Add new user settings for quota warning threshold and notification preferences - Create backend API and database support for user notification configuration - Enhance frontend personal settings with notification configuration UI - Support custom notification email and webhook URL - Add service layer for sending user notifications	2025-02-18 14:54:21 +08:00
CalciumIon	ed435e5c8f	refactor: user cache logic	2024-12-29 16:50:26 +08:00
CalciumIon	4fc1fe318e	refactor: migrate group ratio and user usable groups logic to new setting package - Replaced references to common.GroupRatio and common.UserUsableGroups with corresponding functions from the new setting package across multiple controllers and services. - Introduced new setting functions for managing group ratios and user usable groups, enhancing code organization and maintainability. - Updated related functions to ensure consistent behavior with the new setting package integration.	2024-12-25 19:31:12 +08:00
CalciumIon	7180e6f114	feat: Enhance logging functionality with group support - Added a new 'group' parameter to various logging functions, including RecordConsumeLog, GetAllLogs, and GetUserLogs, to allow for more granular log tracking. - Updated the logs table component to display group information, improving the visibility of log data. - Refactored related functions to accommodate the new group parameter, ensuring consistent handling across the application. - Improved the initialization of the group column for PostgreSQL compatibility.	2024-12-24 14:48:11 +08:00
1808837298@qq.com	9a4ca1e210	feat: playground	2024-09-26 00:59:09 +08:00
Xiangyuan Liu	c993ab2746	feat: suno api 支持 feat: 调试 suno feat: 补充suno 文档	2024-06-13 10:35:48 +08:00

47 Commits