mirror of https://github.com/QuantumNous/new-api.git synced 2026-04-18 00:37:27 +00:00

Go to file

t0ng7u 99fcc354e3 📊 feat: add comprehensive model monitoring dashboard

This commit introduces a complete model monitoring system that provides
real-time insights into model performance and usage statistics.

## ✨ Features Added

### Core Components
- **ModelMonitoringStats**: Four key metric cards displaying total models,
  active models, total requests, and average success rate
- **ModelMonitoringTable**: Interactive table with search, filtering, and
  pagination for detailed model data
- **useModelMonitoring**: Custom hook for data fetching and state management

### Dashboard Integration
- Added new "模型观测" (Model Monitoring) tab to main dashboard
- Integrated monitoring components with existing dashboard layout
- Maintained consistent UI/UX with shadcn/ui design system

### Data Processing
- Smart data aggregation from existing `/api/data/` endpoints
- Automatic calculation of success rates and average metrics
- Support for both admin and user-specific data views

### Interactive Features
- Real-time search by model name
- Business group filtering
- Pagination with 10 items per page
- Color-coded success rate indicators (green >95%, yellow 90-95%, red <90%)
- Refresh capability for up-to-date data

## 🔧 Technical Implementation

### Type Safety
- Added comprehensive TypeScript interfaces in `@/types/api.ts`
- Defined `ModelInfo`, `ModelMonitoringStats`, and related types

### Utility Functions
- Enhanced color utilities with `modelToColor()` for consistent model identification
- Improved formatters for quota, tokens, and percentage display
- Maintained existing utility function architecture

### Architecture
- Follows established patterns from dashboard components
- Reuses existing HTTP client and authentication utilities
- Consistent error handling and loading states

### Code Quality
- All components pass linter checks
- Proper import organization and formatting
- Responsive design for mobile compatibility

## 🎯 User Experience

### Visual Design
- Color-coded model indicators for quick identification
- Success rate visualization with icons and colors
- Clean table layout with proper spacing and typography
- Skeleton loading states for smooth UX

### Functionality
- Search models by name with instant filtering
- Filter by business groups for organized viewing
- Navigate through paginated results efficiently
- Refresh data manually when needed

## 📋 Files Modified

- `web/src/types/api.ts`: Added model monitoring type definitions
- `web/src/features/dashboard/hooks/use-model-monitoring.ts`: Core data hook
- `web/src/features/dashboard/components/model-monitoring-stats.tsx`: Stats cards
- `web/src/features/dashboard/components/model-monitoring-table.tsx`: Data table
- `web/src/features/dashboard/index.tsx`: Dashboard integration
- Various formatting and import organization improvements

This implementation provides a comprehensive solution for model monitoring
that aligns with the existing codebase architecture while delivering
powerful insights into model performance and usage patterns.

2025-09-26 02:41:46 +08:00

.github

2025-09-19 14:20:35 +08:00

bin

chore: add model parameter to the time_test script (#245 )

2023-07-04 18:13:59 +08:00

common

fix: use u.Hostname() instead of u.Host to avoid ipv6 host parse failed

2025-09-17 23:47:59 +08:00

constant

feat: vidu video add starEnd and reference gen video

2025-09-19 18:54:45 +08:00

controller

fix: cast option.Value to string for ratio updates

2025-09-19 14:21:32 +08:00

docs

🍭 style(ui): update the README.md style

2025-08-19 01:44:44 +08:00

dto

feat: change ParallelToolCalls and Store fields to json.RawMessage type

2025-09-20 13:28:33 +08:00

logger

refactor: improve request type validation and enhance sensitive information masking

2025-08-15 13:20:36 +08:00

middleware

Merge branch 'alpha' into feat-vertex-veo

2025-09-13 13:10:39 +08:00

model

Merge branch 'alpha' into imageratio-and-audioratio-edit

2025-09-15 14:12:24 +08:00

relay

fix: gemini system prompt overwrite

2025-09-20 13:38:44 +08:00

router

feat(payment): add payment settings configuration and update payment methods handling

2025-09-12 19:29:34 +08:00

service

Merge branch 'alpha'

2025-09-19 14:20:15 +08:00

setting

Merge branch 'alpha'

2025-09-19 14:20:15 +08:00

types

Merge branch 'alpha'

2025-09-19 14:20:15 +08:00

web

📊 feat: add comprehensive model monitoring dashboard

2025-09-26 02:41:46 +08:00

.dockerignore

🎨 chore: integrate ESLint header automation with AGPL-3.0 notice

2025-07-19 03:30:44 +08:00

.env.example

feat(option): enhance UpdateOption to handle various value types and improve validation

2025-09-03 14:30:25 +08:00

.gitignore

✨ feat: Update .gitignore to exclude additional files

2025-09-25 23:36:47 +08:00

docker-compose.yml

fix(env): update STREAMING_TIMEOUT default value to 300 seconds

2025-08-12 19:58:04 +08:00

Dockerfile

🔄 update: add bun.lock file copy to Dockerfile for dependency management

2025-07-13 14:05:45 +08:00

go.mod

feat: replace pcopy with jinzhu/copier for deep copy functionality

2025-08-26 13:40:41 +08:00

go.sum

feat: replace pcopy with jinzhu/copier for deep copy functionality

2025-08-26 13:40:41 +08:00

LICENSE

⚖️ docs(LICENSE): update license information from Apache 2.0 to New API Licensing

2025-07-20 16:15:00 +08:00

main.go

Merge branch 'alpha' into feat-vertex-veo

2025-09-13 13:10:39 +08:00

makefile

feat: use bun when develop locally

2025-06-09 14:57:01 +08:00

one-api.service

chore: update one-api.service

2023-06-22 11:37:44 +08:00

README.en.md

🤝 docs(README): Enhancing Partner Layout

2025-08-21 12:49:56 +08:00

README.md

feat(readme): update format conversion feature details in README

2025-09-03 14:43:51 +08:00

VERSION

fix: add a blank VERSION file (#135 )

2023-06-02 14:20:40 +08:00

README.en.md

中文 | English

New API

🍥 Next-Generation Large Model Gateway and AI Asset Management System

📝 Project Description

Note

This is an open-source project developed based on One API

Important

This project is for personal learning purposes only, with no guarantee of stability or technical support.

Users must comply with OpenAI's Terms of Use and applicable laws and regulations, and must not use it for illegal purposes.

According to the 《Interim Measures for the Management of Generative Artificial Intelligence Services》, please do not provide any unregistered generative AI services to the public in China.

🤝 Trusted Partners

No particular order

📚 Documentation

For detailed documentation, please visit our official Wiki: https://docs.newapi.pro/

You can also access the AI-generated DeepWiki:

✨ Key Features

New API offers a wide range of features, please refer to Features Introduction for details:

🎨 Brand new UI interface
🌍 Multi-language support
💰 Online recharge functionality (YiPay)
🔍 Support for querying usage quotas with keys (works with neko-api-key-tool)
🔄 Compatible with the original One API database
💵 Support for pay-per-use model pricing
⚖️ Support for weighted random channel selection
📈 Data dashboard (console)
🔒 Token grouping and model restrictions
🤖 Support for more authorization login methods (LinuxDO, Telegram, OIDC)
🔄 Support for Rerank models (Cohere and Jina), API Documentation
⚡ Support for OpenAI Realtime API (including Azure channels), API Documentation
⚡ Support for Claude Messages format, API Documentation
Support for entering chat interface via /chat2link route
🧠 Support for setting reasoning effort through model name suffixes:
1. OpenAI o-series models
  - Add -high suffix for high reasoning effort (e.g.: o3-mini-high)
  - Add -medium suffix for medium reasoning effort (e.g.: o3-mini-medium)
  - Add -low suffix for low reasoning effort (e.g.: o3-mini-low)
2. Claude thinking models
  - Add -thinking suffix to enable thinking mode (e.g.: claude-3-7-sonnet-20250219-thinking)
🔄 Thinking-to-content functionality
🔄 Model rate limiting for users
💰 Cache billing support, which allows billing at a set ratio when cache is hit:
1. Set the Prompt Cache Ratio option in System Settings-Operation Settings
2. Set Prompt Cache Ratio in the channel, range 0-1, e.g., setting to 0.5 means billing at 50% when cache is hit
3. Supported channels:
  - OpenAI
  - Azure
  - DeepSeek
  - Claude

Model Support

This version supports multiple models, please refer to API Documentation-Relay Interface for details:

Third-party models gpts (gpt-4-gizmo-*)
Third-party channel Midjourney-Proxy(Plus) interface, API Documentation
Third-party channel Suno API interface, API Documentation
Custom channels, supporting full call address input
Rerank models (Cohere and Jina), API Documentation
Claude Messages format, API Documentation
Dify, currently only supports chatflow

Environment Variable Configuration

For detailed configuration instructions, please refer to Installation Guide-Environment Variables Configuration:

GENERATE_DEFAULT_TOKEN: Whether to generate initial tokens for newly registered users, default is false
STREAMING_TIMEOUT: Streaming response timeout, default is 300 seconds
DIFY_DEBUG: Whether to output workflow and node information for Dify channels, default is true
FORCE_STREAM_OPTION: Whether to override client stream_options parameter, default is true
GET_MEDIA_TOKEN: Whether to count image tokens, default is true
GET_MEDIA_TOKEN_NOT_STREAM: Whether to count image tokens in non-streaming cases, default is true
UPDATE_TASK: Whether to update asynchronous tasks (Midjourney, Suno), default is true
COHERE_SAFETY_SETTING: Cohere model safety settings, options are NONE, CONTEXTUAL, STRICT, default is NONE
GEMINI_VISION_MAX_IMAGE_NUM: Maximum number of images for Gemini models, default is 16
MAX_FILE_DOWNLOAD_MB: Maximum file download size in MB, default is 20
CRYPTO_SECRET: Encryption key used for encrypting database content
AZURE_DEFAULT_API_VERSION: Azure channel default API version, default is 2025-04-01-preview
NOTIFICATION_LIMIT_DURATION_MINUTE: Notification limit duration, default is 10 minutes
NOTIFY_LIMIT_COUNT: Maximum number of user notifications within the specified duration, default is 2
ERROR_LOG_ENABLED=true: Whether to record and display error logs, default is false

Deployment

For detailed deployment guides, please refer to Installation Guide-Deployment Methods:

Tip

Latest Docker image: calciumion/new-api:latest

Multi-machine Deployment Considerations

Environment variable SESSION_SECRET must be set, otherwise login status will be inconsistent across multiple machines
If sharing Redis, CRYPTO_SECRET must be set, otherwise Redis content cannot be accessed across multiple machines

Deployment Requirements

Local database (default): SQLite (Docker deployment must mount the /data directory)
Remote database: MySQL version >= 5.7.8, PgSQL version >= 9.6

Deployment Methods

Using BaoTa Panel Docker Feature

Install BaoTa Panel (version 9.2.0 or above), find New-API in the application store and install it. Tutorial with images

Using Docker Compose (Recommended)

# Download the project
git clone https://github.com/Calcium-Ion/new-api.git
cd new-api
# Edit docker-compose.yml as needed
# Start
docker-compose up -d

Using Docker Image Directly

# Using SQLite
docker run --name new-api -d --restart always -p 3000:3000 -e TZ=Asia/Shanghai -v /home/ubuntu/data/new-api:/data calciumion/new-api:latest

# Using MySQL
docker run --name new-api -d --restart always -p 3000:3000 -e SQL_DSN="root:123456@tcp(localhost:3306)/oneapi" -e TZ=Asia/Shanghai -v /home/ubuntu/data/new-api:/data calciumion/new-api:latest

Channel Retry and Cache

Channel retry functionality has been implemented, you can set the number of retries in Settings->Operation Settings->General Settings. It is recommended to enable caching.

Cache Configuration Method

REDIS_CONN_STRING: Set Redis as cache
MEMORY_CACHE_ENABLED: Enable memory cache (no need to set manually if Redis is set)

API Documentation

For detailed API documentation, please refer to API Documentation:

One API: Original project
Midjourney-Proxy: Midjourney interface support
chatnio: Next-generation AI one-stop B/C-end solution
neko-api-key-tool: Query usage quota with key

Other projects based on New API:

new-api-horizon: High-performance optimized version of New API
VoAPI: Frontend beautified version based on New API

Help and Support

If you have any questions, please refer to Help and Support:

README.en.md

New API

📝 Project Description

🤝 Trusted Partners

📚 Documentation

✨ Key Features

Model Support

Environment Variable Configuration

Deployment

Multi-machine Deployment Considerations

Deployment Requirements

Deployment Methods

Using BaoTa Panel Docker Feature

Using Docker Compose (Recommended)

Using Docker Image Directly

Channel Retry and Cache

Cache Configuration Method

API Documentation

Related Projects

Help and Support

🌟 Star History