Commit Graph

3041 Commits

Author SHA1 Message Date
山程漫悟
35c2d9d0d3 Merge pull request #950 from SuanmoSuanyangTechnology/fix/wxy_031
feat(model_parsing): add model reference resolution for LLM and relat…
2026-04-21 15:09:49 +08:00
yingzhao
27275eee43 Merge pull request #954 from SuanmoSuanyangTechnology/fix/v0.3.1_zy
Fix/v0.3.1 zy
2026-04-21 15:09:04 +08:00
zhaoying
7eb21f677f fix(web): custom model not support api key edit 2026-04-21 15:04:35 +08:00
wwq
6de5d413c4 fix(app_dsl_service): 修复模型和知识库引用解析逻辑
改进模型引用解析,优先使用ID匹配并处理异常情况
优化知识库引用解析,移除不必要的"None"字符串检查
统一返回字符串类型的ID,保持类型一致性
2026-04-21 15:03:18 +08:00
Mark
aecb0f6497 Merge branch 'feature/rag2' into release/v0.3.1
* feature/rag2:
  [modify] fix
  [modify] Optimize ES connections and add rerank security checks
2026-04-21 13:44:39 +08:00
zhaoying
83b7c6870d fix(web): knowledge config 2026-04-21 13:35:21 +08:00
山程漫悟
74157adb12 Merge pull request #952 from SuanmoSuanyangTechnology/fix/Timebomb_031
fix(model_service)
2026-04-21 12:21:46 +08:00
Timebomb2018
8011610acc fix(model_service): sync model capability and is_omni to associated api_keys 2026-04-21 12:15:14 +08:00
wwq
f1dc507b5c fix: 优化知识库和模型引用解析逻辑
移除对字符串长度的UUID验证,仅检查是否为有效UUID或非"None"字符串
2026-04-21 11:55:00 +08:00
yingzhao
f3ac7e084d Merge pull request #951 from SuanmoSuanyangTechnology/fix/v0.3.1_zy
fix(web): vision_input support file type variable
2026-04-21 11:38:31 +08:00
zhaoying
ba3743f9f1 fix(web): vision_input support file type variable 2026-04-21 11:37:04 +08:00
wwq
20ddc76a4d feat(model_parsing): add model reference resolution for LLM and related node types
- Add model reference resolution for LLM, Question Classifier, and Parameter Extractor nodes.
- Support parsing various model reference formats, including dictionaries, UUID strings, and name strings, when `model_id` is present.
- Add warning logs for cases where model resolution fails.
2026-04-20 21:48:45 +08:00
山程漫悟
84ca98555d Merge pull request #948 from SuanmoSuanyangTechnology/fix/wxy_031
refactor(knowledge_service): refactor model binding logic into generic function
2026-04-20 21:28:03 +08:00
山程漫悟
7e6d17e4e3 Merge pull request #949 from SuanmoSuanyangTechnology/fix/Timebomb_031
fix(model service)
2026-04-20 20:53:37 +08:00
Timebomb2018
7f3c48ce2a Merge remote-tracking branch 'origin/release/v0.3.1' into fix/Timebomb_031 2026-04-20 20:48:46 +08:00
Timebomb2018
e5c16a2a24 refactor(model_service): remove hardcoded extra_params from model initialization 2026-04-20 20:48:00 +08:00
wwq
8887600f7d refactor(knowledge_service): refactor model binding logic into generic function
- Extract duplicate model binding logic into `_get_model_by_name_or_fallback`.
- Implement logic to prioritize workspace default configuration, falling back to the tenant's first available model if not found.
- Simplify binding code for embedding, rerank, and LLM models.
2026-04-20 19:01:06 +08:00
山程漫悟
df6eb74b28 Merge pull request #947 from wanxunyang/feature/add-quota-check-decorator
refactor(api_key): change rate limit handling to auto-cap at tenant l…
2026-04-20 18:48:15 +08:00
wwq
b4b9974064 refactor(api_key): change rate limit handling to auto-cap at tenant limit
- Replace exception throwing with automatic capping when rate limit exceeds tenant plan limit, improving user experience.
2026-04-20 18:45:17 +08:00
yingzhao
ff65dee754 Merge pull request #946 from SuanmoSuanyangTechnology/fix/v0.3.1_zy
fix(web): check list add vision_input
2026-04-20 18:40:58 +08:00
zhaoying
2c2ed0ebf3 fix(web): check list add vision_input 2026-04-20 18:39:59 +08:00
山程漫悟
d60f838fb8 Merge pull request #939 from wanxunyang/feature/add-quota-check-decorator
feat(quota): refactor quota management and rate limiting services
2026-04-20 18:36:33 +08:00
wwq
817aa78d03 fix(rate_limit): differentiate between tenant plan and API Key QPS limit errors
- Add logic to detect tenant plan QPS limits and return a specific error message when triggered.
- Simplify boolean check in model activation quota validation.
2026-04-20 18:34:18 +08:00
Ke Sun
4c73887a48 Merge pull request #945 from SuanmoSuanyangTechnology/fix/read-appNone
fix(memory): use end_user.workspace_id instead of app.workspace_id in…
2026-04-20 18:30:39 +08:00
lanceyq
94d2d975ee fix(memory): use end_user.workspace_id instead of app.workspace_id in log message
Corrected variable reference in get_end_user_connected_config log statement. The previous code referenced app.workspace_id which could be incorrect or undefined in this context.
2026-04-20 18:26:20 +08:00
wwq
d59990d326 fix(rate_limit): differentiate between tenant plan and API Key QPS limit errors
- Add logic to detect tenant plan QPS limits and return a specific error message when triggered.
- Simplify boolean check in model activation quota validation.
2026-04-20 18:25:39 +08:00
wwq
3227c25b07 fix(quota): fix tenant ID retrieval and QPS counting logic
- Fix issue where tenant ID lookup from shared records failed to query the workspace correctly.
- Switch QPS counting from sliding window to simple counter to improve performance and simplify logic.
- Remove unnecessary `time` module import.
2026-04-20 18:10:28 +08:00
wwq
08b5c7bc8a perf(限流服务): 优化Redis查询以减少命令数量
使用zcount替代zremrangebyscore和zcard组合查询,减少一次Redis操作
2026-04-20 17:46:05 +08:00
Ke Sun
475e573891 Merge pull request #943 from SuanmoSuanyangTechnology/fix/v1create-end
fix(api): make unused message body parameter optional in create_end_user
2026-04-20 17:24:21 +08:00
wwq
b03300c804 refactor(rate_limit): refactor API Key rate limiting and remove tenant-level QPS check
- Streamline rate limit check flow by removing redundant tenant-level QPS checks.
- Restrict checks to API Key QPS and plan degradation protection only.
- Update constant naming and error message handling for consistency.
2026-04-20 17:18:05 +08:00
yingzhao
a5d07ee66d Merge pull request #944 from SuanmoSuanyangTechnology/fix/v0.3.1_zy
Fix/v0.3.1 zy
2026-04-20 17:05:32 +08:00
zhaoying
10a655772f fix(web): jump list 2026-04-20 17:04:00 +08:00
zhaoying
aeeb18581d fix(web): change search_result type log result 2026-04-20 17:00:58 +08:00
lanceyq
fb1160e833 fix(api): make unused message body parameter optional in create_end_user
Change Body(...) to Body(None) for the message parameter which is never
used directly (request body is read via request.json() instead).
The required marker caused unnecessary 422 validation errors.
2026-04-20 16:21:18 +08:00
wwq
c448cf0660 refactor(rate-limit): change rate limiting granularity from tenant to API Key
- Refactor rate limiting mechanism to limit per API Key instead of per tenant (workspace).
- Update error code logic and Redis key naming conventions.
- Adjust quota usage statistics to display the QPS of the API Key closest to its limit.
2026-04-20 16:13:30 +08:00
yingzhao
5289b3a2cb Merge pull request #940 from SuanmoSuanyangTechnology/fix/v0.3.1_zy
Fix/v0.3.1 zy
2026-04-20 15:34:48 +08:00
wwq
48f3d9b105 feat(quota): refactor quota management and rate limiting services
- Add `API_KEY_RATE_LIMIT_EXCEEDED` error code.
- Refactor `QuotaExceededError` to support resource type localization.
- Optimize rate limiting service by implementing the sliding window algorithm.
- Add rate limit validation for tenant plans.
- Unify quota check decorator to support both synchronous and asynchronous operations.
- Enhance quota usage statistics endpoints.
2026-04-20 15:10:12 +08:00
zhaoying
559b4bef6b fix(web): add tool_id required check list 2026-04-20 14:47:16 +08:00
zhaoying
4a39fd5f46 fix(web) if-else port y calculate update 2026-04-20 14:31:31 +08:00
yingzhao
b22c15cccc Merge pull request #938 from SuanmoSuanyangTechnology/fix/v0.3.1_zy
fix(web): update quotas key
2026-04-20 10:17:29 +08:00
zhaoying
a2f85b3d98 fix(web): update quotas key 2026-04-20 10:16:31 +08:00
Ke Sun
7f1cf13b23 Merge pull request #932 from SuanmoSuanyangTechnology/fix/extract-metadata
refactor(memory): insert new metadata values at list head for recency…
2026-04-17 21:04:38 +08:00
Ke Sun
d4129edcf5 Merge pull request #923 from SuanmoSuanyangTechnology/feat/enduser-info-apikey
feat(memory): add V1 memory config management endpoints and memory read/write API
2026-04-17 21:03:10 +08:00
yingzhao
ab2a58d68e Merge pull request #937 from SuanmoSuanyangTechnology/feature/if_else_zy
Feature/if else zy
2026-04-17 20:52:34 +08:00
zhaoying
a28b62763e fix(web): CaseItem interface 2026-04-17 20:48:17 +08:00
zhaoying
86540a81d1 fix(web): SubCondition interface 2026-04-17 20:46:03 +08:00
yingzhao
dcd874fecd Merge pull request #936 from SuanmoSuanyangTechnology/feature/if_else_zy
fix(web): if-else port position
2026-04-17 20:42:25 +08:00
zhaoying
bbd85733b8 fix(web): if-else port position 2026-04-17 20:41:23 +08:00
山程漫悟
22c5f12657 Merge pull request #935 from SuanmoSuanyangTechnology/feature/agent-tool_xjn
fix(workflow)
2026-04-17 20:29:34 +08:00
Timebomb2018
7b5d7696cb feat(workflow): support variable input type in if-else node conditions 2026-04-17 20:26:44 +08:00