Commit Graph

3216 Commits

Author SHA1 Message Date
wwq
cb2a7aa60a fix(知识服务): 添加工作空间模型配置的校验
在创建知识时检查工作空间是否配置了必要的模型,未配置时抛出异常提示用户
2026-04-21 17:18:11 +08:00
wwq
402c8aef5d refactor(knowledge_service): 简化模型绑定逻辑,直接使用工作区配置
移除_get_model_by_id_or_fallback方法,直接使用工作区配置的模型ID
对于image2text模型,放宽类型限制并移除composite检查
2026-04-21 17:04:42 +08:00
wwq
eb98a69a84 fix(知识服务): 修复创建知识库时未检查视觉模型存在的错误
当租户下没有可用的视觉模型时,抛出明确异常提示
2026-04-21 16:50:43 +08:00
wwq
152a84aff3 refactor(knowledge_service): 优化模型绑定逻辑,使用ID查询并简化回退机制
将模型绑定逻辑从按名称查询改为按ID查询,提高准确性
简化回退机制,直接查询租户下最新创建的模型
统一处理图像转文本模型的查询方式
2026-04-21 16:45:14 +08:00
zhaoying
a106f4e3cd fix(web): pageTabs style reset 2026-04-21 16:41:08 +08:00
zhaoying
9c20301a52 fix(web): prompt add loading 2026-04-21 16:31:32 +08:00
yingzhao
c5c8be89ed Merge pull request #955 from SuanmoSuanyangTechnology/fix/v0.3.1_zy
fix(web): package support unlimited
2026-04-21 15:54:08 +08:00
zhaoying
30aed72b74 fix(web): package support unlimited 2026-04-21 15:48:24 +08:00
山程漫悟
35c2d9d0d3 Merge pull request #950 from SuanmoSuanyangTechnology/fix/wxy_031
feat(model_parsing): add model reference resolution for LLM and relat…
2026-04-21 15:09:49 +08:00
yingzhao
27275eee43 Merge pull request #954 from SuanmoSuanyangTechnology/fix/v0.3.1_zy
Fix/v0.3.1 zy
2026-04-21 15:09:04 +08:00
yingzhao
cde02026d3 Merge pull request #953 from SuanmoSuanyangTechnology/fix/stream_zy
fix(web): stream support abort
2026-04-21 15:08:45 +08:00
zhaoying
1a826c0026 Revert "fix(web): abort reset"
This reverts commit 8cab49c2b1.
2026-04-21 15:08:15 +08:00
zhaoying
8cab49c2b1 fix(web): abort reset 2026-04-21 15:07:16 +08:00
zhaoying
7eb21f677f fix(web): custom model not support api key edit 2026-04-21 15:04:35 +08:00
wwq
6de5d413c4 fix(app_dsl_service): 修复模型和知识库引用解析逻辑
改进模型引用解析,优先使用ID匹配并处理异常情况
优化知识库引用解析,移除不必要的"None"字符串检查
统一返回字符串类型的ID,保持类型一致性
2026-04-21 15:03:18 +08:00
zhaoying
a2df14f658 fix(web): stream support abort 2026-04-21 15:00:28 +08:00
Mark
aecb0f6497 Merge branch 'feature/rag2' into release/v0.3.1
* feature/rag2:
  [modify] fix
  [modify] Optimize ES connections and add rerank security checks
2026-04-21 13:44:39 +08:00
zhaoying
83b7c6870d fix(web): knowledge config 2026-04-21 13:35:21 +08:00
山程漫悟
74157adb12 Merge pull request #952 from SuanmoSuanyangTechnology/fix/Timebomb_031
fix(model_service)
2026-04-21 12:21:46 +08:00
Timebomb2018
8011610acc fix(model_service): sync model capability and is_omni to associated api_keys 2026-04-21 12:15:14 +08:00
wwq
f1dc507b5c fix: 优化知识库和模型引用解析逻辑
移除对字符串长度的UUID验证,仅检查是否为有效UUID或非"None"字符串
2026-04-21 11:55:00 +08:00
yingzhao
f3ac7e084d Merge pull request #951 from SuanmoSuanyangTechnology/fix/v0.3.1_zy
fix(web): vision_input support file type variable
2026-04-21 11:38:31 +08:00
zhaoying
ba3743f9f1 fix(web): vision_input support file type variable 2026-04-21 11:37:04 +08:00
wwq
20ddc76a4d feat(model_parsing): add model reference resolution for LLM and related node types
- Add model reference resolution for LLM, Question Classifier, and Parameter Extractor nodes.
- Support parsing various model reference formats, including dictionaries, UUID strings, and name strings, when `model_id` is present.
- Add warning logs for cases where model resolution fails.
2026-04-20 21:48:45 +08:00
山程漫悟
84ca98555d Merge pull request #948 from SuanmoSuanyangTechnology/fix/wxy_031
refactor(knowledge_service): refactor model binding logic into generic function
2026-04-20 21:28:03 +08:00
山程漫悟
7e6d17e4e3 Merge pull request #949 from SuanmoSuanyangTechnology/fix/Timebomb_031
fix(model service)
2026-04-20 20:53:37 +08:00
Timebomb2018
7f3c48ce2a Merge remote-tracking branch 'origin/release/v0.3.1' into fix/Timebomb_031 2026-04-20 20:48:46 +08:00
Timebomb2018
e5c16a2a24 refactor(model_service): remove hardcoded extra_params from model initialization 2026-04-20 20:48:00 +08:00
wwq
8887600f7d refactor(knowledge_service): refactor model binding logic into generic function
- Extract duplicate model binding logic into `_get_model_by_name_or_fallback`.
- Implement logic to prioritize workspace default configuration, falling back to the tenant's first available model if not found.
- Simplify binding code for embedding, rerank, and LLM models.
2026-04-20 19:01:06 +08:00
山程漫悟
df6eb74b28 Merge pull request #947 from wanxunyang/feature/add-quota-check-decorator
refactor(api_key): change rate limit handling to auto-cap at tenant l…
2026-04-20 18:48:15 +08:00
wwq
b4b9974064 refactor(api_key): change rate limit handling to auto-cap at tenant limit
- Replace exception throwing with automatic capping when rate limit exceeds tenant plan limit, improving user experience.
2026-04-20 18:45:17 +08:00
yingzhao
ff65dee754 Merge pull request #946 from SuanmoSuanyangTechnology/fix/v0.3.1_zy
fix(web): check list add vision_input
2026-04-20 18:40:58 +08:00
zhaoying
2c2ed0ebf3 fix(web): check list add vision_input 2026-04-20 18:39:59 +08:00
山程漫悟
d60f838fb8 Merge pull request #939 from wanxunyang/feature/add-quota-check-decorator
feat(quota): refactor quota management and rate limiting services
2026-04-20 18:36:33 +08:00
wwq
817aa78d03 fix(rate_limit): differentiate between tenant plan and API Key QPS limit errors
- Add logic to detect tenant plan QPS limits and return a specific error message when triggered.
- Simplify boolean check in model activation quota validation.
2026-04-20 18:34:18 +08:00
Ke Sun
4c73887a48 Merge pull request #945 from SuanmoSuanyangTechnology/fix/read-appNone
fix(memory): use end_user.workspace_id instead of app.workspace_id in…
2026-04-20 18:30:39 +08:00
lanceyq
94d2d975ee fix(memory): use end_user.workspace_id instead of app.workspace_id in log message
Corrected variable reference in get_end_user_connected_config log statement. The previous code referenced app.workspace_id which could be incorrect or undefined in this context.
2026-04-20 18:26:20 +08:00
wwq
d59990d326 fix(rate_limit): differentiate between tenant plan and API Key QPS limit errors
- Add logic to detect tenant plan QPS limits and return a specific error message when triggered.
- Simplify boolean check in model activation quota validation.
2026-04-20 18:25:39 +08:00
wwq
3227c25b07 fix(quota): fix tenant ID retrieval and QPS counting logic
- Fix issue where tenant ID lookup from shared records failed to query the workspace correctly.
- Switch QPS counting from sliding window to simple counter to improve performance and simplify logic.
- Remove unnecessary `time` module import.
2026-04-20 18:10:28 +08:00
Eternity
dc3207b1d3 Merge branch 'develop' into refactor/memory_search
# Conflicts:
#	api/app/core/memory/storage_services/search/__init__.py
2026-04-20 18:07:07 +08:00
wwq
08b5c7bc8a perf(限流服务): 优化Redis查询以减少命令数量
使用zcount替代zremrangebyscore和zcard组合查询,减少一次Redis操作
2026-04-20 17:46:05 +08:00
Eternity
688503a1ca refactor(memory): integrate unified memory service into agent controller
- Replace direct memory agent service calls with unified MemoryService in read endpoint
- Update query preprocessor to use new prompt format and return structured queries
- Enhance MemorySearchResult model with filtering, merging, and ID tracking capabilities
- Add intermediate outputs display for problem split, perceptual retrieval, and search results
- Fix parameter alignment and remove unused history parameter in memory agent service
2026-04-20 17:43:52 +08:00
Ke Sun
475e573891 Merge pull request #943 from SuanmoSuanyangTechnology/fix/v1create-end
fix(api): make unused message body parameter optional in create_end_user
2026-04-20 17:24:21 +08:00
wwq
b03300c804 refactor(rate_limit): refactor API Key rate limiting and remove tenant-level QPS check
- Streamline rate limit check flow by removing redundant tenant-level QPS checks.
- Restrict checks to API Key QPS and plan degradation protection only.
- Update constant naming and error message handling for consistency.
2026-04-20 17:18:05 +08:00
yingzhao
a5d07ee66d Merge pull request #944 from SuanmoSuanyangTechnology/fix/v0.3.1_zy
Fix/v0.3.1 zy
2026-04-20 17:05:32 +08:00
zhaoying
10a655772f fix(web): jump list 2026-04-20 17:04:00 +08:00
zhaoying
aeeb18581d fix(web): change search_result type log result 2026-04-20 17:00:58 +08:00
lanceyq
fb1160e833 fix(api): make unused message body parameter optional in create_end_user
Change Body(...) to Body(None) for the message parameter which is never
used directly (request body is read via request.json() instead).
The required marker caused unnecessary 422 validation errors.
2026-04-20 16:21:18 +08:00
wwq
c448cf0660 refactor(rate-limit): change rate limiting granularity from tenant to API Key
- Refactor rate limiting mechanism to limit per API Key instead of per tenant (workspace).
- Update error code logic and Redis key naming conventions.
- Adjust quota usage statistics to display the QPS of the API Key closest to its limit.
2026-04-20 16:13:30 +08:00
yingzhao
c50969dea4 Merge pull request #942 from SuanmoSuanyangTechnology/feature/history_zy
feat(web): workflow support undo/redo
2026-04-20 16:10:33 +08:00