* feat(web): add PageEmpty component
* feat(web): add PageTabs component
* feat(web): add PageEmpty component
* feat(web): add PageTabs component
* feat(prompt): add history tracking for prompt releases
* feat(web): add prompt menu
* refactor: The PageScrollList component supports two generic parameters
* feat(web): BodyWrapper compoent update PageLoading
* feat(web): add Ontology menu
* feat(web): memory management add scene
* feat(tasks): add celery task configuration for periodic jobs
- Add ignore_result=True to prevent storing results for periodic tasks
- Set max_retries=0 to skip failed periodic tasks without retry attempts
- Configure acks_late=False for immediate acknowledgment in beat tasks
- Add time_limit and soft_time_limit to regenerate_memory_cache task (3600s/3300s)
- Add time_limit and soft_time_limit to workspace_reflection_task (300s/240s)
- Add time_limit and soft_time_limit to run_forgetting_cycle_task (7200s/7000s)
- Improve task reliability and resource management for scheduled jobs
* feat(sandbox): add Node.js code execution support to sandbox
* Release/v0.2.2 (#260)
* [modify] migration script
* [add] migration script
* fix(web): change form message
* fix(web): the memoryContent field is compatible with numbers and strings
* feat(web): code node hidden
* fix(model):
1. create a basic model to check if the name and provider are duplicated.
2. The result shows error models because the provider created API Keys for all matching models.
---------
Co-authored-by: Mark <zhuwenhui5566@163.com>
Co-authored-by: zhaoying <yzhao96@best-inc.com>
Co-authored-by: yingzhao <zhaoyingyz@126.com>
Co-authored-by: Timebomb2018 <18868801967@163.com>
* Feature/ontology class clean (#249)
* [add] Complete ontology engineering feature implementation
* [add] Add ontology feature integration and validation utilities
* [add] Add OWL validator and validation utilities
* [fix] Add missing render_ontology_extraction_prompt function
* [fix]Add dependencies, fix functionality
* [add] migration script
* feat(celery): add dedicated periodic tasks worker and queue (#261)
* fix(web): conflict resolve
* Fix/v022 bug (#263)
* [fix]Fix the issue of inconsistent language in explicit and episodic memory.
* [fix]Fix the issue of inconsistent language in explicit and episodic memory.
* [add]Add scene_id
* [fix]Based on the AI review to fix the code
* Fix/develop memory reflex (#265)
* 遗漏的历史映射
* 遗漏的历史映射
* 反思后台报错处理
* [add] migration script
* fix: chat conversation_id add node_start
* feat(web): show code node
* fix(web): Restructure the CustomSelect component, repair the interface that is called multiple times when the form is updated
* feat(web): RadioGroupCard support block mode
* feat(web): create space add icon
* feat(app and model): token consumption statistics
* Add/develop memory (#264)
* 遗漏的历史映射
* 遗漏的历史映射
* 遗漏的历史映射
* 遗漏的历史映射
* 遗漏的历史映射
* 遗漏的历史映射
* 遗漏的历史映射
* 遗漏的历史映射
* 遗漏的历史映射
* 新增长期记忆功能
* 新增长期记忆功能
* 新增长期记忆功能
* 知识库检索多余字段
* 长期
* feat(app and model): token consumption statistics of the cluster
* memory_BUG_fix
* fix(web): prompt history remove pageLoading
* fix(prompt): remove hard-coded import of prompt file paths (#279)
* Fix/develop memory bug (#274)
* 遗漏的历史映射
* 遗漏的历史映射
* fix_timeline_memories
* fix(web): update retrieve_type key
* Fix/develop memory bug (#276)
* 遗漏的历史映射
* 遗漏的历史映射
* fix_timeline_memories
* fix_timeline_memories
* write_gragp/bug_fix
* write_gragp/bug_fix
* write_gragp/bug_fix
* chore(celery): disable periodic task scheduling
* fix(prompt): remove hard-coded import of prompt file paths
---------
Co-authored-by: lixinyue11 <94037597+lixinyue11@users.noreply.github.com>
Co-authored-by: zhaoying <yzhao96@best-inc.com>
Co-authored-by: yingzhao <zhaoyingyz@126.com>
Co-authored-by: Ke Sun <kesun5@illinois.edu>
* fix(web): remove delete confirm content
* refactor(workflow): relocate template directory into workflow
* feat(memory): add long-term storage task routing and batching
* fix(web): PageScrollList loading update
* fix(web): PageScrollList loading update
* Ontology v1 bug (#291)
* [changes]Add 'id' as the secondary sorting key, and 'scene_id' now returns a UUID object
* [fix]Fix the "end_user" return to be sorted by update time.
* [fix]Set the default values of the memory configuration model based on the spatial model.
* [fix]Remove the entity extraction check combination model, read the configuration list, and add the return of scene_id
* [fix]Fix the "end_user" return to be sorted by update time.
* [fix]
* fix(memory): add Redis session validation
- Add macOS fork() safety configuration in celery_app.py to prevent initialization issues
- Add null/False checks for Redis session queries in term_memory_save to handle missing sessions gracefully
- Add null/False checks in memory_long_term_storage to prevent processing empty Redis results
- Add null/False checks in aggregate_judgment before format_parsing to avoid errors on missing data
- Initialize redis_messages variable in window_dialogue for consistency
- Add debug logging when no existing session found in Redis for better troubleshooting
- Add TODO comments for magic numbers (scope=6, time=5) to be extracted as constants
- Improve error handling when Redis returns False or empty results instead of crashing
* fix(web): PageScrollList style update
* fix(workflow): fix argument passing in code execution nodes
* fix(web): prompt add disabled
* fix(web): space icon required
* feat(app): modify the key of the token
* fix(fix the key of the app's token):
* fix(workflow): switch code input encoding to base64+URL encoding
* [add]The main project adds multi-API Key load balancing.
* [changes]Attribute security access, secure numerical conversion, unified use of local variables
* fix(web): save add session update
* fix(web): language editor support paste
* [changes]Active status filtering logic, API Key selection strategy
* memory_BUG
* memory_BUG_long_term
* [changes]
* memory_BUG_long_term
* memory_BUG_long_term
* Fix/release memory bug (#306)
* memory_BUG_fix
* memory_BUG
* memory_BUG_long_term
* memory_BUG_long_term
* memory_BUG_long_term
* knowledge_retrieval/bug/fix
* knowledge_retrieval/bug/fix
* knowledge_retrieval/bug/fix
* [fix]1.The "read_all_config" interface returns "scene_name";2.Memory configuration for lightweight query ontology scenarios
* fix(web): replace code editor
* [changes]Modify the description of the time for the recent event
* [changes]Modify the code based on the AI review
* feat(web): update memory config ontology api
* fix(web): ui update
* knowledge_retrieval/bug/fix
* knowledge_retrieval/bug/fix
* knowledge_retrieval/bug/fix
* feat(workflow): add token usage statistics for question classifier and parameter extraction
* feat(web): move prompt menu
* Multiple independent transactions - single transaction
* Multiple independent transactions - single transaction
* Multiple independent transactions - single transaction
* Multiple independent transactions - single transaction
* Write Missing None (#321)
* Write Missing None
* Write Missing None
* Write Missing None
* Apply suggestion from @sourcery-ai[bot]
Co-authored-by: sourcery-ai[bot] <58596630+sourcery-ai[bot]@users.noreply.github.com>
* Write Missing None
---------
Co-authored-by: sourcery-ai[bot] <58596630+sourcery-ai[bot]@users.noreply.github.com>
* Fix/release memory bug (#324)
* Write Missing None
* Write Missing None
* Write Missing None
* Apply suggestion from @sourcery-ai[bot]
Co-authored-by: sourcery-ai[bot] <58596630+sourcery-ai[bot]@users.noreply.github.com>
* Write Missing None
* redis update
* redis update
* redis update
* redis update
---------
Co-authored-by: sourcery-ai[bot] <58596630+sourcery-ai[bot]@users.noreply.github.com>
* Fix/writer memory bug (#326)
* [fix]Fix the bug
* [fix]Fix the bug
* [fix]Correct the direction indication.
* fix(web): markdown table ui update
* Fix/release memory bug (#332)
* Write Missing None
* Write Missing None
* Write Missing None
* Apply suggestion from @sourcery-ai[bot]
Co-authored-by: sourcery-ai[bot] <58596630+sourcery-ai[bot]@users.noreply.github.com>
* Write Missing None
* redis update
* redis update
* redis update
* redis update
* writer_dup_bug/fix
---------
Co-authored-by: sourcery-ai[bot] <58596630+sourcery-ai[bot]@users.noreply.github.com>
* Fix/fact summary (#333)
* [fix]Disable the contents related to fact_summary
* [fix]Disable the contents related to fact_summary
* [fix]Modify the code based on the AI review
* Fix/release memory bug (#335)
* Write Missing None
* Write Missing None
* Write Missing None
* Apply suggestion from @sourcery-ai[bot]
Co-authored-by: sourcery-ai[bot] <58596630+sourcery-ai[bot]@users.noreply.github.com>
* Write Missing None
* redis update
* redis update
* redis update
* redis update
* writer_dup_bug/fix
* writer_graph_bug/fix
* writer_graph_bug/fix
---------
Co-authored-by: sourcery-ai[bot] <58596630+sourcery-ai[bot]@users.noreply.github.com>
* Revert "feat(web): move prompt menu"
This reverts commit 9e6e8f50f8.
* fix(web): ui update
* fix(web): update text
* fix(web): ui update
* fix(model): change the "vl" model type of dashscope to "chat"
* fix(model): change the "vl" model type of dashscope to "chat"
---------
Co-authored-by: zhaoying <yzhao96@best-inc.com>
Co-authored-by: Eternity <1533512157@qq.com>
Co-authored-by: Mark <zhuwenhui5566@163.com>
Co-authored-by: yingzhao <zhaoyingyz@126.com>
Co-authored-by: Timebomb2018 <18868801967@163.com>
Co-authored-by: 乐力齐 <162269739+lanceyq@users.noreply.github.com>
Co-authored-by: lixinyue11 <94037597+lixinyue11@users.noreply.github.com>
Co-authored-by: lixinyue <2569494688@qq.com>
Co-authored-by: Eternity <61316157+myhMARS@users.noreply.github.com>
Co-authored-by: lanceyq <1982376970@qq.com>
Co-authored-by: sourcery-ai[bot] <58596630+sourcery-ai[bot]@users.noreply.github.com>
517 lines
25 KiB
Python
517 lines
25 KiB
Python
"""
|
||
|
||
"""
|
||
|
||
from typing import Any, Optional, List, Dict, Literal, Union
|
||
import time
|
||
import uuid
|
||
from pydantic import BaseModel, Field, ConfigDict, field_validator, model_validator
|
||
|
||
|
||
|
||
|
||
|
||
# ============================================================================
|
||
# 从 json_schema.py 迁移的 Schema
|
||
# ============================================================================
|
||
class BaseDataSchema(BaseModel):
|
||
"""Base schema for the data"""
|
||
# 保持原有必需字段为可选,以兼容不同数据源
|
||
id: Optional[str] = Field(None, description="The unique identifier for the data entry.")
|
||
statement: Optional[str] = Field(None, description="The statement text.")
|
||
created_at: Optional[str] = Field(None, description="The creation timestamp in ISO 8601 format.")
|
||
expired_at: Optional[str] = Field(None, description="The expiration timestamp in ISO 8601 format.")
|
||
description: Optional[str] = Field(None, description="The description of the data entry.")
|
||
|
||
# 新增字段以匹配实际输入数据 - 改为可选以支持resolved_memory场景
|
||
entity1_name: Optional[str] = Field(None, description="The first entity name.")
|
||
entity2_name: Optional[str] = Field(None, description="The second entity name.")
|
||
statement_id: Optional[str] = Field(None, description="The statement identifier.")
|
||
# 新增字段 - 设为可选以保持向后兼容性
|
||
predicate: Optional[str] = Field(None, description="The predicate describing the relationship between entities.")
|
||
relationship_statement_id: Optional[str] = Field(None, description="The relationship statement identifier.")
|
||
# 保留原有字段 - 修改relationship字段类型以支持字符串和字典
|
||
relationship: Optional[Union[str, Dict[str, Any]]] = Field(None, description="The relationship object or string.")
|
||
entity2: Optional[Dict[str, Any]] = Field(None, description="The second entity object.")
|
||
|
||
@model_validator(mode="before")
|
||
def _set_default_created_at(cls, v):
|
||
"""Set default created_at if missing"""
|
||
if isinstance(v, dict) and v.get("created_at") is None:
|
||
from datetime import datetime
|
||
v["created_at"] = datetime.now().isoformat()
|
||
return v
|
||
|
||
|
||
class QualityAssessmentSchema(BaseModel):
|
||
"""Schema for memory quality assessment results."""
|
||
score: int = Field(..., ge=0, le=100, description="Quality score percentage (0-100).")
|
||
summary: str = Field(..., description="Brief summary of data quality status, including main issues and strengths.")
|
||
|
||
|
||
class MemoryVerifySchema(BaseModel):
|
||
"""Schema for memory privacy verification results."""
|
||
has_privacy: bool = Field(..., description="Whether privacy information was detected.")
|
||
privacy_types: List[str] = Field([], description="List of detected privacy information types.")
|
||
summary: str = Field(..., description="Brief summary of privacy detection results.")
|
||
|
||
|
||
class ConflictResultSchema(BaseModel):
|
||
"""Schema for the conflict result data in the reflexion_data.json file."""
|
||
data: List[BaseDataSchema] = Field(..., description="The conflict memory data. Only contains conflicting records when conflict is True.")
|
||
conflict: bool = Field(..., description="Whether the memory is in conflict.")
|
||
quality_assessment: Optional[QualityAssessmentSchema] = Field(None, description="The quality assessment object. Contains score and summary when quality_assessment is enabled, null otherwise.")
|
||
memory_verify: Optional[MemoryVerifySchema] = Field(None, description="The memory privacy verification object. Contains privacy detection results when memory_verify is enabled, null otherwise.")
|
||
|
||
@model_validator(mode="before")
|
||
def _normalize_data(cls, v):
|
||
if isinstance(v, dict):
|
||
d = v.get("data")
|
||
if isinstance(d, dict):
|
||
v["data"] = [d]
|
||
return v
|
||
|
||
|
||
class ConflictSchema(BaseModel):
|
||
"""Schema for the conflict data in the reflexion_data"""
|
||
data: List[BaseDataSchema] = Field(..., description="The conflict memory data.")
|
||
conflict_memory: Optional[BaseDataSchema] = Field(None, description="The conflict memory data.")
|
||
|
||
@model_validator(mode="before")
|
||
def _normalize_data(cls, v):
|
||
if isinstance(v, dict):
|
||
d = v.get("data")
|
||
if isinstance(d, dict):
|
||
v["data"] = [d]
|
||
return v
|
||
|
||
|
||
class ReflexionSchema(BaseModel):
|
||
"""Schema for the reflexion data in the reflexion_data"""
|
||
reason: str = Field(..., description="The reason for the reflexion.")
|
||
solution: str = Field(..., description="The solution for the reflexion.")
|
||
|
||
|
||
class ChangeRecordSchema(BaseModel):
|
||
"""Schema for individual change records
|
||
|
||
字段值格式说明:
|
||
- id: 字符串,表示修改字段对应的记录ID
|
||
- 其他字段: 可以是字符串、None,数组 [修改前的值, 修改后的值],或嵌套字典结构
|
||
- entity2等嵌套对象的字段也遵循 [old_value, new_value] 格式
|
||
"""
|
||
field: List[Dict[str, Any]] = Field(
|
||
...,
|
||
description="List of field changes. First item: {id: value}, followed by changed fields as {field_name: [old_value, new_value]} or {field_name: new_value} or nested structures like {entity2: {field_name: [old, new]}}"
|
||
)
|
||
|
||
class ResolvedSchema(BaseModel):
|
||
"""Schema for the resolved memory data in the reflexion_data"""
|
||
original_memory_id: Optional[str] = Field(None, description="The original memory identifier.")
|
||
# resolved_memory: Optional[BaseDataSchema] = Field(None, description="The resolved memory data (only contains records that need modification).")
|
||
resolved_memory: Optional[Union[BaseDataSchema, List[BaseDataSchema]]] = Field(None, description="The resolved memory data (only contains records that need modification). Can be a single record or list of records.")
|
||
change: Optional[List[ChangeRecordSchema]] = Field(None, description="List of detailed change records with IDs and field information.")
|
||
|
||
|
||
class SingleReflexionResultSchema(BaseModel):
|
||
"""Schema for a single reflexion result item."""
|
||
conflict: ConflictResultSchema = Field(..., description="The conflict result data for this specific conflict type.")
|
||
reflexion: ReflexionSchema = Field(..., description="The reflexion data for this conflict.")
|
||
resolved: Optional[ResolvedSchema] = Field(None, description="The resolved memory data for this conflict.")
|
||
type: str = Field("reflexion_result", description="The type identifier.")
|
||
|
||
class ReflexionResultSchema(BaseModel):
|
||
"""Schema for the complete reflexion result data - a list of individual conflict resolutions."""
|
||
results: List[SingleReflexionResultSchema] = Field(..., description="List of individual conflict resolution results, grouped by conflict type.")
|
||
|
||
@model_validator(mode="before")
|
||
def _normalize_resolved(cls, v):
|
||
if isinstance(v, dict):
|
||
conflict = v.get("conflict")
|
||
if isinstance(conflict, dict) and conflict.get("conflict") is False:
|
||
v["resolved"] = None
|
||
else:
|
||
resolved = v.get("resolved")
|
||
if isinstance(resolved, dict):
|
||
orig = resolved.get("original_memory_id")
|
||
mem = resolved.get("resolved_memory")
|
||
if orig is None and (mem is None or mem == {}):
|
||
v["resolved"] = None
|
||
return v
|
||
|
||
|
||
# ============================================================================
|
||
# 从 messages.py 迁移的 Schema
|
||
# ============================================================================
|
||
|
||
# Composite key identifying a config row
|
||
class ConfigKey(BaseModel): # 配置参数键模型
|
||
model_config = ConfigDict(populate_by_name=True, extra="forbid")
|
||
config_id:Union[uuid.UUID, int, str] = Field(..., description="配置唯一标识(UUID或int)")
|
||
user_id: str = Field("user_id", description="用户标识(字符串)")
|
||
apply_id: str = Field("apply_id", description="应用或场景标识(字符串)")
|
||
|
||
|
||
# Allowed chunking strategies (extendable later)
|
||
ChunkerStrategy = Literal[ # 分块策略枚举
|
||
"RecursiveChunker",
|
||
"TokenChunker",
|
||
"SemanticChunker",
|
||
"NeuralChunker",
|
||
"HybridChunker",
|
||
"LLMChunker",
|
||
"SentenceChunker",
|
||
"LateChunker"
|
||
]
|
||
|
||
|
||
# 这是 Request body示例
|
||
class ConfigParams(ConfigKey): # 创建配置参数模型 旧
|
||
model_config = ConfigDict(populate_by_name=True, extra="forbid")
|
||
|
||
# Boolean switches
|
||
enable_llm_dedup_blockwise: bool = Field(True, description="启用LLM决策去重")
|
||
enable_llm_disambiguation: bool = Field(True, description="启用LLM决策消歧")
|
||
deep_retrieval: bool = Field(True, description="深度检索开关(保留既有拼写)")
|
||
|
||
# Thresholds in [0, 1]
|
||
t_type_strict: float = Field(0.8, ge=0.0, le=1.0, description="类型严格阈值")
|
||
t_name_strict: float = Field(0.8, ge=0.0, le=1.0, description="名称严格阈值")
|
||
t_overall: float = Field(0.8, ge=0.0, le=1.0, description="综合阈值")
|
||
state: bool = Field(False, description="配置使用状态(True/False)")
|
||
# Chunker strategy selection (must be one of the declared literals)
|
||
chunker_strategy: ChunkerStrategy = Field(
|
||
"RecursiveChunker",
|
||
description=(
|
||
"分块策略:RecursiveChunker/TokenChunker/SemanticChunker/NeuralChunker/"
|
||
"HybridChunker/LLMChunker/SentenceChunker/LateChunker"
|
||
),
|
||
)
|
||
|
||
@field_validator("chunker_strategy", mode="before")
|
||
@classmethod
|
||
def map_chunker_aliases(cls, v: str):
|
||
# 允许常见别名并映射到合法枚举
|
||
if isinstance(v, str):
|
||
m = v.strip().lower()
|
||
alias_map = {
|
||
"auto": "RecursiveChunker",
|
||
"by_sentence": "SentenceChunker",
|
||
"by_paragraph": "SemanticChunker",
|
||
"fixed_tokens": "TokenChunker",
|
||
"递归分块": "RecursiveChunker",
|
||
"token 分块": "TokenChunker",
|
||
"token分块": "TokenChunker",
|
||
"语义分块": "SemanticChunker",
|
||
"神经网络分块": "NeuralChunker",
|
||
"混合分块": "HybridChunker",
|
||
"llm 分块": "LLMChunker",
|
||
"llm分块": "LLMChunker",
|
||
"句子分块": "SentenceChunker",
|
||
"延迟分块": "LateChunker",
|
||
}
|
||
if m in alias_map:
|
||
return alias_map[m]
|
||
return v
|
||
|
||
@field_validator("config_id", "user_id", "apply_id")
|
||
@classmethod
|
||
def non_empty_str(cls, v: str) -> str:
|
||
s = str(v).strip() if v is not None else ""
|
||
if not s:
|
||
raise ValueError("标识字段必须为非空字符串")
|
||
return s
|
||
|
||
|
||
class ConfigParamsCreate(BaseModel): # 创建配置参数模型(仅 body,去除主键)
|
||
model_config = ConfigDict(populate_by_name=True, extra="forbid")
|
||
config_name: str = Field("配置名称", description="配置名称(字符串)")
|
||
config_desc: str = Field("配置描述", description="配置描述(字符串)")
|
||
workspace_id: Optional[uuid.UUID] = Field(None, description="工作空间ID(UUID)")
|
||
|
||
# 本体场景关联(可选)
|
||
scene_id: Optional[uuid.UUID] = Field(None, description="本体场景ID(UUID),关联ontology_scene表")
|
||
|
||
# 模型配置字段(可选,用于手动指定或自动填充)
|
||
llm_id: Optional[str] = Field(None, description="LLM模型配置ID")
|
||
embedding_id: Optional[str] = Field(None, description="嵌入模型配置ID")
|
||
rerank_id: Optional[str] = Field(None, description="重排序模型配置ID")
|
||
reflection_model_id: Optional[str] = Field(None, description="反思模型ID,默认与llm_id一致")
|
||
emotion_model_id: Optional[str] = Field(None, description="情绪分析模型ID,默认与llm_id一致")
|
||
|
||
|
||
class ConfigParamsDelete(BaseModel): # 删除配置参数模型(请求体)
|
||
model_config = ConfigDict(populate_by_name=True, extra="forbid")
|
||
# config_name: str = Field("配置名称", description="配置名称(字符串)")
|
||
config_id:Union[uuid.UUID, int, str] = Field(..., description="配置ID(支持UUID、整数或字符串)")
|
||
|
||
|
||
class ConfigUpdate(BaseModel): # 更新记忆萃取引擎配置参数时使用的模型
|
||
config_id: Union[uuid.UUID, int, str] = None
|
||
config_name: Optional[str] = Field(None, description="配置名称(字符串)")
|
||
config_desc: Optional[str] = Field(None, description="配置描述(字符串)")
|
||
scene_id: Optional[uuid.UUID] = Field(None, description="本体场景ID")
|
||
|
||
|
||
class ConfigUpdateExtracted(BaseModel): # 更新记忆萃取引擎配置参数时使用的模型
|
||
config_id:Union[uuid.UUID, int, str] = None
|
||
llm_id: Optional[str] = Field(None, description="LLM模型配置ID")
|
||
embedding_id: Optional[str] = Field(None, description="嵌入模型配置ID")
|
||
rerank_id: Optional[str] = Field(None, description="重排序模型配置ID")
|
||
enable_llm_dedup_blockwise: Optional[bool] = None
|
||
enable_llm_disambiguation: Optional[bool] = None
|
||
deep_retrieval: Optional[bool] = Field(None, validation_alias="deep_retrieval")
|
||
|
||
t_type_strict: Optional[float] = Field(None, ge=0.0, le=1.0)
|
||
t_name_strict: Optional[float] = Field(None, ge=0.0, le=1.0)
|
||
t_overall: Optional[float] = Field(None, ge=0.0, le=1.0)
|
||
state: Optional[bool] = None
|
||
chunker_strategy: Optional[ChunkerStrategy] = None
|
||
# 句子提取
|
||
statement_granularity: Optional[int] = Field(2, ge=1, le=3, description="陈述提取颗粒度,挡位 1/2/3;默认 2")
|
||
include_dialogue_context: Optional[bool] = None
|
||
max_context: Optional[int] = Field(1000, gt=100, description="对话语境中包含字符的最大数量(>100);默认 1000")
|
||
|
||
# 剪枝配置:与 runtime.json 中 pruning 段对应
|
||
pruning_enabled: Optional[bool] = Field(None, description="是否启动智能语义剪枝")
|
||
pruning_scene: Optional[Literal["education", "online_service", "outbound"]] = Field(
|
||
None, description="智能剪枝场景:education/online_service/outbound"
|
||
)
|
||
pruning_threshold: Optional[float] = Field(
|
||
None, ge=0.0, le=0.9, description="智能语义剪枝阈值(0-0.9)"
|
||
)
|
||
|
||
# 反思配置
|
||
enable_self_reflexion: Optional[bool] = Field(None, description="是否启用自我反思")
|
||
iteration_period: Optional[Literal["1", "3", "6", "12", "24"]] = Field(
|
||
"3", description="反思迭代周期,单位小时"
|
||
)
|
||
reflexion_range: Optional[Literal["partial", "all"]] = Field(
|
||
"partial", description="反思范围:部分/全部"
|
||
)
|
||
baseline: Optional[Literal["TIME", "FACT", "TIME-FACT"]] = Field(
|
||
"TIME", description="基线:时间/事实/时间和事实"
|
||
)
|
||
|
||
@field_validator("chunker_strategy", mode="before")
|
||
@classmethod
|
||
def map_chunker_aliases_update(cls, v: str):
|
||
if isinstance(v, str):
|
||
m = v.strip().lower()
|
||
alias_map = {
|
||
"auto": "RecursiveChunker",
|
||
"by_sentence": "SentenceChunker",
|
||
"by_paragraph": "SemanticChunker",
|
||
"fixed_tokens": "TokenChunker",
|
||
"递归分块": "RecursiveChunker",
|
||
"token 分块": "TokenChunker",
|
||
"token分块": "TokenChunker",
|
||
"语义分块": "SemanticChunker",
|
||
"神经网络分块": "NeuralChunker",
|
||
"混合分块": "HybridChunker",
|
||
"llm 分块": "LLMChunker",
|
||
"llm分块": "LLMChunker",
|
||
"句子分块": "SentenceChunker",
|
||
"延迟分块": "LateChunker",
|
||
}
|
||
if m in alias_map:
|
||
return alias_map[m]
|
||
return v
|
||
|
||
|
||
class ConfigUpdateForget(BaseModel): # 更新遗忘引擎配置参数时使用的模型
|
||
# 遗忘引擎配置参数更新模型
|
||
config_id:Union[uuid.UUID, int, str] = None
|
||
lambda_time: Optional[float] = Field(0.5, ge=0.0, le=1.0, description="最低保持度,0-1 小数;默认 0.5")
|
||
lambda_mem: Optional[float] = Field(0.5, ge=0.0, le=1.0, description="遗忘率,0-1 小数;默认 0.5")
|
||
offset: Optional[float] = Field(0.0, ge=0.0, le=1.0, description="偏移度,0-1 小数;默认 0.0")
|
||
|
||
|
||
class ConfigPilotRun(BaseModel): # 试运行触发请求模型
|
||
config_id:Union[uuid.UUID, int, str] = Field(..., description="配置ID(唯一,支持UUID、整数或字符串)")
|
||
dialogue_text: str = Field(..., description="前端传入的对话文本,格式如 '用户: ...\nAI: ...' 可多行,试运行必填")
|
||
model_config = ConfigDict(populate_by_name=True, extra="forbid")
|
||
|
||
|
||
class ConfigFilter(BaseModel): # 查询配置参数时使用的模型
|
||
model_config = ConfigDict(populate_by_name=True, extra="forbid")
|
||
|
||
config_id: Union[uuid.UUID, int, str] = None
|
||
user_id: Optional[str] = None
|
||
apply_id: Optional[str] = None
|
||
|
||
limit: int = Field(20, ge=1, le=200, description="返回数量上限")
|
||
offset: int = Field(0, ge=0, description="起始偏移")
|
||
|
||
|
||
class ApiResponse(BaseModel): # 通用API响应模型
|
||
model_config = ConfigDict(populate_by_name=True, extra="forbid")
|
||
code: int = Field(..., description="0=成功,非0=各类业务异常")
|
||
msg: str = Field("", description="说明信息")
|
||
data: Optional[Any] = Field(None, description="返回数据载荷")
|
||
error: str = Field("", description="错误信息,失败时有值,成功为空字符串")
|
||
time: Optional[int] = Field(None, description="响应时间(毫秒,Unix 时间戳)")
|
||
|
||
|
||
def _now_ms() -> int:
|
||
return round(time.time() * 1000)
|
||
|
||
|
||
def ok(msg: str = "OK", data: Optional[Any] = None, time: Optional[int] = None) -> ApiResponse:
|
||
return ApiResponse(code=0, msg=msg, data=data, error="", time=time or _now_ms())
|
||
|
||
|
||
def fail(
|
||
msg: str,
|
||
error_code: str = "ERROR",
|
||
data: Optional[Any] = None,
|
||
time: Optional[int] = None,
|
||
query_preview: Optional[str] = None,
|
||
) -> ApiResponse:
|
||
payload = data
|
||
if query_preview is not None:
|
||
if payload is None:
|
||
payload = {"query_preview": query_preview}
|
||
elif isinstance(payload, dict):
|
||
payload = {**payload, "query_preview": query_preview}
|
||
else:
|
||
payload = {"data": payload, "query_preview": query_preview}
|
||
|
||
return ApiResponse(
|
||
code=1,
|
||
msg=msg,
|
||
data=payload,
|
||
error=error_code,
|
||
time=time or _now_ms(),
|
||
)
|
||
|
||
class GenerateCacheRequest(BaseModel):
|
||
"""缓存生成请求模型"""
|
||
model_config = ConfigDict(populate_by_name=True, extra="forbid")
|
||
|
||
end_user_id: Optional[str] = Field(
|
||
None,
|
||
description="终端用户ID(UUID格式)。如果提供,只为该用户生成;如果不提供,为当前工作空间的所有用户生成"
|
||
)
|
||
|
||
|
||
# ============================================================================
|
||
# 遗忘引擎相关 Schema
|
||
# ============================================================================
|
||
|
||
class ForgettingTriggerRequest(BaseModel):
|
||
"""手动触发遗忘周期请求模型"""
|
||
model_config = ConfigDict(populate_by_name=True, extra="forbid")
|
||
|
||
end_user_id: str = Field(..., description="组ID(即终端用户ID,必填)")
|
||
max_merge_batch_size: int = Field(100, ge=1, le=1000, description="单次最大融合节点对数(默认100)")
|
||
min_days_since_access: int = Field(30, ge=1, le=365, description="最小未访问天数(默认30天)")
|
||
|
||
|
||
class ForgettingConfigResponse(BaseModel):
|
||
"""遗忘引擎配置响应模型"""
|
||
model_config = ConfigDict(populate_by_name=True, extra="forbid")
|
||
|
||
config_id: Union[uuid.UUID, int, str] = Field(..., description="配置ID(支持UUID、整数或字符串)")
|
||
decay_constant: float = Field(..., description="衰减常数 d")
|
||
lambda_time: float = Field(..., description="时间衰减参数")
|
||
lambda_mem: float = Field(..., description="记忆衰减参数")
|
||
forgetting_rate: float = Field(..., description="遗忘速率(根据 lambda_time / lambda_mem 计算得出)")
|
||
offset: float = Field(..., description="偏移量")
|
||
max_history_length: int = Field(..., description="访问历史最大长度")
|
||
forgetting_threshold: float = Field(..., description="遗忘阈值")
|
||
min_days_since_access: int = Field(..., description="最小未访问天数")
|
||
enable_llm_summary: bool = Field(..., description="是否使用 LLM 生成摘要")
|
||
max_merge_batch_size: int = Field(..., description="单次最大融合节点对数")
|
||
forgetting_interval_hours: int = Field(..., description="遗忘周期间隔(小时)")
|
||
|
||
|
||
class ForgettingConfigUpdateRequest(BaseModel):
|
||
"""遗忘引擎配置更新请求模型"""
|
||
model_config = ConfigDict(populate_by_name=True, extra="forbid")
|
||
|
||
config_id: Union[uuid.UUID, int,str] = Field(..., description="配置唯一标识(UUID或int)")
|
||
decay_constant: Optional[float] = Field(None, ge=0.0, le=1.0, description="衰减常数 d")
|
||
lambda_time: Optional[float] = Field(None, ge=0.0, le=1.0, description="时间衰减参数")
|
||
lambda_mem: Optional[float] = Field(None, ge=0.0, le=1.0, description="记忆衰减参数")
|
||
offset: Optional[float] = Field(None, ge=0.0, le=1.0, description="偏移量")
|
||
max_history_length: Optional[int] = Field(None, ge=10, le=1000, description="访问历史最大长度")
|
||
forgetting_threshold: Optional[float] = Field(None, ge=0.0, le=1.0, description="遗忘阈值")
|
||
min_days_since_access: Optional[int] = Field(None, ge=1, le=365, description="最小未访问天数")
|
||
enable_llm_summary: Optional[bool] = Field(None, description="是否使用 LLM 生成摘要")
|
||
max_merge_batch_size: Optional[int] = Field(None, ge=1, le=1000, description="单次最大融合节点对数")
|
||
forgetting_interval_hours: Optional[int] = Field(None, ge=1, le=168, description="遗忘周期间隔(小时)")
|
||
|
||
|
||
class ForgettingCycleHistoryPoint(BaseModel):
|
||
"""遗忘周期历史数据点模型(用于趋势图)"""
|
||
model_config = ConfigDict(populate_by_name=True, extra="forbid")
|
||
|
||
date: str = Field(..., description="日期(格式: '1/1', '1/2')")
|
||
merged_count: int = Field(..., description="每日融合节点数")
|
||
average_activation: Optional[float] = Field(None, description="平均激活值")
|
||
total_nodes: int = Field(..., description="总节点数")
|
||
execution_time: int = Field(..., description="执行时间(Unix时间戳,秒)")
|
||
|
||
|
||
class PendingForgettingNode(BaseModel):
|
||
"""待遗忘节点模型"""
|
||
model_config = ConfigDict(populate_by_name=True, extra="forbid")
|
||
|
||
node_id: str = Field(..., description="节点ID")
|
||
node_type: str = Field(..., description="节点类型:statement/entity/summary")
|
||
content_summary: str = Field(..., description="内容摘要")
|
||
activation_value: float = Field(..., description="激活值")
|
||
last_access_time: int = Field(..., description="最后访问时间(Unix时间戳,秒)")
|
||
|
||
|
||
class ForgettingStatsResponse(BaseModel):
|
||
"""遗忘引擎统计信息响应模型"""
|
||
model_config = ConfigDict(populate_by_name=True, extra="forbid")
|
||
activation_metrics: Dict[str, Any] = Field(..., description="激活值相关指标")
|
||
node_distribution: Dict[str, int] = Field(..., description="节点类型分布")
|
||
recent_trends: List[ForgettingCycleHistoryPoint] = Field(..., description="最近7个日期的遗忘趋势数据(每天取最后一次执行)")
|
||
pending_nodes: List[PendingForgettingNode] = Field(..., description="待遗忘节点列表(前20个满足遗忘条件的节点)")
|
||
timestamp: int = Field(..., description="统计时间(时间戳)")
|
||
|
||
|
||
class ForgettingReportResponse(BaseModel):
|
||
"""遗忘周期报告响应模型"""
|
||
model_config = ConfigDict(populate_by_name=True, extra="forbid")
|
||
|
||
merged_count: int = Field(..., description="融合的节点对数量")
|
||
nodes_before: int = Field(..., description="遗忘前的节点总数")
|
||
nodes_after: int = Field(..., description="遗忘后的节点总数")
|
||
reduction_rate: float = Field(..., description="节点减少率(0-1)")
|
||
duration_seconds: float = Field(..., description="执行耗时(秒)")
|
||
start_time: str = Field(..., description="开始时间(ISO格式)")
|
||
end_time: str = Field(..., description="结束时间(ISO格式)")
|
||
failed_count: int = Field(..., description="失败的融合数量")
|
||
success_rate: float = Field(..., description="成功率(0-1)")
|
||
|
||
|
||
class ForgettingCurvePoint(BaseModel):
|
||
"""遗忘曲线数据点模型"""
|
||
model_config = ConfigDict(populate_by_name=True, extra="forbid")
|
||
|
||
day: int = Field(..., description="天数")
|
||
activation: float = Field(..., description="激活值")
|
||
retention_rate: float = Field(..., description="保持率(与激活值相同)")
|
||
|
||
|
||
class ForgettingCurveRequest(BaseModel):
|
||
"""遗忘曲线请求模型"""
|
||
model_config = ConfigDict(populate_by_name=True, extra="forbid")
|
||
|
||
importance_score: float = Field(0.5, ge=0.0, le=1.0, description="重要性分数(0-1)")
|
||
days: int = Field(60, ge=1, le=365, description="模拟天数(默认60天)")
|
||
config_id: Union[uuid.UUID, int, str] = Field(..., description="配置唯一标识(UUID或int)")
|
||
|
||
|
||
class ForgettingCurveResponse(BaseModel):
|
||
"""遗忘曲线响应模型"""
|
||
model_config = ConfigDict(populate_by_name=True, extra="forbid")
|
||
|
||
curve_data: List[ForgettingCurvePoint] = Field(..., description="遗忘曲线数据点列表")
|
||
config: Dict[str, Any] = Field(..., description="使用的配置参数")
|