MemoryBear

Author	SHA1	Message	Date
lanceyq	6419dcd932	[commit] Refactor write pipeline	2026-05-08 11:28:24 +08:00
lanceyq	9dc9b7aee7	refactor(memory): remove legacy extraction pipeline and add dialog_at temporal grounding - Delete ExtractionOrchestrator (~2500 lines) and write_tools legacy path; MemoryService/WritePipeline is now the sole write path - Remove NEW_PIPELINE_ENABLED feature flag from memory_agent_service - Simplify pilot_run_service to always use PilotWritePipeline - Add dialog_at field to statement and triplet extraction prompts as the primary reference time for resolving relative temporal expressions - Rewrite relative time phrases (e.g. 昨天, 下周) into concrete dates directly in statement_text when stably resolvable from dialog_at - Rename extracat_Pruning.jinja2 to extracat_pruning.jinja2; expand few-shot examples and update memory type enum (drop NULL, add agreement/repetition/other)	2026-05-08 11:28:24 +08:00
lanceyq	cf389bb978	refactor(memory): remove expired_at field and add dialog_at timestamp Remove the deprecated expired_at field from all graph models, Neo4j Cypher queries, repositories, and pipeline code. Replace with dialog_at on StatementNode to track the original dialog timestamp. - Strip expired_at from DialogueNode, ChunkNode, StatementNode, ExtractedEntityNode, edges, and all Cypher queries - Add dialog_at to MessageItem schema and propagate through extraction and graph build steps - Extract emotion/metadata async submission from WritePipeline into a generic _submit_celery_task helper - Add post_store_dedup_and_alias_merge Celery task for async alias merging and second-layer dedup after Neo4j write - Switch pytest async backend from anyio to asyncio_mode=auto	2026-05-08 11:27:59 +08:00
lanceyq	d66d601e41	refactor(memory): redesign metadata extraction as async pipeline step - Replace extract_user_metadata_task with entity-level extract_metadata_batch_task - Add MetadataExtractionStep following ExtractionStep pattern with Jinja2 prompts - Flatten MetadataExtractionResponse to 9-field schema (aliases, core_facts, etc.) - Add Cypher queries for incremental metadata writeback and alias edge redirection - Wire _extract_metadata into WritePipeline as Step 3.6 (fire-and-forget) - Add pilot_write() to MemoryService; refactor pilot_run_service to use it - Extract snapshot logic into WriteSnapshotRecorder	2026-05-08 11:27:51 +08:00
lanceyq	4af9b02815	feat(memory): propagate temporal validity fields through extraction pipeline - Add valid_at/invalid_at passthrough in triplet extraction prompt (both zh/en) - Propagate temporal_validity to EntityEntityEdge in ExtractionOrchestrator - Use coalesce() for valid_at/invalid_at in Neo4j cypher queries to handle NULLs - Fix workspace_id/config_id UUID parsing in read_memory config resolution - Downgrade verbose extraction pipeline logs from info to debug - Remove UUID and short API key patterns from sensitive filter to reduce false positives - Standardize log message format (use = spacing, end_user_id label) - Fix misindented TODO comment in write_pipeline.py	2026-05-08 11:26:24 +08:00
lanceyq	1f0c88a5f0	refactor(memory): consolidate write pipeline and rename statement extraction step - Rename StatementExtractionStep → StatementTemporalExtractionStep and extract_statement.jinja2 → extract_statement_temporal.jinja2 to reflect merged temporal extraction logic - Move extraction_pipeline_orchestrator.py out of steps/ to engine root - Move dedup_step.py into steps/ directory - Introduce WriteMemoryRequest schema to replace positional args in write_memory() - Extract _resolve_and_load_config, _preprocess_files, _write_neo4j, and _invalidate_interest_cache as private helpers in MemoryAgentService - Remove shadow pipeline and simplify NEW_PIPELINE_ENABLED branch - Merge 类型归属/成员隶属/任职服务 relation types into single 归属身份关系 in triplet prompt - Add alias merge logic (别名属于) in deduplication and MERGE_ALIAS_BELONGS_TO Cypher query - Add StorageType, Language, MessageItem enums/models to memory_agent_schema - Reduce AgentMemory_Long_Term.DEFAULT_SCOPE from 6 to 1 - Delete standalone extract_temporal.jinja2 (logic merged into statement step)	2026-05-08 11:26:24 +08:00
lanceyq	7747ed7ac1	refactor(memory): enhance extraction ontology and add assistant pruning graph support - Expand entity type ontology with detailed definitions, examples, and notes (merged types: 地点设施, 物品设备, 产品服务, 软件平台, 角色职业, 知识能力, 偏好习惯目标, 称呼别名, 智能体) - Add relation ontology taxonomy with 15 predicate categories and usage rules - Strengthen reference resolution rules: resolve pronouns before extraction, skip unresolvable references entirely - Add guidelines to avoid extracting abstract propositions, emotions, and low-value entities (effort/reward/success patterns) - Add 7 new extraction examples covering edge cases - Add AssistantOriginal/AssistantPruned node models and graph persistence (PRUNED_TO and BELONGS_TO_DIALOG edges, Neo4j indexes and constraints) - Add graph_build_step.py for building graph nodes/edges from DialogData - Update write_pipeline.py to pass assistant pruning nodes/edges to graph saver - Update data_pruning.py with related preprocessing changes	2026-05-08 11:26:24 +08:00
lanceyq	2355536b44	refactor(memory): add PilotWritePipeline and enrich extraction schema - Add dedicated PilotWritePipeline (statement → triplet → graph_build → layer-1 dedup, no Neo4j write) - Add type_description/predicate_description fields across entity and triplet models, Cypher queries, and graph builders - Refactor data_pruning with LRU cache and snapshot support; skip assistant chunks in extraction - Remove strict Predicate enum whitelist; support statement_text alias in legacy extractor - Wire PipelineSnapshot through preprocessing and emotion extraction for debug tracing - Add PILOT_RUN_USE_REFACTORED_PIPELINE env toggle for pipeline selection	2026-05-08 11:26:04 +08:00
lanceyq	b0ddd12cc6	feat(memory): add emotion batch extraction task and improve extraction prompts - Add extract_emotion_batch_task for async emotion extraction - Refine Chinese entity types and relation types in extraction prompts - Add STATEMENT_EMOTION_UPDATE Cypher query for Neo4j backfill - Refactor statement_step and triplet_step implementations	2026-05-08 11:26:04 +08:00
lanceyq	a98011fc8a	feat(memory): implement step-based extraction pipeline architecture Introduce ExtractionStep abstraction with modular pipeline stages: - Add base ExtractionStep class with render/call/parse lifecycle - Implement StatementExtractionStep, TripletExtractionStep, EmbeddingStep, EmotionStep, GraphBuildStep, and DedupStep - Add SidecarStepFactory for hot-pluggable non-critical steps - Define Pydantic I/O schemas for all pipeline stages - Refactor WritePipeline to orchestrate new step-based flow - Add NEW_PIPELINE_ENABLED env switch for old/new pipeline routing - Add emotion_enabled config flag to MemoryConfig - Fix workspace_id reference in get_end_user_connected_config	2026-05-08 11:26:04 +08:00
lanceyq	41535c34e6	feat(memory): add WritePipeline and MemoryService facade Introduce a layered pipeline architecture for the memory write flow: - WritePipeline: orchestrates preprocess → extract → store → cluster → summarize with deadlock retry, resource cleanup, and pilot-run support - MemoryService: facade that delegates to WritePipeline, placeholder methods for read/forget/reflect - BearLogger: structured step-level logging with perf threshold alerts - Shadow pipeline integration in MemoryAgentService (env-gated pilot run) Also includes: - Fix deprecated SQLAlchemy declarative_base import - Extend Neo4j Entity fulltext index to cover description and aliases - Migrate Pydantic schemas to v2 (ConfigDict, field_validator)	2026-05-08 11:26:04 +08:00
Ke Sun	feae2f2e1e	Merge pull request #1033 from SuanmoSuanyangTechnology/release/v0.3.2 Release/v0.3.2 v0.3.2	2026-04-30 11:12:12 +08:00
Mark	415234d4c8	Merge pull request #1032 from SuanmoSuanyangTechnology/fix/sandbox feat(core): add configurable SANDBOX_URL for code node sandbox requests	2026-04-29 20:26:55 +08:00
Eternity	e38a60e107	feat(core): add configurable SANDBOX_URL for code node sandbox requests	2026-04-29 20:24:10 +08:00
yingzhao	86eb08c73f	Merge pull request #1027 from SuanmoSuanyangTechnology/fix/release0.3.2_zy fix(web): node executionStatus update remove silent	2026-04-29 12:26:26 +08:00
zhaoying	53f1b0e586	fix(web): node executionStatus update remove silent	2026-04-29 12:24:34 +08:00
yingzhao	49cc47a79a	Merge pull request #1026 from SuanmoSuanyangTechnology/fix/release0.3.2_zy fix(web): ontology tag	2026-04-29 12:17:40 +08:00
zhaoying	1817f52edf	fix(web): ontology tag	2026-04-29 11:55:43 +08:00
山程漫悟	40633d72c3	Merge pull request #1024 from SuanmoSuanyangTechnology/fix/Timebomb_032 fix(workspace)	2026-04-28 18:37:50 +08:00
Timebomb2018	6f10296969	fix(workspace): deactivate user when removed from last active workspace	2026-04-28 18:34:06 +08:00
yingzhao	89228825cf	Merge pull request #1023 from SuanmoSuanyangTechnology/fix/v0.3.2_zy fix(web): workflow redo/undo	2026-04-28 17:41:45 +08:00
zhaoying	cab4deb2ff	fix(web): workflow redo/undo	2026-04-28 17:37:59 +08:00
Ke Sun	4048a10858	ci: add GitHub Actions workflow to sync all branches and tags to Gitee	2026-04-28 16:44:50 +08:00
yingzhao	d6ef0f4923	Merge pull request #1022 from SuanmoSuanyangTechnology/fix/v0.3.2_zy fix(web): thinking_budget_tokens add min & default value	2026-04-28 16:18:11 +08:00
zhaoying	75fbe44839	fix(web): add min validator	2026-04-28 16:17:31 +08:00
山程漫悟	06597c567b	Merge pull request #1019 from SuanmoSuanyangTechnology/fix/Timebomb_032 fix(workspace)	2026-04-28 16:11:44 +08:00
Timebomb2018	28694fefb0	fix(app): adjust thinking budget tokens default and validation range The default thinking budget tokens value was changed from 10000 to 1024 in base.py, and the minimum validation constraint was updated from 1024 to 1 in app_schema.py to allow smaller budgets while maintaining backward compatibility.	2026-04-28 16:10:44 +08:00
zhaoying	7a0f08148e	fix(web): thinking_budget_tokens add min & default value	2026-04-28 16:10:18 +08:00
Timebomb2018	d3058ce379	fix(workspace): make delete workspace member async and invalidate user tokens	2026-04-28 15:04:13 +08:00
Ke Sun	8d88df391d	Merge pull request #1017 from SuanmoSuanyangTechnology/revert-1016-feat/episodic-memory-detail-and-pagination Revert "refactor(memory): replace raw dict responses with Pydantic schema mod…"	2026-04-27 18:50:43 +08:00
Ke Sun	7621321d1b	Revert "refactor(memory): replace raw dict responses with Pydantic schema mod…"	2026-04-27 18:50:26 +08:00
Ke Sun	0e29b0b2a5	Merge pull request #1016 from SuanmoSuanyangTechnology/feat/episodic-memory-detail-and-pagination refactor(memory): replace raw dict responses with Pydantic schema mod…	2026-04-27 18:43:53 +08:00
lanceyq	2fa4d29548	fix(memory): use explicit None checks and remove unnecessary Optional type - Replace truthiness checks with 'is not None' for data.message in graph_data and community_graph endpoints to handle empty string correctly - Remove Optional wrapper from GraphStatistics.edge_types since it already has a default_factory	2026-04-27 18:39:33 +08:00
yingzhao	7bb181c1c7	Merge pull request #1014 from SuanmoSuanyangTechnology/fix/v0.3.2_zy Fix/v0.3.2 zy	2026-04-27 18:07:10 +08:00
zhaoying	a9c87b03ff	Merge branch 'fix/v0.3.2_zy' of github.com:SuanmoSuanyangTechnology/MemoryBear into fix/v0.3.2_zy	2026-04-27 18:05:59 +08:00
zhaoying	720af8d261	fix(web): file icon	2026-04-27 18:04:55 +08:00
山程漫悟	09d32ed446	Merge pull request #1015 from SuanmoSuanyangTechnology/fix/Timebomb_032 fix(multimodal)	2026-04-27 18:01:12 +08:00
lanceyq	9a5ce7f7c6	refactor(memory): replace raw dict responses with Pydantic schema models in user memory controllers - Add user_memory_schema.py with typed Pydantic models for all user memory API responses: MemoryInsightReportData, UserSummaryData, GraphData, MemoryTypeStatItem, cache result models, and RelationshipEvolutionData - Refactor user_memory_controllers.py to construct schema instances and return model_dump() instead of raw dicts - Remove unused imports (datetime, timestamp_to_datetime, EndUserInfoResponse, EndUserInfoCreate, EndUser)	2026-04-27 17:57:06 +08:00
Timebomb2018	531d785629	fix(multimodal): support HTML image tags in document extraction and chat responses - Replace plain image URLs with `<img src="..." data-url="...">` HTML tags in multimodal and document extractor services - Propagate citations from workflow end events to client responses - Update system prompts to instruct LLMs to render images using Markdown `![alt](url)` with strict UUID-preserving URL copying	2026-04-27 17:56:58 +08:00
zhaoying	6d80d74f4a	Merge branch 'fix/v0.3.2_zy' of github.com:SuanmoSuanyangTechnology/MemoryBear into fix/v0.3.2_zy	2026-04-27 17:55:51 +08:00
Ke Sun	3d9882643e	ci: add GitHub Actions workflow to sync all branches and tags to Gitee	2026-04-27 17:48:35 +08:00
zhaoying	b4e4be1133	fix(web): chat file icon	2026-04-27 17:42:56 +08:00
zhaoying	16926d9db5	fix(web): tool node config reset	2026-04-27 17:10:02 +08:00
zhaoying	f369a63c8d	fix(web): loop & iteration child node history	2026-04-27 16:31:10 +08:00
zhaoying	1861b0fbc9	Merge branch 'fix/v0.3.2_zy' of github.com:SuanmoSuanyangTechnology/MemoryBear into fix/v0.3.2_zy	2026-04-27 16:07:20 +08:00
zhaoying	750d4ca841	fix(web): custom tool schema api add case Co-authored-by: Copilot <copilot@github.com>	2026-04-27 16:04:02 +08:00
山程漫悟	ce4a3daec7	Merge pull request #1012 from SuanmoSuanyangTechnology/fix/wxy-032 feat(workflow): augment logging queries and ameliorate error handling	2026-04-27 16:00:49 +08:00
山程漫悟	c12d06bb07	Merge pull request #1013 from SuanmoSuanyangTechnology/fix/Timebomb_032 fix(workflow)	2026-04-27 15:51:18 +08:00
Timebomb2018	98d8d7b261	fix(conversation_schema): refine citations field type to Dict[str, Any]	2026-04-27 15:49:21 +08:00
Timebomb2018	12a08a487d	fix(tool_controller): re-raise HTTPException to preserve original status codes	2026-04-27 15:47:34 +08:00

1 2 3 4 5 ...

3268 Commits