MemoryBear

Author	SHA1	Message	Date
lanceyq	e3ab19dd4f	feat(memory): sync user entity aliases and metadata to PostgreSQL - Add `aliases` and `end_user_id` fields to user entity dicts in `collect_user_entities_for_metadata` so downstream tasks can write them to PostgreSQL - Add `update_aliases_and_metadata` method to `EndUserInfoRepository` for incremental, case-insensitive dedup merge of aliases and structured metadata fields - Add `_sync_end_user_info_pg` helper in tasks.py that writes aliases and extracted metadata to `end_user_info`, and back-fills `end_user.other_name` when empty - Call `_sync_end_user_info_pg` from `extract_metadata_batch_task` after Neo4j write, and also when no new metadata but aliases exist - Filter `meta_data` response in `UserMemoryService.get_end_user_info` to expose only four core fields: goals, traits, interests, core_facts	2026-05-08 11:28:44 +08:00
lanceyq	9dc9b7aee7	refactor(memory): remove legacy extraction pipeline and add dialog_at temporal grounding - Delete ExtractionOrchestrator (~2500 lines) and write_tools legacy path; MemoryService/WritePipeline is now the sole write path - Remove NEW_PIPELINE_ENABLED feature flag from memory_agent_service - Simplify pilot_run_service to always use PilotWritePipeline - Add dialog_at field to statement and triplet extraction prompts as the primary reference time for resolving relative temporal expressions - Rewrite relative time phrases (e.g. 昨天, 下周) into concrete dates directly in statement_text when stably resolvable from dialog_at - Rename extracat_Pruning.jinja2 to extracat_pruning.jinja2; expand few-shot examples and update memory type enum (drop NULL, add agreement/repetition/other)	2026-05-08 11:28:24 +08:00
lanceyq	cf389bb978	refactor(memory): remove expired_at field and add dialog_at timestamp Remove the deprecated expired_at field from all graph models, Neo4j Cypher queries, repositories, and pipeline code. Replace with dialog_at on StatementNode to track the original dialog timestamp. - Strip expired_at from DialogueNode, ChunkNode, StatementNode, ExtractedEntityNode, edges, and all Cypher queries - Add dialog_at to MessageItem schema and propagate through extraction and graph build steps - Extract emotion/metadata async submission from WritePipeline into a generic _submit_celery_task helper - Add post_store_dedup_and_alias_merge Celery task for async alias merging and second-layer dedup after Neo4j write - Switch pytest async backend from anyio to asyncio_mode=auto	2026-05-08 11:27:59 +08:00
lanceyq	d66d601e41	refactor(memory): redesign metadata extraction as async pipeline step - Replace extract_user_metadata_task with entity-level extract_metadata_batch_task - Add MetadataExtractionStep following ExtractionStep pattern with Jinja2 prompts - Flatten MetadataExtractionResponse to 9-field schema (aliases, core_facts, etc.) - Add Cypher queries for incremental metadata writeback and alias edge redirection - Wire _extract_metadata into WritePipeline as Step 3.6 (fire-and-forget) - Add pilot_write() to MemoryService; refactor pilot_run_service to use it - Extract snapshot logic into WriteSnapshotRecorder	2026-05-08 11:27:51 +08:00
lanceyq	4af9b02815	feat(memory): propagate temporal validity fields through extraction pipeline - Add valid_at/invalid_at passthrough in triplet extraction prompt (both zh/en) - Propagate temporal_validity to EntityEntityEdge in ExtractionOrchestrator - Use coalesce() for valid_at/invalid_at in Neo4j cypher queries to handle NULLs - Fix workspace_id/config_id UUID parsing in read_memory config resolution - Downgrade verbose extraction pipeline logs from info to debug - Remove UUID and short API key patterns from sensitive filter to reduce false positives - Standardize log message format (use = spacing, end_user_id label) - Fix misindented TODO comment in write_pipeline.py	2026-05-08 11:26:24 +08:00
lanceyq	1f0c88a5f0	refactor(memory): consolidate write pipeline and rename statement extraction step - Rename StatementExtractionStep → StatementTemporalExtractionStep and extract_statement.jinja2 → extract_statement_temporal.jinja2 to reflect merged temporal extraction logic - Move extraction_pipeline_orchestrator.py out of steps/ to engine root - Move dedup_step.py into steps/ directory - Introduce WriteMemoryRequest schema to replace positional args in write_memory() - Extract _resolve_and_load_config, _preprocess_files, _write_neo4j, and _invalidate_interest_cache as private helpers in MemoryAgentService - Remove shadow pipeline and simplify NEW_PIPELINE_ENABLED branch - Merge 类型归属/成员隶属/任职服务 relation types into single 归属身份关系 in triplet prompt - Add alias merge logic (别名属于) in deduplication and MERGE_ALIAS_BELONGS_TO Cypher query - Add StorageType, Language, MessageItem enums/models to memory_agent_schema - Reduce AgentMemory_Long_Term.DEFAULT_SCOPE from 6 to 1 - Delete standalone extract_temporal.jinja2 (logic merged into statement step)	2026-05-08 11:26:24 +08:00
lanceyq	2355536b44	refactor(memory): add PilotWritePipeline and enrich extraction schema - Add dedicated PilotWritePipeline (statement → triplet → graph_build → layer-1 dedup, no Neo4j write) - Add type_description/predicate_description fields across entity and triplet models, Cypher queries, and graph builders - Refactor data_pruning with LRU cache and snapshot support; skip assistant chunks in extraction - Remove strict Predicate enum whitelist; support statement_text alias in legacy extractor - Wire PipelineSnapshot through preprocessing and emotion extraction for debug tracing - Add PILOT_RUN_USE_REFACTORED_PIPELINE env toggle for pipeline selection	2026-05-08 11:26:04 +08:00
lanceyq	a98011fc8a	feat(memory): implement step-based extraction pipeline architecture Introduce ExtractionStep abstraction with modular pipeline stages: - Add base ExtractionStep class with render/call/parse lifecycle - Implement StatementExtractionStep, TripletExtractionStep, EmbeddingStep, EmotionStep, GraphBuildStep, and DedupStep - Add SidecarStepFactory for hot-pluggable non-critical steps - Define Pydantic I/O schemas for all pipeline stages - Refactor WritePipeline to orchestrate new step-based flow - Add NEW_PIPELINE_ENABLED env switch for old/new pipeline routing - Add emotion_enabled config flag to MemoryConfig - Fix workspace_id reference in get_end_user_connected_config	2026-05-08 11:26:04 +08:00
lanceyq	41535c34e6	feat(memory): add WritePipeline and MemoryService facade Introduce a layered pipeline architecture for the memory write flow: - WritePipeline: orchestrates preprocess → extract → store → cluster → summarize with deadlock retry, resource cleanup, and pilot-run support - MemoryService: facade that delegates to WritePipeline, placeholder methods for read/forget/reflect - BearLogger: structured step-level logging with perf threshold alerts - Shadow pipeline integration in MemoryAgentService (env-gated pilot run) Also includes: - Fix deprecated SQLAlchemy declarative_base import - Extend Neo4j Entity fulltext index to cover description and aliases - Migrate Pydantic schemas to v2 (ConfigDict, field_validator)	2026-05-08 11:26:04 +08:00
Timebomb2018	6f10296969	fix(workspace): deactivate user when removed from last active workspace	2026-04-28 18:34:06 +08:00
Timebomb2018	d3058ce379	fix(workspace): make delete workspace member async and invalidate user tokens	2026-04-28 15:04:13 +08:00
Timebomb2018	531d785629	fix(multimodal): support HTML image tags in document extraction and chat responses - Replace plain image URLs with `<img src="..." data-url="...">` HTML tags in multimodal and document extractor services - Propagate citations from workflow end events to client responses - Update system prompts to instruct LLMs to render images using Markdown `![alt](url)` with strict UUID-preserving URL copying	2026-04-27 17:56:58 +08:00
山程漫悟	ce4a3daec7	Merge pull request #1012 from SuanmoSuanyangTechnology/fix/wxy-032 feat(workflow): augment logging queries and ameliorate error handling	2026-04-27 16:00:49 +08:00
Timebomb2018	f7fa33c0c4	Merge remote-tracking branch 'origin/release/v0.3.2' into fix/Timebomb_032	2026-04-27 15:36:03 +08:00
Timebomb2018	faf8d1a51a	fix(workflow): add reasoning content, suggested questions, citations and audio status support - Introduce `reasoning_content`, `suggested_questions`, `citations`, and `audio_status` fields in conversation and app response schemas - Conditionally set `audio_status` to `"pending"` only when `audio_url` is present - Replace `model_dump` override with `@model_serializer(mode="wrap")` for cleaner serialization logic - Change knowledge base validation failure from `RuntimeError` to warning + `continue` to avoid halting retrieval on invalid KB	2026-04-27 15:35:26 +08:00
wxy	adb7f873b5	Merge remote-tracking branch 'origin/fix/wxy-032' into fix/wxy-032	2026-04-27 15:29:54 +08:00
wxy	b64bcc2c50	feat(workflow): augment logging queries and ameliorate error handling - Augment log search with app type filtering to enable keyword searching within workflow_executions. - Introduce execution sequence markers to ensure logs are displayed in the correct chronological order. - Ameliorate error handling to capture successful node outputs alongside failure details. - Rectify the processing of empty JSON bodies in HTTP request nodes.	2026-04-27 15:20:25 +08:00
山程漫悟	d9de96cffa	Merge pull request #1011 from wanxunyang/fix/wxy-032 fix(api_key): bypass publication check for SERVICE type API keys	2026-04-27 14:44:19 +08:00
wxy	546bfb9627	fix(api_key): bypass publication check for SERVICE type API keys - Exclude SERVICE type keys from application publication validation since their resource_id targets the workspace instead of an application.	2026-04-27 14:05:06 +08:00
Timebomb2018	a268d0f7f1	fix(multimodal_service): add '文档内容：' prefix to document text and simplify image placeholder text	2026-04-27 12:25:27 +08:00
山程漫悟	2c14344d3f	Merge pull request #1002 from SuanmoSuanyangTechnology/feature/agent-tool_xjn fix(multimodal_service)	2026-04-24 19:42:38 +08:00
Timebomb2018	141fd94513	fix(multimodal_service): refactor image processing to use intermediate list before extending result	2026-04-24 19:40:57 +08:00
山程漫悟	6cb48664b7	Merge pull request #992 from wanxunyang/develop-wxy fix(workflow): rectify error handling and bolster execution logging	2026-04-24 18:58:40 +08:00
wxy	f63bcd6321	refactor(tool): flatten request body parameters for model exposure - Refactor the extraction logic in tool service to flatten request body parameters into independent arguments exposed to the model.	2026-04-24 18:49:55 +08:00
wxy	21eb500680	refactor(workflow): streamline node execution handling and log service logic - Consolidate node data retrieval from workflow_executions.output_data to unify storage access. - Optimize the construction of messages and execution records to support opening suggestions. - Eliminate redundant queries and storage logic to simplify the overall codebase structure.	2026-04-24 18:20:14 +08:00
Ke Sun	c70f536acc	Merge pull request #986 from SuanmoSuanyangTechnology/feat/episodic-memory-detail-and-pagination feat:episodic memory detail and pagination	2026-04-24 18:19:11 +08:00
Ke Sun	5f96a6380e	Merge pull request #990 from SuanmoSuanyangTechnology/feature/celery-task-scheduler Feature/celery task scheduler	2026-04-24 18:19:00 +08:00
Timebomb2018	4b0afe867a	fix(app_chat_service,draft_run_service): move system_prompt augmentation before LangChainAgent instantiation	2026-04-24 18:00:44 +08:00
Timebomb2018	8f31236303	fix(app_chat_service,draft_run_service): move system_prompt augmentation before LangChainAgent instantiation	2026-04-24 17:48:15 +08:00
wwq	cf8db47389	feat(workflow): augment logging capabilities with execution status and loop support - Augment workflow logs with execution status fields and loop node information. - Refactor log service to handle distinct processing logic for workflows and agents. - Construct message and node logs derived from workflow_executions data.	2026-04-24 17:02:03 +08:00
Timebomb2018	74be09340c	feat(multimodal): support tenant-aware document image storage and improve image placeholder labeling - Pass workspace_id to multimodal_service.process_files across app_chat_service, draft_run_service - Fetch tenant_id from workspace in multimodal_service for proper file storage scoping - Update image placeholder format from "[第N页第M张图片]" to "[图片第N页第M张图片]" for clarity - Add strict URL preservation rules to system prompt for agents handling document images - Refactor _save_doc_image_to_storage to accept explicit tenant_id and workspace_id instead of inferring from FileMetadata	2026-04-24 15:56:06 +08:00
wwq	cedf47b3bc	fix(workflow): rectify error handling and bolster execution logging	2026-04-24 15:29:33 +08:00
Timebomb2018	2c2551e15c	feat(citation): add download_url to citations when allow_download is enabled	2026-04-24 14:44:27 +08:00
Eternity	be10bab763	refactor(core): migrate task scheduler to per-user queue with dynamic sharding	2026-04-24 14:21:18 +08:00
Timebomb2018	89f2f9a045	feat(citation): support downloading cited documents with allow_download toggle Added `allow_download` flag to citation config and `download_url` field to citation output. Implemented `/citations/{document_id}/download` endpoint to serve original files when enabled. Removed unused `files` field and `HttpRequestDataProcessing` model from HTTP request node config.	2026-04-24 14:18:25 +08:00
wwq	0f7a7263eb	fix(workflow): rectify error handling and bolster execution logging - Rectify exception propagation during node execution failures to ensure errors are correctly raised. - Bolster workflow logging to support failed status records and persist node execution data, including loop nodes.	2026-04-24 11:39:33 +08:00
Timebomb2018	767eb5e6f2	feat(multimodal): support document image extraction and inline vision processing Added document image extraction capability for PDF and DOCX files, including page/index metadata and storage integration. Extended `process_files` with `document_image_recognition` flag to conditionally enable vision-based image processing when model supports it. Updated knowledge repository and workflow node logic to enforce status=1 checks. Added PyMuPDF dependency.	2026-04-24 11:18:50 +08:00
wwq	5c89acced6	fix(api_key): validate application publication status before key generation - Ensure the application exists and is published when resource_id is present; raise an exception otherwise.	2026-04-24 10:29:41 +08:00
山程漫悟	9fdb952396	Merge pull request #985 from wanxunyang/develop-wxy feat: enhance workflow debugging, logging and auth middleware	2026-04-24 10:17:32 +08:00
wwq	fb23c34475	feat: enhance HTTP request debugging and extend logging data - feat(http_request): augment debugging capabilities with raw request generation and improved error handling. - feat(app_log): extend session filtering logic to support retrieving all session types. - feat(log): add 'process' field to node execution records for better data tracking.	2026-04-23 20:55:34 +08:00
miao	4619b40d03	fix(memory): fix timezone and add generate_cache API endpoint - Fix episodic memory time filter to use UTC (datetime.fromtimestamp with tz=timezone.utc) to match Neo4j stored UTC timestamps - Add POST /v1/memory/analytics/generate_cache endpoint for cache generation via API Key Modified files: - api/app/services/memory_explicit_service.py - api/app/controllers/service/user_memory_api_controller.py	2026-04-23 19:32:13 +08:00
miao	7ac0eff0b8	fix(memory): fix problems - Parameterize SKIP/LIMIT in Cypher query instead of f-string interpolation - Add UUID format validation in validate_end_user_in_workspace before DB query - Update limit/depth Query descriptions to clarify auto-cap behavior in service layer - Move uuid import to module level in api_key_utils.py Modified files: - api/app/services/memory_explicit_service.py - api/app/core/api_key_utils.py - api/app/controllers/service/user_memory_api_controller.py	2026-04-23 16:29:22 +08:00
wwq	404ce9f9ba	feat(workflow): enhance HTTP request node with curl debugging support - Augment HTTP request node capabilities and add generated curl commands for easier debugging. feat(log): implement workflow execution logs and search functionality - Add detailed logging for workflow node execution and enable search capabilities within application logs. feat(auth): introduce middleware to verify application publication status - Add a check to ensure the application is published before allowing access. fix(converter): rectify variable handling logic in Dify converter - Correct issues related to processing variables within the Dify converter module. refactor(model): remove quota check decorator from model update operations - Decouple quota validation from the model update process to streamline the logic.	2026-04-23 15:46:12 +08:00
miao	aac89b172f	fix(memory): remove unused date import and fix docstring route paths Remove unused rom datetime import date in controller and service Fix Examples route paths from /episodic-list to /episodics to match actual router	2026-04-23 15:37:54 +08:00
miao	5c836c90c9	feat(memory): add episodic memory pagination and semantic memory list API Split explicit memory overview into two independent endpoints: - GET /memory/explicit-memory/episodics: episodic memory paginated query with date range filter (millisecond timestamp) and episodic type filter using Neo4j datetime() for precise time comparison - GET /memory/explicit-memory/semantics: semantic memory full list query returns data as array directly Modified files: - api/app/controllers/memory_explicit_controller.py - api/app/services/memory_explicit_service.py	2026-04-23 15:30:58 +08:00
Ke Sun	b8009074d5	Merge branch 'release/v0.3.1' into develop	2026-04-23 12:16:57 +08:00
Eternity	f93ec8d609	fix(core): fix end_user_id reference and add task status tracking - Fix write_router to use actual_end_user_id instead of end_user_id - Add task status tracking via Redis in scheduler - Expose task_id in memory write response - Fix logging import path in scheduler	2026-04-22 18:06:14 +08:00
Eternity	c5ae82c3c2	refactor(core): migrate memory write tasks to centralized scheduler	2026-04-22 16:50:06 +08:00
Mark	363d775270	Merge pull request #961 from SuanmoSuanyangTechnology/fix/wxy_031 fix(api): fix API Key rate limiting and terminal user quota checks	2026-04-21 20:57:25 +08:00
wwq	ad4121b0d8	fix(api): fix API Key rate limiting and terminal user quota checks - Revert API Key rate limit handling to throw an error instead of auto-capping when exceeding the plan limit. - Optimize terminal user quota check logic to validate only during new user creation, avoiding redundant checks. - Add method to query terminal users by `workspace_id` and `other_id`.	2026-04-21 20:48:06 +08:00

1 2 3 4 5 ...

682 Commits