ai-space/opencode-working-memory

Fork 0

mirror of https://github.com/sdwolf4103/opencode-working-memory.git synced 2026-06-02 06:19:36 +02:00

Files

T

Ralph Chang 01bda7c134 fix(memory): freeze hot session prompt epoch

2026-05-20 12:38:16 +08:00

17 KiB

Raw Blame History

Architecture Documentation

Overview

OpenCode Working Memory implements a three-layer memory architecture designed to preserve context across OpenCode session compactions.

┌─────────────────────────────────────────────────────────────┐
│  LAYER 1: WORKSPACE MEMORY (Long-term, cross-session)      │
│  • Persistent storage: ~/.local/share/opencode-working-...  │
│  • Types: feedback | project | decision | reference        │
│  • Sources: explicit | compaction | manual                  │
│  • Render limits: 3600 chars / 28 entries                    │
│  • Survives: session reset, compaction (same workspace)    │
└─────────────────────────────────────────────────────────────┘
                              ↓
┌─────────────────────────────────────────────────────────────┐
│  LAYER 2: HOT SESSION STATE (Short-term, per-session)        │
│  • Session-scoped tracking: active files, open errors       │
│  • Storage: sessions/{sessionID}.json                       │
│  • Frozen prompt snapshot shares the workspace epoch        │
│  • Auto-extracted from tool usage and explicit remembers    │
│  • Cleared: on new session start                             │
└─────────────────────────────────────────────────────────────┘
                              ↓
┌─────────────────────────────────────────────────────────────┐
│  LAYER 3: NATIVE OPENCODE STATE                             │
│  • Uses OpenCode's built-in todos during compaction          │
│  • No additional storage required                            │
│  • Delegated to OpenCode's native features                   │
└─────────────────────────────────────────────────────────────┘

Layer 1: Workspace Memory

Purpose

Long-term memory that persists across sessions within the same workspace. Perfect for:

Project conventions and patterns
Important decisions that span sessions
User preferences for this codebase

Storage

Location: ~/.local/share/opencode-working-memory/workspaces/{workspaceKey}/workspace-memory.json
Workspace Key: First 16 chars of sha256(realpath(workspaceRoot))

Schema:

{
  version: 1,
  workspace: { root: string, key: string },
  limits: { maxRenderedChars: 3600, maxEntries: 28 },
  entries: LongTermMemoryEntry[],
  lastActivityAt?: string,
  updatedAt: string
}

Evidence Log Schema

Workspace diagnostics also read the append-only evidence log for the current workspace. New evidence records are additive and keep historical records valid:

{
  version: 1,
  eventId: string,
  createdAt: string,
  workspaceKey: string,
  workspaceRootHash: string,
  producerName?: string,
  producerVersion?: string,
  instrumentationVersion?: number,
  type: EvidenceEventType,
  phase: EvidencePhase,
  outcome: EvidenceOutcome,
  reasonCodes: string[],
  details?: Record<string, string | number | boolean | null | string[] | number[]>
}

Instrumentation version 2 added optional causal block details for diagnostics without backfilling old JSONL records. Instrumentation version 3 adds elapsed-window reinforcement details such as details.elapsedMs, details.requiredElapsedMs, details.sameSession, details.reinforcementMode, and details.legacyMissingTimestamp. Historical reinforcement-block events may still include older details.blockReason values such as same_session, same_utc_day, min_interval, max_count, or may have missing block details. Capacity-removal events may include details.strengthAtRemoval, details.rankAtRemoval, details.typeRankAtRemoval, and details.ageDaysAtRemoval. memory-diag quality treats missing producer/instrumentation fields as historical or ambiguous rather than proof of current behavior.

Entry Types

Type	Purpose	Example
`feedback`	User preferences	"User prefers functional React components"
`project`	Project-level info	"This monorepo uses turborepo"
`decision`	Important decisions	"Use PostgreSQL for primary database"
`reference`	Key references	"API endpoints defined in `src/api/`"

Entry Sources

Source	Confidence	How Added
`explicit`	1.0	User said "remember this"
`compaction`	0.75	Extracted during compaction
`manual`	varies	Programmatically added

Memory Extraction

During compaction, OpenCode Working Memory scans for Memory candidates: sections:

Memory candidates:
- [decision] Use npm cache for plugin loading
- [project] This repo uses TypeScript with strict mode

Legacy Format: OpenCode Working Memory also accepts <workspace_memory_candidates> XML blocks for backward compatibility, but this format is deprecated.

Quality Gate: Not all candidates become memories. OpenCode Working Memory rejects:

Git commit hashes (e.g., abc1234)
Raw errors (e.g., Error: something failed)
Stack traces
Path-heavy facts (>50% paths)
Very short text (<20 chars)

Consolidation, Deduplication, and Retention

Memories are deduplicated and consolidated with accounting:

Normalize exact text: lowercase, strip punctuation, collapse whitespace.
Group project/reference entries by identity where possible.
Keep decision and feedback entries on exact canonical matching to avoid broad semantic merges.
Keep the best surviving entry by source, confidence, specificity, and freshness tie-breakers.
Emit accounting events so pending memories can be classified as promoted, absorbed, superseded, or rejected.

This prevents absorbed or superseded pending memories from retrying forever while still preserving the active surviving memory.

Retention then decides which active memories are rendered into prompt context. It does not hard-delete old memories by age.

strength = initialStrength * 2 ** (-effectiveAgeDays / effectiveHalfLifeDays)

Initial strength is based on memory type, source, and optional user importance. Confidence remains stored for compatibility but is not part of retention scoring.

Rendered candidates are selected in this order:

Exclude status: "superseded" entries.
Compute current retention strength.
Sort by strength descending.
Apply per-type caps.
Keep the top 28 rendered entries under the workspace memory character budget.

Default type caps:

Type	Rendered cap
`feedback`	10
`decision`	10
`project`	8
`reference`	6

The type-cap total is 34, intentionally above the global 28-entry cap. These are maximums, not quotas.

Dormant workspaces age more slowly: after 14 inactive days, additional dormant time counts at 0.25x for retention decay. Repeated duplicate memories reinforce the surviving entry only after a rolling 7-day elapsed window. Below reinforcement count 6, an allowed recurrence increments the reinforcement count and refreshes retention timestamps; at count 6 or higher, an allowed recurrence refreshes retention timestamps without increasing the count. Same-session status is recorded as diagnostic evidence, not as a new-policy block reason.

Safety-Critical Deprecation

The safetyCritical field on LongTermMemoryEntry is deprecated as of the retention v1.5.1 model update. It no longer affects retention strength or type-cap bypass. The field is preserved in the type definition for backward compatibility with existing workspace memory JSON files, but has no active behavior. Safety rules should be maintained in user-controlled files such as agent.md rather than in system memory.

System Prompt Injection

Workspace memory is injected at the top of every message:

Workspace memory (cross-session, verify if stale):
decision:
- Use npm cache for plugin loading, not npm link
project:
- This repo uses the opencode-agenthub plugin system
reference:
- Storage: ~/.local/share/opencode-working-memory/...

Layer 2: Hot Session State

Purpose

Track current session context automatically:

What files are you working on?
What errors are currently open?
What decisions were made recently?
Which explicit memories are pending promotion?

Hot session state is stored continuously during a session, but it is not rendered as a per-turn dynamic prompt. The prompt layer uses a frozen hot snapshot created or refreshed at the same epoch boundary as frozen workspace memory. Active files and open errors are current at epoch boundaries, not on every normal turn. After epoch start, the conversation/tool transcript is the source of truth for newer events.

Storage

Location: ~/.local/share/opencode-working-memory/workspaces/{workspaceKey}/sessions/{hashedSessionID}.json

Schema:

{
  version: 1,
  sessionID: string,
  turn: number,
  updatedAt: string,
  activeFiles: ActiveFile[],
  openErrors: OpenError[],
  recentDecisions: SessionDecision[],
  pendingMemories: LongTermMemoryEntry[]
}

Active Files

Automatically tracked from tool.execute.after events:

Action	Weight
`edit`	50
`write`	45
`grep`	30
`read`	20

Files are ranked by: ACTION_WEIGHT[action] + count * 3

Open Errors

Tracked from tool.execute.after events when exitCode !== 0:

Category	Trigger Pattern
`typecheck`	`TS####:` or TypeScript errors
`test`	Test failures
`lint`	ESLint warnings/errors
`build`	Build failures
`runtime`	`Error:`, `TypeError:`, etc.

False Positive Guards:

Commands like git log, cat with "error" in output are ignored
Only actual command failures (exitCode !== 0) trigger errors
exitCode === undefined is ignored (no error created, no error cleared)

Error Fingerprinting

Errors are fingerprinted by:

Extract error message summary
Generate fingerprint: first 12 chars of sha256(summary)
Group similar errors by fingerprint

recentDecisions

Short-term decisions made this session. Candidates for promotion to workspace memory during compaction.

System Prompt Injection

Workspace memory and hot session state are separate cached prompt layers that share a prompt epoch lifecycle:

system[1]*: frozen workspace memory
system[2+]*: frozen hot snapshot

The hot state example below is included in a frozen hot snapshot when the epoch is created or refreshed, not rendered again on every normal turn. Active files and open errors are current at epoch boundaries, not on every normal turn; the plugin intentionally does not invalidate the hot snapshot on active-file or open-error changes because doing so would defeat prefix KV-cache reuse. Explicit pending memories persist in session state and the pending journal, then promote safely at compaction; once the current epoch caches exist, new pending memories do not force pre-history prompt refresh. After epoch start, the conversation/tool transcript is the source of truth for newer events.

Hot session state snapshot (epoch start; conversation history may be newer):

active_files:
- src/plugin.ts (edit, 18x)
- tests/plugin.test.ts (edit, 5x)

open_errors: (none)

recent_decisions:
- Use frozen workspace memory snapshots for cache stability

pending_memories:
- [decision] Parser supports 3 candidate formats

Layer 3: Native OpenCode State

Purpose

Delegate task tracking to OpenCode's native features.

Behavior

Uses OpenCode's built-in todos during compaction
No additional storage or injection required
Allows the agent to manage task lists natively

Plugin Hooks

OpenCode Working Memory hooks into OpenCode lifecycle events:

`experimental.chat.system.transform`

Injects cached frozen workspace memory and cached frozen hot snapshot prompts into the system prompt. Normal tool/user churn updates storage but does not mutate these pre-history prompts until a new epoch starts.

`tool.execute.after`

Tracks active files (read, grep, edit, write actions)
Tracks open errors from failed commands
Clears errors when commands succeed
Ignores exitCode === undefined (successful commands without explicit exit codes)

`experimental.session.compacting`

Extracts workspace memory candidates from conversation. Applies quality gate, redaction, migration, consolidation accounting, deduplication, and source priority.

`event` (session.compacted, session.deleted)

session.compacted: Promote session decisions to workspace memory
session.deleted: Clean up session state files

Promotion uses accounting results from workspace memory normalization. Pending memories that are kept are promoted; duplicate memories are absorbed; exact decision replacements can be superseded; over-capacity compaction memories are rejected. Stale-marked memories are not hard-pruned by age; they lose rendered space through retention strength and cap competition.

Quality Guarantees

No False Positive Errors

// Bad: Would create false positive
"Error: something failed" in output

// Good: Actually failed
exitCode === 1 && output.includes("Error")

// Good: Actually succeeded
exitCode === 0 (clears errors for that category)

// Good: Ignore ambiguous cases
exitCode === undefined → skip error tracking

Negative Memory Filtering

// Correctly interpreted
"don't remember this" → NOT added to memory
"不要記住這個" → NOT added to memory
"remember this" → added to memory candidates

Canonical Deduplication

// Same memory (after normalization)
"Use npm cache for plugins"
"USE NPM CACHE for plugins!!"
"use npm cache for plugins."

// All map to same canonical key
canonical("Use npm cache for plugins") === "use npm cache for plugins"

Compaction Quality Gate

// Rejected (not valuable as long-term memory)
"4832b38 fix: something"  // git hash
"Error: something failed"  // raw error
"at Object.method (file.ts:42)"  // stack trace
"/Users/x/project/file.ts /Users/x/project/other.ts"  // path-heavy

// Accepted
"[decision] Use npm cache for plugin loading"  // good pattern

File System Layout

~/.local/share/opencode-working-memory/
└── workspaces/
    └── {workspaceKey}/
        ├── workspace-memory.json      # Long-term memory
        └── sessions/
            └── {hashedSessionID}.json  # Session state

Workspace Key

// First 16 chars of SHA-256 hash of workspace root realpath
const workspaceKey = sha256(realpath(workspaceRoot)).slice(0, 16)

Storage Safety

All read-modify-write updates go through updateJSON(), which combines an in-process promise queue with an on-disk .lock file. The lock file uses exclusive creation, heartbeat refreshes while held, stale-lock recovery after 30 seconds, and a 5 second wait timeout for live contention. Direct readJSON() is fallback-oriented and does not mutate data except when corrupt JSON is quarantined.

Performance Considerations

Memory Budgets

Layer	Max Chars	Max Entries
Workspace Memory	3600	28
Hot Session State	700	8 files, 3 errors

Injection Overhead

Workspace memory: usually under ~2000 chars in observed rendered output
Hot session state: usually under ~500 chars in observed rendered output
Total: typically well below the configured maximums

Storage Footprint

Workspace memory: ~2-5 KB per workspace
Session state: ~1-3 KB per session
Auto-cleanup on workspace/session deletion

Extension Points

Custom Memory Types

Add new types in src/types.ts:

export type LongTermType = "feedback" | "project" | "decision" | "reference" | "custom";

Custom Error Categories

Add new categories in src/types.ts:

export type ErrorCategory = "typecheck" | "test" | "lint" | "build" | "runtime" | "custom";

Custom Extraction Patterns

Modify src/extractors.ts to add new extraction patterns.

Migration Notes

Memory V1 to V2

OpenCode Working Memory automatically migrates old format files to the new three-layer architecture. No manual intervention needed.

Last Updated: April 2026
Implementation: src/plugin.ts, src/extractors.ts, src/workspace-memory.ts, src/session-state.ts

17 KiB Raw Blame History

Architecture Documentation

Overview

Layer 1: Workspace Memory

Purpose

Storage

Evidence Log Schema

Entry Types

Entry Sources

Memory Extraction

Consolidation, Deduplication, and Retention

Safety-Critical Deprecation

System Prompt Injection

Layer 2: Hot Session State

Purpose

Storage

Active Files

Open Errors

Error Fingerprinting

recentDecisions

System Prompt Injection

Layer 3: Native OpenCode State

Purpose

Behavior

Plugin Hooks

experimental.chat.system.transform

tool.execute.after

experimental.session.compacting

event (session.compacted, session.deleted)

Quality Guarantees

No False Positive Errors

Negative Memory Filtering

Canonical Deduplication

Compaction Quality Gate

File System Layout

Workspace Key

Storage Safety

Performance Considerations

Memory Budgets

Injection Overhead

Storage Footprint

Extension Points

Custom Memory Types

Custom Error Categories

Custom Extraction Patterns

Migration Notes

Memory V1 to V2

17 KiB

Raw Blame History

`experimental.chat.system.transform`

`tool.execute.after`

`experimental.session.compacting`

`event` (session.compacted, session.deleted)