Memory Systems Deep Dive: Technical Appendix

October 28, 2025 • 22 min read

Complete technical comparison of MemMachine vs mem0, PostgreSQL graph alternatives (recursive CTEs, ltree, adjacency lists), and detailed...

Note: This is the full technical appendix for "Memory Systems and the Graph That Wasn't". Read that post first for context and the high-level decision narrative. This document contains detailed comparisons, code examples, and implementation patterns.

Or: How We Learned That Relationships Can Just Be Rows with JSONB

The Memory Problem

"Memory" sounds singular. In practice it spans: working context (what are we talking about right now), episodic history (what did we talk about before), and durable facts (what have I learned about you). These are different problems with different storage needs.

You start simple: the 8-Bit Oracle should remember what users ask about. When someone consults the I-Ching about their career transition on Monday, then comes back Friday with a follow-up question, the oracle shouldn't start from scratch¹.

The straightforward approach: store full conversations as JSONB, one row per divination session. This works. This is what 8-Bit Oracle ships today.

But then you read about "memory systems" and discover an entire field dedicated to how AI agents should remember things. You find MemMachine (graph-based temporal relationships) and mem0 (LLM-powered fact extraction). Both are open source. Both solve memory elegantly. Both suggest we might be missing something.

So we analyzed them thoroughly—chunking strategies, deduplication logic, storage architectures—and arrived at a realization: we need their ideas, not their implementations. Not yet, anyway.

This is the story of what we actually ship, what we considered adding, and why we're shipping boring Postgres instead of exciting graph databases.

What 8-Bit Oracle Actually Ships Today

Before we explore what we could build, let's be precise about what we have.

Single Source: Supabase Postgres

No Neo4j. No separate vector database. No external memory service. Just Postgres with JSONB columns and GIN indexes.

The Core Table: divination_sessions

// From src/lib/database.types.ts
interface DivinationSession {
  id: string;
  anonymous_user_id: string;
  user_id: string | null;
  session_id: string;
  sequence_number: number;

  // The full conversation (JSONB)
  message_history: Array<{
    role: 'user' | 'assistant';
    content: string;
    timestamp: string;
  }>;

  // Hexagram context (JSONB)
  hexagram_data: {
    currentHexagram: {
      number: number;
      upperTrigram: { english: string };
      lowerTrigram: { english: string };
    };
    transformedHexagram?: {
      number: number;
    };
  } | null;

  // Metadata
  title: string;                        // Generated from question
  summary: string | null;               // Column exists, currently unused
  neutralized_question_text: string | null;

  // LLM accounting
  ai_provider: string | null;
  ai_model: string | null;
  input_tokens: number | null;
  output_tokens: number | null;
  finish_reason: string | null;

  // Settings
  locale: string;
  counselor: string | null;
  divination_mode: string | null;

  timestamp: string | null;
  is_active: boolean | null;
}

The SQL schema confirms the shape:

CREATE TABLE divination_sessions (
    id uuid PRIMARY KEY,
    anonymous_user_id text NOT NULL,
    user_id uuid REFERENCES auth.users(id),
    session_id uuid NOT NULL,
    sequence_number integer NOT NULL,

    message_history jsonb NOT NULL,      -- Full conversation
    hexagram_data jsonb,                 -- Structured I-Ching context

    title varchar(255) NOT NULL,
    summary text,                        -- Exists but unused
    neutralized_question_text text,

    ai_provider text,
    ai_model text,
    input_tokens integer,
    output_tokens integer,
    finish_reason text,

    locale varchar(15) DEFAULT 'en',
    counselor text DEFAULT 'MORPHEUS',
    divination_mode text,

    timestamp timestamptz DEFAULT now(),
    is_active boolean DEFAULT true
);

The Conversation View

We expose a conversation_history view that aggregates messages per session:

CREATE VIEW conversation_history AS
SELECT
  s.id AS session_id,
  s.session_type,
  s.anonymous_user_id,
  s.user_id,
  json_agg(
    json_build_object(
      'id', m.id,
      'role', m.role,
      'content', m.content,
      'name', m.name,
      'function_call', m.function_call
    ) ORDER BY m.sequence_number
  ) AS message_history
FROM sessions s
JOIN messages m ON s.id = m.session_id
GROUP BY s.id;

The Profile Table

User profiles track identity and preferences, but no divination-specific memory:

interface Profile {
  id: string;
  username: string | null;
  full_name: string | null;
  avatar_url: string | null;
  bio: string | null;
  website: string | null;

  // Anonymous user tracking
  active_anonymous_id: string | null;
  anonymous_user_ids: string[] | null;   // Links to divination_sessions

  // Preferences
  theme_preference: string | null;
  notification_preferences: Json | null;
  privacy_settings: Json | null;

  is_admin: boolean | null;
  updated_at: string | null;
}

Notice what's not here:

❌ No divination_profile JSON with consolidated facts
❌ No extracted_facts per session
❌ No related_sessions array
❌ No cross-session similarity graph
❌ No fact extraction or LLM consolidation

What We Store vs. What We Could Store

Data Type	Currently Stored	Could Add (MemMachine/mem0 inspired)
Full conversation	✅ `message_history` JSONB	(already have)
Hexagram context	✅ `hexagram_data` JSONB	(already have)
Session summary	⚠️ `summary` column exists, unused	Populate with LLM summary
Extracted facts	❌ Not stored	`extracted_facts` JSONB column
Related sessions	❌ Not computed	`related_sessions` JSONB array
User profile facts	❌ Not consolidated	`divination_profile` in profiles
Vector embeddings	⚠️ pgvector exists (for Cantonese slang)	Use for session semantic search

This is boring technology. This is what we're running in production. This works².

The MemMachine Architecture

MemMachine is what happens when you take memory seriously as a graph problem. The core insight: memories aren't isolated facts, they're nodes in a temporal graph connected by relationships.

Here's what MemMachine gives you:

Storage: Neo4j + PostgreSQL

Neo4j stores episodic memories as graph nodes
Episodes connect via temporal edges (this happened before that)
PostgreSQL stores user profiles (consolidated facts)
Vector embeddings live in Neo4j's vector index³

Memory Types:

Episodic Memory - Short-term (working memory) and long-term (historical conversations)
Declarative Memory - Searchable derivatives of episodes (chunks, summaries)
Profile Memory - User facts consolidated across sessions using LLM

Chunking Strategies:

Sentence chunking: NLTK sentence tokenizer, each sentence becomes searchable
Concatenation: Join related episodes with separator
Identity: Keep episodes intact without splitting

Deduplication:

UUID-based episode dedup (simple, deterministic)
Set-based node dedup (automatic via Python sets)
LLM-powered profile consolidation (semantic merging)

The configuration looks like this:

long_term_memory:
  derivative_deriver: sentence
  max_episodes: 100

profile_memory:
  storage: postgresql
  consolidation_frequency: 5_sessions

This is elegant. This is well-designed. This is what a proper memory system looks like when you have a PhD in cognitive architecture and know what you're doing⁴.

The mem0 Philosophy

mem0 takes a different approach. Instead of storing full conversation chunks, it extracts and stores salient facts.

Core Insight: Facts > Full Text

Where MemMachine says "store the conversation as searchable derivatives," mem0 says "extract only what matters and throw away the rest."

from mem0 import MemoryClient

memory = MemoryClient(apiKey=API_KEY)

# This conversation:
messages = [
  {"role": "user", "content": "I'm transitioning from law to tech"},
  {"role": "assistant", "content": "Big career change. What prompted this?"},
  {"role": "user", "content": "Burnout. Need more creative work."}
]

# Becomes this stored memory:
# "User is transitioning from law to tech due to burnout, seeking creative work"

Memory Types:

User memory - Facts about individual users across all sessions
Session memory - Facts about a specific conversation
Agent memory - Facts the agent should remember about itself
Multi-level hierarchy - Facts can be session-scoped or user-scoped

Deduplication: LLM-First

When you add new memories, mem0 doesn't just append. It asks the LLM: "Given existing memories and new information, should I add, update, or delete?"

# Existing: "User prefers practical advice"
# New conversation suggests user wants philosophical depth
# LLM consolidates: "User prefers practical advice but appreciates philosophical context"

The result is 10% higher accuracy than RAG, 92% lower latency (1.44s vs 17s), and massive token savings because you're not stuffing full conversation history into every prompt⁵.

The Comparison

Let me lay out what each system offers:

Feature	MemMachine	mem0
Storage	Neo4j + PostgreSQL	Vector + Graph + Key-Value
Chunking	Configurable (sent/concat/id)	LLM fact extraction
Deduplication	UUID + Sets + LLM (profiles)	LLM consolidation (everything)
Retrieval	Graph-based temporal proximity	Vector search + consolidated facts
Infrastructure	Neo4j required	Flexible (Qdrant, Chroma, etc.)
Philosophy	Store derivatives, traverse graph	Extract facts, forget details

MemMachine is for when you need full conversation preservation with graph-based temporal relationships. You want to ask: "How did the user's thinking evolve from session A through B to C?" You want deterministic deduplication and complete audit trails.

mem0 is for when you need performance and intelligence. You want the LLM to decide what's important. You want fast retrieval and low token costs. You don't care about preserving every message, only the salient information.

Both are excellent. Both solve real problems. Both would work.

So here's the question: which one should 8-Bit Oracle use?

The Hidden Third Option

Let's look at what 8-Bit Oracle actually stores right now:

// From database.types.ts
interface DivinationSession {
  id: string;
  user_id: string;
  anonymous_user_ids: string[];

  // The full conversation
  message_history: Array<{
    role: 'user' | 'assistant';
    content: string;
    timestamp: string;
  }>;

  // The divination context
  hexagram_data: {
    currentHexagram: {
      number: number;
      upperTrigram: { english: string };
      lowerTrigram: { english: string };
    };
    transformedHexagram?: {
      number: number;
    };
  };

  // Extracted metadata
  neutralized_question_text: string;
  summary: string;
  locale: string;

  // AI metadata
  ai_provider: string;
  ai_model: string;
  input_tokens: number;
  output_tokens: number;

  timestamp: string;
}

This is already a hybrid memory system. We store:

Full message history (MemMachine-style preservation)
Extracted question (mem0-style fact extraction)
Summary (mem0-style consolidation)
Hexagram context (domain-specific structured data)
Anonymous user tracking (cross-session identity)

The only thing we're missing is:

Cross-session fact extraction - "This user tends to ask about career decisions"
Session relationships - "This reading is similar to their reading from last week"
Profile consolidation - "Automatically build user profile from history"

And here's the realization: we can add all three without Neo4j or mem0.

The JSONB Solution

Let's state the actual requirements:

Requirement 1: Find similar past readings

Not: Build a graph database with temporal edges. Not: Deploy Neo4j and maintain graph infrastructure.

Just: Given a new divination session, find related past sessions.

The graph database solution:

// Neo4j query
MATCH (s1:Session {id: $sessionId})
MATCH (s2:Session {userId: s1.userId})
WHERE s2.timestamp < s1.timestamp
  AND (
    s1.hexagram = s2.hexagram OR
    s1.theme = s2.theme OR
    s1.timestamp - s2.timestamp < 7_days
  )
CREATE (s1)-[:RELATED_TO {similarity: $score}]->(s2)

The JSONB solution:

// After saving new session, find related sessions
const similarSessions = await supabase
  .from('divination_sessions')
  .select('id, title, hexagram_data, timestamp, extracted_facts')
  .eq('user_id', userId)
  .neq('id', newSessionId)
  .order('timestamp', { ascending: false })
  .limit(10);

// Calculate similarity (pure TypeScript)
const relatedSessions = similarSessions
  .map(session => ({
    session_id: session.id,
    relationship: determineRelationship(newSession, session),
    similarity: calculateSimilarity(newSession, session),
  }))
  .filter(r => r.similarity > 0.5)
  .sort((a, b) => b.similarity - a.similarity)
  .slice(0, 5);

// Store denormalized in the new session
await supabase
  .from('divination_sessions')
  .update({ related_sessions: relatedSessions })
  .eq('id', newSessionId);

The similarity calculation doesn't need graph traversal:

function calculateSimilarity(s1: Session, s2: Session): number {
  let score = 0;

  // Same hexagram = high similarity
  if (s1.hexagram_data.currentHexagram.number ===
      s2.hexagram_data.currentHexagram.number) {
    score += 0.5;
  }

  // Shared trigrams = medium similarity
  const trigrams1 = [s1.upperTrigram, s1.lowerTrigram];
  const trigrams2 = [s2.upperTrigram, s2.lowerTrigram];
  const shared = trigrams1.filter(t => trigrams2.includes(t)).length;
  score += shared * 0.15;

  // Same theme = medium similarity
  if (s1.extracted_facts?.primary_concern ===
      s2.extracted_facts?.primary_concern) {
    score += 0.2;
  }

  // Temporal proximity (within 7 days) = small boost
  const daysDiff = Math.abs(
    new Date(s1.timestamp).getTime() -
    new Date(s2.timestamp).getTime()
  ) / (1000 * 60 * 60 * 24);
  if (daysDiff < 7) {
    score += 0.15 * (1 - daysDiff / 7);
  }

  return Math.min(score, 1.0);
}

No graph database. No Neo4j hosting ($65/month for Aura). No learning Cypher. Just Postgres with GIN indexes and TypeScript similarity scoring⁶.

Requirement 2: Extract facts from sessions

The mem0 approach: Send full conversation to LLM, extract salient facts, store in separate memory store.

The JSONB approach: Send full conversation to LLM, extract salient facts, store in same table.

// After saving divination session
const facts = await extractFacts({
  question: lastUserMessage.content,
  interpretation: lastAssistantMessage.content,
  hexagram: hexagramData.currentHexagram.number,
  locale: locale,
});

interface ExtractedFacts {
  primary_concern: string;      // "Career transition"
  emotional_state: string;      // "Anxious but hopeful"
  key_themes: string[];         // ["timing", "patience"]
  hexagram_themes: string[];    // ["waiting", "nourishment"]
}

// Store alongside message_history
await supabase
  .from('divination_sessions')
  .update({
    extracted_facts: facts,
    question_theme: facts.primary_concern
  })
  .eq('id', sessionId);

Schema change required:

ALTER TABLE divination_sessions
ADD COLUMN extracted_facts JSONB,
ADD COLUMN question_theme TEXT;

CREATE INDEX idx_question_theme
  ON divination_sessions(question_theme);

CREATE INDEX idx_extracted_facts_gin
  ON divination_sessions USING GIN (extracted_facts);

That's it. No separate vector database. No fact store. Just one more JSONB column⁷.

Requirement 3: Build user profiles across sessions

The MemMachine approach: Consolidate profile facts in separate PostgreSQL table, use LLM to merge semantically.

The JSONB approach: Consolidate profile facts in existing profiles table, use LLM to merge semantically.

// Add to existing profiles table
interface UserProfile {
  user_id: string;

  // NEW: Divination profile built from history
  divination_profile: {
    life_stage?: string;           // "Early career professional"
    recurring_themes?: string[];   // ["career", "relationships"]
    hexagram_affinity?: Array<{
      number: number;
      count: number;
    }>;
    last_consolidated?: string;    // ISO date
  };
}

// Background job runs after every 5 sessions
async function consolidateUserProfile(userId: string) {
  const recentSessions = await supabase
    .from('divination_sessions')
    .select('extracted_facts, hexagram_data')
    .eq('user_id', userId)
    .order('timestamp', { ascending: false })
    .limit(5);

  const existingProfile = await supabase
    .from('profiles')
    .select('divination_profile')
    .eq('user_id', userId)
    .single();

  // Use LLM to consolidate
  const consolidated = await llm.consolidate({
    existingProfile: existingProfile.divination_profile,
    newSessions: recentSessions.map(s => s.extracted_facts),
  });

  await supabase
    .from('profiles')
    .update({
      divination_profile: consolidated,
    })
    .eq('user_id', userId);
}

This is MemMachine's profile consolidation logic, but using Supabase tables you already have instead of a separate PostgreSQL instance⁸.

What We're Actually Stealing

Let's be precise about what ideas we're taking from each system:

From mem0:

Fact extraction - Don't just store full conversations, extract salient points
Selective loading - Load facts first, full history only on demand
Multi-level context - Session facts + user profile + similar readings

From MemMachine:

Profile consolidation - LLM-powered semantic merging across sessions
Temporal relationships - Track session similarity, not just chronology
Derivative storage - Store both raw (message_history) and processed (extracted_facts)

What we're NOT taking:

❌ Neo4j graph database
❌ Vector search for memory retrieval (we have pgvector for Cantonese slang, but not for session search)
❌ Separate memory service (hosted MemMachine or mem0)
❌ Graph traversal queries
❌ Complex deduplication logic

Why? Because we don't actually need them for the memory use cases we're solving.

The Graph Question

Here's the thing about graph databases: they're fantastic for graph problems. But not everything that has "relationships" is a graph problem.

A graph problem is when you need to:

Find shortest path between nodes
Detect communities/clusters
Calculate PageRank or centrality
Traverse multiple hops efficiently (5+ levels deep)
Pattern match on relationship types

An array-of-relationships problem is when you need to:

Find top 5 most similar sessions
Check if two sessions share a hexagram
Look up related sessions for context
Filter by temporal proximity

8-Bit Oracle has the second problem, not the first.

Let's look at actual queries we'd run:

Query: "Show similar past readings"

Graph database (Neo4j):

MATCH (s1:Session {id: $id})
MATCH (s1)-[r:RELATED_TO]->(s2:Session)
WHERE r.similarity > 0.5
RETURN s2
ORDER BY r.similarity DESC
LIMIT 5

JSONB:

const session = await supabase
  .from('divination_sessions')
  .select('related_sessions')
  .eq('id', sessionId)
  .single();

const similar = session.related_sessions
  .filter(r => r.similarity > 0.5)
  .sort((a, b) => b.similarity - a.similarity)
  .slice(0, 5);

Performance: JSONB is faster (no network hop, GIN indexed). Complexity: JSONB is simpler (no Cypher, no graph traversal). Maintenance: JSONB is easier (no Neo4j instance to manage).

Query: "How did my thinking evolve from session A to session C?"

This is actual graph traversal. You want the path: A → B → C, following relationship edges.

Neo4j:

MATCH path = shortestPath(
  (a:Session {id: $startId})-[:RELATED_TO*]->(c:Session {id: $endId})
)
RETURN path

JSONB:

// Recursive search through related_sessions arrays
function findPath(start: string, end: string, visited = new Set()): Session[] {
  if (start === end) return [start];
  if (visited.has(start)) return null;

  visited.add(start);
  const current = await getSession(start);

  for (const related of current.related_sessions) {
    const path = findPath(related.session_id, end, visited);
    if (path) return [start, ...path];
  }

  return null;
}

Here, Neo4j wins on elegance and performance. But here's the question: do users actually want this?

If a user has sessions A, B, and C about career transitions, do they care about the path from A to C? Or do they just want to see "you asked about this before"?

The path query is intellectually interesting. The similarity query is practically useful. And you can implement the similarity query without a graph database⁹.

PostgreSQL as Graph Lite

If you do decide you need graph-like queries, PostgreSQL has options:

Option 1: Recursive CTEs

WITH RECURSIVE session_path AS (
  -- Base case
  SELECT id, related_sessions, 0 as depth, ARRAY[id] as path
  FROM divination_sessions
  WHERE id = $start_id

  UNION ALL

  -- Recursive case
  SELECT
    ds.id,
    ds.related_sessions,
    sp.depth + 1,
    sp.path || ds.id
  FROM divination_sessions ds
  JOIN session_path sp ON ds.id = ANY(
    SELECT (jsonb_array_elements(sp.related_sessions)->>'session_id')::uuid
  )
  WHERE ds.id != ALL(sp.path)
    AND sp.depth < 5
)
SELECT * FROM session_path WHERE id = $end_id;

This gives you graph traversal without Neo4j. It's standard SQL. It works in Supabase. It's fast enough for small graphs (which user session graphs always are)¹⁰.

Option 2: ltree Extension

PostgreSQL's ltree extension is designed for hierarchical data:

CREATE EXTENSION ltree;

ALTER TABLE divination_sessions
ADD COLUMN session_path ltree;

-- Path format: user_id.year.theme.hex_number
-- Example: user123.2024.career.hex03
UPDATE divination_sessions
SET session_path = text2ltree(
  user_id || '.' ||
  EXTRACT(YEAR FROM timestamp)::text || '.' ||
  question_theme || '.' ||
  'hex' || LPAD(hexagram_number::text, 2, '0')
);

-- Find all sessions in career journey
SELECT * FROM divination_sessions
WHERE session_path <@ 'user123.2024.career'::ltree
ORDER BY session_path;

This is elegant for "journey" visualization. It's built into Postgres. It's GiST-indexed for performance.

Option 3: Just Use Arrays

-- Add simple relationship table if JSONB feels wrong
CREATE TABLE session_links (
  from_session UUID REFERENCES divination_sessions(id),
  to_session UUID REFERENCES divination_sessions(id),
  relationship_type TEXT,
  similarity_score FLOAT,
  PRIMARY KEY (from_session, to_session)
);

CREATE INDEX idx_links_from ON session_links(from_session, similarity_score DESC);
CREATE INDEX idx_links_to ON session_links(to_session, similarity_score DESC);

-- Query similar sessions
SELECT ds.*, sl.similarity_score
FROM divination_sessions ds
JOIN session_links sl ON sl.to_session = ds.id
WHERE sl.from_session = $sessionId
  AND sl.similarity_score > 0.5
ORDER BY sl.similarity_score DESC
LIMIT 5;

This is the classic adjacency list pattern. It's been solving graph-like problems in relational databases since the 1970s. It still works¹¹.

What We Deliberately Don't Have (Yet)

Let's be explicit about what we're not shipping:

❌ Neo4j or any graph database - No temporal relationship edges, no graph traversal
❌ Vector-based memory retrieval - We have pgvector (used for Cantonese slang matching in HK commentary generation), but not for session/conversation retrieval
❌ mem0-style fact store - No extracted facts, no LLM consolidation loops
❌ Cross-session similarity - No related_sessions column, no similarity graph
❌ Profile consolidation - No divination_profile in the profiles table

The summary column exists but is currently unused—not populated on insert, not read in UI or server code³.

Why We Didn't Ship MemMachine or mem0

Scope fit: Our immediate need is storing complete conversations and hexagram context per reading. JSONB fits perfectly. We don't need multi-hop graph traversal when users just want "show me my past readings about career."

Operational simplicity: One database (Supabase Postgres), one mental model (JSONB + GIN indexes), one set of backups. Adding Neo4j means another service to deploy, monitor, version, and maintain. Adding mem0 means managing consolidation loops and fact extraction pipelines.

Cost/benefit: Graph traversal and LLM consolidation add complexity. What do users actually get? Faster follow-up responses? Better related reading suggestions? We haven't validated the user benefit yet, so we're not adding the infrastructure.

The boring technology principle: When in doubt, ship the simplest thing that works. We can always add sophistication later if the use case emerges. We can't easily remove complexity once we've built dependencies on it⁴.

If/When We Add "Memory++"

We kept the ideas on our roadmap. If we do add enhanced memory, here's how we'd do it—incrementally, Postgres-first, without new infrastructure:

Option 1: Fact Extraction per Session

Add extracted_facts JSONB beside message_history:

ALTER TABLE divination_sessions
ADD COLUMN extracted_facts JSONB,
ADD COLUMN question_theme TEXT;

CREATE INDEX idx_question_theme ON divination_sessions(question_theme);
CREATE INDEX idx_facts_gin ON divination_sessions USING GIN (extracted_facts);

interface ExtractedFacts {
  primary_concern: string;      // "Career transition"
  emotional_state: string;      // "Anxious but hopeful"
  key_themes: string[];         // ["timing", "patience"]
  hexagram_themes: string[];    // ["waiting", "nourishment"]
}

// After storing session, extract facts
const facts = await extractWithLLM(lastUserMessage, lastAssistantMessage);
await supabase.from('divination_sessions')
  .update({ extracted_facts: facts })
  .eq('id', sessionId);

This enables fast filtering ("show readings about career") without full-text search on message_history.

Option 2: Shallow Related Links

Add a simple adjacency list for "similar past readings":

CREATE TABLE session_links (
  from_session UUID REFERENCES divination_sessions(id) ON DELETE CASCADE,
  to_session UUID REFERENCES divination_sessions(id) ON DELETE CASCADE,
  relationship_type TEXT,  -- 'same_hexagram', 'theme_related', 'temporal'
  similarity_score FLOAT,
  PRIMARY KEY (from_session, to_session)
);

CREATE INDEX idx_links_from ON session_links(from_session, similarity_score DESC);

Or denormalize into JSONB (fewer joins):

ALTER TABLE divination_sessions
ADD COLUMN related_sessions JSONB DEFAULT '[]'::jsonb;

CREATE INDEX idx_related_gin ON divination_sessions USING GIN (related_sessions);

// After saving new session, compute top 5 similar
const similar = await findSimilarSessions(userId, newSessionId);
const related = similar
  .map(s => ({
    session_id: s.id,
    hexagram: s.hexagram_data.currentHexagram.number,
    similarity: calculateSimilarity(newSession, s)
  }))
  .sort((a, b) => b.similarity - a.similarity)
  .slice(0, 5);

await supabase.from('divination_sessions')
  .update({ related_sessions: related })
  .eq('id', newSessionId);

Similarity calculation doesn't need Neo4j—just TypeScript:

function calculateSimilarity(s1: Session, s2: Session): number {
  let score = 0;
  if (s1.hexagram === s2.hexagram) score += 0.5;
  if (s1.question_theme === s2.question_theme) score += 0.2;
  // Temporal proximity boost
  const daysDiff = Math.abs(s1.timestamp - s2.timestamp) / (24*60*60*1000);
  if (daysDiff < 7) score += 0.15 * (1 - daysDiff/7);
  return Math.min(score, 1.0);
}

Option 3: Periodic Profile Summary

Add a compact JSON summary per user in the existing profiles table:

ALTER TABLE profiles
ADD COLUMN divination_profile JSONB;

CREATE INDEX idx_div_profile_gin ON profiles USING GIN (divination_profile);

interface DivinationProfile {
  life_stage?: string;           // "Early career professional"
  recurring_themes?: string[];   // ["career", "relationships"]
  hexagram_affinity?: Array<{
    number: number;
    count: number;
  }>;
  last_consolidated?: string;
}

// Background job: consolidate after every 5 sessions
async function consolidateProfile(userId: string) {
  const recent = await getRecentSessions(userId, 5);
  const existing = await getProfile(userId);
  const consolidated = await llm.consolidate({
    old: existing.divination_profile,
    new: recent.map(s => s.extracted_facts)
  });
  await updateProfile(userId, { divination_profile: consolidated });
}

Each step stays inside Supabase, leverages GIN indexes, and avoids introducing graph infrastructure until we truly need graph problems solved.

The Philosophy

This is the same lesson as the binary calendar: infrastructure is a smell.

When you find yourself adding databases (Neo4j), you should ask: "Am I solving a graph problem, or do I just have relationships?"

When you find yourself adding services (separate memory layer), you should ask: "Am I solving a distributed systems problem, or do I just have data?"

Most of the time, the answer is: you just have data. And Postgres is very good at data.

MemMachine's insight about temporal graphs is correct. mem0's insight about fact extraction is correct. But the implementation choice—Neo4j, separate vector stores—is solving problems we don't have.

We have:

Small graphs (user session histories, not social networks)
Simple traversal (1-2 hops, not PageRank)
Structured data (hexagrams, themes, timestamps)
Existing Postgres infrastructure (Supabase)

For this, JSONB + GIN indexes + TypeScript logic is sufficient. It's not just "good enough." It's actually better:

Better Performance:

No network hop to Neo4j
GIN indexes are fast
Denormalized related_sessions = O(1) lookup

Better Complexity:

One database instead of two
SQL instead of SQL + Cypher
One mental model instead of two

Better Operations:

Supabase Postgres (already deployed)
No Neo4j or separate vector db instance to maintain
Backups are just Postgres backups

The graph database version would be more "real engineering." More components, more complexity, more résumé bullet points. But the JSONB version is more real—it acknowledges what we actually need rather than what we think we should need¹².

What We Learned

Lesson 1: Relationships ≠ Graphs

Just because data has relationships doesn't mean you need a graph database. Most relationship queries are shallow (1-2 hops). Postgres handles these fine.

Lesson 2: Steal Ideas, Not Implementations

MemMachine's profile consolidation is brilliant. mem0's fact extraction is brilliant. Neo4j and separate vector stores are not essential to those ideas. AI Tooling lets you more easily extract what you need, rather than integrating entire frameworks.

Lesson 3: Denormalization Is Underrated

Storing related_sessions as a JSONB array in each session is denormalization. It duplicates data. It violates normal forms. It's also fast, simple, and exactly what we need.

Lesson 4: JSONB Is a Superpower

Postgres JSONB with GIN indexes gives you:

Schema flexibility (like NoSQL)
Query power (like SQL)
Indexing performance (like specialized stores)

It's not "almost as good as a real graph database." It's better for most use cases.

The Test

Here's how you know you've made the right choice: can you explain it without invoking architecture diagrams?

The Neo4j version requires explaining:

Graph nodes and edges
Cypher query language
Vector indexes in graph context
When to use graph vs relational
How temporal relationships work in graph databases

The JSONB version is:

Each session stores an array of related sessions
Find similar sessions by comparing hexagrams and themes
Store the top 5 in the array
Query the array when you need related sessions

One requires a whiteboard. The other fits in a sentence¹³.

The Principle

Steal ideas, not implementations. Keep the storage model boring, the queries legible, and the infrastructure minimal.

MemMachine taught us: profile consolidation matters, temporal relationships have value, deduplication should be semantic not just structural.

mem0 taught us: fact extraction reduces token costs, selective loading beats full-context dumping, LLM-powered consolidation works.

What we're shipping: Postgres with JSONB, full conversations, hexagram context, zero external dependencies.

What we're not shipping (yet): fact extraction, cross-session links, profile consolidation, graph traversal.

Conclusion

We started by analyzing MemMachine (graph-based temporal memory) and mem0 (LLM-powered fact extraction). Both are excellent. Both solve real problems. Both would work.

But after mapping their architectures to our actual use cases, we realized: we don't need their implementations yet. We need their ideas, stored for when we validate the user benefit.

The current system is boring: one Postgres database, JSONB for conversations, JSONB for hexagram context, GIN indexes for fast queries. No Neo4j ($65/month + ops overhead). No separate vector store. No fact extraction pipelines. No consolidation loops.

This isn't "good enough for now." This is correct for the problem we're solving. Users want the oracle to remember their conversations. We remember them—in message_history JSONB, queryable by session_id, scoped by user_id or anonymous_user_id.

When the day comes that we need multi-hop graph traversal ("show me how my thinking evolved from reading A through B to C"), we'll add it. When we need semantic search across all past readings, we'll add pgvector. When we need LLM-consolidated user profiles, we'll add the background job.

Until then, the oracle remembers through JSONB rows, typed columns, and very fast Postgres queries⁵.

Sometimes the journey from "we should use a memory system" to "we already have one" requires researching graph databases, analyzing chunking strategies, comparing deduplication logic, and understanding what MemMachine and mem0 actually do. You have to understand why Neo4j exists before you can confidently decide you don't need it.

The memory layer was there all along. We just had to stop trying so hard to add sophistication and start shipping the simple thing.

Footnotes

Technical Specifications (Current Production State)

Database: Supabase Postgres
Primary table: divination_sessions
Conversation storage: message_history JSONB (ordered by sequence_number)
Context storage: hexagram_data JSONB, plus title (schema has summary, currently unused)
Identity: anonymous_user_id, optional user_id, session_id
LLM accounting: ai_provider, ai_model, input_tokens, output_tokens, finish_reason
Views: conversation_history for aggregated messages
External services for memory: None
Graph database: None
Vector database: pgvector enabled (used for Cantonese slang matching, not memory retrieval)
Fact extraction: Not implemented
Profile consolidation: Not implemented
Infrastructure cost: $0 additional (existing Supabase)

The oracle remembers. The oracle uses JSONB. The oracle ships boring technology.

This is the core problem of "stateless" LLM APIs. Each request is independent. The model doesn't remember you. Context windows are temporary. If you want memory, you have to build it yourself. This is by design—statelessness scales better—but it means memory becomes the application's responsibility. ↩
"Boring technology" per Dan McKinley's "Choose Boring Technology" (2015): use new tech for competitive advantage, use boring tech for everything else. Postgres is 30 years old. JSONB is 10 years old. They're boring. They work. This is exactly where you want boring. ↩
The summary column was created with optimistic foresight but never wired into the persistence or display logic. It exists in the schema snapshot and generated types, unused. This is fine. Empty columns cost almost nothing in Postgres. If we need it later, it's there. If we don't, it's ignored. ↩ ↩²
"Complexity is easy to add, hard to remove" is a corollary of Lehman's Laws of Software Evolution. Every dependency you add—Neo4j, mem0, vector stores—creates inertia. Teams build on it, monitoring depends on it, backups expect it. Removing it later means migration risk and coordination cost. Better to add complexity when you have validated user demand, not speculative architectural elegance. ↩ ↩²
JSONB performance at scale: Postgres JSONB with GIN indexes handles millions of rows efficiently. Our message_history arrays are small (typically 2-10 messages per session). GIN index lookups are O(log n) on keys, O(1) on row retrieval. For our data shape (thousands of users, tens of sessions each), this is sub-10ms query time. We're nowhere near needing distributed systems or specialized document stores. ↩ ↩²
GIN (Generalized Inverted Index) is Postgres's index type for JSONB, arrays, and full-text search. It's based on inverted indexes (like Lucene/Elasticsearch) but integrated into Postgres. For JSONB queries with containment operators (@>, ?, etc.), GIN indexes are extremely fast—often faster than dedicated document stores because there's no network hop. ↩
The field of memory systems for AI agents spans "memory-augmented neural networks" and "episodic memory for agents." Both research communities independently discovered that AI systems need three types of memory: working (current context), episodic (past events), and semantic (learned facts). Humans figured this out in cognitive psychology in the 1960s. AI is catching up. MemMachine and mem0 represent two different implementations of these principles. ↩
MemMachine comes from academic research on cognitive architectures for AI agents. The design shows: theoretically grounded, well-abstracted, handles edge cases you wouldn't think of. This is a compliment. It's also why it might be overkill for a production app that just needs to remember user conversations without multi-hop graph traversal. ↩
Mem0's benchmarks: 10% higher accuracy (67% vs 61%) compared to RAG baseline, and 92% lower latency (1.44s vs 17s) compared to full-context approach. The latency improvement comes from storing facts instead of full conversations—smaller retrieval, smaller prompts, faster inference. The accuracy comes from LLM-powered consolidation that removes contradictions and duplicates. These are real benefits, if you need them. ↩
Recursive CTEs in Postgres have depth limits (default 100) and performance degrades with depth. But user session graphs are shallow. Most users have dozens of sessions, not thousands. Most relationships are 1-2 hops, not 10. For this scale, recursive CTEs are perfectly adequate. You don't need Neo4j until you're doing multi-hop traversal on millions of nodes with complex relationship types. ↩
The adjacency list pattern (relationships in a separate table with from/to foreign keys) dates to the 1970s. It's in every database textbook. It's "boring technology" in the best sense—well-understood, widely supported, obviously correct. Modern graph databases offer query expressiveness and traversal optimizations. But for simple relationship queries (top N similar sessions), the old pattern still works fine. ↩
Rich Hickey's "Simple Made Easy" distinction applies: graph databases are easy to reach for because they're marketed as "the solution for connected data." Postgres + JSONB is simple because it doesn't add dependencies, doesn't require new mental models, solves the actual problem without extra machinery. Easy is what you think of first. Simple is what you wish you'd thought of first. ↩
The "can you explain it without a whiteboard" test is surprisingly reliable. If the architecture requires diagrams to understand, it's probably too complex for the problem. If you can explain it in one sentence ("we store full conversations as JSONB in Postgres"), it's probably about right. This isn't always true—sometimes complex problems require complex solutions—but it's true more often than we admit. ↩

This post was written with the assistance of Claude (Anthropic). The author provided editorial direction, project context, and fact-checked all claims. The AI assisted with drafting and research.