Skip to content

Related Work

This page maps Memoir's design choices to the research that informs them. Sections are organized by which Memoir component the citation grounds, with a short note on relevance — not a literature review, just enough to point a curious reader at the strongest prior art.

Classifier — Hierarchical Text Classification with LLMs

Memoir's IntelligentClassifier performs zero/few-shot hierarchical multi-label classification with a single LLM call against a structured taxonomy prompt. This is a quietly active subfield (2024–2025) with several papers describing nearly the same construction.

  • TELEClass: "Taxonomy Enrichment and LLM-Enhanced Hierarchical Text Classification with Minimal Supervision", 2024. Minimally-supervised hierarchical classification that uses LLM-generated class-indicative terms to augment the taxonomy. Closest match to Memoir's LLMIterativeTaxonomy expansion logic, where the taxonomy itself grows from observed content. arXiv:2403.00165
  • TaxMorph: "Hierarchical Text Classification with LLM-Refined Taxonomies". Frames the LLM as a taxonomist — the model can reshape an input hierarchy into a more semantically coherent one. Justifies Memoir's stance that the taxonomy is a living artifact, not a fixed schema. arXiv:2601.18375
  • Single-pass Hierarchical Text Classification with LLMs (Payberah, 2024). Single-call hierarchical classification, mirroring Memoir's "one prompt, full taxonomy in context, structured JSON out" approach. Compares flat vs. hierarchical vs. discriminative baselines. PDF
  • HierPrompt: "Zero-Shot Hierarchical Text Classification", EMNLP Findings 2025. Pure zero-shot hierarchical classification — relevant because Memoir's taxonomy can be deployed cold against new domains without labelled data. PDF
  • KG-HTC: "Integrating Knowledge Graphs into LLMs for Effective Zero-shot Hierarchical Text Classification", 2025. Treating the taxonomy as a structured prompt input. Same intuition Memoir uses when injecting ${TAXONOMY_BLOCK} into the classifier and Stop-hook prompts. arXiv:2505.05583
  • "Effective Hierarchical Text Classification with Large Language Models", SN Computer Science, 2025. Survey-style empirical comparison of LLM-based HTC approaches. Useful background. Springer

Long-Term Memory Architecture for LLM Agents

The broader frame Memoir competes in: how should an agent's persistent memory be organized, retrieved, and pruned. Memoir's distinguishing claim — versionable, branchable, taxonomy-keyed memory with cryptographic proofs — sits adjacent to but not duplicated by these systems.

  • Packer et al., "MemGPT: Towards LLMs as Operating Systems", 2023. The foundational hierarchical-memory-tier paper (main / recall / archival). Memoir's branches and namespaces are an alternative topology for the same problem — same goal of fitting unbounded history into a bounded working set, different solution. arXiv:2310.08560
  • Xu et al., "A-MEM: Agentic Memory for LLM Agents", 2025. Current SOTA on multi-hop memory benchmarks, reporting 85–93% token-usage reduction vs. MemGPT. A-MEM's self-organized note-linking is the closest published analog to Memoir's related_keys cross-references. arXiv:2502.12110
  • Mem0: "Building Production-Ready AI Agents with Scalable Long-Term Memory", 2025. The production-systems framing Memoir competes with most directly. Useful as a comparison point on operational concerns (latency, cost, retrieval quality). arXiv:2504.19413
  • "Hierarchical Memory for High-Efficiency Long-Term Reasoning in LLM Agents", July 2025. Direct support for the hierarchical-taxonomy-as-memory thesis Memoir advances.
  • "Memory in the Age of AI Agents: A Survey", 2025. The survey to cite when framing Memoir's place in the landscape; identifies five mechanism families (context-resident compression, retrieval-augmented stores, reflective self-improvement, hierarchical virtual context, policy-learned management). Memoir is primarily retrieval-augmented + hierarchical virtual context with a versioning twist. Paper list
  • "Memory for Autonomous LLM Agents: Mechanisms, Evaluation, and Emerging Frontiers", 2026. Recent companion survey covering evaluation methodology — useful for grounding Memoir's benchmark choices. arXiv:2603.07670

Versioned Structured Data and Content-Addressed Storage

Memoir's git-like-versioning-for-memory thesis sits on a body of systems work, much of it grey literature, but with one foundational academic citation.

  • Benet, "IPFS — Content Addressed, Versioned, P2P File System", 2014. The canonical content-addressed-DAG paper. Memoir applies the same construction (immutable content-addressed chunks → Merkle DAG of revisions) to memory blobs rather than files. arXiv:1407.3561

Memento Pattern — Episodic, Semantic, and Spatial Memory

Memoir's three Memento containers (ProfileMemento, TimelineMemento, the location-keyed equivalent) map cleanly onto the cognitive-science distinction between semantic, episodic, and spatial memory that has been formalized for LLM agents over the last two years. This is the most direct theoretical alignment in Memoir.

  • Park et al., "Generative Agents: Interactive Simulacra of Human Behavior", UIST 2023. Introduces the episodic-event-stream + reflection-into-semantic-memory pattern that Memoir's Timeline → Profile flow mirrors. Their three retrieval scores (recency, importance, relevance) define the design space Memoir's search engine operates in. arXiv:2304.03442
  • Sumers et al., "Cognitive Architectures for Language Agents" (CoALA), 2023. The canonical academic taxonomy for agent memory; episodic / semantic / procedural categories are CoALA's. Letta, Mem0, and LangChain all use CoALA as their framing — Memoir's Mementos sit in the same ontology. arXiv:2309.02427
  • "Synapse: Empowering LLM Agents with Episodic-Semantic Memory via Spreading Activation", 2025. Activation-based retrieval across the episodic ↔ semantic boundary. Closest prior art for Memoir's related_keys cross-reference design, which is structurally a static spreading-activation graph. arXiv:2601.02744

Notable Gaps

Two parts of Memoir's design currently sit ahead of the published literature, worth flagging honestly:

  • "Memory worthiness" filtering — the should-this-turn-become-a-memory? decision Memoir's Stop hook performs. Generative Agents and A-MEM both touch importance scoring, but no paper is dedicated to this filter.
  • Multi-key sibling backlinks at storage time — A-MEM's note-linking is the closest analog, but it is computed at retrieval time, not frozen into the blob at write. Memoir's design (write-time related_keys, edit-preserved) appears to be novel.

These are reasonable areas for a future Memoir whitepaper to claim contribution.