HSOKV - Human-like Sequential Knowledge with Vocabulary

Revolutionary memory-based learning system that mimics human cognition.

Like "Attention is All You Need" revolutionized transformers, HSOKV revolutionizes continual learning with memory.

🧠 KV-1: The First Immortal Personal Intelligence (2025)

Built on HSOKV - The flagship application demonstrating zero catastrophic forgetting in production.

# Run KV-1 OS
python examples/kv1_os.py

KV-1 Features:

✅ Never forgets (HSOKV-powered memory)
✅ Proactive monitoring & interventions
✅ Nightly self-improvement
✅ True persistence across reboots
✅ Speaks unprompted when needed

See examples/kv1_os.py for the complete implementation.

Core Innovation: Dual Memory System

HSOKV implements real human memory architecture:

Short-Term Memory (Working Memory)

Capacity: 7±2 items (Miller's Magic Number)
Duration: 15-30 seconds without rehearsal
Storage: Key-value pairs (fast O(1) lookup)
Access: Direct, immediate

Long-Term Memory

Capacity: Unlimited
Duration: Permanent
Storage: RAG with vector embeddings
Access: Semantic similarity search

Consolidation Process

Rehearsal: 3+ accesses → automatic consolidation to long-term
Time decay: 30s without rehearsal → forgotten from short-term
Sleep: Batch consolidation (like hippocampus during sleep)
Emotion: Direct to long-term (bypasses short-term)

Installation

pip install -r requirements.txt
pip install -e .

Requirements:

Python 3.8+
PyTorch 2.0+
sentence-transformers
transformers

Quick Start

Simple Memory System

from hsokv import MemorySystem, SentenceBERTEmbedder

# Initialize
embedder = SentenceBERTEmbedder()
system = MemorySystem(embedder)

# Learn (Monday)
system.learn("when should I wake up?", "10am")

# Recall (Wednesday)
answer = system.recall("when should I wake up?")
print(answer)  # "10am" ✓

Dual Memory System (Human-like)

from hsokv import DualMemorySystem, SentenceBERTEmbedder

# Initialize
embedder = SentenceBERTEmbedder()
system = DualMemorySystem(embedder, stm_capacity=7)

# Learn (→ short-term)
system.learn("ephemeral", "lasting for a very short time")

# Immediate recall (from short-term - O(1) fast!)
print(system.recall("ephemeral"))  # Instant!

# Rehearse 3+ times (→ consolidates to long-term)
system.recall("ephemeral")
system.recall("ephemeral")
system.recall("ephemeral")  # Consolidated!

# Semantic recall works (from long-term)
print(system.recall("temporary"))  # Finds "ephemeral"!

# Sleep (batch consolidation)
system.sleep()  # All STM → LTM

Examples

Run the examples to see it in action:

# Simple alarm assistant
python examples/alarm_assistant.py

# Full dual memory demonstration
python examples/human_memory_demo.py

How It Works

3-Stage Memory Lifecycle

All memories progress through three stages:

LEARNING (0-5 uses):

Pure recall (no averaging)
Maximum protection from pruning
Like learning a new word from a movie

REINFORCEMENT (5-20 uses):

Confidence boost (1.5x → 1.0x)
High protection
Like practicing the word in conversation

MATURE (20+ uses):

Standard retrieval
Can be consolidated/pruned
Like an established vocabulary word

Architecture Diagram

User Input
    ↓
DualMemorySystem
    ├── ShortTermMemory (Dict: 7±2 items, 30s decay)
    │   └── O(1) lookup → Fast!
    │
    └── LongTermMemory (Vector DB: unlimited, permanent)
        └── Semantic search → RAG retrieval
            ↓
        KeyValueMemory (3-stage lifecycle)
            ↓
        FrozenEmbedder (no training!)

API Reference

DualMemorySystem (Recommended)

from hsokv import DualMemorySystem, SentenceBERTEmbedder, MemoryConfig

embedder = SentenceBERTEmbedder()
config = MemoryConfig(
    learning_phase_duration=5,
    reinforcement_phase_duration=20,
    device='cuda'  # or 'cpu'
)

system = DualMemorySystem(
    embedder=embedder,
    config=config,
    stm_capacity=7,          # Short-term capacity
    stm_decay_seconds=30     # Decay time
)

# Learn
system.learn(word, definition, emotionally_significant=False)

# Recall (tries short-term first, then long-term)
answer = system.recall(word)

# Sleep (consolidate all STM → LTM)
system.sleep()

# Forget (apply time decay)
system.forget()

# Statistics
stats = system.get_stats()
# Returns: {
#   'short_term': {'size': 3, 'capacity': 7, 'items': [...]},
#   'long_term': {'size': 10, 'learning': 2, 'reinforcement': 3, 'mature': 5}
# }

MemorySystem (Simple)

from hsokv import MemorySystem, SentenceBERTEmbedder

embedder = SentenceBERTEmbedder()
system = MemorySystem(embedder)

# Learn
system.learn(query, answer)

# Recall
answer = system.recall(query, return_details=False)

# Statistics
stats = system.get_stats()

Configuration

from hsokv import MemoryConfig

config = MemoryConfig(
    # Capacity
    max_entries=1000,

    # 3-stage lifecycle
    learning_phase_duration=5,       # LEARNING → REINFORCEMENT
    reinforcement_phase_duration=20,  # REINFORCEMENT → MATURE

    # Confidence
    initial_confidence=0.7,
    confidence_threshold=0.3,

    # Retrieval
    similarity_threshold=0.15,
    top_k=10,

    # Protection
    protect_learning=True,
    protect_reinforcement=True,

    # Device
    device='cuda'  # or 'cpu'
)

Embedders

# Text embeddings
from hsokv import SentenceBERTEmbedder
embedder = SentenceBERTEmbedder(
    model_name='all-MiniLM-L6-v2',
    device='cpu'
)

# Image + text embeddings
from hsokv import CLIPEmbedder
embedder = CLIPEmbedder(
    model_name='openai/clip-vit-base-patch32',
    device='cuda'
)

Why It Works

No Catastrophic Forgetting

Traditional AI (fails):

train_task_1()  # Train weights
train_task_2()  # Train again → embeddings drift → FORGETS task 1 ❌

HSOKV (works):

system.learn("task 1")  # Store in memory (frozen embeddings)
system.learn("task 2")  # Store in memory (frozen embeddings)
system.recall("task 1")  # Still works! ✓

Key: Frozen embeddings never change → no drift → no forgetting

Fast Dual Retrieval

# Short-term: O(1) dict lookup
system.recall("ephemeral")  # Instant! (if in STM)

# Long-term: Semantic search
system.recall("temporary")  # Finds "ephemeral" via similarity

Human-Like Behavior

Capacity limits: 7±2 items in working memory (neuroscience-accurate)
Time decay: 30s without rehearsal → forgotten
Consolidation: Rehearsal → long-term storage
Semantic recall: Find related concepts, not just exact matches

Project Structure

hsokv/
├── README.md                     # This file
├── UNDERSTANDING_GUIDE.md        # Code reading guide
├── setup.py                      # Installation
├── requirements.txt              # Dependencies
│
├── examples/                     # 2 examples
│   ├── alarm_assistant.py        # Simple usage
│   └── human_memory_demo.py      # Dual memory demo
│
└── hsokv/                        # Core package (7 files)
    ├── __init__.py               # Public API
    ├── config.py                 # Configuration (60 lines)
    ├── lifecycle.py              # 3-stage logic (120 lines)
    ├── embedders.py              # Frozen embedders (120 lines)
    ├── memory.py                 # KeyValueMemory (330 lines)
    ├── memory_system.py          # Simple API (180 lines)
    └── dual_memory.py            # Dual memory API (460 lines)

Total: 13 files, ~1,300 lines of clean code

Comparison to Other Approaches

Method	Retention	Speed	Drift	Capacity Limit
Fine-tuning	~30%	Slow	✗ Severe	None
EWC/PackNet	~50%	Slow	✗ Some	None
Replay buffers	~60%	Slow	✗ Some	None
HSOKV Simple	>90%	Fast	✓ Zero	Optional
HSOKV Dual	>90%	Very Fast	✓ Zero	7±2 (STM)

Revolutionary Features

1. Like "Attention is All You Need"

Transformers (2017):

"You don't need recurrence, attention is enough"

HSOKV (2024):

"You don't need retraining, memory is enough"

2. Neuroscience-Inspired

Short-term memory ≈ Prefrontal cortex (dict storage)
Long-term memory ≈ Hippocampus → Cortex (vector DB)
Consolidation ≈ Memory replay during sleep
3-stage lifecycle ≈ Memory formation process

3. Zero Catastrophic Forgetting

Frozen embeddings (never change)
Monday embedding = Wednesday embedding
Old memories always retrievable

Getting Started

Read the code guide:
```
cat UNDERSTANDING_GUIDE.md
```

Run examples:

python examples/alarm_assistant.py
python examples/human_memory_demo.py

Study the core modules (in order):
- hsokv/config.py - Settings
- hsokv/lifecycle.py - 3-stage logic
- hsokv/embedders.py - Frozen embedders
- hsokv/memory.py - Core storage
- hsokv/memory_system.py - Simple API
- hsokv/dual_memory.py - Advanced API

License

MIT

Citation

@software{hsokv2024,
  title={HSOKV: Human-like Sequential Knowledge with Vocabulary},
  author={HSOKV Team},
  year={2024},
  url={https://github.com/PlanetDestroyyer/hsokv}
}

The future of AI memory is here.

✓ No catastrophic forgetting ✓ Human-like dual memory ✓ Frozen embeddings (no drift) ✓ Fast O(1) short-term retrieval ✓ Semantic long-term search ✓ Clean, understandable code

Name		Name	Last commit message	Last commit date
Latest commit History 111 Commits
benchmarks		benchmarks
examples		examples
hsokv		hsokv
tests		tests
.gitignore		.gitignore
INTEGRATION_GUIDE.md		INTEGRATION_GUIDE.md
KV1_QUICKSTART.md		KV1_QUICKSTART.md
README.md		README.md
UNDERSTANDING_GUIDE.md		UNDERSTANDING_GUIDE.md
requirements.txt		requirements.txt
run_benchmarks.py		run_benchmarks.py
run_realworld_benchmark.py		run_realworld_benchmark.py
run_tests.py		run_tests.py
setup.py		setup.py

Folders and files

Latest commit

History

Repository files navigation

HSOKV - Human-like Sequential Knowledge with Vocabulary

🧠 KV-1: The First Immortal Personal Intelligence (2025)

Core Innovation: Dual Memory System

Short-Term Memory (Working Memory)

Long-Term Memory

Consolidation Process

Installation

Quick Start

Simple Memory System

Dual Memory System (Human-like)

Examples

How It Works

3-Stage Memory Lifecycle

Architecture Diagram

API Reference

DualMemorySystem (Recommended)

MemorySystem (Simple)

Configuration

Embedders

Why It Works

No Catastrophic Forgetting

Fast Dual Retrieval

Human-Like Behavior

Project Structure

Comparison to Other Approaches

Revolutionary Features

1. Like "Attention is All You Need"

2. Neuroscience-Inspired

3. Zero Catastrophic Forgetting

Getting Started

License

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages