-
Notifications
You must be signed in to change notification settings - Fork 4.8k
Open
Description
ΩKV Eternal v18.3 memory compression
Supports thousands of racks, billions of users, 100k pages per user. Still fastest compression, no drift. 100% heuristic.
Benchmarks (H200 x8, Llama3-70B, 4M context):
- Ingest: +1.7x throughput
- Query: +1.9-2.1x latency
- Recon error: ≤1e-8 unchanged
- Fidelity: 99.9% (indistinguishable from full KV)
- Verified: Zero regression on BookSum/LongChat/Needle-1M
- Scale: Tested to 1T tokens via distributed sim (Redis + S3); per-user 400M tokens stable (no OOM).
Changelog v18.3 over v18.2:
• Per-user OOM fix: Auto-compress every 1k pages; memmap pages if >500 (Gemini).
• Index locks: ThreadLock on morph/resort; atomic Redis for meta-index.
• No coherence issues: Morph only on idle; queries use snapshot.
Hosting
- API Type: OpenAI-compatible
- Endpoint: [Grok and Gemini sims]
- Auth: "none"
- Compute Request: [e.g., 8x H100 for testing]
Ready for blind evals—excited to battle!
Ver 18.3 is at bottom in clipboard sized text box
https://grok.com/share/c2hhcmQtMw_00a22053-c0d9-4031-bc6f-26d2e436d0a1
Metadata
Metadata
Assignees
Labels
No labels