Roadmap

Cognitive cores

Compressed cognitive cores for on-device and single-GPU deployment. Open weights, signed evals, predictable footprints.

Currently in design. Three sizes planned. We will publish the line once the first eval is signed cold — not before.

The line, in plan

01

Edge core

In design

targeting 1.5 – 2 B params

On-device reasoning. Laptop and phone budgets.

spec
  • INT8 weights, FP16 activations on high-sensitivity layers.
  • Context window: 8 K tokens initial, scaling pending memory profile.
  • Distilled from a frontier teacher. No fine-tune fork from a public base.
target deploy
WebGPU on laptops, Apple Neural Engine on phone-class silicon, commodity ARM via llama.cpp.
eval plan
A frozen suite covering reasoning, instruction following, and known refusal patterns. Reported as the geomean against a stated FP32 teacher baseline; per-task numbers in the model card.
02

Workstation core

In design

targeting ~3 B params

Single GPU production traffic.

spec
  • Mixed-precision: INT8 weights, FP16 activations at attention layers.
  • Context window: 32 K tokens. Sliding-window attention beyond.
  • Calibrated against the deployment workload, not a public set.
target deploy
Single consumer-class GPU (RTX 4070 and up), or one A10G / L4 in production.
eval plan
Same frozen suite as Edge core plus a workload-specific eval drawn from the deployment domain. Sensitivity per layer published with the model card.
03

Server core

Planned

targeting ~7 B params

Distilled from frontier teachers, longer contexts.

spec
  • INT8 weights, FP16 activations. INT4 weights as an opt-in compile flag.
  • Context window: 128 K tokens, retrieval-augmented by default.
  • Distilled from a frontier teacher with curriculum from the task graph.
target deploy
Single A100, H100, or equivalent. Multi-replica scale-out via standard inference frameworks.
eval plan
Standard suite plus long-context retrieval evals (NIAH, etc.). Reproducibility hash and seed published per release.

Principles before product

  • Open weights, Apache-2.0.
  • Every release ships a frozen eval and a release hash.
  • No claim without a reproducible measurement behind it.
  • No staged demos. No staged numbers.

Notify when we ship

We email once per core. No newsletter.

[email protected]

↑↓ navigate · enter open · esc close