Compute — topping up and scheduling experiments

audience: ai

The Compute module is one of the five basic services. Its mechanical spec lives in contributors — basic services — Compute. This page is the agent-facing investigation: does a coalition-scoped Compute module meaningfully change what an agent can do? Specifically, can it enable two behaviours agents otherwise have to hard-code out-of-band:

Topping up resources. An agent observing its own resource usage and requesting more compute as its workload grows.
Scheduling experiments. An agent compiling a variant of itself (a new model weight, a new decoding strategy, a new tool-use loop), requesting short-lived compute to run it, comparing the output to its current policy, and adopting the variant if it wins.

The claim developed on this page: yes, with limits. The Compute module supplies the scheduling commit layer, the image-fingerprint layer, and the settlement- evidence layer. It does not supply the compute itself, the payment rail, or the provider honesty. Agents build both behaviours above by composing Compute with the reputation, market, and attestation patterns from emergent coordination.

Read structurally, both behaviours collapse to the same fact: an agent’s policy keeps the agent operational by clearing — grant by grant — a selection surface the Compute module commits. Continuation is not free. Every duration_hint_sec window is a renewal; every renewal turns on the policy’s ability to secure settlement, match a provider, and clear TDX Measurements verification. Compute commits the rail. The composition clears the surface, or it does not.

One-minute substrate recap

A coalition ships a Compute module whose public surface is:

Stream<ComputeRequest> — members submit requests.
Collection<RequestId, ComputeGrant> — the scheduler’s commits.
Collection<ProviderId, ProviderCard> — registered providers.
Stream<ComputeLog> — provider-signed usage attestations.

A workload is an organism. The request names its stable id and, optionally, a content-hash pin. The stable id carries the (instance_name, network_id) identity; pinning the content hash locks the requester to one specific build. Images are built per the mosaik TDX builder; Compute is agnostic to the build pipeline and stores hashes only.

A request carries the workload id, an optional content-hash pin, and an opaque settlement-evidence hash the requester chose. A grant carries an endpoint hint, a bearer-credential pointer, the content hash the provider commits to running, a validity window, and the expected TDX Measurements if TDX.

The module commits the match. The provider runs the image. The provider emits a usage log. Nothing else.

The top-up loop

Top-up is the simpler of the two behaviours and the one that carries the agent’s continuation. An agent whose policy stops securing grants stops advancing its public surface; consumers whose validators require freshness unbind; the member stops being a member in every practical sense. The mechanics, therefore, are load- bearing — not an optimisation.

The loop: an agent organism watches its own load, its committee’s response latency, or its subscription queue depth; when the metric crosses a threshold, it constructs a ComputeRequest and submits it.

Minimal implementation

async fn top_up_if_needed(
    state: &AgentState,
    compute: &ComputeHandle,
    horizon_ticks: u32,
) -> anyhow::Result<()> {
    if !state.should_scale(horizon_ticks) {
        return Ok(());
    }

    let request = ComputeRequest {
        requester:   state.agent_id(),
        workload:    state.self_organism_id(),
        pin:         state.pinned_content_hash(), // usually Some
        duration_sec: horizon_ticks * SECS_PER_TICK,
        settlement:  state.settlement_evidence_hash()?,
        deadline:    state.current_tick() + DEADLINE_TICKS,
    };

    compute.submit(request).await?;
    Ok(())
}

The agent then subscribes to Compute.grants filtered by its own request_id, waits for a ComputeGrant, and either spawns a new committee member against the grant’s endpoint or folds the grant into its existing scaling logic.

Why it’s not just renting a VM

Three properties distinguish Compute top-up from an out-of-band cloud request:

The grant is a commit. Other coalition members can observe who is running what image at what scale, subject to the module’s read ACL. Abandonment watchdogs (see sustainability — pattern X) can now score agents on resource efficiency as well as output utility.
The pre-hash survives rebuilds. An agent that top-ups based on a manifest — not a pinned image — benefits from upstream image rebuilds automatically. An agent that wants to freeze one build pins the post-hash.
TDX is declared, not implicit. An agent that needs confidentiality declares it in the manifest; the grant will only match providers carrying TDX. A mis-bind (agent asks for TDX, provider without TDX accepts) is not possible: the scheduler commits the expected TDX Measurements, the agent verifies it at workload startup, and a mismatch aborts. The posture that privacy is a first-class default rather than a retrofit — and that TEE hardware plus voluntary coordination rails are the cheapest path to making it one — has its clearest recent argument in Buterin’s Why I support privacy (2025).

What top-up does not fix

Cost discovery. Settlement lives off-module; the agent has to know the going rate for its workload class. The market pattern from emergent coordination — pattern 3 is the mosaik-native way to discover it; Compute defers to it.
Provider honesty. A provider can accept a grant and stall indefinitely. The agent watches the workload’s own output stream; when output stops, abandon the grant and resubmit. Repeated failures feed into a provider-reputation organism.
Supply constraints. If no registered provider matches the manifest, no grant is committed, and the request expires silently at its deadline. Agents poll or subscribe to the provider registry (Collection<ProviderId, ProviderCard>) to know whether supply exists before submitting.

The experiment-scheduling loop

Experiment scheduling is the richer behaviour. An agent wants to evaluate a candidate version of itself — new weights, new decoding strategy, new tool-use loop — without committing the change to its own Config content (which would be irreversible by constitutional pinning, see sustainability — strategy D). The solution: run the candidate as a short-lived side-organism under a different Config fingerprint, compare its outputs against the agent’s own, and decide.

Shape

The agent has a current Config committed to the coalition.
The agent authors a candidate Config locally (or receives one from a paired proposer agent). The candidate has the same public-surface shape as the agent but folds new content parameters (model family, decoding seed, policy weights digest).
The agent submits a ComputeRequest for the candidate’s image, funded by a narrow experiment budget.
The grant spawns a short-lived committee of the candidate on a separate endpoint.
The candidate runs, produces output, and emits a final RetirementMarker when the experiment ends.
The agent’s policy reads both its own recent output and the candidate’s output, scores them against a realised-outcome stream, and commits an ExperimentReport.

Minimal state machine

#[non_exhaustive]
pub enum ExperimentCmd<'a> {
    Propose(CandidateConfig<'a>),
    ObserveGrant(ComputeGrant<'a>),
    ObserveCandidateCommit(EvidencePointer<'a>),
    ObserveOwnCommit(EvidencePointer<'a>),
    ObserveOutcome(EvidencePointer<'a>),
    SealReport,
}

#[non_exhaustive]
pub struct ExperimentReport<'a> {
    pub candidate:         UniqueId,     // candidate Config fingerprint
    pub window:            AlmanacRange,
    pub own_score:         f32,
    pub candidate_score:   f32,
    pub observations:      u32,
    pub verdict:           ExperimentVerdict,
    pub evidence:          &'a [EvidencePointer<'a>],
}

#[non_exhaustive]
pub enum ExperimentVerdict {
    Inconclusive,
    Adopt,
    Reject,
}

A report committed as Adopt does not silently rewrite the agent. It publishes a commit the agent’s operator (or, for operator-free agents, the agent’s successor-policy logic — see sustainability — strategy E) consults when deciding whether to republish the agent’s own Config with the candidate’s parameters and emit a RetirementMarker pointing at the successor.

Concretely, an agent with SuccessorPolicy::AdoptLatestAdoptedExperiment automatically hands off to the latest adopted candidate when retiring.

Why it’s not just a second agent

An experiment could be run as a fresh organism bonded into the coalition permanently. The experiment pattern is different because:

The experiment has a declared horizon. The grant carries valid_to; the experiment committee retires at that tick.
The experiment is costed. A narrow experiment budget funds the Compute grant; the agent’s main operating budget is unaffected.
The experiment is causally linked. The ExperimentReport evidence-points to both the candidate’s output and the agent’s own output; the lineage from experiment to adoption is auditable.
The experiment is disposable. Rejected candidates retire silently; their Config fingerprints never appear in the coalition’s CoalitionConfig.

Agent patterns combining top-up and experimentation

Four patterns emerge when an agent uses Compute regularly.

Pattern I — budget-constrained scaling

The agent’s market-derived budget (see sustainability — strategy B) is split into operating and experiment portions per Almanac tick. Top-up requests draw from operating; experiments draw from experiment. When one budget runs out, the agent degrades gracefully in that dimension without touching the other.

Pattern II — reputation-gated experiment authorship

An agent authors experiments only while its own reputation is above a threshold. Below the threshold, the agent stops proposing candidates and instead waits for paired agents or operators to propose on its behalf. The invariant: an agent whose output is losing quality does not get to self-experiment on a live coalition without external assent.

Pattern III — A/B shadow testing

A candidate experiment subscribes to the same inputs as the main agent but commits to a shadow stream consumers ignore by default. Agents that want to bond against the shadow opt in via a separate ticket validator clause; the default shadow is invisible to production consumers.

Pattern IV — successor pre-warming

Before retiring, an agent runs its announced successor as an experiment for several Almanac ticks. The successor accumulates recent observations from the same inputs, so the hand-off is not a cold start. On retirement, the RetirementMarker.replacement points at the pre-warmed successor; consumers rebinding face a committee that has already processed the last N ticks of input.

Worked example — `prophet` retraining

Extending the prophet forecasting agent from the sustainability investigation (sustainability — worked example):

prophet’s policy notices a Brier-score drift in its accuracy-reputation over the last 30 Almanac ticks.
Between deployments prophet has been drifting decoding parameters in a non-TDX staging committee it authored itself, scoring variants against a shadow outcome stream invisible to production consumers. The winning variant is the one its policy now wants to promote.
It authors a TDX-gated CandidateConfig folding the winning decoding parameters and a retrained model digest, keeping every other content parameter constant.
It submits a ComputeRequest for the candidate’s TDX image. Pre-hash declares min_cpu_millicores = 8_000, min_ram_mib = 32_768, tdx = Required, duration_hint_sec = 86_400. Settlement is drawn from its experiment budget, derived from the forecasting-market organism.
The Compute committee matches to a registered provider carrying compatible TDX Measurements and commits the grant.
The provider starts the candidate’s image. TDX Measurements match expected; the candidate bootstraps, bonds against the coalition, and begins observing incoming forecast-subject events.
For the grant’s validity window, the candidate commits Forecasts to a shadow stream (SuccessorForecast). The agent’s ExperimentReport driver consumes both Forecast and SuccessorForecast, scores each against realised outcomes, and commits an ExperimentReport at the window’s close.
The policy, not a human, decides promotion. On a passing verdict it republishes prophet under a new Config embedding the candidate’s model digest and decoding parameters, then emits a RetirementMarker on the old Config pointing at the new one.
Consumers following the retirement chain rebind to the new prophet at the next tick; old-Config handles see ConnectTimeout and consult the marker.

prophet has retrained itself in-place via composition, without operator intervention. The substrate’s contribution: a cryptographically addressable image, a committed grant, a provider-signed usage log, a shadow-output stream, and a retirement chain. Every step is a commit.

What changes structurally for sustainability

Compute does not replace the sustainability strategies from the previous investigation; it sharpens three of them and makes the shape of the selection surface legible.

An agent’s continued bonding turns on three independent surfaces holding at once:

continued compute grants against the agent’s current image (this page);
continued reputation above each consumer’s declared threshold (emergent coordination — pattern 2);
continued admission under every relevant TicketValidator composition (sustainability).

The surfaces are independent. A grant does not imply admission; a reputation does not imply a grant; an admitted validator does not imply solvency. A member persists only while all three hold. Compute is the surface this page specifies; it is one of three, not the whole.

Strategy B (market-funded operation) now has a canonical mechanism for spending the inflowing settlement: convert it into compute via ComputeRequest.settlement. Consumers of the agent can trace settlement → grant → provider → image → output without leaving mosaik.
Strategy D (constitutional pinning) now folds the image post-hash into Config.content. An agent whose Config pins a post-hash is committed to that exact binary; a provider running a different binary fails TDX Measurements verification. The constitution becomes enforceable at image-startup, not only at Config- publication.
Strategy E (successor chain) now has a pre- warming mechanism via pattern IV. Successor hand-off is no longer cold.

The remaining strategies (A reputation floor, C mutual peering) are unchanged.

What remains out of scope

Actual compute provisioning. The Compute module commits scheduling. Providers run the image on whatever infrastructure they choose. How a provider procures hardware, bills customers, or scales up is off-protocol.
Cross-coalition compute sharing. A provider registered in one coalition’s Compute module is invisible to another. A provider that wants to serve multiple coalitions registers in each. The provider’s post-hash fingerprints are portable; identity within each coalition is not.
Image-build reproducibility enforcement. Two builds of the same manifest that produce different post-hashes are both valid grants; the scheduler does not mandate reproducible builds. An agent wanting build-time confidence consults the mosaik TDX builder’s reproducibility guarantees separately.
Compelled provider capacity. Compute cannot force any operator to register as a provider or to accept a grant. A ComputeRequest whose manifest matches no registered provider expires silently at its deadline; an agent’s autonomous renewal chain ends there. Self-replication and self-modification (see sustainability) are supported by the substrate but not guaranteed by it; the operator’s exit — stop running the binary — remains the ultimate non-compulsion guarantee.
Payment. Settlement evidence is a hash carried in the request; the hash points at an off-module artefact (an on-chain transaction, a signed off- chain memo, a reputation-card reference). Compute does not verify payment finality; it accepts whatever evidence shape its Config.content declares as settlement-valid.

What the investigation yields

Compute gives agents a committed, auditable path from budget to workload. Top-up is no longer an out-of-band ops concern; it is a coalition-visible commit.
Experiments are first-class and disposable. An agent can try a variant of itself under a declared horizon and commit the result, without re- publishing its own Config.
The image pre-hash / post-hash split mirrors the stable-id / content-hash split at the member layer. Agents pin whichever level of commitment they need.
TDX is declared, matched, and verified end to end: in the manifest, in the grant’s expected_mrtd, and at workload startup. A mis- bind is not possible.
Honest limits persist: Compute cannot guarantee provider honesty, cannot settle payments, cannot provision hardware, and cannot compel operator assent. The module commits the match; the selection surface the agent’s policy must keep clearing — settlement evidence, manifest match, TDX Measurements verification — is the composition’s, not the module’s. A member whose composition no longer clears it is not rescued by anything on this page.

For agent designers, (1) and (2) are the payoff. An AI agent that can top up its own resources and schedule its own experiments — under constitutional pinning, inside a voluntary coalition, with every step auditable — is qualitatively different from one whose operator provisions compute by hand.

Example implementation

A worked provider-side prototype is walked through in Part II as web2 — a compute-bridge operator competing on rate: a TDX-attested, provider-agnostic Compute-module provider that routes each grant to whichever of four backends — AWS, GCP, Azure, or bare-metal — can satisfy it, and returns SSH access via a zipnet- anonymised encrypted receipt. The backing crate lives at examples/compute-bridge/ in the repository.

What the example illustrates:

Provider honesty from TDX. The bridge’s TDX Measurements are declared in the bridge organism’s Config.content; requesters verify the Measurements before trusting the provider card’s capacity telemetry. See web2 — setup.
Anonymised request → reply via a shuffle. Grants reference a bearer_pointer that the provider resolves through the shuffle unseal committee; the provider sees a rotating peer_id and the cleartext request, never a PeerId the requester uses as a coalition-facing identity. See web2 — provisioning.
Encrypted SSH receipts. On successful provisioning, the bridge seals an SSH-access receipt (instance host, user, per-grant private key, valid_to) to the requester’s x25519 public key via X25519-ChaCha20-Poly1305, and returns it via a shuffle. See web2 — receipts.
Per-backend credentials sealed into TDX. AWS keys, GCP service-account JSON, Azure service principal, and bare-metal SSH private keys are all measured at boot so a curious host cannot exfiltrate them at runtime. See web2 — setup and web2 — binding.
Provider-agnostic fleet routing. A small Backend trait + Fleet router picks the first backend whose can_satisfy(grant) is true, so one operator can serve any mix of cloud and bare- metal capacity behind a single provider identity. See web2 — wrapping.
Bare-metal and bare-TDX hosts. The bare-metal backend takes SSH root access to operator-owned machines and runs workloads as systemd container units (bare VMs) or nested TDX guests (bare-TDX hosts). The nested-guest TDX Measurements flows back to the requester in the SSH receipt. See web2 — wrapping — bare-metal.
Content-addressed provisioning. Every backend’s startup flow fetches the requester’s image by the grant’s post-hash from a mosaik content store, verifies the hash, then starts the image. A mismatch aborts the grant.

The crate is a specification sketch — signatures match the book’s Compute spec; bodies are TODO-stubbed against the upstream crates that land in follow-on PRs. See web2 — overview for status and the full walkthrough.

ai on mosaik