Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.remyx.ai/llms.txt

Use this file to discover all available pages before exploring further.

Roadmap

This page tracks the current state of Remyx’s capabilities. We publish it so customers, collaborators, and researchers can see where we’re going well before each capability ships. For the architectural through-line, see Causal Intelligence. For how these capabilities map to the customer journey, see Maturity Progression.

Status legend

IndicatorStatusMeaning
ShippedAvailable in production today
In developmentActively being built
PlannedCommitted to the roadmap, not yet started
ResearchExploring feasibility, not yet committed

ExperimentOps

These capabilities are live in production today. They form the Stage 2 foundations and are documented in ExperimentOps and the platform pages.
StatusCapabilityDescription
ShippedExperiment captureFull lifecycle from origin through decision, with target metric, hypothesis, and decision rationale captured directly on the experiment record
ShippedCross-experiment patternsTag-based clustering identifies which directions consistently produce results
ShippedResource discoverySemantic search across papers, repos, models, and datasets, matched to your team’s experiment history
ShippedPortfolio viewLeadership-facing view of every initiative with health indicators, hit rates, and metric trends
ShippedConnector frameworkBidirectional sync with GitHub, Linear, Jira, Slack, Claude Code MCP
ShippedMCP serverProgrammatic access to Remyx capabilities from Claude Code and other MCP clients
In developmentStandalone Stage 1 product surfaceMilestone-driven recommendations for early-dev teams without production traffic

Causal intelligence, evidence layer

Stage 2. The evidence layer feeding the causal model.
StatusCapabilityDescription
PlannedObservational log ingestionPluggable adapters for Datadog, Honeycomb, structured JSON, OpenTelemetry. Customers connect existing telemetry, and Remyx populates the evidence layer without instrumentation changes
PlannedCommit-correlated regime boundary detectionExtension of the existing repo integration to identify regime changes in the data-generating process from commit history
PlannedQuasi-experiment identificationCombine observational logs with regime boundaries to produce identified causal effects via difference-in-differences, interrupted time series, or regression discontinuity
PlannedCausal discoveryBootstrap a partial causal graph from observational data, supplemented by regime-boundary structure. Discovered structure is human-validated before being treated as the working model
PlannedCausal graph engineVersioned causal graph as a top-level object, supporting interventional, counterfactual, and mediation-aware graphical models. Semi-Markovian formulation to handle latent confounders
PlannedCausal data fusionCombine evidence from multiple sources into one coherent posterior. Conflict resolution, graph refinement proposals, continuous incremental updates

Causal intelligence, query and interaction

StatusCapabilityDescription
PlannedIdentification dispatcherGiven a question, classify it by required identification layer, route to evidence sources with appropriate identification, and dispatch to estimation logic
PlannedNatural language query layerCustomer-facing interface that takes natural language questions, parses into formal estimands, and returns natural language answers with identification status and recommendations

Causal intelligence, Stage 3

StatusCapabilityDescription
PlannedA/B test integration frameworkConnectors for Statsig, Eppo, LaunchDarkly. A/B test results become an evidence source feeding the causal model

Causal intelligence, Stage 4

StatusCapabilityDescription
PlannedShadow-decision SDK (log-only mode)Python SDK that wraps decision points in your AI system. The first version captures natural policy output without applying overrides
PlannedShadow-mode audit infrastructureDedicated product surface for the shadow-mode adoption phase. Audit trail viewer, override proposal review, and compliance reporting
PlannedCTF-RAND override policiesExtension of the SDK with counterfactual randomization. Trajectory-consistent semantics by default. Per-decision-point semantics opt-in for mediation analysis
PlannedETT and NDE estimationCounterfactual estimation procedures for effect-of-treatment-on-the-treated and natural direct effect

Hypothesis triage

The triage layer matures with the customer. Stage 2 customers see quasi-experimental recommendations. Stage 3 customers see A/B test recommendations. Stage 4 customers see CTF-RAND recommendations.
StatusCapabilityDescription
PlannedHypothesis ranking and evidence path recommendationRank hypotheses by expected information gain. Identify the cheapest evidence path to an answer for each
PlannedOrchestration schedulerCoordinate active interventions (CTF-RAND randomizations, A/B tests, quasi-experiment analyses) for maximum concurrency without compromising estimate validity
PlannedIdentification-enabling intervention proposalsProactively propose CTF-RAND or A/B interventions that would make currently-unidentifiable hypotheses estimable

How we sequence

Foundational evidence-layer capabilities ship first. The causal model and query layer come next. A/B integration follows. Counterfactual perturbations ship last. The dependency hot path through the architecture follows this order.
  1. Evidence schema.
  2. Observational log ingestion.
  3. Quasi-experiment identification.
  4. Causal graph engine and data fusion.
  5. Identification dispatcher and natural language query.
  6. A/B integration.
  7. Shadow-decision SDK.
  8. CTF-RAND override policies.
  9. ETT and NDE estimation.
This roadmap is shared publicly. We update it as priorities shift.