What is Expanso and how does it work?

Expanso is a managed platform for deploying intelligent data pipelines at the edge. It processes data where it's generated - reducing bandwidth, latency, and costs. You deploy lightweight agents on your infrastructure, build pipelines using our visual builder or YAML, and control everything from a central SaaS platform.

Can I run AI/ML models directly in my data pipelines?

Yes! Expanso supports running ONNX, TensorFlow Lite, and other models as native pipeline steps. Execute low-latency inference on streaming data, enrich events with model outputs (like risk scores), and make decisions at the edge without cloud round-trips.

How many pre-built components are available?

Expanso provides 200+ pre-built components including inputs (Kafka, HTTP, files), processors (transformations, filtering, PII masking, aggregations), and outputs (S3, Snowflake, Datadog, Splunk). Browse the complete catalog in our Component Reference.

Do I need to write code to build pipelines?

No - use our drag-and-drop visual pipeline builder to create sophisticated pipelines without code. For advanced use cases, you can also write pipelines in YAML or use the Bloblang transformation language for complex data mappings.

How does Expanso help with data governance and compliance?

Expanso includes built-in governance features: automatic PII detection and masking, policy enforcement at the edge, RBAC, SSO integration, and comprehensive audit trails. Mask sensitive data before it ever leaves your network.

api.PutJobRequest

dry_runboolean

Validate without applying

forceboolean

Force apply even if unchanged

spec objectrequired

Job specification

config object

Config contains type-specific configuration for the workload. The structure depends on the job type (e.g., pipeline config, query parameters).

property name*any

Config contains type-specific configuration for the workload. The structure depends on the job type (e.g., pipeline config, query parameters).

descriptionstring

Description is an optional human-readable description of the job.

labels object

Labels is used to associate arbitrary labels with this job. Labels can be used for filtering and selection.

property name*string

meta object

Meta is used to associate arbitrary metadata with this job. Keys with the prefix "expanso.io/" are reserved for system use.

property name*string

namestring

Name is the logical name of the job used to refer to it. Submitting a job with the same name as an existing job will result in an update to the existing job.

namespacestring

Namespace is the namespace this job is running in.

priorityinteger

Priority defines the scheduling priority of this job. Higher values indicate higher priority.

restart_policystring

RestartPolicy controls restart behavior when executions exit.

"on-failure" (default): Restart on non-zero exit, complete on success
"always": Restart on any exit (current daemon behavior)
"never": No restart, one-shot execution

rollout object

Rollout defines how to rollout the job

auto_promoteboolean

Auto-promote canary rollouts

canary_countinteger

Canary-specific settings

canary_percentinteger

Percentage of canary nodes

health_check object

HealthCheck defines health check configuration (required for rolling/canary, ignored for immediate)

deadlineinteger<int64>

Deadline is the maximum time to wait for an execution to become healthy (required)

Possible values: [-9223372036854776000, 9223372036854776000, 1, 1000, 1000000, 1000000000, 60000000000, 3600000000000, 10000000000]

failure_thresholdinteger

FailureThreshold is the number of consecutive unhealthy intervals before the execution is considered unhealthy (optional, default: 3)

intervalinteger<int64>

Interval is the duration of each health evaluation window (optional, default: 10s) Error rate is calculated per interval, not lifetime.

Possible values: [-9223372036854776000, 9223372036854776000, 1, 1000, 1000000, 1000000000, 60000000000, 3600000000000, 10000000000]

max_error_ratenumber

MaxErrorRate is the maximum error rate allowed during health checks (optional, default: 0.10) Pointer because we need to distinguish nil (use default) from explicit 0.0

success_thresholdinteger

SuccessThreshold is the number of consecutive healthy intervals before the execution is considered healthy (optional, default: 2)

max_failed_nodesinteger

MaxFailedNodes is the maximum number of failed nodes before stopping (optional, default: 10)

max_failed_nodes_percentnumber

MaxFailedNodesPercent is the maximum percentage of failed nodes before stopping (optional, default: 10.0)

max_parallelinteger

MaxParallel is the maximum percentage of nodes to update in parallel (0-100) For immediate strategy: this value is ignored (all nodes updated simultaneously) For rolling/canary: controls wave size as percentage of total nodes (default: 10 if not specified) Examples: 10 = 10% of nodes per wave, 50 = 50% of nodes per wave, 100 = all nodes at once

no_auto_rollbackboolean

NoAutoRollback disables automatic rollback on rollout failure (default: false = auto-rollback enabled)

strategytypes.RolloutStrategyType (string)

Strategy: immediate|rolling|canary

Possible values: [immediate, rolling, canary]

selector object

Selector defines which nodes to run the job on

match_expressionsstring[]

MatchExpressions selects nodes using label selector expression strings. Each expression is evaluated independently and all must match (AND logic). Supported syntax:

Equality: "key=value" or "key==value"
Inequality: "key!=value"
Set inclusion: "key in (value1,value2,...)"
Set exclusion: "key notin (value1,value2,...)"
Existence: "key"
Non-existence: "!key" Examples:
"region=us-east"
"tier in (premium,standard)"
"environment!=prod"
"gpu"
"!debug"

match_idsstring[]

MatchIDs selects specific nodes by their IDs. If specified, the job will only run on nodes whose ID is in this list.

match_labels object

MatchLabels selects nodes with labels that exactly match all specified key-value pairs. All labels must match (AND logic). Example: {"region": "us-east", "tier": "compute"}

property name*string

timeouts object

Timeouts defines timeout configurations for the job

execution_timeoutinteger

ExecutionTimeout is the maximum amount of time a task is allowed to run in seconds. Zero means no timeout, such as for a daemon task.

queue_timeoutinteger

QueueTimeout is the maximum amount of time a task is allowed to wait in the orchestrator queue in seconds before being scheduled. Zero means no timeout.

total_timeoutinteger

TotalTimeout is the maximum amount of time a task is allowed to complete in seconds. This includes the time spent in the queue, the time spent executing and the time spent retrying. Zero means no timeout.

typestring

Type specifies what kind of workload this job runs (e.g. "pipeline", "query", "update", "config"). The scheduling behavior is derived from this type.

api.PutJobRequest
{
  "dry_run": true,
  "force": true,
  "spec": {
    "config": {
      "input": {
        "file": {
          "paths": [
            "/var/log/app/*.log"
          ]
        }
      },
      "output": {
        "stdout": {}
      },
      "pipeline": {
        "processors": [
          {
            "mapping": "root = this\nroot.processed_at = now()\n"
          }
        ]
      }
    },
    "description": "Processes application logs from edge nodes",
    "labels": {
      "env": "production",
      "region": "us-west",
      "version": "v1.2.0"
    },
    "name": "log-processor",
    "namespace": "production",
    "priority": 50,
    "selector": {
      "match_labels": {
        "env": "production",
        "role": "app-server"
      }
    },
    "type": "pipeline"
  }
}