What is Expanso and how does it work?

Expanso is a managed platform for deploying intelligent data pipelines at the edge. It processes data where it's generated - reducing bandwidth, latency, and costs. You deploy lightweight agents on your infrastructure, build pipelines using our visual builder or YAML, and control everything from a central SaaS platform.

Can I run AI/ML models directly in my data pipelines?

Yes! Expanso supports running ONNX, TensorFlow Lite, and other models as native pipeline steps. Execute low-latency inference on streaming data, enrich events with model outputs (like risk scores), and make decisions at the edge without cloud round-trips.

How many pre-built components are available?

Expanso provides 200+ pre-built components including inputs (Kafka, HTTP, files), processors (transformations, filtering, PII masking, aggregations), and outputs (S3, Snowflake, Datadog, Splunk). Browse the complete catalog in our Component Reference.

Do I need to write code to build pipelines?

No - use our drag-and-drop visual pipeline builder to create sophisticated pipelines without code. For advanced use cases, you can also write pipelines in YAML or use the Bloblang transformation language for complex data mappings.

How does Expanso help with data governance and compliance?

Expanso includes built-in governance features: automatic PII detection and masking, policy enforcement at the edge, RBAC, SSO integration, and comprehensive audit trails. Mask sensitive data before it ever leaves your network.

Pipeline Stuck in Degraded State

Sometimes a pipeline gets stuck in a degraded state and won't recover on its own. This usually happens when the executor has a stale process that ignores restart signals from the scheduler.

Diagnose the Problem

First, check if your execution is actually stuck:

# List executions by state
expanso-cli execution list --state degraded

# Check how long execution has been degraded
expanso-cli execution describe <execution-id>

# If the execution has been degraded longer than your retry backoff period
# (e.g., 10+ minutes when backoff is 5 minutes), it's likely stuck

Solutions

Stop and restart the job

# Stop the job (terminates stuck executions)
expanso-cli job stop <job-id>

# Restart the job
expanso-cli job deploy your-pipeline.yaml

Restart the edge agent

If stopping the job doesn't work:

# On the edge node
sudo systemctl restart expanso-edge

Monitoring and Prevention

Catch stuck executions automatically:

# Check for executions stuck in Degraded for >10 minutes
expanso-cli execution list --state degraded --format json | \
  jq '.[] | select(.updated_at < (now - 600))'

To prevent this in the future:

Set alerts for executions stuck in Degraded state
Use health checks and automatic rollback when available
Design pipelines with error handling and retries

Diagnose the Problem​

Solutions​

Stop and restart the job​

Restart the edge agent​

Monitoring and Prevention​

Diagnose the Problem

Solutions

Stop and restart the job

Restart the edge agent

Monitoring and Prevention