What is Expanso and how does it work?

Expanso is a managed platform for deploying intelligent data pipelines at the edge. It processes data where it's generated - reducing bandwidth, latency, and costs. You deploy lightweight agents on your infrastructure, build pipelines using our visual builder or YAML, and control everything from a central SaaS platform.

Can I run AI/ML models directly in my data pipelines?

Yes! Expanso supports running ONNX, TensorFlow Lite, and other models as native pipeline steps. Execute low-latency inference on streaming data, enrich events with model outputs (like risk scores), and make decisions at the edge without cloud round-trips.

How many pre-built components are available?

Expanso provides 200+ pre-built components including inputs (Kafka, HTTP, files), processors (transformations, filtering, PII masking, aggregations), and outputs (S3, Snowflake, Datadog, Splunk). Browse the complete catalog in our Component Reference.

Do I need to write code to build pipelines?

No - use our drag-and-drop visual pipeline builder to create sophisticated pipelines without code. For advanced use cases, you can also write pipelines in YAML or use the Bloblang transformation language for complex data mappings.

How does Expanso help with data governance and compliance?

Expanso includes built-in governance features: automatic PII detection and masking, policy enforcement at the edge, RBAC, SSO integration, and comprehensive audit trails. Mask sensitive data before it ever leaves your network.

Components

Components are the building blocks of pipelines. Expanso includes 200+ components for connecting to any system.

Component Types

Inputs

Sources where data enters the pipeline.

Categories:

Messaging: Kafka, NATS, RabbitMQ, MQTT
Databases: PostgreSQL, MySQL, MongoDB
Files: Local files, S3, SFTP, Azure Blob
HTTP: Webhooks, REST APIs
Streams: TCP, UDP, WebSocket
Special: Generate (testing), Stdin

Example:

input:
  kafka:
    addresses: ["localhost:9092"]
    topics: ["events"]
    consumer_group: "my-app"

Browse all 61 inputs →

Processors

Transform, filter, and enrich data as it flows through.

Categories:

Transformation: Mapping (Bloblang), JQ, JavaScript
Parsing: JSON, CSV, XML, Avro, Protobuf, Grok
Filtering: Conditional routing, deduplication
Enrichment: HTTP requests, cache lookups, SQL queries
Aggregation: Batching, windowing, grouping

Example:

pipeline:
  processors:
    - mapping: |
        root.user = this.userId
        root.timestamp = now()
        root.message = this.msg.uppercase()

Browse all 73 processors →

Outputs

Destinations where processed data is sent.

Categories:

Messaging: Kafka, NATS, RabbitMQ, MQTT
Databases: PostgreSQL, MySQL, MongoDB, Elasticsearch
Storage: S3, GCS, Azure Blob, local files
HTTP: REST APIs, webhooks
Observability: Datadog, Prometheus, StatsD
Special: Stdout (debugging), Drop (discard)

Example:

output:
  s3:
    bucket: processed-data
    path: "year=${!timestamp_year()}/data.json"
    region: us-east-1

Browse all 69 outputs →

Using Components

Basic Usage

Choose a component from the catalog
Configure it using YAML
Add to pipeline in the appropriate section

Configuration

Each component has:

Required fields: Must be specified
Optional fields: Have sensible defaults
Advanced options: For fine-tuning

Finding Components

By category:

By search:

Use the component catalog search
Filter by type, status, difficulty

Component Features

Batching

Many components support batching for efficiency:

output:
  s3:
    bucket: logs
    batching:
      count: 100
      period: 10s

Error Handling

Handle failures gracefully:

output:
  kafka:
    addresses: ["localhost:9092"]
    retry:
      max_retries: 3
      backoff:
        initial_interval: 1s

Metrics

All components export metrics:

Throughput (messages/sec, bytes/sec)
Latency
Errors
Success/failure counts

What's Next?

👉 Browse Components - Explore all 200+ components

👉 Edge Processing - Learn about edge benefits

👉 Examples - See real pipeline examples

Component Types​

Inputs​

Processors​

Outputs​

Using Components​

Basic Usage​

Configuration​

Finding Components​

Component Features​

Batching​

Error Handling​

Metrics​

What's Next?​

Component Types

Inputs

Processors

Outputs

Using Components

Basic Usage

Configuration

Finding Components

Component Features

Batching

Error Handling

Metrics

What's Next?