What is Expanso and how does it work?

Expanso is a managed platform for deploying intelligent data pipelines at the edge. It processes data where it's generated - reducing bandwidth, latency, and costs. You deploy lightweight agents on your infrastructure, build pipelines using our visual builder or YAML, and control everything from a central SaaS platform.

Can I run AI/ML models directly in my data pipelines?

Yes! Expanso supports running ONNX, TensorFlow Lite, and other models as native pipeline steps. Execute low-latency inference on streaming data, enrich events with model outputs (like risk scores), and make decisions at the edge without cloud round-trips.

How many pre-built components are available?

Expanso provides 200+ pre-built components including inputs (Kafka, HTTP, files), processors (transformations, filtering, PII masking, aggregations), and outputs (S3, Snowflake, Datadog, Splunk). Browse the complete catalog in our Component Reference.

Do I need to write code to build pipelines?

No - use our drag-and-drop visual pipeline builder to create sophisticated pipelines without code. For advanced use cases, you can also write pipelines in YAML or use the Bloblang transformation language for complex data mappings.

How does Expanso help with data governance and compliance?

Expanso includes built-in governance features: automatic PII detection and masking, policy enforcement at the edge, RBAC, SSO integration, and comprehensive audit trails. Mask sensitive data before it ever leaves your network.

Distributed Analytics

Overview

Organizations operating across multiple regions face the challenge of collecting, processing, and analyzing data while maintaining compliance with regional data sovereignty laws. Traditional approaches either sacrifice centralized analytics for compliance or risk violations by centralizing raw data. Edge processing enables both: regional data processing that meets compliance requirements while providing centralized analytics and insights.

Expanso's Approach to Distributed Analytics

Expanso Edge processes data regionally at the source, applying local transformations, aggregations, and compliance rules before sending analytics-ready data to centralized systems. Each region operates independently while contributing to a unified analytical view.

Key capabilities:

Regional Processing: Run complete analytics pipelines in each geographic region, ensuring data is processed locally before any cross-border transmission.
Compliance-First Aggregation: Apply regional privacy rules, PII redaction, and data minimization at the edge before creating analytics datasets.
Centralized Analytics: Aggregate pre-processed, compliant data from all regions into a single analytics platform for global insights.
Local-First Operations: Maintain full operational capability in each region even during network outages or compliance restrictions.
Flexible Routing: Send different data types to different destinations - raw events stay regional, aggregated metrics go global.

Benefits of Distributed Edge Analytics

Compliance & Governance

Meet GDPR, CCPA, and data sovereignty requirements through regional processing
Automatic PII redaction and anonymization before data leaves regions
Maintain audit trails showing where and how data was processed
Ensure data residency requirements without sacrificing analytics value

Performance & Scale

Reduce central analytics load by pre-aggregating data at regional edges
Faster queries on smaller, pre-processed datasets vs. raw global data
Scale processing horizontally across regions rather than centralizing
Maintain regional responsiveness with local processing

Operational Resilience

Regional operations continue during network partitions
Local analytics and dashboards remain available regardless of central connectivity
Automatic backfill when connectivity to central systems is restored
No single point of failure in analytics infrastructure

Common Patterns

Regional Aggregation with Global Rollup Process and aggregate data within each region (EU, US, APAC), sending only aggregated metrics and anonymized datasets to global analytics platforms. Raw data never crosses regional boundaries.

Tiered Data Sovereignty Keep personally identifiable data and sensitive business information within regions while sharing anonymized trends, aggregates, and insights globally.

Multi-Region Compliance Apply region-specific privacy rules automatically: GDPR redaction in EU, CCPA compliance in California, industry regulations in specific countries - all from centralized pipeline management.

Federated Querying Run analytics queries that combine results from multiple regional processing systems, each enforcing local compliance rules before contributing to the unified result.

Local-First Dashboards Serve operational dashboards from regional edge systems for immediate insights, while background processes sync aggregated data to central business intelligence platforms.

Example Use Cases

Global retail chains processing transaction data regionally to comply with local privacy laws while maintaining centralized sales analytics and inventory optimization
Healthcare networks analyzing patient data within regional hospital systems, sharing only aggregated clinical insights for research while maintaining HIPAA compliance
Financial services performing fraud detection regionally with local models, aggregating threat intelligence globally while keeping customer data in-region
Multi-national SaaS platforms processing user analytics regionally per GDPR/CCPA requirements while providing product teams with global usage insights

Production Examples

Data RoutingBeginner

Content Routing

Conditional routing and message filtering based on content.

View example →

Data RoutingBeginner

Fan-Out Pattern

Multi-destination broadcast patterns for parallel processing.

View example →

Data TransformationIntermediate

Transform Formats

JSON/CSV/Avro conversion and format transformation.

View example →

Data SecurityIntermediate

Remove PII

GDPR-compliant data redaction and anonymization.

View example →

Next Steps

Quick Start Guide: Build your first distributed analytics pipeline
Bloblang Transformations: Learn data transformation and anonymization techniques
Branch Processor: Route data to different regional destinations
Mapping Processor: Apply regional compliance rules
Kafka Output: Aggregate regional data to central systems

Overview​

Expanso's Approach to Distributed Analytics​

Benefits of Distributed Edge Analytics​

Common Patterns​

Example Use Cases​

Production Examples​

Content Routing

Fan-Out Pattern

Transform Formats

Remove PII

Next Steps​

Overview

Expanso's Approach to Distributed Analytics

Benefits of Distributed Edge Analytics

Common Patterns

Example Use Cases

Production Examples

Next Steps