Fluid ForgeFluid Forge
Home
Get Started
  • Local (DuckDB)
  • GCP (BigQuery)
  • Snowflake Team Collaboration
  • Declarative Airflow
  • Orchestration Export
  • Jenkins CI/CD
  • Universal Pipeline
CLI Reference
  • Overview
  • Architecture
  • GCP (BigQuery)
  • AWS (S3 + Athena)
  • Snowflake
  • Local (DuckDB)
  • Custom Providers
  • Roadmap
GitHub
GitHub
Home
Get Started
  • Local (DuckDB)
  • GCP (BigQuery)
  • Snowflake Team Collaboration
  • Declarative Airflow
  • Orchestration Export
  • Jenkins CI/CD
  • Universal Pipeline
CLI Reference
  • Overview
  • Architecture
  • GCP (BigQuery)
  • AWS (S3 + Athena)
  • Snowflake
  • Local (DuckDB)
  • Custom Providers
  • Roadmap
GitHub
GitHub
  • Introduction

    • /
    • Getting Started
    • Snowflake Quickstart
    • Vision & Roadmap
  • Walkthroughs

    • Walkthrough: Local Development
    • Walkthrough: Deploy to Google Cloud Platform
    • Walkthrough: Snowflake Team Collaboration
    • Declarative Airflow DAG Generation - The FLUID Way
    • Generating Orchestration Code from Contracts
    • Jenkins CI/CD for FLUID Data Products
    • Universal Pipeline
  • CLI Reference

    • CLI Reference
    • init Command
    • validate Command
    • plan Command
    • apply Command
    • verify Command
    • generate-airflow Command
  • Providers

    • Providers
    • Provider Architecture
    • GCP Provider
    • AWS Provider
    • Snowflake Provider
    • Local Provider
    • Creating Custom Providers
    • Provider Roadmap
  • Advanced

    • Blueprints
    • Governance & Compliance
    • Airflow Integration
    • Built-in And Custom Forge Agents
    • FLUID Forge Contract GPT Packet
    • Forge Copilot Discovery Guide
    • Forge Copilot Memory Guide
  • Project

    • Contributing to Fluid Forge
    • Fluid Forge v0.7.1 - Multi-Provider Export Release

Vision & Roadmap

The Problem

Data engineering is too hard. Building a production analytics pipeline today means:

  • Hundreds of lines of cloud SDK boilerplate per provider
  • Deep expertise in BigQuery and Athena and Snowflake APIs
  • Manually crafting IAM policies, then rewriting them for each cloud
  • Copy-paste infrastructure with no reusable abstractions
  • Weeks of setup before a single row of data flows

For every new data product, teams repeat the same laborious process. The industry has solved this for infrastructure (Terraform), containers (Kubernetes), and configuration (Ansible). Data products deserve the same treatment.


The Solution

Fluid Forge introduces Data Products as Code — a declarative approach where you write a YAML contract describing what you want, and the engine figures out how to build it.

# One contract. Every cloud.
fluidVersion: "0.7.1"
kind: DataProduct
id: analytics.customers
name: Customer Analytics

metadata:
  owner: { team: data-engineering }

exposes:
  - exposeId: customers_table
    kind: table
    binding:
      platform: gcp            # Change to aws or snowflake — same contract
      resource:
        type: bigquery_table
        dataset: analytics
        table: customers
        partitioning:
          type: time
          field: created_at
          granularity: DAY
    contract:
      schema:
        - name: id
          type: INTEGER
          required: true
        - name: email
          type: STRING
          sensitivity: pii

Behind the scenes, Fluid Forge:

  • Creates datasets, tables, and schemas with optimal configuration
  • Sets up IAM roles, service accounts, and RBAC
  • Generates Airflow DAGs for orchestration
  • Validates schema compatibility and detects configuration drift
  • Enforces governance policies and data sovereignty rules

All from one contract file.


Core Principles

1. Declarative First

You declare the desired state. Fluid Forge plans the execution, handles errors, ensures idempotency, and converges toward that state. No imperative scripts. No manual steps.

2. Developer Experience

pip install fluid-forge          # Install
fluid init my-project --quickstart  # Scaffold
fluid apply contract.fluid.yaml --yes  # Deploy

If a workflow isn't delightful, it's not done. Zero boilerplate, maximum productivity.

3. Multi-Cloud Native

GCP, AWS, and Snowflake are production-ready today. Azure and Databricks are on the roadmap. The key insight: same contract, same commands, different cloud. Switching providers is a one-line change.

4. Production Ready

Enterprise features out of the box:

  • Built-in governance and compliance (GDPR, SOC2)
  • Automated testing and contract validation
  • Drift detection and remediation
  • Comprehensive audit trails
  • Multi-environment support (dev → staging → prod)

5. Open and Extensible

  • Open source (Apache 2.0) and community-driven
  • Custom providers — build one in ~40 lines of Python
  • LLM integration — plug in any AI model for copilot-powered generation
  • Open standards — export to ODPS v4.1, ODCS v3.1, data mesh catalogs

The Data Product Lifecycle

Fluid Forge covers the full journey:

Design

fluid wizard                              # Interactive guided setup
fluid forge --mode copilot                # AI-powered generation
fluid blueprint list --category analytics # Browse templates

Validate

fluid validate contract.yaml              # Schema + semantic checks
fluid contract-tests contract.yaml        # Contract test suites
fluid policy-check contract.yaml          # Governance compliance

Plan

fluid plan contract.yaml                  # Preview changes (no side effects)
fluid viz-graph contract.yaml             # Visualize data lineage
fluid diff contract.yaml --env prod       # Compare environments

Deploy

fluid apply contract.yaml --yes           # Execute against target provider
fluid verify contract.yaml                # Post-deployment verification
fluid generate-airflow contract.yaml      # Generate orchestration

Operate

fluid diff contract.yaml --exit-on-drift  # Monitor drift in CI/CD
fluid policy-apply policy.yaml            # Enforce governance changes
fluid export-opds contract.yaml           # Export to open data standards

How It Compares

DevOps ConceptFluid Forge Equivalent
Infrastructure as Code (Terraform)Data Products as Code
GitOpsDataOps — contracts in version control
CI/CD PipelinesAutomated data deployments
Policy as Code (OPA/Sentinel)Governance as Code
ObservabilityDrift detection + contract verification

Roadmap

MilestoneWhat's IncludedTimeline
v0.7.1 (current)GCP + AWS + Snowflake production, Airflow/Dagster/Prefect export, governance engine✅ Released
Azure ProviderSynapse Analytics, Data Lake Gen2, Azure FunctionsQ3 2026
Databricks ProviderDatabricks SQL, Delta Lake, MLflow, Unity CatalogQ4 2026
Data Marketplace v2Publish, discover, and compose data products across teams2027

Get Involved

Fluid Forge is open source and built in the open.

  • Star & Fork — github.com/Agentics-Rising/forge-cli
  • Report Issues — Issue Tracker
  • Contribute — Contributing Guide
  • Discussions — GitHub Discussions

Ready to Build?

Get Started →


Copyright 2025-2026 Agentics Transformation Pty Ltd · Open source under Apache 2.0

Edit this page on GitHub
Last Updated: 4/4/26, 3:15 PM
Contributors: khanya_ai, fas89
Prev
Snowflake Quickstart