Skip to main content
Ctrl+K

Distributed DataFusion documentation

User Guide

  • Index
  • Getting Started
  • Building a TaskEstimator
  • Building a ChannelResolver
  • Building an Arrow Flight endpoint
  • Concepts
  • How a Distributed Plan is Built

Contributor Guide

  • Index
  • Setup
  • Tests
  • Benchmarks
  • Index

Index#

Distributed DataFusion is a library that brings distributed capabilities to DataFusion. It provides a set of execution plans, optimization rules, configuration extensions, and new traits to enable distributed execution.

This user guide will walk you through using the tools in this project to set up your own distributed DataFusion cluster.

  • Concepts

  • Getting Started

  • Building a ChannelResolver

  • Building a TaskEstimator

  • Building an Arrow Flight endpoint

  • How a distributed plan is built

previous

DataFusion Distributed

next

Getting Started

Edit on GitHub

This Page

  • Show Source