Introduction

What is Kubeflow Pipelines?

Kubeflow Pipelines (KFP) is a platform for building and deploying portable and scalable machine learning (ML) workflows using Docker containers.

With KFP you can author components and pipelines using the KFP Python SDK, compile pipelines to an intermediate representation YAML, and submit the pipeline to run on a KFP-conformant backend such as the open source KFP backend or Google Cloud Vertex AI Pipelines.

The open source KFP backend is available as a core component of Kubeflow or as a standalone installation. Follow the installation instructions and Hello World Pipeline example to quickly get started with KFP.

Why Kubeflow Pipelines?

KFP enables data scientists and machine learning engineers to:

Author end-to-end ML workflows natively in Python
Create fully custom ML components or leverage an ecosystem of existing components
Easily manage, track, and visualize pipeline definitions, runs, experiments, and ML artifacts
Efficiently use compute resources through parallel task execution and through caching to eliminating redundant executions
Maintain cross-platform pipeline portability through a platform-neutral IR YAML pipeline definition

What is a pipeline?

A pipeline is a definition of a workflow that composes one or more components together to form a computational directed acyclic graph (DAG). At runtime, each component execution corresponds to a single container execution, which may create ML artifacts. Pipelines may also feature control flow.

Next steps

Feedback

Was this page helpful?

Glad to hear it! Please tell us how we can improve.

Sorry to hear that. Please tell us how we can improve.

Last modified April 4, 2023: add kfp v2 keywords to pipelines documentation (#3488) (7c98c9d)

You are viewing documentation for Kubeflow 1.7

Introduction

Why Kubeflow Pipelines?

What is a pipeline?

Next steps

Feedback