π Getting Started with FlowyML
Welcome to FlowyML! In the next 5-10 minutes, you'll go from zero to running your first production-ready pipeline. No prior MLOps experience required.
π― What You'll Build
A complete ML pipeline with data loading, processing, context injection, and real-time monitoring. These patterns scale from quick prototypes to enterprise deployments.
π¦ Installation
FlowyML requires Python 3.9 or higher.
π§ Basic Installation
π‘ Pro Tip
Use a virtual environment (venv or conda) to avoid dependency conflicts with other projects.
π Full Installation (Recommended)
Includes UI support and common ML dependencies:
What this gets you: The web dashboard, Keras integration, cloud storage backends, and everything you need for production deployments. Start with this unless you have size constraints.
β Verify Installation
You should see the version number. If not, check that your Python PATH is configured correctly.
π Your First Project
Let's create a new project using the CLI.
This creates a directory structure like this:
ποΈ Why this structure?
It separates code (src/), configuration (flowyml.yaml), and dependencies (requirements.txt) β exactly what you need for clean version control and team collaboration.
π§ͺ Creating a Pipeline
Open src/pipeline.py and replace its content with this simple example:
π¬ Understanding What Just Happened
Let's break down the key concepts:
-
@stepdecorator: Turns any Python function into a pipeline step. Theoutputs=["data"]tells FlowyML what this step produces. -
Data flow: The
@step(inputs=["data"], ...)onprocess_dataautomatically connects it tofetch_data's output. No manual wiring needed. -
Pipeline assembly:
pipeline.add_step()builds your DAG. FlowyML figures out the execution order based on data dependencies. -
Execution:
pipeline.run()executes all steps in the right order and returns a result object with status and outputs.
π― Why this matters
This same pattern works whether you have 3 steps or 300. The complexity doesn't grow with your pipeline.
βΆοΈ Running the Pipeline
Execute the script:
You should see output indicating the steps are executing:
β‘ Caching in Action
Pipelines are idempotent by default. Run it again and watch how caching kicks in β steps that haven't changed won't re-execute.
π₯οΈ Visualizing with the UI
Now, let's see your pipeline in the FlowyML UI β this is where the magic happens for debugging and monitoring.
Step 1: Start the UI server
You'll see:
π§ What's running
A lightweight FastAPI server that displays your pipeline runs, DAG visualizations, and artifact inspection β all in real-time.
Step 2: Run your pipeline (in a separate terminal)
Step 3: Watch it live!
Open your browser to http://localhost:8080. You'll see:
- π Pipeline DAG: Visual graph showing step dependencies
- β‘ Real-time execution: Steps highlight as they run
- π Artifact inspection: Click any step to see its inputs/outputs
- π Run history: Compare different runs side-by-side
π Why the UI matters
Imagine debugging a failed step at 3 AM in production. Instead of grep'ing through logs, you see exactly: which step failed, what its inputs were, the full error traceback, and what downstream steps were skipped.
ποΈ Adding Context & Parameters
Let's make the pipeline configurable using context β one of FlowyML's killer features.
Update your pipeline:
Run it again:
Output:
π‘ The Power of Context Injection
π Why this is revolutionary
You just separated configuration from code. The same pipeline can run with different configs for Dev (small dataset), Staging (medium dataset), and Production (full dataset). Change the context, not the code.
π Next Steps
Congratulations! You've built a complete pipeline with monitoring. Here's where to go next based on your goals:
π― Production Pipelines
- Projects & Multi-Tenancy β Organize teams & environments
- Scheduling β Cron-style automation
- Versioning β Track & rollback changes
π Performance & Scale
- Caching Strategies β Save compute costs
- Parallel Execution β Run steps concurrently
- Performance Guide β Benchmarks & tuning
π¬ Advanced ML Features
- Assets & Lineage β Typed artifacts
- Model Registry β Version models
- LLM Tracing β GenAI costs & tracing
π§ Deep Dive into Concepts
β Core Concepts: Pipelines β Master pipeline design patterns
β Core Concepts: Steps β Learn step best practices
β Core Concepts: Context β Advanced context injection techniques
π¨ Integrate with Your Stack
β Keras Integration β Automatic experiment tracking
β GCP Integration β Deploy to Google Cloud
β Custom Components β Extend FlowyML
Questions or stuck? Check out the Resources page for community links, tutorials, and support channels.
Ready to dive deeper? The User Guide is your next stop for production-grade patterns.