Dask logo


Runtime engine

Dask is a flexible library for parallel computing in Python.

Use it when

  • You are working with large-scale data analytics.
  • You want a familiar interface that mimics NumPy, Pandas, and PySpark for drop-in replacement.
  • You want to run the same Pandas or NumPy code locally or on a cluster.

Watch out

  • It does not support deep learning.

Example stacks

Airflow + MLflow stack


python -m pip install "dask[complete]"