Git LFS logo


Data versioning

Git LFS is an open-source tool that allows you to version large files with Git.

Use it when

  • You want to version large files such as datasets.
  • You want to use the same commands and interface as versioning code with Git.

Watch out

  • Source code repository tools may impose a file size limit (2GB for GitHub Free)
  • You must download training data when cloning a remote repository. It requires large Internet bandwidth and can be very slow.

Example stacks

Airflow + MLflow stack


git lfs install