datalad
« Back to VersTracker
Description:
Data distribution geared toward scientific datasets
Type: Formula  |  Latest Version: 1.2.3@1  |  Tracked Since: Dec 17, 2025
Links: Homepage  |  @datalad  |  formulae.brew.sh
Category: Developer tools
Tags: data-science scientific-computing version-control research reproducibility
Install: brew install datalad
About:
Datalad is a distributed data management system that tracks and shares datasets across multiple storage locations. It leverages Git and Git-Annex to provide a robust version control layer for large scientific data, enabling seamless collaboration and data provenance tracking.
Key Features:
  • Distributed data versioning with Git and Git-Annex
  • Reproducible data analysis workflows
  • Access to a global network of shared datasets
  • Supports large files and diverse storage backends
Use Cases:
  • Managing and versioning large scientific datasets
  • Ensuring reproducibility in computational research
  • Sharing data across institutions with distributed storage
Alternatives:
  • Git LFS – Datalad offers more powerful data distribution and management features on top of Git, whereas Git LFS focuses primarily on handling large files within a single repository.
  • DVC – DVC is heavily optimized for machine learning pipelines, while Datalad is a more general-purpose tool for any type of scientific data management and distribution.
License: MIT
Dependencies: certifi, cryptography, git-annex, p7zip, python@3.14
Bottles available for: arm64_tahoe, arm64_sequoia, arm64_sonoma, sonoma, arm64_linux, x86_64_linux
Version History
Detected Version Rev Change Commit
Dec 13, 2025 3:07pm 1 VERSION_BUMP 9c13dc9b
Sep 15, 2025 6:39am 0 VERSION_BUMP 314d8b11
Dec 15, 2024 2:27am 0 VERSION_BUMP 30efd6a2
Nov 18, 2024 10:39pm 0 VERSION_BUMP d505e428
Nov 18, 2024 9:34pm 0 VERSION_BUMP 993bb9f9
Oct 12, 2024 8:34pm 0 VERSION_BUMP aa42e154
Oct 12, 2024 3:12pm 0 VERSION_BUMP 1fc44f87