Make GitHub your single source of truth for ML

Make GitHub your single source of truth for ML

Review dataset and model diffs in GitHub

Review dataset and model diffs in GitHub

The PRs you love, now scaled to support ML. The XetData app upgrades your GitHub experience.

When you use pull requests to review models alongside your code and data, the XetData app will show difference visualizations of  model architecture changes via Netron for improved understandability.

Track your ML models and deployments with Git hashes

Track your ML models and deployments with Git hashes

Stop saving models with complicated naming conventions that combine folder names, dates, and version number. Let Git do the heavy lifting and rely on Git hashes to tell you what was made when.

Instead of reinventing the wheel, we opted for the same approach software teams have been using for decades.

Only upload changes to your large files

Only upload changes to your large files

Our block-level deduplication algorithm minimizes time spent waiting for file uploads and downloads while saving on storage costs.

It's so novel that we wrote a paper on it (CIDR'23). The takeaway? We fast. Read more about our performance against Git LFS, DVC, and LakeFS in our benchmark blog post.

Frequently Asked Questions

What does the XetData integration do?

How is the XetData integration different from Git LFS or DVC?

How is the XetData integration different from XetHub?

Can I try this on my repository now?

How much does this cost?

Works with your existing data tools

Keep your existing file formats, libraries, ML frameworks, and IDEs.

Stop git-ignoring your data

Install the XetData integration for GitHub today to version your data and models alongside your code.

Stop git-ignoring your data

Install the XetData integration for GitHub today to version your data and models alongside your code.

Stop git-ignoring your data

Install the XetData integration for GitHub today to version your data and models alongside your code.