ChaiMind
Dataset Mind

India's largest AI dataset repository

High-quality datasets across text, speech, vision and tabular modalities — cleaned, documented and ready to train.

Explore Datasets

Discover, publish, and collaborate on datasets powering the next generation of AI.

Preview data

Browse rows and samples before you download.

Stream & download

Parquet, WebDataset and direct API access.

Version history

Every revision tracked and reproducible.

Dataset metrics

Quality, coverage and bias dashboards.

No datasets published yet

Published datasets — including GitHub-backed datasets — will appear here.