bespoke data solutions
for your AI models
We are the world's first data cooperative enabling economic opportunities for rural Indians.


Over 3 million paid digital tasks have been completed on the Karya platform.
Our goal at Karya is simple: bring dignified, digital work to rural Indians. By providing high-quality data annotation services for AI/ML models, we create economic opportunities for rural communities.
Building Speech Datasets
Our platform builds speech corpora across major Indian languages, powering language models and conversational AI in low-resource languages.
Document Digitisation
Karya workers digitise documents in English and local Indian languages — research presented at CHI Glasgow.
Image Datasets
Collect diverse, regionally specific image corpora — like 10,000 Bengali signboards we gathered for Microsoft OCR.
Image Annotation
Bounding boxes, classification and labelling at scale — currently annotating a corpus of 100,000 newspaper photos.