Nemotron-Personas-India: Synthesized Data for Sovereign AI
NVIDIA released Nemotron-Personas-India, a dataset of 98,000 synthetic human interactions in 12 Indian languages. The dataset aims to support development of AI models tailored to Indian languages and cultural contexts. You can access the dataset via Hugging Face. This release supports the growth of sovereign AI capabilities in India.
- 98,000 synthetic human interactions in 12 Indian languages.
- Dataset available on Hugging Face for model training.
- Supports development of India-specific AI models.