About BharatGen

BharatGen is a multimodal large language model initiative, developing advanced generative AI models tailored to India's linguistic, cultural, and socio-economic diversity. At its core is Bharat Data Sagar, a vast repository of India-centric data that ensures the AI models are deeply rooted in the country’s unique context. By integrating text, speech, and images, BharatGen builds accessible AI technologies that foster innovation across key sectors like agriculture, education, and healthcare, ensuring inclusivity for India’s diverse population.

BharatGen is a public / private partnership to build foundational AI technologies in Bharat and with a Bharatiya perspective. The program is launched as a non-profit company with funding from the Government of India, Department of Science and Technology. The program has 4 primary goals...

The creation of Bharat centric foundational LLM models with a focus on our languages, culture and tradition.
The creation of an ambitious data repository called Bharat Data Sagar, that will aim to centralize high quality Bharatiya data necessary for model building
The creation of an ecosystem of companies that can leverage this work to launch products and applications for India and the global south at large.
Upskilling the Indian workforce to become creators of cutting edge AI and not just consumers.

The company is hosted at IIT-Bombay with academic partners across other IITs and IIMs. We are super excited about this launch and happy to share that we have already created an LLM with promising results. As a part of this work we are collaborating with industry partners including Reliance (Jio), & Tata and so on, and also working with government organizations to provide AI based solutions.