AI4Bharat, the ai research lab associated with IIT Madras, recently launched Airavata, an instructional model tailored to the Hindi language. This model, derived from Sarvam ai's OpenHathi tuning, aims to improve performance on assistance tasks by incorporating various Hindi instruction tuning datasets.
The Airavata development approach
AI4Bharat emphasizes a sustainable approach to the development of Airavata. Model development involves instruction-tuned, human-curated, licensure-friendly datasets, avoiding data generated from commercial models such as GPT-4. This approach ensures cost-effectiveness and facilitates unrestricted use in downstream applications due to the absence of licensing restrictions.
Also Read: India's ai Leap : 6 LLMs Created in India
Facing the challenge of Hindi language
Leveraging IndicTrans2, an advanced open-source machine translation model for Indian languages, the team translates well-constructed and supervised English instruction tuning data sets into Hindi. This method addresses the challenge of data scarcity for Hindi, aligning with AI4Bharat's commitment to fostering advancements in Indic language models.
Airavata Full Release
AI4Bharat not only introduced Airavata but also shared the instruction tuning data sets for the model. This step encourages innovation in the Indic language model domain, allowing researchers and developers to contribute to the evolution of Hindi language models.
The broader context
This release of AI4Bharat comes at a time when there is growing interest in large language models across the world. Recent attention has focused on English-centric models, leaving a gap in support for Indian languages. Collaborating with Sarvam ai to launch OpenHathi laid the foundation and now with Airavata, AI4Bharat is taking a significant step forward to address the needs of the Hindi language model.
Looking to the future
As AI4Bharat continues to push the boundaries in ai research, Airavata is a testament to the lab's commitment to innovation and sustainability. The model's performance on natural language understanding (NLU) tasks is noteworthy, indicating the potential for broader applications in various domains.
Also read: The small but mighty leap of stability ai with the LM 2 1.6B stable language model
Our opinion
The launch of Airavata is a milestone for AI4Bharat as it paves the way for advancements in Indic language models. It aligns with the global shift towards more inclusive language models, emphasizing comprehensive solutions beyond English-centric approaches. Airavata's impact on Hindi language processing could herald further advances in the broader landscape of ai language models.
Follow us Google news to stay up to date with the latest innovations in the world of ai, data science and GenAI.