Tech Mahindra Launches 'Project Indus' LLM, Phase-1 Designed for the Hindi and Its 37+ Dialects

Tech Mahindra has just introduced Project Indus, a large language model (LLM) designed to converse in a multitude of Indian languages and dialects.

Project Indus stands out due to its focus on Indic languages and dialects, making it a unique and valuable addition to the language model landscape.

To give a comparative perspective, the well-known multilingual models like BERT, XLM are trained on a mix of languages, including English, but may not perform optimally for specific Indic languages. Project Indus, on the other hand, is tailored specifically for Indic languages, ensuring better accuracy and understanding.

Similarly, powerful LLMs like GPT-3 and GPT-4 are primarily trained on English and other major languages. Project Indus focuses on Indic linguistic diversity, addressing nuances and dialects that these models might miss.

Existing Indic-specific models such as TALNet and IndicBERT are valuable but may lack the scale and versatility of Project Indus.

In summary, Project Indus bridges the gap by offering a robust, scalable, and context-aware solution for Indian languages. Its focus on dialects and industry applications makes it a promising addition to the AI landscape.

Moreover, Tech Mahindra is collaborating with Dell Technologies and Intel to implement the project’s ‘GenAI in a box’ framework globally. As part of this collaboration, Tech Mahindra will also leverage Intel® Gaudi®AI Accelerators and AI training assets to train the future generation of Indus models as well as skill up its employees on Intel product portfolio (hardware and software) to provide GenAI expertise to its wide network of global customers across industries.

1. Foundational Model for Indic Languages:

  • Project Indus is an indigenous LLM developed by Tech Mahindra.
  • The first phase of Indus LLM focuses on the Hindi language and its 37+ dialects.
  • It aims to provide advanced AI solutions that enable enterprises to scale rapidly.

2. Innovative Deployment Framework: GenAI in a Box:

  • The Indus LLM will be implemented using an innovative framework called 'GenAI in a box'.
  • This solution simplifies the deployment of advanced AI models for enterprises.
  • It leverages Dell Technologies' high-performance computing solutions, storage, and networking capabilities.

3. Intel Collaboration:

  • The LLM also adopts Intel-based infrastructure solutions, including Intel® Xeon® Processors and OneAPI software.
  • Future generation products leveraging CPU features like Intel® Advanced Matrix Extensions (AMX) are used.
  • Tech Mahindra collaborates with Intel to train the future generation of Indus models and skill up its employees on Intel product portfolio.

4. Industry Applications:

  • Project Indus aims to redefine AI-driven solutions across various industries.
  • Use cases include customer support, experience, content creation, and more in sectors like healthcare, rural education, banking, finance, agriculture, and telecom.

5. Dell Technologies' Perspective:

Denise Millard, Chief Partner Officer at Dell Technologies, emphasizes the importance of accessibility and scalability for organizations adopting AI.

The Dell AI Factory supports LLMs like Project Indus, promoting growth, productivity, and innovation.

Tech Mahindra has been making significant strides in offering next-gen solutions to enterprises worldwide. The company recently announced that it is building an LLM to preserve Bahasa Indonesia, the official and national language of Indonesia and its dialects. This collaboration further demonstrates Tech Mahindra's commitment to enabling enterprises to scale rapidly with technological advancements, building a future where AI solutions are accessible, scalable, and responsible.
Advertisements

Post a Comment

Previous Post Next Post
Like this content? Sign up for our daily newsletter to get latest updates.