Rooted in Language.
Built for Impact.

A community-driven research collective focused on NLP and low-resource technologies for the languages of Meghalaya, including Khasi and Garo.

NLP Low-Resource Community Research
Explore Resources Read Our Mission

Our Mission

Bridging the gap between academic research and practical deployment for indigenous languages.

Research Advance NLP research specifically for low-resource and indigenous languages.
Data Create gated datasets and reproducible models to standardize benchmarks.
Deployment Bridge academic findings with real-world practical deployment.
Ethics Encourage ethical, transparent, and community-led AI development.

Areas of Focus

Targeted initiatives to digitize and empower Meghalayan languages.

๐Ÿง 

Machine Translation

Developing Neural Machine Translation (NMT) systems specifically for Khasi, Garo, and related dialects.

๐Ÿ“

Documentation

Language documentation, corpus creation, and building robust annotation pipelines.

๐Ÿ› 

Applied AI

Creating NLP systems designed for public good and real-world utility.

โš–๏ธ

Tooling

Building open tools for easier data collection and community annotation.

What Youโ€™ll Find

We build, curate, and release assets on Hugging Face.

๐Ÿ“š Datasets

High-quality data for training and evaluation.

  • Parallel Corpora
  • Annotated Text
  • Speech Resources
  • QA Datasets

๐Ÿค– Models

State-of-the-art architectures adapted for our needs.

  • Fine-tuned Transformers
  • Experimental NLP Models
  • Reproducible Checkpoints

๐Ÿงช Spaces

Interactive ways to test our technology.

  • Live Demos
  • Evaluation Tools
  • Interactive Experiments

๐Ÿงญ Governance

Tynrai currently operates as a community organization. Roles and access are managed by the org admins. As the project evolves, governance structures may be formalized to ensure long-term sustainability.

๐Ÿ“œ License

Unless otherwise specified, our content is released under permissive open-source licenses (Apache-2.0). We believe in open science while respecting data sovereignty.