Code & Resources
All our research code is available on GitHub.
Featured Repositories
LinguaFlow
Neural Architecture for Multilingual NLP
A comprehensive toolkit for building and evaluating multilingual language models with cross-lingual transfer capabilities.
-
GitHub Repository Documentation Paper -
Stars: 2.4k Forks: 380
DialogueMaster
End-to-End Dialogue Systems
Open-source framework for building task-oriented and open-domain conversational AI systems.
-
GitHub Repository Demo Paper -
Stars: 1.8k Forks: 290
KnowledgeGraph-QA
Question Answering over Knowledge Graphs
State-of-the-art models for semantic parsing and question answering over large-scale knowledge bases.
-
GitHub Repository Datasets Paper -
Stars: 950 Forks: 180
FairNLP
Fairness Evaluation Toolkit
Tools and benchmarks for evaluating and improving fairness in NLP models.
-
GitHub Repository Leaderboard Paper -
Stars: 720 Forks: 140
Datasets
| Dataset | Description | Size | Download |
|---|---|---|---|
| MultiLang-QA | Multilingual QA benchmark | 500K examples | Link |
| DialogueBench | Dialogue evaluation dataset | 100K conversations | Link |
| FairText | Bias evaluation corpus | 50K sentences | Link |
Pre-trained Models
We release pre-trained models on Hugging Face:
- LinguaFlow-Base - Multilingual encoder (12 layers, 768 hidden)
- LinguaFlow-Large - Multilingual encoder (24 layers, 1024 hidden)
- DialogueMaster-GPT - Dialogue generation model