Change the repository type filter
All
Repositories list
41 repositories
GlotOCR-bench
PublicGlotOCR BenchmarkGlotLID
Public[EMNLP 2023] 💬 Language Identification with Support for More Than 2000 LabelsGlot500
Public[ACL 2023] Glot500: Scaling Multilingual Corpora and Language Models to 500 LanguagesGlotWeb
Public[WWW 2026] 🕸 GlotWeb: Web Indexing for Minority LanguagesGlotScript
Public[LREC 2024] 🖋 Resource and Tool for Writing System Identificationcisnlp.github.io
PublicHomepage of cisnlp- This is the codebase for "Large Reasoning Models Are (Not Yet) Multilingual Latent Reasoners"
multypo
PublicA Multilingual Keyboard Layout-Based Typo GeneratorKLAR-CLC
PublicLanguage-Mixing
Publicmanchu-in-context-mt
PublicMIB-circuit-track
Publicspatial_intuitions
Public- [EMNLP 2025] Tracing Multilingual Factual Knowledge Acquisition in Pretraining
MEXA
Public[ACL 2025] 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual AlignmentGlotCC
Public[NeurIPS 2024] 🕸 GlotCC Dataset and Piplinecode-specific-neurons
Public[ACL 2025] 💻🔍 How Programming Concepts and Neurons Are Shared in Code Language Modelsoscar-io
Publicungoliant
Publicoscar-tools
PublicLangSAMP
PublicLangSAMP: Language-Script Aware Multilingual Pretraininganalogical_reasoning
PublicTransliteration-PPA
PublicBreaking the Script Barrier in Multilingual Pre-Trained Language Models with Transliteration-Based Post-Training Alignmentlohoravens-webpage
PublicMaskLID
Public[ACL 2024] 💬 MaskLID: Code-Switching Language Identification through Iterative MaskingTaxi1500
PublicTransMI
PublicTransMI: A Framework to Create Strong Baselines from Multilingual Pretrained Language Models for Transliterated DataTransliCo
PublicTransliCo: A Contrastive Learning Framework to Address the Script Barrier in Multilingual Pretrained Language Models
ProTip! When viewing an organization's repositories, you can use the
props. filter to filter by custom property.