CommonLID
Language Identification for the Real Web
CommonLID is a language-identification benchmark project for noisy web text, with an official Python package and CLI for evaluating language-identification models on the CommonLID benchmark and complementary datasets.
Recent stories
0 linked stories
No linked stories yet.