Translate text between multiple languages
Translate Japanese text to English
Generate responses using a language model
Traduz textos entre mais de 100 idiomas usando IA
Traduz texto do inglês para múltiplas línguas
Translate text between Western Armenian and English
Translate text between languages
Translate text into multiple languages
Transform informal Persian text to formal
Translate text from English to multiple languages
Translate Russian to Circassian
Translate English text to Spanish
Translate text from Chinese to English or German
Helsinki-NLP/tatoeba_mt is a multilingual machine translation model developed by Helsinki-NLP. It is designed to translate text between multiple languages efficiently, leveraging the Tatoeba dataset, which is a large collection of example sentences and translations. This model is particularly useful for low-resource languages and provides high-quality translations for a wide range of language pairs.
• Multilingual Support: Translate between multiple languages, including low-resource languages.
• High-Quality Translations: Fine-tuned on the Tatoeba dataset for accurate and natural translations.
• Open-Source: Accessible for research, development, and customization.
• Efficient Inference: Optimized for both speed and quality in translation tasks.
• Flexible Integration: Can be integrated into various applications for translation needs.
pip install transformers
from transformers import MarianMTModel, MarianTokenizer
Helsinki-NLP/tatoeba_mt
) to load the pre-trained model and tokenizer.
model_name = "Helsinki-NLP/tatoeba_mt"
tokenizer = MarianTokenizer.from_pretrained(model_name)
model = MarianMTModel.from_pretrained(model_name)
def translate_text(text, source_lang, target_lang):
batch = tokenizer([text], return_tensors="pt")
gen = model.generate(**batch)
return tokenizer.decode(gen[0], skip_special_tokens=True)
translated = translate_text("Hello, how are you?", "en", "fr") # Example: French translation
print(translated)
1. What languages does Helsinki-NLP/tatoeba_mt support?
The model supports a wide range of languages, including many low-resource languages. You can check the specific language pairs by referring to the Hugging Face model card.
2. How accurate are the translations?
The translations are highly accurate, especially for language pairs with sufficient training data. However, accuracy may vary for very low-resource languages.
3. Can I use Helsinki-NLP/tatoeba_mt for commercial purposes?
Yes, the model is open-source and can be used for both research and commercial applications under the Apache 2.0 license.