![]() The deformatter removes formatting markup (HTML, RTF, etc.) that should be kept in place but not translated.Source language text is passed into Apertium for translation.The diagram displays the steps that Apertium takes to translate a source-language text (the text we want to translate) into a target-language text (the translated text). This is an overall, step-by-step view how Apertium works. Translation methodology Pipeline of Apertium machine translation system Several companies are also involved in the development of Apertium, including Prompsit Language Engineering, Imaxin Software and Eleka Ingeniaritza Linguistikoa. Language data developed for it (in collaboration with the Universidade de Vigo, the Universitat Politècnica de Catalunya and the Universitat Pompeu Fabra) currently support (in stable version) the Arabic, Aragonese, Asturian, Basque, Belarusian, Breton, Bulgarian, Catalan, Crimean Tatar, Danish, English, Esperanto, French, Galician, Hindi, Icelandic, Indonesian, Italian, Kazakh, Macedonian, Malaysian, Maltese, Northern Sami, Norwegian ( Bokmål and Nynorsk), Occitan, Polish, Portuguese, Romanian, Russian, Sardinian, Serbo-Croatian, Silesian, Slovene, Spanish, Swedish, Tatar, Ukrainian, Urdu, and Welsh languages. ![]() ![]() To create a new machine translation system, one just has to develop linguistic data (dictionaries, rules) in well-specified XML formats. It was originally designed to translate between closely related languages, although it has recently been expanded to treat more divergent language pairs. Being an open-source project, Apertium provides tools for potential developers to build their own language pair and contribute to the project.Īpertium originated as one of the machine translation engines in the project OpenTrad, which was funded by the Spanish government, and developed by the Transducens research group at the Universitat d'Alacant. Įxisting machine translation systems available at present are mostly commercial or use proprietary technologies, which makes them very hard to adapt to new usages furthermore, they use different technologies across language pairs, which makes it very difficult, for instance, to integrate them in a single multilingual content management system.Īpertium uses a language-independent specification, to allow for the ease of contributing to Apertium, more efficient development, and enhancing the project's overall growth.Īt present (December 2020), Apertium has released 51 stable language pairs, delivering fast translation with reasonably intelligible results (errors are easily corrected). Constraint Grammar taggers are also used for some language pairs (e.g. It is free software and released under the terms of the GNU General Public License.Īpertium is a shallow-transfer machine translation system, which uses finite state transducers for all of its lexical transformations, and hidden Markov models for part-of-speech tagging or word category disambiguation. POSIX compatible and Windows NT (limited support)Īpertium is a free/open-source rule-based machine translation platform. 1 November 2022 10 months ago ( 1 November 2022)
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |