Enca is an Extremely Naive Charset Analyser. It detects character set and encoding of text files and can also convert them to other encodings using either a built-in converter or external libraries and tools like libiconv, librecode, or cstocs.
Currently it supports Belarusian, Bulgarian
... [More], Croatian, Czech, Estonian, Hungarian, Latvian, Lithuanian, Polish, Russian, Slovak, Slovene, Ukrainian, Chinese, and some multibyte encodings independently on language. [Less]
Task of the project is a semantic annotation of texts using NLP tools.
Czsem Mining Suite is mainly a GATE plugin that allows to use Treex and TectoMT tools inside GATE. Bsides that is also a Information Extraction tool based on dependency liguistics. It si capable to learn tree queries
... [More] (dependecy based extraction rules) using Inducive Logic Programming. [Less]
This site uses cookies to give you the best possible experience.
By using the site, you consent to our use of cookies.
For more information, please see our
Privacy Policy