Development of language technology changes the relationship we have with the computer. Nowadays we expect that the computer can find relevant documents, even if searching in images and videos, extract information from such documents or produce reasonably understandable translations from a range of languages.
Fundamentally all these possibilities are based on research into how words are used in large collections of text, how their use differs across languages, how human intentions can be integrated with data available in texts, and so on. Our research in computational linguistics addresses these topics. Our research has implications for language theory — for example, it provides reliable information on which linguistic features or their configurations are common across languages — as well as practical applications, for example in machine translation and language teaching.
In the process of our research, we have assembled large collections of corpora for a wide range of languages from the web, developed new tools for their automatic tagging and parsing, investigated their composition in terms of domains and genres using methods of machine learning, studied their use in such tasks as terminology extraction or machine translation, and developed applications for their use in language teaching, interpretation and subtitling, and other practical areas.