Corpora and corpus tools

The Centre for Translation Studies has developed and hosts a range of large representative corpora in a variety of languages including English, Arabic, Chinese, French, German, Italian, Japanese, Spanish, Polish and Russian. Some corpora are available in-house only, while others can be accessed freely. For more information, go to http://corpus.leeds.ac.uk/.

The School of Computing has developed and hosts a number of Arabic corpora, including the Corpus of Contemporary Arabic (http://www.comp.leeds.ac.uk/eric/latifa/research.htm) and the Quranic Arabic Corpus (http://corpus.quran.com/).

For a range of corpus processing tools developed at Leeds, go to http://corpus.leeds.ac.uk/tools/.