Pelcra

Polish & English
Language Corpora
for Research
& Applications

OSW Polish-English Parallel Corpus (CC-BY-NC)

This corpus contains texts from the Centre for Eastern Studies sentence-aligned with the mAlignaaligner using the Church & Gale algorithm. Original texts were downloaded from the OSW webiste. The texts are available under the CC-BY-NC license.

 

Source language  Target language  Texts  Source words  Target words  Alignment level  Alignment type 
Polish English 796 635 352 796 947 sentence statistical

 

The corpus is available for DOWNLOAD at http://pelcra.pl/resources/parallel/pelcra_par_4.tgz.