Polish & English
Language Corpora
for Research
& Applications


DiaBiz is a newly released corpus of Polish call center dialogues including phone-based interactions in several business It currently contains:

  1. 3,766 conversations amounting to 385 hours and over 3 million words
  2. dialogues between 5 professional call-center agents and 189 participants as customers
  3. data from 8 business domains with high commercial demand for conversational analytics and automation solutions
  4. dialogues based on 200 real-life interaction scenarios.
Current information on DiaBiz is available here.