Hate speech corpus overview

The DP-R|EX hate speech corpus comprises communication content from 127 public Telegram channels classified as right-wing extremist. The data cover the period from January 2023 to September 2024 and include more than 2.5 million messages. An annotated sample of 1,800 messages was coded using an established hate speech classification scheme (Dahn) and target group categories. The corpus thus provides an empirical basis for analysing right-wing extremist online communication, hate speech forms and narratives, and radicalisation dynamics in the German-speaking context.


The corpus contains Telegram messages together with basic metadata (e.g. forwards, timestamps, links, interactions). For annotation, message texts were extracted and manually classified in a random sample. Five forms of hate speech and the affected target groups were recorded.


The data enable analyses including:


  • forms and intensities of hate speech
  • addressed target groups
  • communication dynamics of right-wing extremist online milieus