The Russian protest framing dictionary was created to analyse how Russian state-controlled media cover street protests. The list of keywords in the dictionary and continuous scores attached to the words allow computer programs to locate Russian language news stories on a social disorder vs. freedom to protest scale.
The dictionary was constructed using a technique called Latent Semantic Scaling. It is based on a 27 million-word corpus of Russian newspaper articles and TV transcripts published in state-controlled media sources in 2011-2014 (NTV, Russia 1, Channel 1, Izvestia, Russian Gazette and Komsomolskaya Pravda). The dictionary is able to capture the framing of protest on a par with human coders. Nevertheless, caution needs to be exercised when applying this dictionary to analysis of news stories collected from different time periods or with different types of media content.
Use of the dictionary is very simple and is similar to other forms of dictionary-based content analysis. Document scores should be calculated ignoring words not found in the dictionary. In other words, the document scores are a sum of scores divided by the number of entry words in the documents, not by the total number of words in the documents. Please see the sample code for more detail.