A multilingual dictionary for the recognition of macro-regions and countries mentioned in daily press news
This data paper presents a reproducible method for identifying national and supranational geographical entities mentioned in international news published by daily newspapers. The approach relies on a multilingual dictionary covering macro-regions (part of the world and political organisations) and nation-states in four languages (French, German, English, Turkish). The dictionary has been tested and validated on a corpus of news titles from five daily newspapers in France, Tunisia, Germany, the United Kingdom, and Turkey, spanning the period from April 2013 to March 2023. This corpus serves as an example of the dictionary’s application for elaborating geographical networks and geopolitical agendas.

