Acompanhamento, Análise e Pesquisa de Opinião Pública e Sentimentos
Acompanhamento, Análise e Pesquisa de Opinião Pública e Sentimentos
This project has two main goals. The first is to design an opinion mining system capable of measuring, almost in real-time, sentiments vis-à-vis parties and political actors and the economy in the contents of both conventional web-based media (online newspapers) and the so-called social media (blogs and micro-blogs). The second is to use the data collected in such a way to explore and explain the relationship between trends in sentiments as expressed in the conventional media, the social media and the public opinion polls and surveys in Portugal.
The concept of "public opinion" that prevails in modern social scientific research consists of the aggregation of individual attitudes, preferences and beliefs, as captured by polls and surveys using randomly selected samples. This, however, should not lead us to believe that surveys can be the only theoretically and empirically relevant source of data for the study of "public opinion." On the one hand, as means to capture mass beliefs and attitudes, they have been facing increasing challenges in dealing with sources of bias in inferences, caused by coverage problems in telephone polls - the rise of "cell only" individuals and households - and rising non-response rates. On the other hand, understanding and explaining mass public opinion has always required more than the use of surveys. Media content is not only a relevant object of study on its own for communication scholars, but can also provide crucial insights into the very sources of mass public opinion. Are mass attitudes somehow explained by the media messages to which individuals are exposed? Are purely news-based or editorial contents equally influential, and is that distinction clear cut? Is "public opinion" driven by the cues and frames provided by "published opinion"? Or instead, are the views conveyed by elites and media agents affected by the preferences of mass publics? Answering such questions requires the collection of data beyond the provided by survey research.
The relationship between political and economic events, how citizens come to apprehend them and how they react to them has become more complex with the rise of the social media. Blogs and micro-blogs (such as Twitter) perform several functions in this relationship. First, they can constitute additional sources of politically relevant information and stimuli for citizens. Second, they are themselves sources of relevant information and stimuli for journalists and political elites, raising the possibility that social media messages and conversations indirectly influence public opinion way beyond that the size of their readership might suggest. Finally, they may provide a window into mass public opinion itself: although bloggers, micro-bloggers and those who engage in online communication are certainly not a representative cross-section of the population at large, the small but rapidly increasing research on the content of social media messages suggests that their frequency and tone provide valid indications of trends and even, in some cases, work as a leading indicator of electoral results.
As Drezner and Farrell note, one of the problems faced by scholars in this regard is that "the proper exploitation of this data requires skills and expert knowledge of a kind that social scientists frequently don't have". This project addresses this problem by constituting a truly multidisciplinary team composed by computers engineers, linguists, political scientists and economists with the technical and theoretical expertise required to meet the project's main goals.
Project POPSTAR - Public Opinion and Sentiment Tracking, Analysis and Research - PTDC/CPJ-CPO/116888/2010 - Financed by FCT
Public Opinion;
Social Web Mining;
Online Sentiment;
Time series
This project has two main goals. The first is to design an opinion mining system capable of measuring, almost in real-time, sentiments vis-à-vis parties and political actors and the economy in the contents of both conventional web-based media (online newspapers) and the so-called social media (blogs and micro-blogs). The second is to use the data collected in such a way to explore and explain the relationship between trends in sentiments as expressed in the conventional media, the social media and the public opinion polls and surveys in Portugal.
The concept of "public opinion" that prevails in modern social scientific research consists of the aggregation of individual attitudes, preferences and beliefs, as captured by polls and surveys using randomly selected samples. This, however, should not lead us to believe that surveys can be the only theoretically and empirically relevant source of data for the study of "public opinion." On the one hand, as means to capture mass beliefs and attitudes, they have been facing increasing challenges in dealing with sources of bias in inferences, caused by coverage problems in telephone polls - the rise of "cell only" individuals and households - and rising non-response rates. On the other hand, understanding and explaining mass public opinion has always required more than the use of surveys. Media content is not only a relevant object of study on its own for communication scholars, but can also provide crucial insights into the very sources of mass public opinion. Are mass attitudes somehow explained by the media messages to which individuals are exposed? Are purely news-based or editorial contents equally influential, and is that distinction clear cut? Is "public opinion" driven by the cues and frames provided by "published opinion"? Or instead, are the views conveyed by elites and media agents affected by the preferences of mass publics? Answering such questions requires the collection of data beyond the provided by survey research.
The relationship between political and economic events, how citizens come to apprehend them and how they react to them has become more complex with the rise of the social media. Blogs and micro-blogs (such as Twitter) perform several functions in this relationship. First, they can constitute additional sources of politically relevant information and stimuli for citizens. Second, they are themselves sources of relevant information and stimuli for journalists and political elites, raising the possibility that social media messages and conversations indirectly influence public opinion way beyond that the size of their readership might suggest. Finally, they may provide a window into mass public opinion itself: although bloggers, micro-bloggers and those who engage in online communication are certainly not a representative cross-section of the population at large, the small but rapidly increasing research on the content of social media messages suggests that their frequency and tone provide valid indications of trends and even, in some cases, work as a leading indicator of electoral results.
As Drezner and Farrell note, one of the problems faced by scholars in this regard is that "the proper exploitation of this data requires skills and expert knowledge of a kind that social scientists frequently don't have". This project addresses this problem by constituting a truly multidisciplinary team composed by computers engineers, linguists, political scientists and economists with the technical and theoretical expertise required to meet the project's main goals.
Project POPSTAR - Public Opinion and Sentiment Tracking, Analysis and Research - PTDC/CPJ-CPO/116888/2010 - Financed by FCT