Basic information on COVID-19 mobile contact tracing apps from tweets

We have been monitoring the Twitter social media platform for tweets relevant to European COVID-19 mobile contact tracing apps for a year (from July 2020 to June 2021). We used the official streaming Twitter APIs to collect the relevant tweets, and we opened a tweet streaming for each contract tracing app. The table below shows the analysed contact tracing apps and the search keys used for streaming.

Keywords used for streaming on Twitter about European COVID-19 mobile contact tracing apps

For every individual tweet, a set of predetermined analyses were conducted in a real-time way combining the Tweepy library, Apache kafka and Elasticsearch. We developed a system capable of processing different streams of tweets and extracting place names, opinions, hashtags, entities and quite a number of aggregations.

Relevant information extracted from the collected tweets

The table below shows the number of tweets collected for each mobile contact tracing app, ordered by the number of tweets. Moreover, the table reports the percentage of tweets with opinions, geographic (geo) information and the number of detected relevant EMM news.

The opinions were extracted for the languages English, French, German, Spanish and Italian, which are the more frequently used languages in the our dataset. For the language detection we used the standard Twitter engine for language identification and evaluation. In the Sentiment analysis section you can find more information and our relevant observations.

Tweets that contained any kind of geo information were identified using the geoparser library Mordecai. The table shows the percentage of tweets which contain a place name in the tweet text or in the user profile.

A cross checking was performed between the dedicated  EMM channel for Contact tracing apps and the tweet-linked pages in order to detect meaningful tweets.

Finally, we considered the hashtags that are a very good source of information. In the section Hashtags  there are more analyses regarding hashtags with related results and observations.

Covid apps and temporal evolution

The stacked area chart below shows the user activities about the main European COVID-19 mobile contact tracing apps. We can notice a high activity during the second wave of the pandemic between the October and November months. Furthermore, the chart shows some peak-areas with a temporal extension of around five days that usually represent relevant activities, like, for example software releases, news and any kind of relevant events.

Geolocalized Tweets

As already anticipated in the table about COVID-19 apps and temporal evolution, geoinformation has been extracted from the tweet texts and the user profiles. The map below shows the density of the geolocalised tweets (geo-tweets) around Europe, considering only the place names extracted from the tweet text. Some European capitals, such as Paris, London, Dublin and Madrid are the main areas of interest detected by the geo-tweets in their countries. In Italy and Germany, though, the geo-tweet density is more scattered along theirs cities.

EMM news about covid-19 apps

The plot chart below shows the number of the relevant EMM-news detected by cross checking with the linked pages, and the background shows the trend of the tweets.

Sentiment analysis and Hashtags

As the two analyses return related results, dedicated sections have been created. The first section includes a deep analysis about  opinions  and the second one an exploratory analysis using hashtags and network analysis.