ABSTRACT
In this article, we provide a collection of tweet data from five major cities (provisional capitals and federal capital) of Pakistan for the year 2021. A python’s library “Tweepy” was used to collect the data. Tweets were filtered out using a set of related keywords. The dataset has five sub-datasets, i.e., one dataset for a city. Each sub-dataset contains the tweet data on a daily basis and can be analyzed separately. Pakistan observed two Covid waves in the year 2021 so these datasets also have tweets reflecting people’s behavior from the 3rd to the 4th wave. These datasets can be used to compare the mental health and sentiments of the people from the different cities. These data sets also attract the attention of researchers from different fields such as data science, sentiment analysis, natural language processing, psychology and others.
To share on other social networks, click on any
share button. What are these?