Vaccination Drives and the Inequalities Underneath

Eshan Gujarathi , IIT Gandhinagar, eshan.rg@iitgn.ac.in

Hitarth Gandhi , IIT Gandhinagar, hitarth.g@iitgn.ac.in

Vishal Soni , IIT Gandhinagar, jayesh.s@iitgn.ac.in

Repo

Covid Vaccination Drive Analysis and the Inequalities Underneath

Introduction

The Covid-19 pandemic, of which we are all too aware, has had a devastating impact on the world in recent years. It is much more than just a health threat and has affected each individual in some way or the other. With lockdowns and emergencies in all parts of the world, it has changed the way everything used to function. The economic and social disruption caused by the pandemic is huge, and has caused crisis even in well developed countries. Millions of people lost their job, and are at risk of extreme poverty. Lot of enterprises reached a state of existential threat. But all of this can be improved in the future. Something we cannot change is the dramatic loss of human life worldwide and the effect it has had on the health of people.

The best way to really understand how the coronavirus pandemic has affected the world is using statistics. Let us first start with analyzing its spread and the number of deaths it caused.

Analysis of the Covid Spread around the world

A lot of data in the dataset is missing, and thus we drop the rows which have missing data in the necessary columns.

Note: Interpolation techniques cannot be used here to fill the missing data since interpolating features like 'iso_code', 'location', 'continent', and 'date' does not make sense.

Following is the visualization of the data we will be using. Some of the relavant columns in our data for each country include:

The spread of cases around the world

The following plot denotes the number of 'new cases' in a day. We can move on the slider timeline bar to get the plot for a particular day. We can thus get a idea of the time period when a country was hit by a covid wave. We can hover over the countries in the world map to get the exact number of cases.

For example, if you check for around April 2021, we observe that India has the most rise in cases. This was the time India was going through the second wave of the pandemic, and thus we had most new cases in a day.

The next plot shows the 'total number of cases' with time for each country. We can move on the slider timeline bar to see the rise of cases around the world.

Effect of Covid-19 in terms of human loss

The most major and devastating loss due to Covid-19 was in terms of human life. Let us now visualize the number of deaths.

We first drop the rows which have missing data in the relevant columns.

The following plot shows the total number of deaths for some of the countries with time. We have chosen the countries from all three categories, developed, developing, and under-developed. The x-axis represents the date and the y-axis represents the number of deaths.

We can choose to visualize the plot for a particular country by double clicking on that country. We can add the plots for countries to compare with a single click on the country we wish to add. Now if we want to go back and visualize plots for all countries, we can double click on any unselected country.

We can also hover on the graph to get the exact number of deaths of a particular country till that date.

Does the above plot mean that US, Brazil, and India are the worst affected countries due to Covid? No! This is because each country has different population and thus we should not compare them with the total deaths. One possible solution would be to compare the deaths per million. To understand it better and prove the above, we can take a simple example.

Below is a similar plot, but here, instead of the total deaths, we have the plot for total deaths per million. If you select India and Georgia in the above plot, we can see that the total deaths in India (514k) is a lot more than that of Georgia (16k). Now, select the same two countries in the below plot. The total deaths per millions for India as observed is around 350, but on the other hand, the total deaths per million for Georgia is around 4000!

This is a big misconception that a lot of poeple have, and thus small contries like Georgia don't make it to the news even if they are more severely affected, and don't get the attention and help they should.

To get a better insight, we should view the statistics for each continent separately as each of them have different resources. Below is the plot for the top 5 countries in Asia with the most and least deaths.

We can give other continents as input in the python function defined below to view the statistics of that continent.

We saw how much damage Covid has done to the world. What is the solution? Vaccines!

Covid Vaccination Analysis:

Introduction:

To bring this long running pandemic to an end, an efficient and inclusive distribution of Covid-19 vaccines could be our next most prospective step. If order to take action along these lines, we first need to understand how the current covid vaccination drives are runnning. If we are to interpret this data, we will be able to identify any underlying inequalities that might be happening during the distribution of covid vaccinations. So, our task is to understand Covid-19 vaccinations data worldwide and draw inferences from the same to understand how Covid-19 vaccination drives are going. We also plan to understand the underlying inequalities across the world.

Covid Vaccine Distribution

As mentioned, let us compare the distribution of Covid-19 vaccines in different countries. For better understanding the inequalities in the distribution, we compare the statistics for 3 countries, Canada, India, and Chad. Canada is a developed country, India is a developing country, and Chad is an under-developed country.

In each plot, we show three trends, distribution of first dose, second dose and the booster dose. For comparing the three, we have plotted the percentage of poeple in the country who have received the doses. We can hover on the plots to see the percentage of vaccinated people at a particular point of time.

We have also created a custom python function in which we can input the country for which we need to see the trends.

We can clearly observe that Canada is the most vaccinated country with around 81% of people doubly vaccinated, and also the trends show that vaccination drive was organized in a planned and timely manner. This shows that developed countries were able to vaccinate a majority of their population in a short span of time. The vaccination drive for booster dose started quite early in the developing countries and can be seen from the line plots.

In India, around 57% of the population is fully vaccinated. It is evident from the curve of the graph that developing countries like India took a lot of time to complete their vaccination drives whereas developed countries have a steep vaccination curve. Moreover, the vaccination drive in India started a couple of months later as compared to Canada, this adds to the delayed vaccination drive in India for the booster dose.

Underdeveloped countries like Chad are in a very bad situation with only 1% of poeple fully vaccinated and the drive for booster dose not even started. Such countries need a lot of attention and help as they have a weak economy and lack the resources needed to carry out planned vaccination drives.

Covid Vaccinations by different manufacturers

Now let us have a look at how there was a rise in the number of doses of different vaccines in various countries. The type of vaccine and thus the manufacturer played an important role in the vaccination drives due to their cost and success rate.

The data contains the number of total doses of different vaccines with time in each country.

Following is the plot for the country Argentina. The x-axis represents the time series and the y-axis contains the number of total doses of that particular vaccine. We can observe how certain vaccines saw a sudden rise in their production.

We can also create a interactive plot in the following way. When run in python, this gives us a dropdown to select the country for which we need to analyse the number of doses of different vaccines used in that country.

area = widgets.Dropdown(
    options=v_by_manu["location"].unique(),
    value='Argentina',
    description='Country',
)

def plotit(area):
    v_arg=v_by_manu[v_by_manu["location"]==area]
    x_ticks = [v_arg["date"].min(),v_arg["date"].max()]
    for i in v_arg["vaccine"].unique():
        v_arg_spu=v_arg[v_arg["vaccine"]==i]
        plt.plot(v_arg_spu["date"],v_arg_spu["total_vaccinations"], label=i)
    x_labels = x_ticks
    plt.xticks(ticks=x_ticks, labels=x_labels)
    plt.legend()
    plt.xlabel('Date')
    plt.ylabel('Total Vaccinations till Date')
    plt.title(area)

interactive(plotit, area=area)

Understanding the Inequalities

What would be the reason for different percentage of vaccinated people in different countries? There are two main factors:

Vaccination vs Economy

First, let's understand the co-relation of vaccination with economy of the country. We have plotted a scatter plot for the percentage of people fully vaccinated against the GDP of the country. We have grouped the countries of a continent with the same colour to understand trends between continents. We have also plotted the trendline that fits the data.

We can hover on the points to get the country name, GDP and vaccination percentage for that country.

Based on the above plot, we can make the following inferences:

  1. We see a positive co-relation between GDP and vaccination percentage. Countries with higher GDP have more percentage of vaccination as expected and we can conclude that they had planned vaccination drives. This is also evident from the trendline.
  2. Africa has the worst percentage of vaccination and its countries have relatively low GDP.
  3. On the other hand, some countries of Asia like Bhutan and Combodia have a high vacciantion percentage inspite of having low GDP

Vaccination vs Human Development Index

The second factor that might affect vaccination drives is the Human Development Index. HDI is a composite measure of a country's life expectancy, education and per capita income. It indicates the overall development of a country. Below is the scatter plot for percentage of poeple fully vaccinated against the HDI. Like the previous plot, we have grouped countries of a continent with the same colour and also plotted the trendline fitting the data.

Based on the above plot we can make the following observations:

  1. Even HDI has a positive co-relation with percentage of fully vaccinated poeple, as shown by the trendline. Countries with high HDI have better vaccination percentage.
  2. African countries have a low HDI, and thus low vaccination percentage. These countries need support to organize vaccination drives and need to speed up the process.
  3. European and some of North-American and Asian countries have a very good HDI and thus are thus more developed as compared to others. They were thus able to organize vaccination drives in a planned and organized way.
  4. Similar to the previous plot, Asian countries Bhutan and Combodia go against the trend, and have high vaccination percentage in spite of having low HDI.

Conclusion

From the analysis made above, we get a very good insight of the effects of Covid-19, Vaccination drives, and the inequalities beneath. Data we see is often misleading, and under-developed severely affected countries with low population don't get the help they actualy need. The economy and the Human Development Index of a country have a very important role in the vaccination drives. Countries with good HDI and economy have better vaccination percentages and were able to organize planned vaccination drives in less amount of time. We can also infer that HDI and ecomony are directly proportional. The booster dose vaccination is yet to be started in many of the under-developed countries and even some of the developing countries. The only way to end the coronavirus pandemic is getting vaccinated. The entire world needs to fight in this together and this can only happen by breaking the inequalities between the developed, developing and under-developed countries.