A brief discussion on Confusion Matrix – As the COVID-19 pandemic continues to ravage the world, India has done remarkably well. Even as global infections have crossed 1.2 million, India’s number at around 4,000 seem model, given we are home to one-sixth of the world’s population. In per capita terms, only 3 in a million people in India are infected by COVID-19, vs 156 in a million people globally (as of 5th April 2020).
A number of reasons have been suggested including a smaller number of tests, BCG vaccine usage in India, Indian strain being less virulent and Indians being more resistant to infections.
In this context, I seek to use the confusion matrix (also called error matrix and matching matrix in unsupervised learning) to discuss the impact of higher testing.
Students of my Quantitative Methods class would be familiar with the chart below. The confusion matrix tends to confuse a lot of the students. Its implications are underappreciated by many practitioners of statistics.
1. True Positive: A true positive is the correct affirmation of the presence of a condition
For example, concluding a pregnant lady is pregnant would be “True Positive”
2. False Negative: A false negative is an error in which a test result improperly indicates no presence of a condition (the result is negative), when in reality, the condition is present.
For example, concluding a pregnant lady is not pregnant would be “False Negative”
3. False Positive: A false positive is the error in affirmation of the presence of a condition
For example, concluding a man is pregnant would be “False Positive”
4. True Negative: A true negative is the correct affirmation that the condition is not present
For example, concluding a man is not pregnant would be “True Negative”
For any tests, some error will occur. Some infected people will be assumed to be uninfected (false negatives) and some uninfected people will show up being infected (false positives). What are the true positive rates (also called sensitivity) for the COVID-19 tests? We don’t know yet.
Now lets assume India has the money, resources, time and effort to get everyone in India tested. Yes, all 1.3 billion Indians. What would that mean? Think about this before you peep into the solution.
1. True Positive:
99.9% will be correctly diagnosed or 12, 987 infected Indians will be confirmed to have been infected
2. False Negative:
0.1% (or 13 Indians) will be incorrectly diagnosed as being uninfected while they are infected
1,29,99,87,000 are uninfected. However, the false positive rate is 0.01%. As a result, 129,999 people who do not have the infection would have been diagnosed as being infected. This is around 10 times the actual number of infected. As a result, if the medical test leads to even small false positives, excessive testing may overwhelm the medical system and make it difficult for the actual patents to get correct treatment.
99.99% of the 1,29,99,87,000 uninfected (or 1,29,98,57,001) will be true negatives.
However, we realize how excessive testing and even small false positives can impact us in ways we do not realize. Currently, 3% of COVID-19 tests lead to confirmation of infections. We are testing 33x more than actual infections which is better than most nations and in line with South Korea, among the best in testing its citizens. India is already testing 97% of the people who end up not being infected. We may infer that we are testing more likely and possible cases than most western nations already.
Are we testing enough? Should we test more? I am not a medical practitioner, and will let more capable minds decide on the testing rates.
What do you think? Would love to hear your thoughts and comments.
Disclaimer: I offer my views, with the knowledge that medicine, and health is not my area of expertise. Also, given that many discussions on the topic have been polarized by political leanings and viewpoints, I would like to stress that these are not to promote any ideology.
Fill in the details to know more
What Are SOC and NOC In Cyber Security? What’s the Difference?
February 27, 2023
Fundamentals of Confidence Interval in Statistics!
February 26, 2023
A Brief Introduction to Cyber Security Analytics
Cyber Safe Behaviour In Banking Systems
February 17, 2023
Everything Best Of Analytics for 2023: 7 Must Read Articles!
December 26, 2022
Best of 2022: 5 Most Popular Cybersecurity Blogs Of The Year
December 22, 2022
Metaverse: The Virtual Universe and its impact on the World of Finance
April 13, 2023
The Portal Podcast Transcription – Episode 3 – Analytics in HR Management With Sayantani Pyne
March 18, 2023
Podcast Transcript Episode 2: Product Thinking For Entrepreneurs With Mr. Praveen Udupa, Co-founder, eedge.ai
March 13, 2023
“The Power of SQL in Driving Business Success”
March 8, 2023
Exploring the Potential of Artificial Intelligence & Machine Learning for Improving Program Management
February 28, 2023
Add your details:
By proceeding, you agree to our privacy policy and also agree to receive information from UNext through WhatsApp & other means of communication.
Upgrade your inbox with our curated newletters once every month. We appreciate your support and will make sure to keep your subscription worthwhile