Science journalism, expertise and common sense

On March 27, the Johns Hopkins University said an article published on the website of the Centre For Disease Dynamics, Economics and Policy (CDDEP), a Washington-based think tank, had used its logo without permission and distanced itself from the study, which had concluded that the number of people in India who could test positive for the new coronavirus could swell into the millions by May 2020. Soon after, a basement of trolls latched onto CDDEP founder-director Ramanan Laxminarayan’s credentials as an economist to dismiss his work as a public-health researcher, including denying the study’s conclusions without discussing its scientific merits and demerits.

A lot of issues are wound up in this little controversy. One of them is our seemingly naïve relationship with expertise.

Expertise is supposed to be a straightforward thing: you either have it or you don’t. But just as specialised knowledge is complicated, so too is expertise.

Many of us have heard stories of someone who’s great at something “even though he didn’t go to college” and another someone who’s a bit of a tubelight “despite having been to Oxbridge”. Irrespective of whether they’re exceptions or the rule, there’s a lot of expertise in the world that a deference to degrees would miss.

More importantly, by conflating academic qualifications with expertise, we risk flattening a three-dimensional picture to one. For example, there are more scientists who can speak confidently about statistical regression and the features of exponential growth than there are who can comment on the false vacua of string theory or discuss why protein folding is such a hard problem to solve. These hierarchies arise because of differences in complexity. We don’t have to insist only a virologist or an epidemiologist is allowed to answer questions about whether a clinical trial was done right.

But when we insist someone is not good enough because they have a degree in a different subject, we could be embellishing the implicit assumption that we don’t want to look beyond expertise, and are content with being told the answers. Granted, this argument is better directed at individuals privileged enough to learn something new every day, but maintaining this chasm – between who in the public consciousness is allowed to provide answers and who isn’t – also continues to keep power in fewer hands.

Of course, many questions that have arisen during the coronavirus pandemic have often stood between life and death, and it is important to stay safe. However, there is a penalty to think the closer we drift towards expertise, the safer we become — because then we may be drifting away from common sense and accruing a different kind of burden, especially when we insist only specialised experts can comment on a far less specialist topic. Such convictions have already created a class of people that believes ad hominem is a legitimate argumentative ploy, and won’t back down from an increasingly acrimonious quarrel until they find the cherry-picked data they have been looking for.

Most people occupy a less radical but still problematic position: even when neither life nor fortune is at stake, they claim to wait for expertise to change one’s behaviour and/or beliefs. Most of them are really waiting for something that arrived long ago and are only trying to find new ways to persist with the status quo. The all-or-nothing attitude of the rest – assuming they exist – is, simply put, epistemologically inefficient.

Our deference to the views of experts should be a function of how complex it really is and therefore the extent to which it can be interrogated. So when the topic at hand is whether a clinical trial was done right or whether the Indian Council of Medical Research is testing enough, the net we cast to find independent scientists to speak to can include those who aren’t medical researchers but whose academic or vocational trajectories familiarised them to some parts of these issues as well as who are transparent about their reasoning, methods and opinions. (The CDDEP study is yet to reveal its methods, so I don’t want to comment specifically on it.)

If we can’t be sure if the scientist we’re speaking to is making sense, obviously it would be better to go with someone whose words we can just trust. And if we’re not comfortable having such a negotiated relationship with an expert – sadly, it’s always going to be this way. The only way to make matters simpler is by choosing to deliberately shut ourselves off, to take what we’re hearing and, instead of questioning it further, running with it.

This said, we all shut ourselves off at one time or another. It’s only important that we do it knowing we do it, instead of harbouring pretensions of superiority. At no point does it become reasonable to dismiss anyone based on their academic qualifications alone the way, say, Times of India and OpIndia have done (see below).

What’s more, Dr Giridhar Gyani is neither a medical practitioner nor epidemiologist. He is academically an electrical engineer, who later did a PhD in quality management. He is currently director general at Association of Healthcare Providers (India).

Times of India, March 28

Ramanan Laxminarayanan, who was pitched up as an expert on diseases and epidemics by the media outlets of the country, however, in reality, is not an epidemiologist. Dr Ramanan Laxminarayanan is not even a doctor but has a PhD in economics.

OpIndia, March 22

Expertise has been humankind’s way to quickly make sense of a world that has only been becoming more confusing. But historically, expertise has also been a reason of state, used to suppress dissenting voices and concentrate political, industrial and military power in the hands of a few. The former is in many ways a useful feature of society for its liberating potential while the latter is undesirable because it enslaves. People frequently straddle both tendencies together – especially now, with the government in charge of the national anti-coronavirus response.

An immediately viable way to break this tension is to negotiate our relationship with experts themselves.

On India’s path to community transmission

There’s a virus out there among many, many viruses that’s caught the world’s attention. This virus came into existence somewhere else, it doesn’t matter where, and developed a mutation at some point that allowed it to do what it needs to do inside the body of one specific kind of animal: Homo sapiens. And once it enters one Homo sapiens, it takes advantage of its new surroundings to produce more copies of itself. Then, its offspring wait for the animal to cough or sneeze – acts originally designed to expel irritating substances – to exit their current home and hopefully enter a new one. There, these viruses go through the same cycle of reproduction and expulsion, and so forth.

This way, the virus has infected over 210,000 people in the last hundred days or so. Some people’s bodies have been so invaded by the virus that their immune systems weren’t able to fight it off, and they – nearly 9,000 of them – succumbed to it.

Thus far, the virus has reportedly invaded the bodies of at least 282 people in India. There’s no telling how the virus will dissipate through the rest of the population – if it needs to – except by catching people who have the virus early, separating them from the rest of the population for long enough to ensure they don’t have and/or transmit the virus or, if they do, providing additional treatment, and finally reintegrating them with the general population.

But as the virus spreads among more and more people, it’s going to become harder and harder to tell how every single new patient got their particular infection. Ultimately, a situation is going to arise wherein too many people have the virus for public-health officials to be able to say how exactly the virus got to them. The WHO calls this phase ‘community transmission’.

India is a country of over 1.3 billion people, and is currently on the cusp of what the Indian Council of Medical Research (ICMR) has called ‘stage 3’ – the advent of community transmission. It’s impossible to expect a developing country as big and as densely populated as India to begin testing all 1.3 billion Indians for the virus as soon as there is news of the virus having entered the national border because the resource cost required to undertake such an exercise is extremely high, well beyond what India can generally afford. However, this doesn’t mean Indians are screwed.

Instead of testing every Indian, ICMR took a different route. Consider the following example: there’s a population of red flecks randomly interspersed with yellow flecks. You need to choose a small subset of flecks from this grid (shown below) such that checking for the number of yellow flecks in the subset gives you a reliable idea of the number of yellow flecks overall.

The ideal subset would be the whole set, of course, so there is one more catch: you have a fixed amount of money to figure out the correct answer (as well as for a bunch of other activities), so it’s in your best interests to keep the subset as small as possible. In effect, you need to balance the tension between two important demands: getting to a more accurate answer while spending less.

Similarly, ICMR assumed that the virus is randomly distributed in the Indian population, and decided to divide the population into different groups, for example by their relative proximity to a testing centre. That is, each testing centre would correspond to the group of all people who live closer to that testing centre than any other. Then, ICMR would pick a certain number of people from each group, collect their nasal and throat samples and send it to the corresponding labs for tests.

Say group size equals 100. For a Bernoulli random variable with unknown probability p, if no events occur in n independent trials, the maximum value of p (at 95% confidence) is approximately 3/n. In our case, n = 100 and p at 95% confidence is 3/100, which is 3%. Since this is the upper bound, it means less than 3% of the population has the ‘event’ which didn’t occur in n trials – which in our case is the event of ‘testing positive’. Do note, this is what is safe to say; it’s not what may actually be happening on the ground. So by increasing the sample size n as much as possible, ICMR can ascertain with higher and higher confidence as to whether the corresponding group has community transmission or not.

Thus far, ICMR has said there is no community transmission in India based on these calculations. Independent experts have been reluctant to take its word, however, because while ICMR has publicised what the sample size and the number of positives are, there is very little information about two other things.

First: we don’t know how ICMR selected the samples that it did for testing. While the virus’s distribution in the population can be considered to be random, especially if community transmission is said to have commenced, the selection of samples needs to have an underlying logic. What is that logic?

Second: we don’t know the group sizes. It’s important for the sample size to be proportionate to the group size. So without knowing what the group size underlying each sample is, it becomes impossible to tell if ICMR is doing its job right.

On March 17, one ICMR scientist said that some testing centres had admitted fewer people with COVID-19-like symptoms and the source of whose infections was unknown (i.e. community transmission) than the size of the sample chosen from their corresponding group. She was suggesting that ICMR’s choice of samples from each group was large enough to not overlook community transmission. To translate in terms of the example above: she was saying ICMR’s subset size was big enough to catch at least one yellow fleck – and didn’t.

As it happens, on March 20, ICMR announced that it would begin testing for a potential type of community-transmission cases even though its sampling exercise had produced 1,020 negative results in 1,020 samples (distributed across 51 testing centres).

The reasons for this are yet unclear but suggests that ICMR suspects there is community transmission of the virus in the country even though its methods – which ICMR has always stood by – haven’t found evidence of such transmission. This in turn prompts the following question: why not test for all types of community transmission? The answer is the same as before: ICMR has limited resources but at the same time has been tasked with discovering how many yellow flecks are there in the total population.

The virus is not an intelligent creature. In fact, it’s extremely primitive. Each virus is in its essence a packet of chemical reactions, and when each reaction happens depends on a combination of internal and external conditions. Other than this, the virus does not harbour any intentions or aspirations. It simply responds to stimuli that it cannot manipulate or affect in any way.

The overarching implication is that beyond how good the virus is at spreading from person to person, a pandemic is what it is because of human interactions, and because of human adaptation and mitigation systems. And as more and more people get infected, and their groups verge towards the WHO’s definition of ‘community transmission’, the virus’s path through the population becomes less and less obvious, but at the same time a greater depth of transmission opens the path to better epidemiological modelling.

When such transmission happens in a country like India, the body responsible for keeping the people safe – whether the Union health ministry, ICMR or any other entity – faces the same challenge that ICMR did. This is also why direct comparisons of India’s and South Korea’s testing strategies are difficult to justify, especially of the number of people tested per million: India has nearly 26-times as many people but spends 11.5-times less on healthcare per capita.

At the same time, ICMR isn’t making it easy for anyone – least of all itself – when it doesn’t communicate properly, and leaves itself open to criticism, which in turn chips away at its authority and trustworthiness in a time as testing as this. Demonetisation taught us very well that a strategy is only as good as its implementation.

But on the flip side, it wouldn’t be amiss to make a distinction here: between testing enough to get a sense of the virus’s prevalence in the population – in order to guide further action and policy – and the fact that the low expenditure on public healthcare is always going to incentivise India to skew towards a sampling strategy instead of an alternative that requires mass-testing. ICMR and the Union health ministry haven’t inspired confidence on the first count but it’s important to ensure criticism of the former doesn’t spillover into criticism of the latter as well.

Anyway, the corresponding sampling strategy is going to have to be based on a logic. Why? Because while the resources for the virus to spread exist abundantly in nature (in the form of humans), the human response to containing the spread requires resources that humans find hard to get. Against the background of this disparity, sampling, testing and treatment logics – such as Italy’s brutal triaging policy – help us choose better sampling strategies; predict approximately how many people will need to be quarantined in the near future; prepare our medical supplies; recruit the requisite number of health workers; stockpile important drugs; prepare for economic losses; issue rules of social conduct for the people; and so forth.

A logic could even help anticipate (or perpetuate, depending on your appetite for cynicism) ‘leakages’ arising due to, say, caste or class issues. Think of it like trying to draw a circle with only straight lines of a fixed length: with 200 strokes, you could technically draw a polygon with 200 sides that looks approximately like a circle – but it will still have some discernible edges and vertices that won’t exactly map on a circle, leaving a small part of the latter out. Similarly, using a properly designed technique that can predict which person might get infected and who might not can still catch a large number of people – but the technique won’t catch all of them.

One obvious way to significantly improve the technique’s efficacy as it stands is to account for the fact that more than half of all Indians are treated at private hospitals whereas you can be tested for COVID-19 only at a government facility, and not all VRDLs receive samples from all private hospitals in their respective areas.

Ultimately, the officials who devise the logics must be expected to justify how the combination of all logics can – even if only on paper – uncover most, if not all, cases of the virus’s infection in India.

Dehumanising language during an outbreak

It appears the SARS-CoV-2 coronavirus has begun local transmission in India, i.e. infecting more people within the country instead of each new patient having recently travelled to an already affected country. The advent of local transmission is an important event in the lexicon of epidemics and pandemics because, at least until 2009, that’s how the WHO differentiated between the two.

As of today, the virus has become locally transmissible in the world’s two most populous countries. At this juncture, pretty much everyone expects the number of cases within India to only increase, and as it does, the public healthcare system won’t be the only one under pressure. Reporters and editors will be too, and they’re likely to be more stressed on one front: their readers.

For example, over the course of March 4, the following sentences appeared in various news reports of the coronavirus:

The Italian man infected 16 Italians, his wife and an Indian driver.

The infected techie boarded a bus to Hyderabad from Bengaluru and jeopardised the safety of his co-passengers.

Two new suspected coronavirus cases have been reported in Hyderabad.

All 28 cases of infection are being monitored, the health ministry has said.

Quite a few people on Twitter, and likely in other fora, commented that these lines exemplify the sort of insensitivity towards patients that dehumanises them, elides their agency and casts them as perpetrators – of the transmission of a disease – and which, perhaps given enough time and reception, could engender apathy and even animosity towards the poorer sick.

The problem words seem to include ‘cases’, ‘burden’ and ‘infected’. But are they a problem, really? I ask because though I understand the complaints, I think they’re missing an important detail.

Referring to people as if they were objects only furthers their impotency in a medical care setup in which doctors can’t be questioned and the rationale for diagnoses is frequently secreted – both conditions ripe for exploitation. At the same time, the public part of this system has to deal with a case load it is barely equipped for and whose workers are underpaid relative to their counterparts in the private sector.

As a result, a doctor seeing 10- or 20-times as many patients as they’ve been trained and supported to will inevitably precipitate some amount of dehumanisation, and it could in fact help medical workers cope with circumstances in which they’re doing all they can to help but the patient suffers anyway. So dehumanisation is not always bad.

Second, and perhaps more importantly, the word ‘dehumanise’ and the attitude ‘dehumanise’ can and often do differ. For example, Union home minister Amit Shah calling Bangladeshi immigrants “termites” is not the same as a high-ranking doctor referring to his patient in terms of their disease, and this doctor is not the same as an overworked nurse referring to the people in her care as ‘cases’. The last two examples are progressively more forgivable because their use of the English language is more opportunistic, and the nurse in the last example may not intentionally dehumanise their patients if they knew what their words meant.

(The doctor didn’t: his example is based on a true story.)

Problematic attitudes often manifest most prominently as problematic words and labels but the use of a word alone wouldn’t imply a specific attitude in a country that has always had an uneasy relationship with the English language. Reporters and editors who carefully avoid potentially debilitating language as well as those who carefully use such language are both in the minority in India. Instead, my experiences as a journalist over eight years suggest the majority is composed of people who don’t know the language is a problem, who don’t have the time, energy and/or freedom to think about casual dehumanisation, and who don’t deserve to be blamed for something they don’t know they’re doing.

But by fixating on just words, and not the world of problems that gives rise to them, we risk interrogating and blaming the wrong causes. It would be fairer to expect journalists of, say, the The Guardian or the Washington Post to contemplate the relationship between language and thought if only because Western society harbours a deeper understanding of the healthcare system it originated, and exported to other parts of the world with its idiosyncrasies, and because native English speakers are likelier to properly understand the relationship between a word, its roots and its use in conversation.

On the other hand, non-native users of English – particularly non-fluent users – have no option but to use the words ‘case’, ‘burden’ and ‘infected’. The might actually prefer other words if:

  • They knew that (and/or had to accommodate their readers’ pickiness for whether) the word they used meant more than what they thought it did, or
  • They knew alternative words existed and were equally valid, or
  • They could confidently differentiate between a technical term and its most historically, socially, culturally and/or technically appropriate synonym.

But as it happens, these conditions are seldom met. In India, English is mostly reserved for communication; it’s not the language of thought for most people, especially most journalists, and certainly doesn’t hold anything more than a shard of mirror-glass to our societies and their social attitudes as they pertain to jargon. So as such, pointing to a reporter and asking them to say ‘persons infected with coronavirus’ instead of ‘case’ will magically reveal neither the difference between ‘case’ or ‘infected’ the scientific terms and ‘case’ or ‘infected’ the pejoratives nor the negotiated relationship between the use of ‘case’ and dehumanisation. And without elucidating the full breadth of these relationships, there is no way either doctors or reporters are going to modify their language simply because they were asked to – nor will their doing so, on the off chance, strike at the real threats.

On the other hand, there is bound to be an equally valid problem in terms of those who know how ‘case’ and ‘infected’ can be misused and who regularly read news reports whose use of English may or may not intend to dehumanise. Considering the strong possibility that the author may not know they’re using dehumanising language and are unlikely to be persuaded to write differently, those in the know have a corresponding responsibility to accommodate what is typically a case of the unknown unknowns and not ignorance or incompetence, and almost surely not malice.

This is also why I said reporters and editors might be stressed by their readers, rather their perspectives, and not on count of their language.


A final point: Harsh Vardhan, the Union health minister and utterer of the words “The Italian man infected 16 Italians”, and Amit Shah belong to the same party – a party that has habitually dehumanised Muslims, Dalits and immigrants as part of its nationalistic, xenophobic and communal narratives. More recently, the same party from its place at the Centre suspected a prominent research lab of weaponising the Nipah virus with help from foreign funds, and used this far-fetched possibility as an excuse to terminate the lab’s FCRA license.

So when Vardhan says ‘infected’, I reflexively, and nervously, double-check his statement for signs of ambiguity. I’m also anxious that if more Italian nationals touring India are infected by SARS-CoV-2 and the public healthcare system slips up on control measures, a wave of anti-Italian sentiment could follow.

Distracting from the peer-review problem

From an article entitled ‘The risks of swiftly spreading coronavirus research‘ published by Reuters:

A Reuters analysis found that at least 153 studies – including epidemiological papers, genetic analyses and clinical reports – examining every aspect of the disease, now called COVID-19 – have been posted or published since the start of the outbreak. These involved 675 researchers from around the globe. …

Richard Horton, editor-in-chief of The Lancet group of science and medical journals, says he’s instituted “surge capacity” staffing to sift through a flood of 30 to 40 submissions of scientific research a day to his group alone.

… much of [this work] is raw. With most fresh science being posted online without being peer-reviewed, some of the material lacks scientific rigour, experts say, and some has already been exposed as flawed, or plain wrong, and has been withdrawn.

“The public will not benefit from early findings if they are flawed or hyped,” said Tom Sheldon, a science communications specialist at Britain’s non-profit Science Media Centre. …

Preprints allow their authors to contribute to the scientific debate and can foster collaboration, but they can also bring researchers almost instant, international media and public attention.

“Some of the material that’s been put out – on pre-print servers for example – clearly has been… unhelpful,” said The Lancet’s Horton.

“Whether it’s fake news or misinformation or rumour-mongering, it’s certainly contributed to fear and panic.” …

Magdalena Skipper, editor-in-chief of Nature, said her group of journals, like The Lancet’s, was working hard to “select and filter” submitted manuscripts. “We will never compromise the rigour of our peer review, and papers will only be accepted once … they have been thoroughly assessed,” she said.

When Horton or Sheldon say some of the preprints have been “unhelpful” and that they cause panic among the people – which people do they mean? No non-expert person is hitting up bioRxiv looking for COVID-19 papers. They mean some lazy journalists and some irresponsible scientists are spreading misinformation, and frankly their habits represent a more responsible problem to solve instead of pointing fingers at preprints.

The Reuters analysis also says nothing about how well preprint repositories as well as scientists on social media platforms are conducting open peer-review, instead cherry-picking reasons to compose a lopsided argument against greater transparency in the knowledge economy. Indeed, crisis situations like the COVID-19 outbreak often seem to become ground zero for contemplating the need for preprints but really, no one seems to want to discuss “peer-reviewed” disasters like the one recently publicised by Elisabeth Bik. To quote from The Wire (emphasis added),

[Elisabeth] Bik, @SmutClyde, @mortenoxe and @TigerBB8 (all Twitter handles of unidentified persons), report – as written by Bik in a blog post – that “the Western blot bands in all 400+ papers are all very regularly spaced and have a smooth appearance in the shape of a dumbbell or tadpole, without any of the usual smudges or stains. All bands are placed on similar looking backgrounds, suggesting they were copy-pasted from other sources or computer generated.”

Bik also notes that most of the papers, though not all, were published in only six journals: Artificial Cells Nanomedicine and BiotechnologyJournal of Cellular BiochemistryBiomedicine & PharmacotherapyExperimental and Molecular PathologyJournal of Cellular Physiology, and Cellular Physiology and Biochemistry, all maintained reputed publishers and – importantly – all of them peer-reviewed.