SECTION 2 Background to medicine 2.1 Science in medicine When, how, and what 33 2.1 Science in medicine: When, how, and what 33 ESSENTIALS Science has always been part of Western medicine, although what counts as scientific has changed over the centuries, as have the con- tent of medical knowledge, the tools of medical investigation, and the details of medical treatments. This brief overview develops a his- torical typology of medicine since antiquity. It divides the ‘kinds’ of medicine into five sections: (1) Bedside medicine, developed by the Hippocratic doctors in classical times, has its modern counterpart in primary care. (2) Library medicine, associated with the scholastic mentality of the Middle Ages, still surfaces in the problems of information storage and retrieval in the computer age. (3) Hospital medicine, central to French medicine of the early 19th century, placed the diagnostic and therapeutic functions of the modern hospital centre stage in care and teaching. (4) Social medicine is about prevention, both communal and in- dividual, and is especially visible in our notion of ‘lifestyle’ and its impact on health. (5) Laboratory medicine has its natural home in the research es- tablishment and is a critical site for the creation of medical know- ledge, setting the standards for both medical science and scientific medicine. Introduction At least since the Hippocratic era, Western medicine has always as- pired to be scientific. What has changed is not so much the aspir- ations but what it has meant to be ‘scientific’. ‘Science is the father of knowledge, but opinion breeds ignorance’, opined the Hippocratic treatise The Canon, and Hippocratic practi- tioners developed an approach to health, disease, and its treatment based on systematic observation and cumulative experience. Even the word ‘physic’, whence physician as well as physicist, derives from the Greek for ‘nature’. Further, Hippocratic medicine was experi- mental, that word stemming from the same classical roots which gave us ‘experience’. Words, however, can be slippery, as philosophers as divergent as Francis Bacon and Ludwig Wittgenstein have stressed. The science and experiment of the Hippocratics can still inspire, but they are not our science and experiment. During the past two or three centuries, an armoury of sciences and technologies has come to underpin medical practice. This essay briefly describes these, within the con- text of distinctive and perennial features of medical practice (i.e. suf- fering individuals whose problems and diseases demand attention). A historical typology of Western medicine The history of Western medicine can be divided into five ‘kinds’ of medicine: bedside, library, hospital, social, and laboratory, with the latter extended to include technology (Table 2.1.1). Each approach to medical care and knowledge emerged at a particular historical period, but each still has relevance to us today. Bedside medicine Bedside medicine can be equated with the vision of the Hippocratics, with its emphasis on the individual patient, a tendency towards holism, and an abiding concern with the patient within his or her own unique environment. These are some of the reasons why Hippocrates (Fig. 2.1.1) is still claimed as the dominant father figure by both orthodox and alternative medical practitioners. Library medicine What can be called ‘library’ medicine dominated in the Middle Ages, when learned medicine retreated into the universities and scholars sometimes assumed that everything worth discovering had been uncovered by the ancients, and everything worth being revealed could be found in the Bible. The millennium between the sacking of Rome and the discovery of the New World is often dismissed as a sterile period scientifically, but the physicians of the period, linguistically erudite and philosophically inclined, would have been surprised to be described as unscientific. They simply believed that the road to knowledge was through the book, and—​were they able to—​would point out that this so-​ called sterile period gave us the hospital and university. 2.1 Science in medicine: When, how, and what William F. Bynum 34 SECTION 2  Background to medicine These medical men also sometimes engaged with nature, although it is undeniable that nature rather than words became an increasing source of truth and knowledge during the Scientific Revolution, a period stretching roughly from just before Andreas Vesalius (1514–​ 1564) to Isaac Newton (1642–​1727). Around 1600, it was becoming apparent to many that the Greeks had not left behind a complete and accurate account of the nature of the world, and that scientific knowledge was cumulative. This ‘Battle of the Books’, the debate over whether the ancients or the moderns knew more, was decided in fa- vour of the moderns. Many of the outstanding scientific achievements of the era were in astronomy and physics, but medicine, both in its theory and its practice, was also affected. Theory has always been easier to change than practice, of course, and it was famously remarked that William Harvey’s discovery of the circulation of the blood had no impact on therapeutics. Harvey (1578–​1657) also notori- ously lamented that his practice fell off mightily following the discovery, his patients fearing that he was ‘crack-​brained’. The fear that too close an identification with science was detrimental to patient confidence recurs in medical history, and is still part of the delicate negotiations between the profession and its public, and to the status of academic medicine. Within the discipline of medicine itself there have always been individuals—​some of them, like Thomas Sydenham (1624–​1689), eminently successful—​who believed that experimental science had little to offer to patient care. But these ‘artists’ of medicine could still invoke the authority of Hippocrates, with its older connotations of knowledge and experience. Sydenham himself did not demur from his being dubbed ‘the English Hippocrates’. During the early modern period, the whole spectrum of the sciences—​mathematics, physics, chemistry, the life sciences (not yet called biology)—​made their ways into formulations of health and disease. Iatrophysics, iatromathematics, and iatrochemistry all had their advocates in the 17th and 18th centuries as approaches to medical theory and practice. That these systems tended to encourage speculation to run ahead of evidence was recognized at the time, and this was part of the reason why ‘hospital medicine’ had little recourse to those discip- lines we now call ‘basic medical sciences’. The founders of French hospital medicine, Xavier Bichat (1771–​1802), J.  N. Corvisart (1755–​1821), and R. T. H. Laennec (1781–​1826), often referred to chemistry, physiology, and the like as sciences that were ‘accessory’ to medicine. Hospital medicine The medicine that developed in the Paris hospitals, after the re- opening in 1794 of the medical schools closed by the Revolution a couple of years earlier, emphasized above all the study of disease in the sick patient. In a sense, this was Hippocratic medicine writ large, but with some significant differences. First, the hospital offered the curious doctor a vast arena for observing disease. The equivalent of a lifetime’s experience of a lone practitioner in the community could be experienced in a few Fig. 2.1.1  A statue of Hippocrates, originally from Kos Odeion but now in the Archaeological Museum of Kos. This is a late Hellenistic period copy of a classical prototype. No contemporary likeness of Hippocrates exists, but several busts and statues were created later on in the classical period. Copyright © D. A. Warrell. Table 2.1.1  A historical typology of Western medicine CHARACTERISTICS Object of inquiry Form and site of education Goal Example T Y P E S Bedside Whole patient Apprenticeship Therapy Hippocrates (c.460–​370 bce) Library Text Scholastic, linguistic, University Preservation, recovery, commentary Constantine the African (d. before 1098) Hospital Patient, organ, lesion Hospital Diagnosis R. T. H. Laennec (1781–​1826) Social Population, statistic Community Prevent John Simon (1816–​1904) Laboratory Animal model Laboratory Understand Claude Bernard (1813–​1878) Technology Molecule Automated laboratory The above + profit Kary Mullis (b. 1944) Adapted from W. F. Bynum, History of Medicine: a very short introduction, OUP, 2008. 2.1  Science in medicine: When, how, and what 35 months of hospital work. Hospitals offered the possibility of defining disease on the basis of hundreds of cases. Secondly, Hippocratic humoralism gradually disappeared as the dominant explanatory framework of health and disease, replaced by the primacy of the lesion, located in the solids: the organs and tissues, and by the mid-​19th century, cells. In this new orientation, disease was literally palpable, its lesions to be discovered in life by the systematic use of physical examination—​Corvisart rediscovered percussion, Laennec invented the stethoscope—​and these findings to be correlated after death by routine autopsy. French high priests of hospital medicine brought diagnosis to a new stage and replaced the older symptom-​based nosologies with a more objective, demon- strable one of lesions. The third feature of hospital medicine was what Pierre Louis (1787–​1872) called the numerical method, the use of numbers to guide both disease classification and therapeutic evaluation. The philosophy underlying early 19th-​century French medicine was most systematically expounded by one of the many American students who studied in Paris, Elisha Bartlett, in his Philosophy of Medical Science (1844). The medical science whose philosophy he chronicled was one of facts. Bartlett argued that all systems of medicine, past and present, were speculative, vague, and useless. Cullen, Brown, Broussais, and Hahnemann were all consigned to the historical dustbin. The new medicine was one of systematic observation and collection of facts, which, when properly com- pared and organized, could provide an objective understanding of disease and a rational basis for its treatment. Bartlett’s philosophy was essentially undiluted Baconian inductivism applied to medi- cine. Unsurprisingly, he counted Hippocrates as well as Pierre Louis among his heroes. One consequence of the lesion-​based medicine was the recogni- tion that not much of what doctors did actually altered the natural history of disease. Therapeutic scepticism, or even nihilism, flour- ished among doctors whose lives were spent, as Laennec put it, ‘among the dead and dying’. It was less likely to be expressed among doctors concerned with earning a living treating private, paying pa- tients, but the concern with medicine’s therapeutic impotency also fuelled the movement to prevent disease. Social medicine The fourth kind of medicine, social, also flourished in the 19th cen- tury. Just as hospitals existed long before ‘hospital medicine’, so epi- demics and preventive measures were not invented by the public health movement of the 1830s. Nevertheless, the preventive infra- structures developed partly in response to the cholera pandemics still exist, although of course much changed. The chief architect of the British public health movement, Edwin Chadwick (1800–​1890), was a lawyer who thought that, on the whole, doctors were overrated (he was neither the first nor the last lawyer to hold that opinion). He held that filth spread via the foul smells (miasma) of rotting organic matter caused epidemic dis- eases. His solutions were engineering ones—​clean water and effi- cient waste disposal—​which he argued would leave the world an altogether more pleasant and healthier place. His ideas were formed during the 1830s and early 1840s, and they remained more or less fixed for the rest of his long life, which extended well into the bac- teriological age. Nevertheless, Chadwick also invoked science in his public health reform programme, above all the science of statistical investigation. His use of statistics can easily be shown to have been naive, but it was ardent. In his own sphere of enquiry, Chadwick was as much in awe of the unadorned ‘fact’ as was his contemporary Bartlett. A later generation of Medical Officers of Health and others concerned with disease prevention (or containment) would develop new investigative techniques, more sophisticated statistics and, es- pecially, new theories of disease causation and transmission. But the early public health movement was firmly based on the science of its time. Laboratory medicine The final locus of medicine, the laboratory, was also largely a product of the 19th century, though of course laboratories (a place where one worked, especially to mutate lead into gold) had existed for much longer. A leading exponent of the laboratory, and one of its most thoughtful philosophers, had experienced Paris hospital medicine as a medical student. Claude Bernard’s Introduction to the Study of Experimental Medicine (1865) is at once an intriguing account of his own brilliant career and a sophisticated analysis of the philosophy of experimentation within the life sciences (Fig. 2.1.2). Hospitals, he argued, are merely the gateways to medical knowledge, and bed- side clinicians can be no more than natural historians of disease. To understand the causes and mechanisms of disease, it is necessary to go into the sanctuary of the laboratory, where experimental con- ditions can be better controlled. There are in nature no uncaused causes:  determinism is the iron law of the universe, extending equally to living systems and inorganic ones. However, organ- isms present special experimental problems, and it is only through isolating particular features, and holding other parameters as con- stant as possible, that reliability and reproducibility can be achieved. Bernard identified three primary branches of experimental medi- cine:  physiology, pathology, and therapeutics. His own research programme touched all three pillars: his research on the roles of the liver and pancreas in sugar metabolism contributed to under- standing normal physiology as well as diseases such as diabetes; his investigations of the sites of action of agents such as curare and carbon monoxide foreshadowed structural pharmacology and drug receptor theory; his work on the functions of the sympathetic nerves buttressed his own more general notion of the constancy of milieu interieur as the precondition to vital action (and freedom), a pre- cursor of Walter Cannon’s concept of homeostasis. Bernard stands supreme as the quintessential advocate of the laboratory. Who was the first modern medical experimentalist? When Bernard wrote, experimental medical science was still a fledgling activity, best developed in the universities of the German states and principalities. The German university ideal of medical education was to be extolled by the American educational re- former Abraham Flexner (1866–​1959) in the early 20th century. It was in the reformed and newly created German universities that the forms of modern scientific research were established. Research careers were created; copublication in specialist journals became common; scientific societies flourished. The microscope became the symbol of the medical scientist even as the stethoscope was becoming the hallmark of the forward-​looking clinician. In the 36 SECTION 2  Background to medicine hands of scientists like Schwann, Virchow, and Weismann, the modern cell theory was developed and applied to medicine and biology more generally. These researchers established the drive to push units of analysis further and further. Eduard Buchner’s identification of cell-​free ferments in 1897 firmly established the importance of subcellular functions. Pasteur, Koch, Ehrlich, von Behring, and others advanced new notions of the causes of disease, the body’s response to infection, and the possibilities of new drugs to combat disease. Any of these scientists might arguably be the answer to the parlour-​game question: Who was the first modern medical scientist? The German-​speaking lands perfected the modern forms of sci- entific research, but a good case can be made for a Frenchman to be crowned the first thoroughly modern experimentalist within medicine. François Magendie (1783–​1855) (Fig. 2.1.3) was a child of the Enlightenment and product of the French Revolution. One of several eminent individuals (Thomas Malthus was another) raised according to the anarchic principles espoused by Jean-​ Jacques Rousseau, Magendie did not learn to read or write until he was 10. His subsequent precocity was such that he was ready for medical studies by the age of 16, learned anatomy and surgery as an apprentice, and made his way through the Paris hospital system. Although he never lost interest in practical medical issues, his reputation was established primarily within the laboratory. Fig. 2.1.2  Claude Bernard and his pupils. Oil painting, 1889 by Léon Augustin Lhermitte. Courtesy of the Wellcome Collection, Creative Commons Attribution (CC BY 4.0). Fig. 2.1.3  François Magendie. Lithograph by N. E. Maurin. From Burgess R (1973). Portraits of doctors and scientists in the Wellcome Institute, London, no. 1870.2, by courtesy of the Wellcome Library, London. 2.1  Science in medicine: When, how, and what 37 His monographs on physiology and pharmacology marked new beginnings, and his life manifests three emblematic qualities which make him one of us. First, he valued facts above theories, evidence above rhetoric. However, he went beyond Bartlett and the high priests of hospital medicine in insisting that in experiment, and not simply observa- tion, lay the real future of medical knowledge. Like his pupil Claude Bernard, Magendie was a deft experimentalist. He used animals (and occasionally patients) to probe into a whole range of prob- lems in physiology, pathology, and pharmacology:  the functions of the spinal nerves, the physiology of vomiting, important facets of absorption, digestion, circulation, nutrition, and the actions of drugs and poisons. He described anaphylaxis a century before it was named. He was as philosophically naive as Bernard was sophisti- cated: of course he had theories, but his image of himself as a rag-​ picker with a spiked stick, gathering isolated experimental facts where he found them, is a telling one. Secondly, he was modern in sometimes backing the wrong horses. He judged cholera and yellow fever to be non​contagious, was suspi- cious of anaesthesia, and sometimes claimed more than we might for his newly introduced therapeutic substances, such as strychnine and veratrine. Magendie could often be mistaken in his beliefs; so can we. Thirdly, Magendie was the scientist who first expunged the double-​faced Janus from the medical mentality. William Harvey worshipped Aristotle, Albrecht von Haller was steeped in his- tory, and Isaac Newton popularized the pious conceit of pygmies standing on the shoulders of giants. Until the 19th century, doc- tors routinely looked to the past, not simply for inspiration but for useful information. Magendie looked only in one direction: the future. He had no sense of history and no use for it. He meant what he said when he insisted that most physiological ‘facts’ had to be verified by new experiments, and he undertook to provide a beginning. He made the laboratory the bedrock of medicine. With Magendie, the history of medicine became an antiquarian discipline. What happened next? Like everyone, Magendie was of his time. Nevertheless, his values were symptomatic of important themes within 19th-​century medi- cine and medical science. By the beginning of the First World War, most of the structures and the fundamental concepts of modern medicine were in place. Of course, both medical science and med- ical practice have been utterly transformed since. But the impulse of experimentation and its variable translation into practice were there. We have gone far beyond the cell in our analytical procedures, and our medical, surgical, and therapeutic armamentaria are vastly more sophisticated and powerful. Technological medicine Our medicine is fundamentally different in one important respect, even if the trend was already evident in the 19th century:  the fusion of science and technology. Science and technology have become so intertwined that the older distinctions between them are blurred. Technology made a real but minimal impact on 19th-​century medicine. Some instruments, such as Helmholtz’s ophthalmoscope, came into clinical medicine through the labora- tory; and German experimental scientists were eager to exploit the latest equipment such as kymographs, sphygmographs, and the profusion of artefacts (Petri dishes, autoclaves, and so on), which Koch and his colleagues devised for the bacteriological laboratory. Most important of all was probably X-​rays, discovered by Roentgen in late 1895. This made an immediate impact on med- ical diagnosis, and the associated science of radioactivity soon was felt within therapeutics (Fig. 2.1.4). Significantly, perhaps, the pi- oneers of the radioactive phenomena—​Roentgen, Becquerel, the Curies—​got their Nobel Prizes in physics or chemistry. Hounsfield and Cormack got theirs for computer-​assisted tomography in medicine or physiology. More recently, Kary Mullis’s Nobel Prize was for a technological development within molecular biology. Both medical science and medical practice are now inseparably rooted in technology. So is modern life, another reflection of a per- ennial historical truth: medical knowledge and medical practice are products of wider social forces with unique historical individualities. Fig. 2.1.4  Photograph of a radiograph (x-ray negative) taken on 22 December 1895 by Wilhelm Röntgen, the discover of x-rays. It shows his wife’s left hand; she wears a large ring on her 4th digit. Courtesy of the Science Museum, London, Creative Commons Attribution (CC BY 4.0). 38 SECTION 2  Background to medicine FURTHER READING Ackerknecht EH (1967). Medicine at the Paris hospital, 1784–​1848. Johns Hopkins University Press, Baltimore, MD. Bynum WF (1994). Science and the practice of medicine in the nine- teenth century. Cambridge University Press, Cambridge. Bynum WF (2008). History of medicine:  a very short introduction. Oxford University Press, Oxford. Bynum WF, Bynum H (eds) (2007). Dictionary of medical biography. Greenwood, Westport, CT. Bynum WF, Porter R (eds) (1993). Companion encyclopedia of the his- tory of medicine. Routledge, London. Bynum WF, et al. (2006). The Western medical tradition, 1800 to 2000. Cambridge University Press, Cambridge. Conrad LI, et al. (1995). The Western medical tradition, 800 bc to ad 1800. Cambridge University Press, Cambridge. Cooter R, Pickstone J (eds) (2000). Medicine in the 20th century. Harwood Academic Publishers, Amsterdam. King LS (1982). Medical thinking:  a historical preface. Princeton University Press, Princeton, NJ. Reiser SJ (1978). Medicine and the reign of technology. Cambridge University Press, Cambridge. Weatherall DJ (1995). Science and the quiet art: medical research and patient care. Oxford University Press, Oxford. 2.10 Medicine quality, physicians, and patients 12 2.10 Medicine quality, physicians, and patients 124 ESSENTIALS Poor-​quality medicines negate the enormous advantages of modern pharmaceuticals and lead to avoidable morbidity and mortality; loss of confidence in medicines and healthcare systems; economic losses for patients, their families, governments, and the pharmaceutical in- dustry; and—​for anti-​infectives—​engender pathogen drug resistance. There are many examples of both falsified (due to fraudulent pro- duction) and substandard (due to in-​factory negligence) medicines, vaccines, diagnostic tests, and devices. The evidence base informing our understanding of their epidemiology is poor, but it is becoming clearer that the distribution of poor-​quality medical products varies greatly around the world in both time and place. Formal chem- ical analysis is highly sophisticated, expensive, and rarely available in many countries. New portable diagnostic devices offer hope that they could empower medicine inspectors and pharmacists to obtain ob- jective evidence for selection of suspicious samples for formal analysis. Caveat emptor (buyer beware) is vital: patients, pharmacists, nurses, and physicians should observe their medicines and be alert when the packaging, medicine taste, or side effects change; when they are sur- prisingly inexpensive; and when they are associated with unexpected treatment failure and puzzling clinical syndromes. Much greater inter- national effort is needed for the sharing of data about poor-​quality medicines so that detection can be acted upon and patients protected. Introduction Most patients, doctors, and pharmacists tend to assume that the medicines they take or provide are of good quality and that poor-​ quality medicines are a problem of the distant past or are far away. However, there have been numerous problems with deliberate falsi- fication of medicines and poor-​quality production since medicines were first manufactured. Attempts to ensure that countries have a safe and efficacious medicine supply have led to great improvements in medicine regulation in the wealthier world over the last century. However, poor-​quality medicines are still with us. There is growing, but belated, concern that much of the developing world’s supply of medical products, including medicines, vaccines, diagnostic tests and devices, in particular its supply of anti-​infective drugs, are of poor quality. There are also less frequent but important problems in wealthier countries. The enormous investment in pharmaceuticals, research, development, registration, clinical trials, and trade will ultimately be pointless and harmful if the medicines patients take contain insufficient, incorrect, or dangerous ingredients. Definitions There has been considerable debate and confusion over definitions for the different types of poor-​quality medicines. This has arisen be- cause of poor science and because of tension between the defence of commercial interests and the public health importance of enhanced access to good quality medicines. Here, we use the term falsified to mean poor-​quality medicines that are deliberately and fraudulently produced, substandard for those that are poor quality because of un- intentional but negligent errors in factory production and degraded for those that left the factory as good quality but deteriorated through not following Good Pharmacy Practice in the distribution chain. The distinction is important as the underlying human motivation and regu- latory actions needed for falsified and substandard medicines differ. The term falsified is preferred to ‘counterfeit’ as the latter is argued to be primarily an intellectual property and trade term rather than concerned with patient outcome and public health. The use of falsi- fied and substandard terminology was agreed at the World Health Assembly in 2017. Substandard medicines usually contain either too little or too much active pharmaceutical ingredient (API) or are of poor bio- availability, leading to low blood levels. Falsified medicines often, but not always, contain no API or the wrong API. The amount of ac- tive ingredient is not sufficient information to determine accurately whether a medicine is falsified. Inspection of the packaging, that re- veals deception for falsified medicines, is usually required. Epidemiology Products of both the innovative and generics industry, in the private and public sectors, for both expensive and inexpensive medicines 2.10 Medicine quality, physicians, and patients Paul N. Newton 2.10  Medicine quality, physicians, and patients 125 are circulating. There is considerable confusion over the extent of the problem. Many of the data have been collected using ‘conveni- ence’ sampling and have often been interpreted and extrapolated un- critically. In order to be able to estimate, with confidence intervals, the proportion of the medicine supply in a community or area that is poor quality and to follow change through time and assess the impact of interventions, random sampling is required. Random lot quality assurance sampling may provide a cost-​effective alternative to estimate whether the prevalence of poor-​quality medicines ex- ceeds a given threshold. The available evidence, although it should be viewed with caution, suggests that there is an important and neglected global problem, but that it is focal through time and space. Hotspots of poor-​quality medicines will inevitably lead to focal excess mortality and morbidity. An epidemic of falsified oral artesunate containing no, or minimal, artesunate afflicted mainland Southeast Asia until recently and there remain severe problems with falsified artemisinin combination therapy across Central and West Africa. The WorldWide Antimalarial Resistance Network tabulates and maps accessible reports of the quality of antimalarials (http://​www.wwarn.org/​aqsurveyor). In the United States, over 700 patients were infected with Exserohilum rostratum, by being given glucocorticoid injections for chronic musculoskeletal pain from one compounding phar- macy, contaminated with this fungus. Sixty-​one patients died. Some c.500 children died from renal failure after ingesting paracetamol containing the renal toxin diethylene glycol in south Asia, Africa, and Central America. In Singapore, 150 non​diabetic patients were admitted with severe hypoglycaemia because they had taken fal- sified erectile dysfunction medicines containing sulfonylureas. Contaminated substandard magnesium sulphate led to Serratia septicaemia in United States and gentamicin-​resistant Pseudomonas aeruginosa in gentamicin eye drops in Mauritius led to severe eye infections, probably due to errors in manufacture rather than fraud. The national visceral leishmaniasis control programme in Bangladesh used ‘miltefosine’ that contained no miltefosine. Vaccines have also been affected by both falsification and sub- standard production, including falsified rabies and influenza vac- cines. Expiry date tampering, to falsely lengthen the duration that medicines and diagnostic tests can be legitimately sold, also occurs. Diagnostic tests have been falsified, as have surgical implants, such as cardiac stents, and insecticide-​treated bednets that contained no insecticide. Dental and surgical equipment and electronic compo- nents have been falsified, causing patient harm. Detection The formal evaluation of the content and dissolution of medicines re- quires sophisticated equipment and consumables with highly trained staff and quality assurance/​quality control (QA/​QC). Medicine regu- latory authority (MRA) laboratories in the wealthy world are likely to be equipped with mass spectrometry and high-​performance liquid chromatography equipment with dissolution apparatus, for example. However, in most low- or middle-income countries such equipment, along with the necessary technical capacity, consumables, and QA/​ QC, are not available. In 2018 in Southeast Asia and Africa there are only 5 and 7 countries, respectively, with World Health Organization (WHO) prequalified laboratories for medicine quality analysis. Forensic techniques such as pollen analysis are rarely available. Over the last decade a plethora of portable devices have been in- vented and developed for the rapid evaluation of medicine quality in the field. These are based on principles such as Raman, near infrared, and image analysis. However, they have not yet been independently compared for diagnostic accuracy and cost-​effectiveness, so the jury is still out. However, the hope is that these will empower medicine in- spectors and pharmacists to screen medicine quality in the distribu- tion chain and objectively select which samples should be escalated for formal analysis, rather than just relying on eyes and experience. Impact Poor-​quality medicines can lead to serious individual health con- sequences, such as treatment failure due to suboptimal dosing, ad- verse reactions, and increased morbidity and mortality. They can undermine public confidence in health programmes and waste scarce resources. Patients might also suffer from surprising ad- verse effects from unexpected active ingredients. For example, fal- sified co-​trimoxazole containing diazepam, metronidazole made of chloroquine, diazepam containing haloperidol, oseltamivir con- taining vitamin C, ceftazidime made of reused vials containing streptomycin and ofloxacin made of penicillin have been described. Therefore, it is crucial to check a patient’s medication when faced with unexpected clinical syndromes. The world clearly has a problem, but the extent is uncertain, in part because the collation of such publicly accessible data has only recently started. There are clear examples where patients have been harmed by poor-​quality medicines. However, objective estimates of impact on patient outcome have been difficult. Logically, if a medi- cine has been shown to reduce mortality by 30% but the medicine taken has zero efficacy (because it contains no API), mortality will not be reduced by that 30%. In order to reliably conclude that a poor-​ quality medicine has harmed, evidence is needed that (a) a patient has suffered demonstrable harm, (b) proof that the patient took a particular medicine and (c)  chemical analysis of that medicine demonstrating that it does not contain what it is stated to contain (or contains toxins). In much of the world, such linked-​up evidence has been difficult to obtain. The first evidence-​based estimate from modelling of the excess mortality suggested 122 350 under-​five malaria deaths per year were associated with consumption of poor-​quality antimalarials, repre- senting 3.75% of all under-​five deaths in a sample of 39 countries. This is a first approximation and will be refined as the objective evi- dence improves. Poor-​quality medicines also have wider consequences on public health with loss of confidence in healthcare, spread of drug resistant pathogens, economic losses for producers and traders in the genuine articles, and increased burden for health workers. Modelling strongly suggests that underdosing is an important contributor to antimicrobial resistance. Therefore, if patients con- sume medicines containing low %API (usually substandard medi- cines) that result in blood levels within the window of selection, the risks of engendering resistance are high. Medicines containing none of the stated API (usually falsified medicines) may also contribute to drug resistance for some pathogens, such as malaria if they result in hyperparasitaemia and high gametocyte burdens. 126 section 2  Background to medicine Clinical trials have naively been regarded as immune from the issues of poor-​quality medicines and medical devices but there is growing realization that their quality should be checked and docu- mented or trials risk wrongly informing policy. Interventions For the individual health worker and patient—​caveat emptor is vital. Patients may notice when their medications change in appearance, taste, or side effects. Greater awareness of the problem and suspicion in the face of puzzling clinical syndromes, treatment failure, and medicines that seem surprisingly inexpensive are needed. The problems of poor-​quality medicines cannot be viewed in iso- lation. They are enmeshed with many other complex health system problems, especially the affordability and accessibility of medi- cines and the often-​limited capacity of MRAs in L/​MICs. MRAs are the keystones for important interventions but only 20% of WHO member states are reported to have well-​developed drug regulation and 30% have ‘either no drug regulation or a capacity that hardly functions’. Strengthening MRAs, improving the oversight and QA/​ QC of medicine manufacturing collaboration with other key stake- holders such as procuring agencies, and facilitating patient access to affordable medicines are likely to be key factors in improving quality. A key neglected issue has been the woeful sharing of informa- tion between and within countries. There is no global system for the mandatory reporting, assessment, and dissemination of infor- mation on suspicious medicines. The more functional MRAs issue alerts and the revitalized WHO’s new Rapid Alert System (http://​ www.who.int/​medicines/​publications/​drugalerts/​en/​) facilitates in- formation sharing on poor-​quality medicines between MRAs. The e-​drug (http://​lists.healthnet.org/​mailman/​listinfo/​e-​drug) and e-​ med (http://​lists.healthnet.org/​mailman/​listinfo/​e-​med) systems re- port medicine quality problems in English and French, respectively. In most of the world there is no legal duty for health workers, aca- demics, or the pharmaceutical industry to report suspicions of poor-​ quality medicines to the national MRA. There is clearly an ethical duty to do so but this needs to be become a legal duty. The Access to Medicine Index (http://​www.accesstomedicineindex.org/​) includes monitoring of the reporting of falsified medicines by the pharma- ceutical industry to WHO and MRAs. Areas of uncertainty and controversy There are many other related important global issues that need to be addressed, such as the lack of an international coding system for identifying pills and capsules, poor-​quality instruction sheets with medicines in small fonts and in user-​unfriendly language. There is an urgent need for clear understanding that generic medicines per se are not counterfeit, and a need for an international convention making medicine falsification an international, extra- ditable, crime. We need better understanding as to how to engage with societies about medicine quality problems so that they do not inappropriately stop good quality medicines in the face of warnings of poor-​quality products. More evidence is needed to understand how best to empower drug inspectors with the optimal new portable devices. It will be key to have an international capacity building and financial mechanism to support MRAs that lack resources to pro- tect the medicine supply and better objective-​collated data to inform policy. FURTHER READING Attaran A, et al. (2012). How to achieve international action on falsified and substandard medicines: a consensus statement. BMJ, 345, e7381. Caudron JM, et al. (2008). Substandard medicines in resource-​poor settings: a problem that can no longer be ignored. Trop Med Int Hlth, 13, 1062–​72. Fernandez FM, et al. (2011). Poor quality drugs: grand challenges in high throughput detection, countrywide sampling, and forensics in developing countries. Analyst, 136(15), 3073–​82. Godlee F (2018). Why aren't medical devices regulated like drugs? BMJ, 363, k5032. Newton PN, et al. (2006). Counterfeit anti-​infective medicines. Lancet Infect Dis, 6, 602–​13. Newton PN, et al. (2014). Falsified medicines in Africa and public health—​‘No Action—​Talk Only’. Lancet Global Health, 2, e509–​10. Vickers S, et al. (2018). Field detection devices for medicines quality screening: a systematic review. BMJ Global Health, 3, e000725. World Health Organization (2017). WHO Global Surveillance and Monitoring System for substandard and falsified medical prod- ucts. World Health Organization: Geneva. http://apps.who.int/ medicinedocs/en/m/abstract/Js23373en/ 2.11 Preventive medicine 127 2.11 Preventive medicine 127 ESSENTIALS Causes of death Most deaths before age 80  years are preventable but preventive medicine cannot offer immortality. Childhood and early adult life Deaths from infectious diseases and trauma usually reflect poverty and political instability. Prevention requires political action to reduce the risk of war and improve the supply of food, clean water, sanitation, and shelter. Preventive medicine can help by controlling spread of infection through vaccination, health education, control of insect vectors, and treatment of disease carriers to prevent onward transmission. Middle age The commonest cause of premature death globally is vascular disease—​mainly heart attacks and stroke. The main causes are poor diet, obesity, and tobacco smoking. Political action is important to make it easy for people to take exercise and eat healthily, and to make it difficult to buy and smoke tobacco. Preventive medicine Preventive medicine can identify and treat people at increased risk of death from vascular disease (particularly those with diabetes, high blood pressure, and high blood lipids) and will save many lives by doing this effectively, but the need for medicines to treat vascular disease in individuals is a measure of public health failure. The co- existence of obesity and starvation as major causes of preventable mortality in many countries is a growing public health challenge. Many effective preventive interventions such as legislation to make seatbelts compulsory or tax tobacco should be targeted at the whole population, whereas preventive medicine provided by clinicians must target individuals. This often requires screening to detect early signs of disease (e.g. HIV infection, cancer) or markers of risk of disease (e.g. high blood pressure, intrauterine growth delay), but clinicians must be clear that something effective can be done to ameliorate the condition detected before any screening is undertaken. Failure of evidence-​based preventive interventions The usual reason for failure is lack of effective implementation, with the three most important issues being: (1) poor population coverage, meaning that only a small proportion of the at-​risk population re- ceive the intervention; (2) inadequate staff training; and (3) inad- equate quality control. Preventive medicine and curative medicine Preventive medicine is an important and integral part of good curative medicine. All doctors have a responsibility to think about why someone is ill. Whatever disease is diagnosed, the question of whether it could have been prevented, and whether the risk of pro- gression can be reduced, must be addressed. For example, every clinician who diagnoses a stroke must ask themselves whether a pre- vious clinical opportunity to measure and control blood pressure has been missed, and reflect on this in regard to their future practice re- lating to other patients. Introduction In his millennium address, Nelson Mandela reminded the world that ‘we close the century with most people still languishing in poverty, subjected to hunger, preventable disease, illiteracy, and insufficient shelter’. The health gap between rich and poor nations remains shameful. For example, overall life expectancy in Sierra Leone is currently estimated as 46 years compared to 81 years in the United Kingdom. But even in the economically developed world, many people still die prematurely. In 2013, approximately 23% (114 740 out of 506 790) of all deaths registered in England and Wales were from causes considered avoidable through good-​ quality healthcare or wider public health interventions. Again, there is a marked gap between rich and poor. Men in the most advantaged areas of England lived on average 9 years longer, and experienced 19 more years of good health, than men living in the least advantaged areas. It is naive to think that medicine will remedy this situation. The fundamental step in achieving good health remains the elimination of poverty, with consequent access to food, sanitation, education, and shelter. The power of medicine lies in the scientific understanding it provides of the disease pro- cess. Preventive medicine uses this understanding both to try to reduce the risk of disease and to detect and treat appropriately emergent disease before it does damage. 2.11 Preventive medicine David Mant 128 section 2  Background to medicine What is the scope for prevention? Fig. 2.11.1 shows the number expected to die at different ages if 10 000 men were subject to the age-​specific death rates in the United Kingdom in 2010 compared to 1870. The dramatic fall in deaths during childhood and early adult- hood demonstrates unequivocally that such deaths are preventable. Moreover, most deaths before age 80 in high-​income countries are from diseases which are amenable to preventive intervention. Migrant studies show that genetic advantage does not explain the high average life expectancy in Japan (81 years in men and 87 years in women). It therefore seems reasonable to expect that effective pre- ventive medicine can make death before age 80 years uncommon. However, the proportion of men or women who can expect to live over 100 years remains very low (1.1% in England in 2011/​12). The achievable objective therefore seems to be better quality of life before death, and avoidance of premature death, rather than extension of life beyond 100 years. Preventive medicine cannot offer immortality. Preventive strategies Identifying and reducing risk The main difference between preventive and curative medicine is the focus on risk. The leading global risk factors for preventable mor- tality in 2013, estimated by the Global Burden of Disease study, are shown in Table 2.11.1. The most disturbing fact remains the occur- rence of both starvation and obesity on the same list. Although there has been progress in some areas since 1990 (e.g. the number of at- tributable deaths due to child starvation, unclean water, and unsafe sex have more than halved), other issues such as deaths due to air pollution, substance abuse (including alcohol and tobacco), and oc- cupational hazards have increased. Although there has been a shift from environmental towards individually mediated risks, individual behaviour is always constrained by sociocultural and economic fac- tors largely outside individual control. On a global scale, the dietary and behavioural risk factors leading to chronic disease now predominate, whether risk is measured in terms of deaths or disability. The geographical differences in risk factors for ill health mainly reflect national wealth. The individual behaviours which promote hypertension and cardiovascular disease (eating an unhealthy diet, smoking tobacco, taking too little exer- cise) are now the most important risk factors in middle-​income as well as high-​income countries. However, Table 2.11.2 shows that lack of food, unsafe water, and unsafe sexual practices still remain the top three causes of years of healthy life lost in the low-​income countries of sub-​Saharan Africa. The prevention paradox Preventive medicine aims to reduce the risk of disease (or the risk of further morbidity and mortality in those who develop disease) so its benefit is the absence of disease in the future rather than the present. Absence of something is a difficult benefit to champion, particularly to the individual. As Geoffrey Rose pointed out many years ago, not only is the benefit intangible but many people must take precautions in order to prevent illness in only a few. Even in a country where diphtheria is common, several hundred children must be immunized to prevent one death. Rose called this the ‘pre- vention paradox’—​a preventive measure, which brings large benefits to the community may offer little to each participating individual. 0 50 100 150 200 250 300 Number of deaths Age at death Age at death of 10 000 people born in the UK experiencing the age- specific death rates of 1870 and 2010 0–5 6–10 11–15 16–20 21–24 25–30 31–35 36–40 41–45 46–50 51–55 56–60 61–65 65–70 71–75 76–80 81–85 86–90 91–95 96–100 100–105 105–110 1870 2010 Fig. 2.11.1  Numbers dying at different ages if 10 000 men were subject from birth to the UK mortality rates in 1870 and 2010. Source of data: Office of National Statistics Life Tables, United Kingdom; the figure is based on one originally drawn by Doll R in British Medical Journal 1982; 286: 445–​53. 2.11  Preventive medicine 129 The risk paradox The importance of the risk paradox is best illustrated by an example. One of the leading global risk factors for vascular disease is a high level of cholesterol in the blood. Fig. 2.11.2 shows historical data from the United Kingdom on the prevalence of high blood chol- esterol, the death rate associated with each cholesterol level, and the proportion of all deaths attributable to cholesterol occurring at each level. The risk paradox is that although those with a blood cholesterol greater than 7.5 mmol/​litre are at highest individual risk of disease, they account for only 8% of total deaths. The group of people in whom most deaths occur (22%) is that with only a mod- estly increased cholesterol level of 5.5–​6.0 mmol/​litre. This is simply because of the number of people involved. There are far fewer in the high-​risk group than in the moderate risk group. Targeting Table 2.11.1  Leading risk factors for global mortality in 2013 Risk factor Attributable deaths each year (millions) Comment Environmental Ambient air pollution 3.1 Particulates 2.9 m; ozone pollution 0.2 m; overall 33% increase in attributable deaths since 1990 Indoor smoke from solid fuels 2.9 No reduction since 1990 Unsafe water, sanitation, and handwashing 1.3 The most important of these issues remains unsafe water (1.2 m attributable deaths) but the number of attributable deaths from this cause has almost halved since 1990 Occupational risks 0.7 Top three issues: 1. air quality (gases/​fumes/​dust); 2. asbestos; 3. industrial injuries. Overall 27% increase in attributable deaths since 1990 Dietary Unhealthy diet 11.3 Top three issues: 1. High sodium; 2. Lack of fruit; 3. Lack of whole grain cereals Child and maternal malnutrition 1.7 Top three issues: 1. undernutrition (starvation); 2. suboptimal breastfeeding; 3. iron deficiency Obesity (high BMI) 4.4 Overeating is compounded by low physical activity (attributable deaths 2.2 m) Other behavioural Tobacco smoke 6.1 Smoking 5.8 m, passive smoking 0.3 m; overall 18% increase in attributable deaths from tobacco use since 1990 Alcohol and drug use 3.2 Alcohol 2.8 m, other drugs 0.4 m; overall 51% increase in attributable deaths since 1990 Unsafe sex and sexual abuse 1.8 Unsafe sex 1.5 m, abuse of partner or child 0.3 m; deaths from unsafe sex more than halved since 1990 Metabolic High systolic blood pressure 10.3 Main modifiable risk factors for high blood pressure are dietary (especially salt intake) High blood cholesterol 4.4 Mean cholesterol levels vary between WHO Regions, but not more than 2.0 mmol/​litre in any age group High fasting blood glucose 4.0 The main risk factor for diabetes is obesity Data from: Global Burden of Disease Study 2013. Published in The Lancet online 11 September 2015 (http://​dx.doi.org/​10.1016/​50140-​6736(15)00128-​2). Table 2.11.2  Top 10 risk factors for ill health (disability-​adjusted life years attributable to level 2 risk factors) in 2013 in sub-​Saharan Africa and globally Rank Sub-​Saharan Africa Global 1 Child and maternal nutrition Dietary risks 2 Unsafe sex High systolic blood pressure 3 Unsafe water sanitation and hand washing Child and maternal nutrition 4 Air pollution Tobacco smoke 5 Alcohol and drug use Air pollution 6 High systolic blood pressure High body mass index 7 Dietary risks Alcohol and drug use 8 High body mass index High fasting blood glucose 9 High fasting blood glucose Unsafe water, sanitation, and hand washing 10 Tobacco smoke Unsafe sex Data from: Global Burden of Disease Study 2013. Published in The Lancet online 11 September 2015 (http://​dx.doi.org/​10.1016/​50140-​6736(15)00128-​2). CHD deaths/1000/6 year Prevalence 20 10 0 4 5 6 7 8 4% 8% 17% 22% 19% 13% 9% 8% 0 10 20 Serum cholesterol (mmol) Fig. 2.11.2  Proportion of coronary heart disease deaths attributable to raised serum cholesterol (percentages above columns). Columns and right axis show population distribution of cholesterol levels. The broken line and left axis show the attributable mortality. Reproduced from Rose G. Rose’s Strategy of Preventive Medicine, 2008. With permission from Oxford University Press, Oxford. 130 section 2  Background to medicine preventive medicine at just the high-​risk group will often have rela- tively little impact on the total number of deaths in the population. Primary and secondary prevention Use of these two terms is sometimes inconsistent (e.g. antenatal care is usually characterized as secondary prevention although preg- nancy is not a disease) but usually they can be interpreted as follows: • Primary prevention—​interventions to reduce the risk of disease in healthy people (e.g. use of seat belts to prevent injury in car accidents; tobacco control to prevent the occurrence of smoking-​ related disease; immunization against infectious disease). • Secondary prevention—​interventions to prevent avoidable mor- bidity in people with disease (e.g. treatment of vascular disease with aspirin; screening for early cancer). Primary prevention could be perceived as always better than sec- ondary prevention in the same way that prevention is better than cure. However, this is oversimplistic for three reasons: (1) many ap- parently healthy people will have undetected disease; (2) some inter- ventions fall into both categories (e.g. stopping smoking reduces the progression as well as onset of many smoking-​related diseases); (3) primary prevention may be less effective than secondary preven- tion, particularly when the intervention is aimed at the individual. Strategic choices In choosing priority strategies for risk prevention, the World Health Organization (WHO) has recommended that in general it is more effective: • to focus on population-​based rather than individual interventions • to prioritize primary rather than secondary prevention • to control distal before proximal risks to health This advice makes good sense from a public health perspective—​it is self-​evidently better to prioritize provision of clean water before fo- cusing on early recognition and treatment of dehydration from diar- rhoea caused by dirty water. However, making strategic choices about preventive medicine aimed at individual patients is less straightfor- ward. It is much easier to persuade individuals to change their behav- iour when they perceive themselves at high risk (e.g. to convince them to stop them smoking after they develop a smoking-​related illness). Primary prevention is seldom delivered cost-​effectively to healthy people by doctors or nurses in routine clinical practice. The one ob- vious exception is vaccination, where involvement of clinicians is often important in delivering vaccines and ensuring high coverage of the population at risk. Clinicians are also in an ideal position to re- inforce some key public health messages, such as stopping smoking and hand washing, with brief opportunistic advice during routine consultations (see next). However, several large-​scale clinical trials have shown that it is not cost-​effective for doctors and nurses to ex- pend scarce resources trying to convince otherwise healthy individ- uals to change eating or exercise habits determined primarily by the cultural and socioeconomic context in which they live. Defining and identifying the at-​risk population Public and individual intervention Some public health interventions do not require anything other than a broad geographical definition of risk. Interventions are targeted at whole populations and do not require identification of individuals within that population—​for example, road accidents can be reduced by seatbelt legislation and tobacco consumption by taxation without identifying the individual driver or smoker. However, some primary preventive strategies (e.g. vaccination) and most examples of secondary prevention (e.g. screening) have to be delivered to specific individuals at risk. You not only need to know that people with high blood pressure are at risk of stroke, you need to know who has high blood pressure. These individuals can be defined in one of three ways: by demographic, phenotypic, or familial characteristics. Within each group of at-​risk people so defined, further subpopulations may be identifiable as at particu- larly high risk. Demographic risk This is the most common way to define the target group for pre- ventive medicine services. For example, the United States Preventive Task Force defines the target group for most of the preventive services it recommends in terms of age. The other demographic risk characteristic commonly used by the Task Force to define the target population is gender. It is obvious why it recommends that breast and cervical cancer screening should only be targeted at women, but gender-​specific recommendations are also made for aortic an- eurysm (based on differential risk) and gonorrhoea (based on the differential likelihood of asymptomatic infection). Some preventive programmes may also target specific ethnic or racial groups—​for example, in Australia, interventions to reduce the risk of rheumatic fever associated with group A streptococcal infection of the throat have been restricted to the indigenous Aboriginal population who are at exceptionally high risk. Phenotypic risk A phenotype is a set of observable characteristics of an individual or group. Many epidemiological risk factors for disease are phys- ical characteristics (e.g. obesity, hyperlipidaemia); other phenotypic categories often used to define at-​risk target populations are behav- iour (e.g. smoking, driving) and disease states (diabetes mellitus, an- gina). As some phenotypic risks interact (e.g. smoking and exposure to asbestos interactively increase risk of mesothelioma), multiple risk assessment is an increasingly common practice. Several clin- ical tools have been developed to help estimate multifactorial risk in everyday practice, such as the New Zealand risk charts for cardio- vascular disease. Familial risk Recent advances in genetic technology have increased our ability to characterize familial risk accurately, and further advance is likely. At present, most preventive medicine programmes in this area use genetic assessment to refine assessment of individual risk in pheno- typically identified high-​risk individuals or families (e.g. cystic fibrosis, neurofibromatosis) or demographically defined popula- tions (e.g. pregnant women at risk of giving birth to a child with Down’s syndrome). However, the characterization of risk based on population-​based genetic screening is already technically feasible in the economically developed world, increasing the potential power of preventive medicine but also raising important ethical issues about how society deals with accurate predictions of high disease risk, par- ticularly when evident at or before birth. 2.11  Preventive medicine 131 Registration, screening, and case-​finding It is much easier to identify specific individuals at risk when uni- versal health registration is in place and its accuracy is systematically maintained. It not only allows efficient provision of primary preven- tion strategies, such as vaccination, but screening for disease risk is also much more efficient when based on an accurate population register. In the absence of a population health register, it is neces- sary to rely on case-​finding. This requires identification of at-​risk individuals during routine clinical work (normally in clinical con- sultations, but sometimes through contact or family tracing). It is less efficient than systematic population screening, but sometimes provides better access to socially disadvantaged groups who may re- spond poorly to screening invitations or have no registered address. It also allows some interventions to be given at a particularly appro- priate moment (e.g. smoking cessation advice at a consultation for cough or contraceptive advice after termination of pregnancy). Interventions to modify risk The importance of public health The marked improvements in health which have been achieved in economically developed countries over the past 150 years are not attributable to medicine. Life expectancy has doubled mainly be- cause of environmental control of infectious pathogens (through sanitation and control of insect vectors) and a lifestyle that reduces individual susceptibility to infectious disease (better food, shelter, and education). So although medical science can play an important role in guiding public health policy by improving understanding of the mechanisms of disease, and specific medical interventions allow us to treat disease when it occurs, the role of preventive medicine should not be overestimated. In particular, the medical profession should not take upon itself responsibilities for public health which are more appropriately assumed by governments and other social and environmental agencies. The preventive responsibilities of all doctors Preventive medicine is an important and integral part of good cura- tive medicine. All doctors have a responsibility to think about why someone is ill. Whatever cause is identified (physiological, social, or psychological), the question about whether the cause can be pre- vented (and the risk of future disease reduced) should be addressed. Clinicians should be held professionally accountable if they can be shown to have missed a previous clinical opportunity to measure blood pressure in an individual who subsequently develops a stroke. Doctors who work in a primary care role (particularly those with a registered population) have the added responsibility to ask them- selves whether the risk should be addressed at a population rather than just an individual patient level. Changing behaviour Most of the leading risk factors for preventable mortality cited earlier can be influenced by individual behaviour: smoking, diet, exercise, alcohol consumption, hand washing, and unsafe sex. The most effective way to influence such behaviour is usually through public health policy, but individual practitioners can play an im- portant complementary role. People do listen to their doctors, and several clinical trials have shown brief advice on behaviour modi- fication to be cost-​effective, even though the impact may be small (e.g. in most studies only about 1 in 30 smokers given brief advice to stop smoking actually quit). Brief advice is most effective if prac- tical in nature (giving guidance on how change can be achieved) and if backed up by written advice to take home (see Chapters 26.6.1, 26.6.2, and 26.6.3 for further discussion). More intensive inter- ventions may sometimes be more effective but tend to be less cost-​ effective. In each case, it is important to take account of the scientific evidence about effectiveness and the local socioeconomic context. Immunization Vaccination is a very effective preventive strategy. Vaccination against smallpox has led to global eradication of the disease; eradi- cation of polio is a feasible global objective. Vaccination against many diseases, particularly diseases of childhood such as measles, diphtheria, and polio, has led to rapid and dramatic falls in disease incidence. Fig. 2.11.3 shows the impact of introduction of the Hib vaccine in 1992 on the incidence of Haemophilus influenzae infec- tion in England and Wales. Table 2.11.3 shows the current routine vaccine schedule in the United Kingdom. Every individual in the population is routinely vaccinated against 14 different organisms. At-​risk groups may be re- quired to have specific vaccines to protect themselves (e.g. health workers against hepatitis B), and the general population and trav- ellers to other countries can choose and pay for a range of vaccines for other diseases including hepatitis A, typhoid, and rabies. It is not uncommon for a United Kingdom resident to have received vaccin- ation against 15–​20 microorganisms during their lifetime. Several new and important vaccines are on the horizon (e.g. against malaria), but the existence of an effective vaccine does not guarantee the success of an immunization programme. This depends on the ef- fective delivery of the vaccine to the at-​risk population. Programmes are often limited in their effect by affordability (some vaccines are unaffordable without external funding in many low-​income coun- tries), acceptance (parental anxiety about the adverse effects often limits uptake), and delivery (vaccines may lose potency if stored outside a refrigerator). There are also potential problems with the Number of reports 250 200 150 100 50 0 Years Hib vaccine introduced 1995 1994 1993 1992 1991 1990 1989 up to 1 year of age all other ages Fig. 2.11.3  Effect of introduction of Haemophilus influenzae type b (Hib) vaccination in the United Kingdom on laboratory reports of Hib infection. Reproduced by permission of the Controller, Her Majesty’s Stationery Office. 132 section 2  Background to medicine antigenic variability of organisms over time (e.g. influenza) and the difficulty of immunizing at an age young enough to prevent mor- bidity but old enough to stimulate an immune response (e.g. mea- sles). Nevertheless, immunization is probably the most important medical contribution to primary disease prevention. Screening The issue of screening is dealt with in Chapter 2.12. Two-​thirds of the preventive services for adults recommended for implementation by the United States Preventive Services Task Force involve screening. The purpose of screening is to identify disease at an early and curable stage. The most important criterion that has to be met for screening to be ethical is that the condition identified can be treated. Good intention is not enough. Screening will do harm if it identifies con- ditions that we can do nothing about, either because of lack of ef- fective interventions or lack of resources. It is particularly important to assess the effectiveness of screening interventions in randomized trials, as people whose disease was detected by screening will appear to clinicians to do well even if the screening is ineffective. This is because early diagnosis will lead to longer survival irrespective of treatment, and illness detected by screening will always tend to be more benign in its natural history than illness detected clinically. Prophylactic treatment Although most people think of medicines as cures for current illness, many medicines are prescribed with a view to preventing future illness. Antibiotics are given before surgery to prevent postoperative infection, antimalarials to prevent malaria in travel- lers, anticoagulants to prevent stroke in people with atrial fibrilla- tion, and lipid-​lowering agents to prevent heart attacks in people at high risk of cardiovascular disease. The duration of treatment may also be extended beyond the initial treatment phase to achieve a pre- ventive effect. Antidepressants are continued after cure to prevent relapse, angiotensin converting enzyme (ACE) inhibitors to prevent worsening of ventricular dysfunction, and uricosuric agents to pre- vent further episodes of gout. It must be clear from these examples that many drugs have the po- tential to be used for prevention as well as cure. In some cases (e.g. treatment of diabetes) the distinction between prevention and cure is unhelpful: treatment aims to prevent morbidity in both the short and long terms. However, in all the examples given, prescribing is limited to a defined high-​risk group. Prophylactic treatment with drugs is less helpful when a high-​risk population cannot be easily defined. It is usually inappropriate to use prophylactic treatment to reduce population risk for three reasons: the strategy is seldom cost-​ effective, increasing the reliance of the population on medicine is an adverse social outcome, and uncommon adverse effects can easily outweigh any benefit. However, a case can be made for treating high-​ risk populations defined only by age and gender with a ‘polypill’ combining certain drugs with a low side-​effect profile that have been shown to be effective in preventing cardiovascular disease. Environmental change Most environmental causes of disease are best modified on a public health rather than an individual basis. Such factors include the safety of the workplace, environmental pollution, transport safety, food hygiene, and provision of clean water. However, some diseases have environmental causes, which need to be recognized and avoided by the individual patient. On a global scale, avoidance of insect and other disease vectors (e.g. by netting) and attention to nutritional hygiene (e.g. by hand washing and filtering water) are probably the most important. In economically developed countries, the most common diseases amenable to individual environmental interven- tion are those associated with atopy, such as asthma and eczema. Not all patients have an identifiable allergenic cause for their symp- toms and, even if one is identified, avoidance (e.g. of house dust mite in asthma) may not be easy. But dramatic improvement can occur, and treating contact dermatitis without giving advice on contact Table 2.11.3  Routine immunization programme in the United Kingdom according to age 2 months 3 months 4 months 1 year 2 years, then annually until 16 years 3 ½ years Girls 12–​14 years 14 years 65 years 65 years then annually 70 years Diphtheria ✔ ✔ ✔ ✔ ✔ Tetanus ✔ ✔ ✔ ✔ ✔ Polio ✔ ✔ ✔ ✔ ✔ Pertussis ✔ ✔ ✔ ✔ H. influenzae B ✔ ✔ ✔ ✔ Rotavirus ✔ ✔ Pneumococcus ✔ ✔ ✔ ✔ Meningococcus C ✔ ✔ Measles ✔ ✔ Mumps ✔ ✔ Rubella ✔ ✔ Influenza ✔ ✔ Human Papillomavirus ✔ Shingles ✔ Data from: Department of Health/​Public Health England. Immunisation against infectious disease (The Green Book) Chapter 11. HMSO, London (updated February 2015). https://​ www.gov.uk/​government/​uploads/​system/​uploads/​attachment_​data/​file/​400554/​2902222_​Green_​Book_​Chapter_​11_​v2_​4.pdf 2.11  Preventive medicine 133 avoidance, or treating louse bites without giving advice on how to rid clothes of lice, is bad medical practice. What interventions work? Public health interventions It is impossible to list here all the public health initiatives that are known to be effective. There is no doubt that provision of clean water and sanitation; avoidance of war; progressive taxation; pro- vision of education; fiscal policy to reduce tobacco and alcohol use; food policies to reduce community intake of salt, saturated fat, and excess calories; and education to promote safe sex, are all ef- fective public health interventions. Their effective implementation depends on political will at a national and international level. In general, the effectiveness of public health interventions will reflect government attitude to regulation (e.g. whether the presumption of market-​led development and free trade dominates individual and societal health concerns) and the understanding of health risk by the general public, politicians, and health practitioners. At an inter- national level, the World Health Organization cites the Framework Convention for Tobacco Control as an exemplar of a very effective government-​led international public health initiative. This covers advertising, regulation, taxation, and smoke-​free zones, as well as the individual treatment of addiction. Interventions aimed at the individual As with public health interventions, it is impossible to list here all the preventive interventions targeted at individuals which have been shown to be effective. Many are in any case better seen as part of good routine clinical care (and are included in the relevant chapters on specific diseases). Nevertheless, one accessible source of regularly updated evidence on individual interventions is the United States Preventive Services Task Force and Table 2.11.4 lists Table 2.11.4  Preventive interventions recommended for adults by the United States Preventive Services Task Force as providing important health benefit on the basis of research evidence rated as A (high certainty of substantial net benefit) or B (high certainty of moderate net benefit or moderate certainty of moderate to substantial net benefit) Men Women Timing and target groups Screening interventions Alcohol misuse screening and counselling ✔ ✔ Counselling about safe-​drinking for all; specific help for heavy drinkers identified by screening Aortic aneurysm ✔ Ultrasound at age 65–​75 in ever smokers BRCA-​related cancer ✔ Women with family history suggestive of BRCA1 or BRCA2 gene mutations Breast cancer ✔ Biennial mammography from ages 50 to 74 years Cervical cancer ✔ Cytology every 3 years age 21–​65 years or combined cytology/​HPV testing every 5 years Chlamydia infection ✔ All sexually active women age <25 years (after age 25 only high-​risk groups) Colorectal cancer ✔ ✔ Age 50–​75 years using faecal occult blood (FOB) testing, sigmoidoscopy, or colonoscopy Depression ✔ ✔ Recommendation states that screening is appropriate only when accurate diagnosis, treatment, and follow-​up possible Diabetes mellitus ✔ ✔ Adults with sustained hypertension (BP >135/​80) Risk of falling in older adults ✔ ✔ Community-​dwelling adults 65+ years; physical therapy and/​or vitamin D supplements recommended for high-​risk groups Gonorrhoea infection ✔ At-​risk groups Hepatitis C infection ✔ ✔ At-​risk groups and those born between 1945 and 1965 High blood pressure ✔ ✔ All adults age 18+ years HIV infection ✔ ✔ At-​risk groups Intimate partner violence ✔ Women of childbearing age (guideline also includes advice on identifying risk of abuse of elderly) Lipid disorders ✔ ✔ All men aged 35+; women and younger men 20+ if high cardiovascular disease (CVD) risk Lung cancer ✔ ✔ Heavy smokers (current or within 15 years of cessation) Obesity ✔ ✔ Assuming availability of intensive multifactorial behavioural intervention when BMI 30+ Osteoporosis ✔ Women aged 65+ (plus women at equivalent risk for reasons other than age) Syphilis infection ✔ ✔ At-​risk groups Other interventions Aspirin ✔ ✔ Adults at increased CVD risk Breast cancer chemoprevention ✔ Women age 35+ years at high risk of breast cancer Dietary counselling ✔ ✔ Adults at high risk of CVD Tobacco use counselling ✔ ✔ All who use tobacco (assuming all patients are asked about tobacco use opportunistically at clinical consultations) Data from: Recommendations of the U.S. Preventive Services Task Force. US Department of Health and Human Services, Washington, DC; October 2015 (http://​www. uspreventiveservicestaskforce.org/​Page/​Name/​uspstf-​a-​and-​b-​recommendations/​Individual interventions). 134 section 2  Background to medicine the conditions for which evidence of effectiveness is rated by them as A (high certainty of substantial net benefit) or B (high certainty of moderate net benefit or moderate certainty of moderate to sub- stantial net benefit). In interpreting this evidence, it is important to recognize that the recommendations reflect the specific context of the United States; differences in the prevalence of the target condition and the availability and effectiveness of interventions to ameliorate risk identified impact on their relevance to other countries. The recommended preventive interventions (other than vac- cination) for two important at-​risk groups—​pregnant women and children—​are listed separately in Table 2.11.5. Again, in interpreting this evidence it is important to consider its local ap- plicability. For example, the appropriateness of treating individual children with fluoride to prevent dental caries depends both on whether the local diet promotes caries and whether fluoride is added to drinking water as a public health measure. Similarly, screening for sickle cell disease and counselling about skin cancer prevention is only appropriate in at-​risk ethnic groups, and screening for depression in adolescents is only appropriate if re- sources are available to make an accurate diagnosis and manage the problem effectively. Implementation issues Cultural constraints Most behaviour aimed at preventing disease has a strong sociocul- tural component and reflects prevalent attitudes and norms in so- ciety. Preventive interventions are severely constrained by this social context. Convincing people to stop smoking, eat less salt, drink less beer, or drive more slowly is difficult if everyone else is doing the op- posite. For example, the data in Fig. 2.11.4 (showing that the mean blood cholesterol level in Finland in the 1990s was almost twice that in Japan), supported by migrant studies showing that this dif- ference was dietary rather than genetic in origin, led some enthu- siastic policy makers to assume that medical advice to individuals to modify their diet could cut cholesterol levels by half. However, even in clinical trials, dietary advice from health professionals in a community setting has seldom achieved a reduction in blood chol- esterol of more than 3–​5%. Studies of salt restriction (to lower blood pressure) show a similar result. Intensive intervention and support are needed for an individual patient to achieve a physiologically significant reduction in intake and many find such a diet unpal- atable. Countercultural change is difficult to achieve. The adverse Table 2.11.5  Preventive interventions for pregnant women and children (other than vaccination) recommended by the US Preventive Services Task Force as providing important health benefit on the basis of research evidence rated as A (high certainty of substantial net benefit) or B (high certainty of moderate net benefit or moderate certainty of moderate to substantial net benefit) Children Pregnant women Timing and target groups Alcohol misuse screening and counselling ✔ First antenatal visit Asymptomatic bacteriuria screening ✔ 12–​16 weeks’ gestation Breastfeeding promotion ✔ During pregnancy and after delivery Congenital hypothyroidism screening ✔ Newborn Deafness ✔ Newborn Dental caries prevention (fluoride) ✔ From first tooth eruption to 5 years Depression (major depressive disorders) screening ✔ Children aged 12+ if diagnostic and treatment provision in place Folic acid supplementation ✔ All women planning or capable of pregnancy Gestational diabetes screening ✔ After 24 weeks’ gestation Gonococcal ophthalmia neonatorum prophylaxis (antibiotic drops) ✔ Newborn Infection screening (chlamydia, gonorrhoea, Hep B, Hep C, HIV, syphilis) ✔ ✔ All at-​risk of specific infections (all pregnant women for HIV and syphilis); Iron deficiency anaemia screening and dietary iron supplementation ✔ ✔ Screen all pregnant women; supplement at-​risk children 6–​12 months Obesity ✔ Children age 6+ years Pre-​eclampsia prevention (aspirin) ✔ After 12 weeks’ gestation in women at high risk PKU (phenylketonuria) screening ✔ Newborn Rhesus (D) incompatibility screening ✔ First antenatal visit and repeat at 24–​28 weeks if Rh (D) negative Sexually transmitted infection, behavioural counselling ✔ Sexually active adolescents Sickle cell disease screening ✔ Newborn Skin cancer avoidance counselling ✔ Children and young adults age 10–​24 years with fair skin Tobacco use counselling ✔ ✔ School age children and pregnant women who smoke Visual-​impairment screening ✔ Ages 3–​5 years Data from: Recommendations of the U.S. Preventive Services Task Force. US Department of Health and Human Services, Washington, DC; October 2015 (http://​www. uspreventiveservicestaskforce.org/​Page/​Name/​uspstf-​a-​and-​b-​recommendations/​Individual interventions). 2.11  Preventive medicine 135 consequence is the common use of medication (in this case statins) to treat what is effectively a sociocultural problem. Time constraints One lesson we often forget is that things change over time. The North Karelia project is of more than historical interest. It was a large-​scale, long-​term programme to reduce mortality from cardiovascular disease in northern Finland, started in 1972, which involved both public health and individual intervention. Fig. 2.11.5 compares mortality from cardiovascular disease in North Karelia with that in 10 other provinces in Finland before and during the intervention by plotting two regression lines. The difference in slopes of these two lines shows that the intervention was to some extent effective. However, far more impressive in magnitude is the absolute fall in mortality over time both in North Karelia and in the other prov- inces. The lessons for preventive medicine are twofold: the effect of medical intervention may be small compared to the effect of other economic and social influences; and the change in baseline risk and social context over time may be so rapid that it will substantially in- fluence the absolute benefit of any preventive intervention. Programme effectiveness Many of the interventions cited earlier are known to work because they have been tested in clinical trials. However, clinical trials are often done in settings far removed from everyday life. Participants are compliant, those delivering the intervention are highly trained, the technology is of high specification, and quality control is rigorous. These conditions will not hold under ordinary working conditions. When recommended preventive interven- tions fail, the most common reason is lack of effective implemen- tation of the programme, rather than lack of effectiveness of the intervention itself. The three most important implementation issues that determine programme effectiveness are: • Coverage—​the population at risk needs to receive the intervention. • Delivery—​the way the intervention is delivered (e.g. maintenance of equipment, the training of staff, and the storage of biological materials) must be high quality. • Quality assurance—​performance indicators (e.g. the maximum number of cases missed by screening) need to be predefined and monitored. Failure to implement any one of these issues can stop a poten- tially effective intervention delivering any benefit. Two well- ​documented examples from the United Kingdom are the resur- gence in whooping cough when vaccine population coverage fell (after media publicity about potential adverse effects of the pertussis vaccine) and the missed cases of uterine cervix which occurred before adequate quality standards were defined and monitored for cervical sampling and cytological assessment in the national screening programme. Conclusion Preventive medicine is an integral part of clinical practice for all doctors. It is our responsibility as clinicians not only to cure the presenting illness but also to take action where possible to prevent future morbidity. However, we must display both humility and as- sertiveness in our approach. We need to be humble in our approach to patients and to recognize that medicine is not the main deter- minant of health. At the same time, we must display assertiveness in our advocacy of prevention. In the United Kingdom, the Royal College of Physicians’ reports, the campaigning of medical charities, and the decision by virtually all doctors to stop smoking have all played an important part in influencing both public and political opinion against tobacco use. As a profession, we can make a unique and powerful contribution to the prevention of premature death by identifying and publicizing the existence and causes of ill health. We also have a unique and powerful responsibility to act as advocates for our patients in ensuring that these causes are addressed and the risk to their health is ameliorated. Good clinical practice entails pre- ventive medicine, but good preventive medicine is more than just good clinical practice. Total serum cholesterol (mg/100 ml) 80 60 40 20 0 100 200 300 400 500 South Japan East Finland Relative frequency (%) Fig. 2.11.4  Distribution of serum cholesterol in southern Japan and eastern Finland. Reproduced from Rose G. Rose’s Strategy of Preventive Medicine, 2008. With permission from Oxford University Press, Oxford. 1969 Year 1000 Mortality/1000 900 800 700 600 500 400 82 81 80 79 78 77 76 75 74 73 72 71 70 North Karelia Ten other provinces Fig. 2.11.5  The North Karelia project. Age-​standardized annual mortality from cardiovascular disease in men aged 35–​64 years in Finland, 1969–​1982. Redrawn from original data published by Tuomilehto J et al., British Medical Journal 1986: 293: 1068–​71. 136 section 2  Background to medicine FURTHER READING Department of Health/​Public Health England (2015). Immunisation Against Infectious Disease (The Green Book). HMSO, London (https://​www.gov.uk/​government/​collections/​ immunisation-​against-​infectious-​disease-​the-​green-​book) Rose G (1992). The strategy of preventive medicine. Oxford University Press, Oxford. US Preventive Services Task Force (2014). Guide to Clinical Preventive Services—​Recommendations of the US Preventive Services Task Force. (http://​www.uspreventiveservicestaskforce. org/​) World Health Organization (2003). World Health Report 2002. Reducing risks, promoting healthy life. World Health Organization, Geneva. http://​www.who.int/​whr/​2002 World Health Organization (2013). Global Action Plan for the Prevention and Control of Non-​communicable Diseases 2013–​2020. (http://​ apps.who.int/​iris/​bitstream/​10665/​94384/​1/​9789241506236_​eng. pdf?ua=1) 2.12 Medical screening 137 2.12 Medical screening 137 ESSENTIALS Medical screening is the systematic application of a test or inquiry to identify individuals at sufficient risk of a specific disorder to benefit from further investigation or direct preventive action (these individ- uals not having sought medical attention on account of symptoms of that disorder). Key to this definition is that the early detection of disease is not an end in itself; bringing forward a diagnosis without altering the prognosis is useless and may be harmful. Criteria for screening Before a potential screening test is introduced into practice it must be shown to prevent death or serious disability from the disease to an ex- tent sufficient to justify the human and financial costs. To this end, three screening parameters need to be determined: (1) the detection rate (sensitivity); (2) the false-​positive rate (equivalent to the specificity); and (3) the odds of being affected given a positive screening result (equiva- lent to the positive predictive value). Where a detection rate cannot be directly determined (e.g. in cancer screening, or if the efficacy of the intervention is uncertain), a randomized trial is needed to show that screening and subsequent treatment reduce disease-​specific mortality. Circumstances where screening is not appropriate Screening tests should not be practised simply because they seem intuitively useful: chest radiography to screen for lung cancer and manual breast self-​examination to screen for breast cancer were as- sumed to be worthwhile, but randomized trials showed they did not significantly reduce mortality from the cancer. Screening for prostate cancer is widely practised, yet it does harm (from hazardous treat- ment) with evidence of a relatively modest reduction in mortality from the disease. Causal risk factors, even important ones like serum cholesterol and blood pressure for cardiovascular disease, usually dis- criminate poorly between individuals who will and will not develop the disease they cause, because most of the population is ‘exposed’. Particular disorders where screening is justified The number of disorders for which medical screening has been shown to be worthwhile is perhaps surprisingly small, but includes: (1) antenatal screening (e.g. various single-​gene disorders, Down’s syndrome, neural tube defects, and some infections such as hepa- titis B and HIV that may be asymptomatic in the mother but cause disease when transmitted to the fetus); (2)  neonatal screening (e.g. congenital hypothyroidism, certain inborn errors of me- tabolism such as phenylketonuria, and congenital deafness); (3) adult screening—​in respect of cancer this has been shown to be worthwhile for only three cancers—​breast, cervical, and colorectal; screening individuals with diabetes mellitus prevents blindness from retinopathy; screening men around the age of 65 prevents death from ruptured abdominal aortic aneurysm; and screening young women for chlamydia infection prevents pelvic inflamma- tory disease and its complications (including infertility). Future prospects Tests that arise out of technological development in the absence of a clear case of medical need (e.g. whole-​body scanning using MRI or CT), should not be ‘sold’ to the public in the belief that they are helpful. As with all screening methods, their value needs to be shown before they are introduced into practice. Determining when medical screening is an effective method of preventing serious dis- ease and disability is one of the most challenging areas in medical research. Introduction There is scarcely a medical discipline that does not include some aspect of screening. It has made significant inroads into the pre- vention of disease, but is often used inappropriately in circum- stances where there is insufficient evidence that it benefits health. Determining when screening is an effective method of prevention is one of the most challenging areas in medical research today, re- quiring an understanding of the principles of screening, the path- ology, natural history, and epidemiology of the diseases concerned, and quantitative information on the efficacy of the screening tests and the remedies available. 2.12 Medical screening Nicholas Wald and Malcolm Law1 1  © The author. 138 section 2  Background to medicine Medical screening contains three elements: 1. Identifying individuals at sufficiently high risk of having or developing a specific disorder to benefit from further investiga- tion or direct preventive action. 2. It is systematically offered to a population that has not sought medical attention for symptoms of the relevant disease. It is usu- ally initiated by medical authorities, not the patient. 3. Its purpose is to benefit screened individuals. On this basis, mass testing activities such as surveillance for HIV infection or pre-​ employment examinations to test fitness for work are not classi- fied as medical screening. The following definition has been widely used and encapsulates these three elements: Medical screening is the systematic application of a test or inquiry, to identify individuals at sufficient risk of a specific disorder to benefit from further investigation or direct preventive action, among those who have not sought medical attention on account of symptoms of that disorder. Worthwhile screening aims to prevent death or disability from specific disorders. Screening that simply brings forward a diag- nosis without altering the prognosis is useless and may be harmful, prompting needless anxiety, and possibly hazardous interven- tions. The early detection of disease is not an end in itself. As with any medical treatment, screening needs to be shown to offer medical benefit and to be acceptably safe. Many medical disorders are not candidates for screening, because they are too trivial or because treatment is no more effective following screening than following clinical presentation. The value of a screening test, in which the benefits are considered in the light of the human and financial costs, needs to be determined before it is introduced into practice. Requirements for a worthwhile screening test See Box 2.12.1. The disorder The disorder needs to be clinically well defined and should, wher- ever possible, be specified independently of the screening test. The disorder should not be an ‘abnormal’ value of the screening test being offered, such as a value lying outside the 95% range. This cre- ates a circularity (‘tautological screening’) that makes it impossible to determine whether screening is genuinely preventing disease or is just causing overdiagnosis. It is necessary to know the distribution of values of the screening test in individuals who have (or will develop) the clinical disease and in individuals who do not, in order to assess the value of the test. An example is hypertension, or high blood pressure, an asymp- tomatic condition that increases risk of a heart attack or stroke. If hypertension were regarded as the medical disorder being screened for, then all the ‘hypertensives’ would have blood pressure above the cut-​off and all the ‘non​hypertensives’ below it (i.e. a perfect test). The apparent screening perfection is a tautological misconception. A high blood pressure measurement is the result of a screening test (blood pressure measurement) for the clinical diseases (stroke and myocardial infarction) caused by high blood pressure. In fact, blood pressure measurement, although widely practised, is not a good screening test for stroke or myocardial infarction. Many people who will not have a stroke or myocardial infarction have high blood pressure, and many who do will not. This is considered in further detail next. Prevalence or incidence To derive an estimate of the odds of being affected among indi- viduals with a positive screening result (see next), the prevalence, or incidence, of the disorder needs to be known. Prevalence is the number of cases of a disorder in a defined population at a given point in time, incidence is the number of new cases occurring in a de- fined population over a specified period. If the disorder is very rare screening may not be justifiable, unless it can easily be incorporated into an existing screening protocol. If the disorder is very common (e.g. heart attacks and strokes), screening may be pointless and a population-​wide preventive strategy may be needed. Natural history Screening should be restricted to disorders that are medically important (i.e. associated with serious morbidity or premature mortality). Remedy A remedy or treatment must be available that is more effective or ac- ceptable following screening than at clinical presentation. Offering an effective treatment is insufficient; the treatment must be more effective or acceptable if delivered early. Screening test The screening test should be simple and safe. Some screening tests are so simple or performed so routinely that they are not recognized as such. For example, asking a woman’s age was once the antenatal screening test for Down’s syndrome. A routine blood count includes Box 2.12.1  Requirements for a worthwhile screening test 1 Disorder: well defined 2 Prevalence/​incidence: known 3 Natural history: medically important disorder 4 Remedy or treatment: more effective or acceptable than at clinical presentation 5 Screening test: simple and safe 6 Test performance specified: (a) Detection rate can be determined:  detection rate and false-​ positive rate known and acceptable. For a quantitative screening test, the distributions of test values in affected and unaffected individuals should be known, the extent of overlap sufficiently small, and a suitable cut-​off level defined (b) Detection rate cannot be determined: randomized trial evidence shows that the combined effect of screening and treatment is sufficiently effective in preventing death and disability from the disease being screened for, with an acceptably low proportion of individuals requiring further investigation 7 Financial: overall cost acceptable to achieve the health benefit 8 Facilities: available or can easily be installed, including for diagnosis and treatment 9 Acceptability:  procedures following a positive result are generally agreed and acceptable to the screening authorities and the screened individuals 2.12  Medical screening 139 the antenatal screening test for β-​thalassaemia (mean corpuscular volume), so the issue is not one of introducing the test but of system- atically interpreting a test already carried out. The purpose of testing generally defines whether it is a screening or diagnostic test. If the aim is to identify a high-​risk group for fur- ther investigation or preventive treatment, it is a screening test; if it is to make a diagnosis, it is a diagnostic test. Screening tests indicate a probability of having or developing a disorder, whereas diagnostic tests usually indicate whether an individual is affected or unaffected. The accuracy of each type of test does not itself define what type of test it is. Sometimes mass testing, perceived as screening, is in fact diagnosis (e.g. obstetric ultrasonography used routinely to diagnose anencephaly). Screening tests usually apply to healthy populations, but this is not always the case (e.g. screening for retinopathy among people with diabetes). Screening test performance It is useful to separate screening tests for which detection rates can be determined from screening tests for which this is not possible. Detection rate can be determined The performance of screening and diagnostic tests is defined by three parameters: (1) the detection rate; (2) the false-​positive rate; and (3) the odds of being affected given a positive result (OAPR). Detection rate The detection rate of a test (or test sensitivity) is the proportion of affected individuals with positive test results (Table 2.12.1). An advantage of the term detection rate over sensitivity is that it avoids confusion with the usage of sensitivity in analytical biochem- istry, where it means the minimum detectable amount in an assay. In cancer screening, ‘detection rate’ is often taken to mean the preva- lence of detected cancers at a screening examination. False-​positive rate The false-​positive rate is the proportion of unaffected individuals with positive test results (Table 2.12.1). The complement of the false-​positive rate is the specificity, which is 100% minus the false-​positive rate (e.g. a false-​positive rate of 3% is the same as a specificity of 97%). Advantages of the term false-​ positive rate over specificity are that (1) it is more easily understood and remembered; (2) it focuses attention on the group to be offered further medical intervention; and (3) a 10% false-​positive rate is twice as ‘bad’ as one of 5%, whereas this is concealed within the cor- responding specificity values of 90% and 95%. Odds of being affected given a positive result (OAPR) The OAPR is the ratio of the number of affected to unaffected indi- viduals among those with positive test results (i.e. true positives:false positives in the population in question). The OAPR in Table 2.12.1 would be a:b if the numbers in Table 2.12.1 came directly from screening everyone in a study popu- lation. In practice this is uncommon because the disorder being screened for is rare and so it is sensible to estimate the detection rate on all the affected individuals but only a small sample of un- affected individuals. Because of this sampling difference, tables like Table 2.12.1 cannot be used to estimate the OAPR. It is best esti- mated indirectly using estimates of the prevalence of the disorder from one source and estimates of the detection rates and false-​ positive rates of the screening test from another source. This can be done using a flow diagram such as that in Fig. 2.12.1, in which the detection rate (80%) is applied to the number of affected individ- uals (prevalence 1%) and the false-​positive rate (4%) to the number of unaffected. Then the ratio of true-​positive to false-​positive tests performed will be an unbiased estimate of the OAPR in a total population. The OAPR is 1:5 after the screening test, and 38:1 after the diagnostic test (detection rate 95%, false-​positive rate 0.5%). If the prevalence of the disorder were halved (0.5%) the OAPRs would be halved to 1:10 and 19:1, respectively. Thus, the less common the disorder, the less likely people with positive results will be affected. The OAPR can be expressed as a probability (‘true positives/​all positives’) which is known as the positive predictive value (PPV). In the example in Fig. 2.12.1) the OAPR, 80:400 = 1:5, is equivalent to a predictive value of 1/​(1+5) = 1/​6 ​-​ 17%. The OAPR is more useful than the PPV because it is numerically easier to compute when tests are performed in sequence (Fig. 2.12.1), and it provides a better impression of the relative performance of tests. In the example, the OAPR of 38:1 is equivalent to a PPV of 97% (38/​39). If the detection Table 2.12.1  Algebraic summary of detection and false-​positive rates of qualitative tests or quantitative tests using a specified cut-​off Test result Affected Unaffected Positive TRUE POSITIVES a FALSE POSITIVES b Negative FALSE NEGATIVES c TRUE NEGATIVES d Total a + c b + d Detection rate (sensitivity) a a+ c False-​positive rate (1 –​ specificity) b b+ d 10000 individuals 100 affected Screening test Diagnostic test DR = 80% DR = 95% 80+ve 400+ve OAPR = 80:400 1:5 76:2 38:1 9900 unaffected 76+ve 2+ve FPR = 0.5% FPR = 4% Fig. 2.12.1  Flowchart to show the performance of screening and diagnostic tests. The critical first step in constructing such a flowchart is to separate individuals into affected and unaffected, not into screen-​positive and screen-​negative. Reproduced from Wald, N. An Introduction to Epidemiology in Medicine. London: Royal Society of Medicine Press, 2004. 140 section 2  Background to medicine rate of the screening test were halved (to 40%) the OAPR would also be halved (to 19:1) but the PPV, 95% (19/​20), appears only a little lower. A good screening test has a high detection rate, a low false-​positive rate, and a high OAPR (e.g. 1:10 is better than 1:50). Stating the de- tection rate for a test is uninformative unless a false-​positive rate (or specificity) is also stated. Screening performance is assessed by spe- cifying the detection rate for a given false-​positive rate, or specifying the false-​positive rate for a given detection rate. The detection rates and false-​positive rates are independent of the prevalence of the disease for tests that measure a consequence of the disease (e.g. the antenatal markers of Down’s syndrome) but may not be independent for a screening test that is a measure of a cause of the relevant disease. For example, when screening for an autosomal recessive disease such as cystic fibrosis by testing for a known DNA mutation in the gene for the disease, a higher gene prevalence will necessarily be linked to a higher disease prevalence. The OAPR is always dependent on the prevalence. The higher the prevalence the higher the OAPR, even if the detection rate and false-​ positive rate are constant. Estimates of the detection rate and false-​positive rate can be ap- plied from one population to others because they are generally inde- pendent of the prevalence of the disorder. This is not the case with the OAPR, which depends on the prevalence. For a qualitative (or categorical) test, such as the presence or ab- sence of a cystic fibrosis mutation among a given panel of muta- tions tested for, there is only one detection rate and false-​positive rate. This is not the case with quantitative (or non​categorical) tests, such as maternal serum α-​fetoprotein (AFP) for open spina bifida screening, which yield numerical results. In such cases, the detection rate and false-​positive rate depend on the screening cut-​off level used to distinguish positive from negative results. No single pair of detec- tion and false-​positive rates summarizes the performance of tests; both will vary as the cut-​off is changed. For example, at cut-​off level A in the relative frequency distributions in Fig. 2.12.2 the test will have a detection rate given by the area under the curve for affected subjects to the right of cut-​off level A (95%), and a false-​positive rate given by the area under the curve for unaffected subjects to the right of the same cut-​off level (10%). The higher the cut-​off level (say, B or C) the lower the detection rate and false-​positive rate. It is common to summarize the performance of a test as a receiver–​ operator characteristic (ROC) curve. This is a plot of the detection rate against the false-​positive rate, with both scales plotted from 0% to 100%. In such a graph a useless test is represented by the diagonal, indicating that the detection rate and the false-​positive rate are al- ways the same. As the screening test improves, the ROC curve bows out from the diagonal towards the axes. A perfect screening test clings to the detection rate axis up to 100% while the false-​positive rate remains at zero. The area under a ROC curve is sometimes used to indicate the performance of a screening test, but it is not a satis- factory measure of this. It is better to state detection rate for specified false-​positive rate or vice versa. A weakness of a ROC curve is that for screening tests that are potentially useful, the area of the graph that is informative is restricted to a small portion, namely the part covering false-​positive rates up to about 10% and detection rates (from 40% to 100%) between about 50% and 100%. Fig. 2.12.3 illus- trates detection rates plotted against false-​positive rates (from 0% to 10%) in multiple marker antenatal screening for Down’s syndrome, showing the improvements in screening that have been made over the past 20 years. Good screening tests are usually early manifestations of the dis- ease being screened for, while causes of a disease that are highly prevalent in a community are usually poor screening tests. Causal risk factors such as blood pressure for stroke are important aetio- logically and account for a large proportion of the disease they cause because they are usually common (e.g. most adults over 55 can be said to have a high blood pressure) yet many escape the conse- quences (e.g. a stroke). This means that causal risk factors usually do not discriminate well between individuals who will and who will not develop the disease. Table 2.12.2 shows the detection rate for a 5% false-​positive rate (DR5) for various risk ratio estimates between the top and bottom fifths of the distribution of a risk factor. Even a ‘strong’ risk factor with a fivefold risk ratio between the top and bottom quintile groups (fifths) of the distribution (typical of low-​density lipoprotein (LDL) cholesterol and myocardial infarction) has only a 14% detection rate Affected A Unaffected Cut-off A at 6 units DR FPR 95% 10% Cut-off B at 6.5 units DR FPR 90% 2.5% Cut-off C at 7 units DR FPR 80% 1% Unaffected Unaffected Test variable (arbitrary units) 2 3 4 5 6 7 8 9 10 11 B C Affected Affected Fig. 2.12.2  Hypothetical example of the detection rate and false-​ positive rate of a screening test at three different cut-​off levels. The implied vertical axis is the percentage of individuals at different levels of the screening test variables, considered separately for affected and unaffected individuals. Reproduced from Wald, N. An Introduction to Epidemiology in Medicine. London: Royal Society of Medicine Press, 2004. 2.12  Medical screening 141 for a 5% false-​positive rate. An interquintile risk ratio of around 1000 is necessary to achieve a detection rate of at least 75% for a 5% false-​positive rate. The OAPR can be determined by using the flowchart method il- lustrated in Fig. 2.12.1. It can also be determined using the likeli- hood ratio (LR) which is a measure of the ‘concentrating’ power of a test (Fig. 2.12.4). For a group of people with values of the screening variable above a specified cut-​off (i.e. all screen positives), this is the proportion of the area for ‘affecteds’ to the right of the cut-​off div- ided by the proportion of the area for ‘unaffecteds’ to the right of the cut-​off. Fig. 2.12.4a, which is equivalent to the detection rate divided by the false-​positive rate (DR/​FPR). It is the number of times indi- viduals with positive results are more likely to have the disorder for which they are being tested compared with the general population (individuals who have not been tested). That is, the OAPR is the likelihood ratio multiplied by the preva- lence of the disorder (expressed as an odds): OAPR LR prevalenceasanodds. × So (see Fig. 2.12.4a for example), if the detection rate is 80% and the false-​positive rate is 1%, then the LR is 80%/​1%, or 80. If the prevalence of the disorder were 1:1000, then OAPR 80 1:1000 80 :1000 1:1000 / 80 1:12.5. × = For an individual with the screening variable at some specific value, the likelihood ratio is the height of the relative distribution curve for ‘affecteds’ at the test value for that individual divided by the height of the curve for ‘unaffecteds’ at the same test value. So, for example, an 100 90 80 60 50 40 0 1 2 3 4 5 False-positive rate (%) Integrated test 9.4 8.6 Serum integrated test 8.7 7.3 Combined test 8.6 7.2 Quadruple test 8.3 6.4 Triple test 6 7 8 9 10 70 Detection rate (%) 7.7 5.6 Fig. 2.12.3  Antenatal screening for Down’s syndrome: detection rates and false-​positive rates for specified screening tests. The integrated test consists of the ultrasound marker nuchal translucency and pregnancy-​ associated plasma protein A (PAPP-​A) measured in the first trimester, and AFP, unconjugated oestriol (uE3), human chorionic gonadotropin (hCG), and inhibin-​A in the second trimester. The serum integrated test is the same as the integrated test without nuchal translucency. The combined test consists of nuchal translucency, PAPP-​A, and hCG in the first trimester. The quadruple test consists of AFP, uE3, hCG, and inhibin-​A in the second trimester. The triple test is the same as the Quadruple test without inhibin-​A. All the tests include maternal age. Reproduced from Wald NJ, et al. (2004). SURUSS in perspective. Br J Obstet Gynaecol, 111, 521–​31, with permission from Wiley-​Blackwell. Table 2.12.2  Detection rate for a 5% false-​positive rate (DR5) according to relative risk between top and bottom fifths of the distribution in unaffected individuals Relative risk between top and bottom fifths of the distribution in unaffected individuals DR5 (%) 1 5 2 8 3 11 5 14 10 20 40 36 80 45 800 71 2000 79 10 000 89 Reproduced from Wald NJ, Hackshaw AK, Frost CD, ‘When can a risk factor be used as a worthwhile screening test?’, BMJ, 319, 1562–​65 © 1999 with permission from BMJ Publishing Group. Affected Affected a b Unaffected Unaffected (a) Likelihood ratio for groups (b) Likelihood ratio for individuals DR = 80% FPR = 1% LR = 80%/1% = 80 FPR = 1% LR = a/b = 12/1 = 12 DR = 80% Test variable (arbitrary units) 2 3 4 5 6 7 8 9 10 11 12 Fig. 2.12.4  Likelihood ratio for groups and for individuals. Reproduced from Wald, N. An Introduction to Epidemiology in Medicine. London: Royal Society of Medicine Press, 2004. 142 section 2  Background to medicine individual with a test result of 7 (arbitrary units) in Fig. 2.12.4b has a likelihood ratio of 12, and so OAPR 12 1:1000 12 :1000 1:1000 /12 1: 83. × = In this way the likelihood ratio is used to estimate the risk for an individual. Fig. 2.12.5, showing the distribution of diastolic blood pressure in men who did and did not subsequently die of a stroke, illustrates how a particular blood pressure measurement of, say, 105 mm Hg in a 70-​year-​old man, can be converted into a risk of developing a stroke. At a diastolic blood pressure of 105 mm Hg the likelihood ratio is 3. The annual risk of a fatal stroke in all 70-​year-​old men regardless of blood pressure is 2:1000 (about 0.2%), so if his dia- stolic blood pressure is 105 mm Hg the risk is 3 × 2:1000 or 6:1000 (about 0.6%). To establish whether a quantitative screening test is worthwhile, the overlapping distributions of the values of the screening test in people with and without the disorder must be examined. If the two distributions are widely separated, as in the example in Fig. 2.12.6 (ultrasound measurement of the diameter of the abdominal aorta as a screening test for aneurysm likely to rupture), the test is good. If they substantially overlap, as in the example in Fig. 2.12.7 (serum cholesterol as a screening test for future death from ischaemic heart disease or blood pressure as a screening test for stroke, Fig. 2.12.5), it is not. Detection rate cannot be determined Determining the detection rate is straightforward when all indi- viduals can be found to be either affected or unaffected. This is not always possible, notably in cancer screening, because if a lesion is found and a treatment carried out, one cannot know if that lesion would have become a clinical case had treatment not been given, or if it is ‘overdiagnosis’. The problem arises for any progressive dis- order in which the clinical outcome is not determined in a uniform way among all individuals, as would be the case if all screening re- search were initially observational without intervention dependent on the result of the screening test. Sometimes such an observational approach is possible, for example, by storing serum samples in a population of adults (without testing them at the time of collection), and later identifying those who did and did not develop a cancer, retrieving the serum samples, and testing them on a case–​control basis; this provides an unbiased estimate of the screening perform- ance of the test. Such an approach is not practical with tests based on imaging, such as mammograms, which could not ethically be taken and stored without being examined at the time. In such circum- stances, it may never be possible to know the screening performance of the test. The solution is to perform a randomized trial of screening (and treatment) versus no screening. If this shows that mortality from the disease is reduced, the combined effect of screening and treatment is known, though the relative contributions of the two in achieving the health benefit may not be. Cancer screening must prolong survival (the time between diag- nosis and death) to be effective, but because of two biases, prolonged survival alone is insufficient evidence that screening genuinely Men who die of stroke 3 1 Men who do not die of stroke Diastolic blood pressure (mm Hg) 40 50 60 70 80 90 100 110 120 130 140 150 Fig. 2.12.5  Likelihood ratio of a fatal stroke in a man with a diastolic blood pressure of 105 mm Hg. Reproduced from Wald, N. An Introduction to Epidemiology in Medicine. London: Royal Society of Medicine Press, 2004. Ruptured aneurysms (n = 163) Unruptured aortas (n = 3897) Maximum aortic diameter (cm) 0 2 4 6 8 10 12 14 16 Fig. 2.12.6  Aortic diameter and ruptured aortic aneurysm. The distribution of less than 2 cm is not real but simply represents the lower half of the Gaussian distribution of more than 2 cm, which is based on data. Data from Law MR, Morris J, Wald NJ (1994). Screening for abdominal aortic aneurysms. J Med Screen, 1, 110–​116. Died of IHD Did not die of IHD Serum cholesterol (mmol/litre) 15% 5% 2 3 4 5 6 7 8 9 10 11 Fig. 2.12.7  Relative distributions of serum cholesterol in men who subsequently died of ischaemic heart disease (IHD) and in men who did not. Reproduced from ‘A strategy to reduce cardiovascular disease by more than 80%’, NJ Wald & MR Law, British Medical Journal 2003, 326; 7404 with permission from BMJ Publishing Group Ltd. 2.12  Medical screening 143 improves prognosis. The first bias, lead time bias, is the prolongation of survival from bringing forward the date of diagnosis, even though the date of death is unchanged. The second bias, length time bias, arises because cancer screening involves periodic examinations (say 3-​yearly). So screening will detect slowly growing tumours more readily than rapidly growing ones because rapidly growing ones are more likely to develop and proceed to clinical presentation within the interval between two consecutive screening examinations, and thereby escape detection at screening. Survival with such rapidly growing screen-​detected cancers will inevitably be shorter than average. This is biased sampling. Both biases can be avoided by com- paring mortality from the specific cancer (the number of deaths divided by the number of people at risk) between screened and unscreened groups rather than comparing survival once a cancer is diagnosed. The biases are avoided because mortality measures deaths, whereas survival measures time. Disease-​specific mortality could be subject to bias if the screened and unscreened groups were at different risks of developing the disease. For example, women of higher socioeconomic status may be more likely to develop breast cancer and more likely to ac- cept screening. So breast cancer mortality could still be higher in screened women even if screening were effective. The only way to reliably avoid such selection bias is to carry out a randomized con- trolled trial to be sure that like is compared with like. Financial considerations Having determined that the first six requirements for a worthwhile screening test are met (see Box 2.12.1), the financial considerations need to be assessed. Screening programmes should seek to minimize the cost for a given outcome (i.e. to maximize the cost-​effectiveness). If the most medically effective form of screening is also the most cost-​ effective, it should be the programme of choice, provided it is affordable. If the best screening policy is not the most cost-​effective, a judgement is needed on whether the extra health gain justifies the extra cost. Facilities Medical screening effectively ‘creates’ patients by identifying indi- viduals at sufficient risk of a disorder to be offered further tests or treatment when they had no prior suggestion that they may have the disorder. This necessarily creates anxiety and a demand for medical attention, and an obligation to ensure that facilities exist for the ne- cessary investigation, treatment, and support. Screening should not be implemented until such arrangements have been made. Screening therefore needs to be offered in the context of programmes that are capable of meeting all the related needs of the people being screened. Acceptability Medical screening, including the treatment or remedy, must be ac- ceptable to the population concerned and to the professional staff involved. The purpose, the benefits, and the limitations of screening need to be understood and regarded as important from the perspec- tive of each individual who is offered screening. The decision not to be screened needs to be respected and programmes should not be driven by targets that set high uptake rates, though of course, if the rates are very low it would call into question the need for the screening programme. A  key element in the acceptability of screening is individual choice set against a justifiable trust in the medical system that offers screening. Requirements for a worthwhile public health screening programme See Box 2.12.2. Screening is a public health activity that should meet certain re- quirements that arise from a professional responsibility to achieve a collective health benefit. It is not the provision of a consumer commodity. Its purpose is to improve the health of individuals and thereby the health of the community. Once the requirements for a worthwhile screening test shown in Box 2.12.1 are met, there are four additional requirements for a worthwhile screening programme implemented as a public health service. These are summarized in Box 2.12.2. In public health terms, interventions that reduce exposure to the causes of disease should have priority over screening to detect early disease and offer treatment, but they are not mutually exclusive. A population approach correcting or reversing adverse risk factors is often more effective. For example, the human papillomavirus (HPV) vaccine is expected to steadily replace screening for cervical cancer after the next 40 years. Screening for specific disorders Antenatal screening See Table 2.12.3. Much of antenatal care is screening—​looking for problems be- fore they arise clinically. Detecting rises in blood pressure to warn of the risk of pre-​eclampsia (which may cause perinatal death and serious illness in the mother), and detecting maternal anti-​D anti- bodies to warn of rhesus haemolytic disease of the newborn, are two examples. The purpose of such screening is usually the welfare of the mother and fetus, but in antenatal screening there is the unusual situation in which some fetal disorders are so severe or potentially disabling to justify screening and diagnosis, and the offer of a ter- mination of pregnancy. Antenatal screening for open neural tube defects, Down’s syndrome, severe congenital heart malformations, and severe, incurable single-​gene disorders are examples. Screening for four infections is worthwhile (syphilis, HIV, hepa- titis B, and bacteriuria) because they may not be clinically apparent in the mother but can cause serious preventable illness in the neo- nate (either immediately or in later life). Prognosis is substantially improved if the infection can be detected in the mother and Box 2.12.2  Requirements for a screening programme implemented as a public health service 1 Equitable: equal access to screening services 2 Organized: individuals are offered screening in an organized manner according to a specified protocol and with relevant information pro- vided to permit an informed choice 3 Comprehensive: screening is the first step in a programme of service and care that includes counselling screen positives, diagnosis, support, and treatment 4 Monitored and auditable:  key aspects of the programme should be monitored so that remedial steps can be taken if they are below standard 144 section 2  Background to medicine Table 2.12.3  Summary of antenatal screening tests of proven value Disorder Approximate natural birth prevalence (per 10 000) in UK Primary screening test Secondary screening test(s) (if available) Detection rate (%) False- positive rate (%) Odds of being affected given a positive result Diagnostic test Intervention 1°ry screening test 2°ry screening test Autosomal or sex-linked recessive disorders Cystic fibrosis 4 Test for CF mutation in both parents (‘couple screening’) 72 0.09 1:3 – CVS or amniocentesis a Sickle cell disease 3 Ethnic origin enquiry (Afro-Caribbean) Sickling test; Hb electrophoresis in mother, and in father if positive in mother 99 3 1:100 1:3 CVS or amniocentesis a β-Thalassaemia 6 Red cell MCV or MCH in mother Hb A2 assay in mother, and in father if positive in mother 89 7 1:125 1:3 CVS or amniocentesis a Tay–Sachs disease 0.04 Ethnic origin enquiry (Ashkenazi Jew) Hexoseaminidase assays in father, and mother if positive in father 50 1 1:3600 1:3 CVS or amniocentesis a Haemolytic disease of the newborn (D-antigen of Rh system) 40 Rh grouping and test for antibody in mother Rh grouping of father; quantitation of maternal antibody 100 16 1:31 1:26 CVS or amniocentesis Intrauterine transfusion, early delivery with exchange transfusion Haemophilia 0.5 Recognition of affected male relative (carrier detection) Test for mutation in mother 55 <0.01 1:35 1:3 CVS or amniocentesis a Chromosomal disorders Down’s syndrome (Trisomy 21) 18 Integrated 1st and 2nd trimester 89 1.0 1:5 CVS or amniocentesis a 1st trimester alone 77 1.0 1:6 CVS or amniocentesis a 2nd trimester alone 70 1.0 1:6 CVS or amniocentesis a Reflex DNA screening 91 0.05 4:1 Amniocentesis Trisomy 18 2.3 Integrated 1st and 2nd trimester 92 0.2 1:3 CVS or amniocentesis 1st trimester alone 89 0.2 1:3 CVS or amniocentesis 2nd trimester alone 58 0.2 1:5 CVS or amniocentesis Reflex DNA screening 89 0.05 1:1 Amniocentesis Trisomy 13 1.4 Integrated 1st and 2nd trimester 72 0.2 1:12 CVS or amniocentesis 1st trimester alone 73 0.2 1:11 CVS or amniocentesis 2nd trimester alone 20 0.2 1:42 CVS or amniocentesis Reflex DNA screening 79 0.05 1:3 Amniocentesis 2.12  Medical screening 145 Other congenital malformations Spina bifida (open) 8.5 Maternal serum AFP assay Ultrasound 87 0.5 1:4 Amniotic fluid acetylcholinesterase + repeat ultrasound a Anencephaly 10 Ultrasound 100 0 1:0 – Independent conf irmation a Severe cardiac malformations 20 Ultrasound 46 ≤0.6 ≥1:6 – Independent conf irmation a Infections transmitted from mother to fetus Congenital rubella syndromeb 0.12 Absent antibodies in mother 90 1.6 <1:1300 – None Vaccinate mother after delivery to protect subsequent pregnancies Congenital syphilis 0.2 VDRL test or flocculation test in mother Specific treponemal test in mother 90 0.2 1:100 1:50 None Penicillin AIDS 1 ELISA test for IgG antibody in mother (repeated on same sample if positive) ELISA test on repeat sample 99.9 0.13 1:13 1:<5 None Antiretroviral drugs to mother and infant Hepatitis B causing hepatoma and chronic liver disease 1.4 ELISA test for HBsAg in mother (repeated if positive) ≥98 0.14 1:10 None Recombinant vaccine to neonate, hepatitis B immunoglobin at birth except when mother has antibodies to e antigen Maternal bacteruria causing pyelonephritisc 200 Urine culture 76 4 1:4 None Antibiotics to mother Noninfectious maternal disease affecting fetus Maternal high blood pressure/ pre-eclampsia causing perinatal death 93 (rate of all perinatal deaths) Maternal blood pressure measurement Test for proteinuria 38 (of all perinatal deaths) 30 1:77 1:41 None Blood pressure lowering drugs a Information on disorder and its prognosis, counselling, termination of pregnancy, or preparation for birth of affected child, advice on risk of recurrence. b Worthwhile only with low uptake of childhood rubella vaccination in community. c May cause low birthweight or fetal death. AFP, α-fetoprotein; CVS, chorionic villus sampling. Adapted from Wald NJ, Leck I, eds. Antenatal and Neonatal Screening (2nd ed). (2000) Oxford University Press, Oxford. 146 section 2  Background to medicine appropriate treatment given to the mother before birth, the neonate at birth, or both. Routine screening for rubella syndrome in preg- nancy is generally not worthwhile because it cannot prevent the dis- order in the pregnancy screened; it can only lead to vaccination after birth in women without rubella antibodies. The preferred method of prevention is childhood vaccination. In recent decades antenatal screening has taken on a scientific methodology and rigour that has permitted the development of screening programmes that are now standard throughout the world. The first such initiative arose with antenatal screening for open neural tube defects, first by measurement of maternal serum AFP and later by ultrasonography, which is used with AFP in many places and has replaced it in some. Screening now detects virtually all cases of anencephaly with scarcely any false positives, and 87% of cases of open spina bifida with a false-​positive rate of less than 1%. The birth prevalence of neural tube defects in Britain has declined by over 90% from 1 in 250 births in the early 1970s to less than 1 in 2500 now, due in part to screening, in part to an increase in folate intake through food and vitamin supplements. Until the 1980s, antenatal screening for Down’s syndrome (trisomy 21) was based on maternal age. In 1988 the triple test was described, based on combining second trimester serum markers with maternal age. Fig. 2.12.3 shows the subsequent improvement in screening per- formance as the number of available markers increased over time. The integrated test can detect about 85% of affected pregnancies for a false-​positive rate of only 0.9%; the low false-​positive rate is important because women with positive results usually have an amniocentesis, which may induce the miscarriage of a healthy fetus. Combining markers to obtain a single test result for an individual involves the multiplying of the likelihood ratios for each marker in that individual (as in Fig. 2.12.4a), allowing for any correlation between them (con- sidered separately among affected and unaffected individuals). So, for example, in the simple situation of three independent screening markers that correspond to likelihood ratios of 3, 4, and 5, the com- bined likelihood ratio is 60 (3 × 4 × 5). To determine the screening performance of tests based on multiple markers, a hypothetical popu- lation of screened individuals is generated and the combined like- lihood ratio for each individual calculated and converted to risk by multiplying it by prevalence expressed as an odds. The overlapping distributions of risk in affected and unaffected individuals are plotted, determining detection rates for specified false-​positive rates in the same way as for a single screening marker. Then risk itself becomes the screening variable—​which is convenient, because it is exactly what is needed in reporting results to screened individuals. Most screening markers associated with Down’s syndrome vary with gestational age, so a high level at one gestational age could be low at another. A widely used advance in screening is to express all values as ‘multiples of the median’ (MoM) for unaffected (or all) screened in- dividuals at a specified gestational age, so that 1.0 MoM represents the median value (‘normal’), 2.0 MoM is twice ‘normal’, and 0.5 MoM is half ‘normal’. The MoM has the advantages that as a ratio it is unitless and so avoids the need to specify the original units of measurement (which vary from centre to centre), that it automatically adjusts for gestation, and that it indicates how high or low a particular value is. In pregnancy, fragments of placental DNA (which reflect the DNA of the fetus) are shed into the maternal circulation and mix with fragments of maternal DNA shed from maternal cells in an approximate ratio of 1:10. DNA analysis, such as counting the number of cell-​free DNA fragments that map to chromosome 21, has been used as an antenatal screening test for Down’s syndrome, with a high screening perform- ance (detection rate 98–​99%, false-​positive rate 0.2%), but there is a test failure rate of a few per cent, partly due to a lack of placental DNA in the maternal plasma. At present the test is costly and labour intensive which has tended to preclude its use as a universal screening test. DNA analysis has become part of routine antenatal screening for Down’s syndrome and other chromosome disorders (trisomy 18 and 13). A cost-effective screening strategy that has been introduced is antenatal reflex DNA screening in which women who accept screening have a conventional first trimester test. Women are screen-negative unless the first trimester markers yield a risk of having a pregnancy with trisomy 21, 18 or 13 above a certain level (eg ≥ 1 in 800 comprising about 10% of the women) and then automatically have a DNA sequencing ana- lysis using stored plasma from the original blood sample (i.e. a reflex analysis), thereby avoiding the need to recall them for counselling and another blood collection. The conventional test markers and the DNA analysis are considered together as a single test. Neonatal screening Neonatal screening for phenylketonuria, one of the first population-​ wide screening programmes to be introduced, has proved to be ef- fective and worthwhile in spite of the rarity of the disorder (about 1 per 10 000 births). A low-​phenylalanine diet prevents severe mental retard- ation in affected infants. Neonatal screening has prevented cretinism, which is now extremely rare. Additional screening tests could be added to the blood already collected for phenylketonuria and hypothyroidism screening (e.g. MCADD; see Table 2.12.4), and may be justified for other inborn errors of metabolism, given that much of the cost and ef- fort is already spent. However, a line needs to be drawn; tandem mass spectrometry can identify over 40 disorders, but only a handful jus- tify screening as defined. Neonatal screening for congenital deafness is worthwhile and has recently been introduced in the United Kingdom using technology that does not rely on voluntary subject response to noise, thus making it possible to test for hearing deficit in infancy. Screening for congenital dislocation of the hip has been widely prac- tised for many years, without good evidence of efficacy. Galactosaemia (an autosomal recessive inborn error of metabolism) may cause ser- ious illness in the neonate, including septicaemia and encephalop- athy, and cognitive impairment in later life, but it has not been shown that neonatal screening prevents these effects. Neonatal screening for cystic fibrosis has been introduced in some places without evidence that screening reduces the incidence or severity of the associated lung disease, the main cause of disability and death from cystic fibrosis. Screening in childhood Children are examined routinely to see if they are gaining weight and height as expected and to assess their hearing and vision. There is no evidence that systematic examination of children achieves greater health benefits than encouraging parents to take their child to a doctor if they are concerned, but nonetheless much such activity has taken place. In spite of the lack of formal evidence, it is probably sensible to check the visual acuity of children on starting school, as is current practice in many places. Unfortunately, the lack of evidence to sup- port screening in childhood is often camouflaged in the term ‘child- hood surveillance’. As with all screening, evidence of benefit should be sought before acceptance, even if this requires large-​scale studies. One disorder that merits screening, is screening for familial hyper- cholesterolaemia, an inherited disorder with a prevalence of about 4 per 1000 that leads to early cardiovascular disease. Parents can be 2.12  Medical screening 147 Table 2.12.4  Summary of neonatal screening tests of proven value Disorder Approximate natural prevalence (per 10 000 births) in UK Primary screening test Secondary screening test(s) Detection rate (%) False-​positive rate (%) Odds of being affected given a positive result Diagnostic test Intervention 1°ry screening test 1°ry and 2°ry screening tests Congenital hypothyroidism 3 T4 or TSH assay before hospital discharge TSH and T4 at 5–​7 days 100 20 1:668 1:19 Clinical examination, T4 , free T4 , TSH, thyroid scan Thyroxine Phenylketonuria 1 Serum phenylalanine assay Repeated serum phenylalanine assay 100 0.2 1:22 1:0.05 High plasma phenylalanine (>16.5 mg/​dl) using quantitative technique; exclusion of biopterin defects Diet low in phenylalanine Medium chain acyl CoA dehydrogenase deficiency (MCADD) 1 Tandem mass spectrometry (together with PKU) 100 0 1:0 Repeat test Avoidance of fasting, prompt treatment of minor illnesses Deafness 14 Transient evoked otoacoustic emissions (TEORE) Automated auditory brainstem response (AABR) 80 0.6% 1:5 Repeat test Hearing aid or cochlear implant Adapted from Wald NJ, Leck I, eds. Antenatal and Neonatal Screening (2nd ed). (2000) Oxford University Press, Oxford. 148 section 2  Background to medicine screened at the same time in child-parent screening for familial hyper- cholesterolaemia. The method uses the timing of childhood vaccin- ation as a convenient turnstile when a cholesterol measurement is most discriminatory for familial hypercholesterolaemia (1–2 years of age) and a cholesterol measurement used in combination with a DNA mutation analysis. The parents of affected children are offered testing because as the disorder is inherited as an autosomal dominant disorder one parent of an affected child will also be affected. The af- fected parent can be offered preventive statin therapy immediately and the child after the age of ten. The method leads naturally to cas- cade testing in which close relatives are tested: half of all first degree relatives being affected. This protocol has been shown to be feasible and acceptable in a national demonstration project. This method of screening is being considered by various public health agencies. Adult screening Perhaps surprisingly, only a few disorders justify medical screening in adults. These are summarized in Table 2.12.5. Cancers Three cancers meet the screening requirements: breast, cervical, and colorectal. Cervical cancer screening illustrates the principle that effective adult screening programmes require a population age–​sex register. Everyone in the appropriate age–​sex group for screening can then be identified and sent written invitations at appropriate intervals. Formerly, cervical screening was carried out ‘oppor- tunistically’ when women happened to consult doctors, and such screening failed because younger women, at lower risk, see doc- tors more frequently than older women, at higher risk, so cervical smears were carried out on the low-​risk group, and at more frequent intervals than necessary for effective screening. It was only with the introduction of a systematic screening programme based on age–​sex registers that most older women were screened and cervical cancer mortality fell appreciably in the United Kingdom and other Western countries. Now women are invited 3-​yearly between the ages of 25 and 49, and 5-​yearly between the ages of 50 and 64. Screening in the United Kingdom is based on testing a cervical brush sample for human papillomavirus (HPV) followed by cytological examination on the same sample if the HPV test is positive (reflex testing). The evidence on efficacy comes from non-​randomized studies: screening reduces mortality from cervical cancer by about 80%. Mammographic breast cancer screening is offered in the United Kingdom at 3-​yearly intervals to women aged 50 to 70 (though the age range may soon extend down to age 47 and up to 73). Randomized trials have shown that it reduces breast cancer mortality, by about a fifth in a population offered screening or a third in women who accept screening. Manual breast self-​examination by women to screen for breast cancer has been shown in randomized trials not to significantly reduce mortality, an observation which illustrates that screening for cancer and other diseases should not be practised simply because it seems intuitively useful: rigorous evidence on efficacy is needed. A colorectal cancer screening programme based on 2-​yearly faecal occult blood testing in men and women aged 60 to 70 has been shown in randomized trials to reduce colorectal cancer mortality by about 15% in a population offered screening. Population screening has been introduced in the United Kingdom. A second screening procedure further reduces colorectal cancer mortality. Two randomized trials of once-​only flexible sigmoid- oscopy versus no intervention, with identification and resection of colonic polyps, showed a reduction in colorectal cancer mortality of about 30% in a population offered screening, or about 40% in people who attended for screening. There are no published randomized trials of colonoscopy but case–​control (observational) studies indi- cate that the mortality reduction is little or no greater than with flex- ible sigmoidoscopy (as fewer cancers occur in the ascending colon and few of the ones that do seem to be prevented). A meta-​analysis of cross-​sectional studies in people who had both CT colonography and colonoscopy showed that CT detected 48%, 70%, and 84% of all polyps less than 6, 6–​9, and more than 9 mm, with a false-​positive rate of 7% (i.e. 7% of unaffected people require colonoscopy). This suggests that CT may be an acceptable surrogate for flexible sigmoid- oscopy or colonoscopy: it may be a little less effective in preventing colorectal cancer, but it is less invasive. Chest radiography to screen for lung cancer has been shown in randomized trials not to significantly reduce mortality. However, low-​dose spiral computed tomographic screening has been shown in a randomized trial to reduce mortality from lung cancer, by about 20%. The screened group had yearly CT over three years with 6.5 years follow-​up; the control group had single view chest X-​rays, but since these have been shown to be ineffective the trial can be interpreted as CT versus no screening. However, the false-​positive rate was high; about 22% of people without lung cancer had positive screening tests. Much of the diagnostic uncertainty was resolved with further imaging examinations, but about 4% of participants without lung cancer had bronchoscopy, thoracoscopy, or thoracotomy. There is also uncertainty as to whether screening is cost-​effective, and con- cern that smokers may derive false reassurance from negative screens and continue to smoke. Launching lung cancer population screening programmes by identifying smokers and offering them a spiral CT examination, while effective, remains an issue for discussion. In systematic population-​based cancer screening programmes, only people within a relatively narrow age range are invited for tests (effectively, age is used as the initial screening enquiry). Cancer screening tends to be most effective around the age of 60 in terms of cost per year of life saved; the lower incidence of cancer in younger people, and the shorter life expectancy in older people, mean that fewer years of life will be gained for the same number of people screened. The justification for a narrow age range is economic. Older people are not turned away, however, and the age range over which women are invited for mammographic screening, for example, widened over time (it was originally 50–​64). Usually it is not ap- propriate to stop inviting people for screening examinations above a certain age; if they are fit enough and willing to attend for screening examinations, they are suitable candidates for screening. Cancer screening is generally conducted at 2-​ to 3-​yearly intervals; in prin- ciple, more frequent screening would detect more cancers but the yield per 1000 screening examinations would be lower. Screening for prostate cancer, mainly through measurement of serum prostate-​specific antigen (PSA) was introduced into medical practice with no evidence of reduction in mortality. PSA can distin- guish between individuals who will and will not die of prostate cancer. However, discrimination weakens as the interval between the PSA test and clinical presentation or death from the cancer lengthens. By the time the PSA test is highly discriminatory, the disease may be too far advanced for treatment to be effective. The usual cut-​off levels pro- posed for PSA screening (c.4 ng/​ml) lead to a high proportion of older men being positive. A prostate biopsy in these individuals is often positive, because 25% of prostates in men aged 70 have histological 2.12  Medical screening 149 Table 2.12.5  Summary of adult screening for selected disorders Disorder Prevalence Screening procedure Age range Subsequent investigation Detection rate Positive rate Odds of disorder in screen positives Uptake of screening Treatment Reduction in disease Breast cancer 4% of all deaths (women) Mammography 2–​3-​yearly 50+ Further imaging; fine needle biopsy Not applicable 8% first screen, 4% subsequent; biopsy rate 0.8% 1:6 (2:1 among women biopsied) 70–​80% Surgery (± chemotherapy, radiotherapy) 24% reduction in mortality at age 50–​74; 16% at age 40–​49 (from meta-​analyses of randomized trials) Colorectal cancer 3% of all deaths (men and women) Faecal occult blood testing 2-​yearly 60+ Colonoscopy ± barium enema Not applicable 2–​3% 1:10 50–​60% Surgery 15–​18% reduction in mortality (from two randomized trials) Cervical cancer 0.5% of all deaths (women) Cervical smear ± HPV testing 3–​5 yearly 25+ Repeat smear in 6 months (mild dyskaryosis); colposcopy (moderate/​severe dyskaryosis) Not applicable 5–​10% (higher in younger than older women), lower with HPV test and smear –​ 80% Local ablation or excision (rarely hysterectomy) 90% reduction in mortality (from case–​control studies) Diabetic retinopathy Proliferative retinopathy 50% IDDM 50% NIDDM Macular oedema 15% IDDM 10% NIDDM Retinal photography with mydriatic yearly All Assessment by ophthalmologist 78% 0 50% if done in hospital clinics Photocoagulation Reduction in blindness >90% (proliferative retinopathy) 65% (macular oedema) Abdominal aortic aneurysm rupture Men aged 65+ 2% of all deaths 7% have aortic diameter ≥3.0 cm Ultrasound scan 65 (men) CT or MRI 86% 0.6% 75% Open surgery Chlamydia trachomatis genital infection (subsequently causing PID) Chlamydia: 5% among women under 25 PID 2% Nucleic acid amplification test on urine sample <25 (sexually active) –​ 90–​95% <1% 64% Doxycycline or azithromycin 56% reduction in PID HPV, human papillomavirus; IDDM, type 1 diabetes; NIDDM, type 2 diabetes; PID, pelvic inflammatory disease (Ashton et al., 2002). 150 section 2  Background to medicine evidence of cancer even though only a small minority of these men will suffer from or die of the disease. Such ‘overdiagnosis’ (the diag- nosis of cancers that would otherwise never have come to clinical at- tention) is a potentially serious problem in cancer screening. These cancers are best never diagnosed; once diagnosed, anxiety and un- necessary hazardous investigation and treatment will ensue. In 2009, two randomized trials of PSA screening for prostate cancer were reported, one showing a significant (p = 0.01) 20% re- duction in prostate cancer mortality in men invited for screening (27% in those who were screened) and the other one showing a non​significant increase, but consistent with a 15% reduction. Both trials showed a high rate of overdiagnosis; in the larger of the two trials, for every one prostate cancer death prevented, 1410 men were screened, of whom 16% (230) had a biopsy, identifying 49 prostate cancers of which 48 were treated unnecessarily. Taking the two trials together, screening for prostate cancer by PSA testing probably does reduce prostate cancer mortality. Subsequent smaller trials have shown similar results. The reduction, however, may not be judged sufficient to warrant the level of overdiagnosis which leads to many men receiving unnecessary hazardous treatment. A randomized trial of ovarian cancer screening using ultrasound examination of the ovaries and measurement of a serum protein marker (CA125), found no statistically significant difference be- tween women randomized to screening and those randomized to ‘no screening’. While the efficacy of such screening cannot be com- pletely excluded the evidence suggests that, if there is an effect, it is small. A cancer screening programme that is currently under inves- tigation is screening for future stomach cancer by identifying people with Helicobacter pylori infection of the stomach. Non​malignant diseases Screening for abdominal aortic aneurysms that, in the absence of surgery, are likely to rupture, by the ultrasound measurement of the aortic diameter, is worthwhile. The test is very discriminatory (see Fig. 2.12.6). Ruptured abdominal aortic aneurysms account for 2% of all deaths in men over 65, but are rare when the maximal aortic diameter is less than 5 cm. In the United Kingdom a screening pro- gramme based on abdominal aortic diameter using ultrasound is in progress for men aged 65 (rupture is rare in younger men). Over all ages ruptured abdominal aortic aneurysm is about twice as common in men as in women. Mortality rates for women for women are similar to those in men about 10 years younger. Most men will need only a single scan in the year in which they reach 65. Screening people with diabetes for retinopathy using retinal pho- tography is very effective; it has been shown in randomized trials to reduce blindness by 90% with proliferative retinopathy and 65% with macular oedema. A national screening programme operates in the United Kingdom, based on inviting people from diabetic regis- ters held in general practice. Chlamydia infection in young women causes pelvic inflammatory disease (which may be complicated by chronic pelvic pain, ectopic pregnancy, and tubal infertility and, when giving birth, causes neo- natal eye and lung damage). Screening for chlamydia infection based on urine samples is followed by short term antibiotic treatment and is effective. Screening women under 25 has been recommended but no systematic screening programme has been introduced in the United Kingdom. Much screening activity falls under the category of ‘risk factor screening’ and such screening tends to be ineffective (e.g. cholesterol testing in screening for future ischaemic heart disease events; see Fig. 2.12.7), blood pressure measurement in screening for future stroke, and bone density measurement as a screening test for future hip fractures. The problem arises because, for the reasons given here, risk factors that may be important causes of disease are usually poor screening tests. Most adults have high serum cholesterol and high blood pressure relative to levels in young adults (say at age 20), and all postmenopausal women have low bone density relative to pre- menopausal women, so nearly all older adults are ‘exposed’. Fig. 2.12.8 shows the effect of combining different markers on the detection rate and false-​positive rate where several markers that each have a detection rate for a 5% false-​positive rate (DR5) of 10%, 15%, or 20% and the standard deviation is the same in affected and unaffected individuals. Only when tests individually have a DR5 of about 20% or greater will multiple marker screening become a real- istic proposition. For example, combining five relatively weak inde- pendent markers, each with a DR5 of 15%, yields only a 40% overall detection rate for a 5% false-​positive rate, and combining ten yields a 60% detection rate for the same 5% false-​positive rate. At present, screening for future coronary disease and most other diseases using causal risk factors is not effective because even in combination they are not sufficiently discriminatory. Hypothyroidism in adults is widely regarded as a preventable cause of lethargy and depression. This has prompted attempts at screening for this disorder by measuring levels of thyroxine (T4) or thyroid-​stimulating hormone (TSH) and classifying individ- uals as positive if TSH is above or T4 below the relevant reference range (which is usually the 95th centile range in the population). This is an example of the ‘tautological screening’ that arises from defining a disorder in terms of the test used to screen for it (see earlier). The solution to this circularity is to identify individuals from a population with TSH or T4 outside specified TSH or T4 limits and then offer each, in random order, thyroxine or placebo to determine whether thyroxine treatment relieves the symptoms more often than can be explained by chance. Each person is therefore their own control, and the response to treatment defines the clinical disorder. Such a cross-over randomized trial has been 100% 80% Number of screening markers 0 5 Detection rate for multiple markers 10 15 20 25 30 35 40 60% 40% 20% 0% Fig. 2.12.8  Overall screening performance from combining individual screening markers: detection rate for a 5% false-​positive rate (DR5) according to the number of screening markers combined that individually have a DR5 of 10% or 15% or 20%. Reproduced from Wald N, Morris J and Rish S, ‘The efficacy of combining several risk factors as a screening test’, J Med Screen 2005; 12: 197–​201. London: Royal Society of Medicine Press, 2005. 2.12  Medical screening 151 done and showed that screening for hypothyroidism is worthwhile. In screening the same approach should be used to identify which individuals will benefit from treatment. Clarity of terminology and purpose Certain terms used in screening are probably best avoided because they lack clarity. The term ‘carrier screening’ implies that carriers of autosomal recessive disorders (e.g. cystic fibrosis) themselves have a disease; they do not. The goal of such screening is to identify couples who are both carriers. ‘Couple screening’ involves collecting samples from both parents and reporting a positive result only when both are carriers. The term ‘genetic screening’ lacks clarity and tends to imply screening for inherited disorders even though some genetic disorders that are screened for (e.g. Down’s syndrome) are usually not inherited. The term creates a false impression that something special is being offered that other forms of screening lack. For many people genes and consequently all things genetic are seen as highly determinant, even inevitable, influences, which is usually not the case. Genetic markers of a disease are in most instances too insensitive and too non​specific for screening purposes. The term ‘case finding’ often implies the identifica- tion of cases of the disorder being screened for, while in fact it identifies individuals with a positive screening test for that disorder. For example, a case of ‘hypertension’ relates to the test result (high blood pressure), not the diseases it causes. The term ‘opportunistic screening’ is a eu- phemism for non​systematic and non​organized screening. The purpose of medical screening is clear—​to avoid disability and premature death at an acceptable level of safety. Determining efficacy is essential. Many screening tests are effective and should be part of public health practice. But particular care is needed in evaluating tests that arise out of technological development in the absence of a clear case of medical need. Whole-​body scanning using MRI and fetal ultra- sound examination are examples. Such screening, without defining the specific disorders being screened for, detects ‘incidentalomas’ (so-​ called ‘abnormal’ findings with little or no knowledge of their med- ical significance). There is no place for such screening in responsible medical practice. For example, total body MRI scanning is now ad- vertised to the public as a screening test with little attention paid to whether it prevents serious disability or death, or meets the criteria set out in Box 2.12.1. A routine fetal anomalies scan at about 18 weeks of pregnancy has some proven specific applications (e.g. the detection of anencephaly, severe congenital heart disease, and placenta praevia ex- tending to cover the internal cervical os), but the term ‘fetal anomaly screening’ lacks specificity. The challenge in performing a scan is to seek these specific anomalies but not to report other ‘incidentalomas’ which will undoubtedly lead to parental anxiety and further investigation but for which early detection has not been shown to be worthwhile. Under the ambiguous heading of genetic screening, so-​called ‘gene chips’ have been developed, that can detect in one test many hundreds of genetic mutations with little or no evidence that knowledge of these will lead to useful medical intervention that will improve the health and quality of lives of the people being so tested. Medical screening needs to be driven by the medical need, not the technological capacity. Doctors have a professional responsibility to discourage techno- logically driven screening and to ensure that all screening meets the requirements set out in Box 2.12.1. Screening promoted only in terms of the application of a particular technology should not be part of medical practice. FURTHER READING Abu-Helalah M, Law MR, Bestwick JP, Monson JP, Wald NJ (2011). A randomized double blind cross-over trial to investigate the efficacy of screening for adult hypothyroidism. J Med Screen, 17, 164–9. Ashton HA, et al. (2002). The Multicentre Aneurysm Screening Study (MASS) into the effect of abdominal aortic aneurysm screening on mortality in men: a randomised controlled trial. Lancet, 360, 1531–​9. Atkins WS, et al. (2010). Once-​only flexible sigmoidoscopy screening in prevention of colorectal cancer: a multicentre randomised con- trolled trial. Lancet, 375, 1624–​33. Breslow N, et al. (1977). Latent carcinoma of prostate at autopsy in seven areas. Int J Cancer, 20, 680–​88. Holland WW, Stewart S (2005). Screening in disease prevention. Nuffield Trust, Radcliffe Publishing, Oxford. Journal of Medical Screening:  screening briefs (1994a, 1994b, 1995, 1996, 1997). J Med Screen, 1, 73; 1, 255; 2, 126; 3, 110; 4, 54. Law MR, Morris J, Wald NJ (1994). Screening for abdominal aortic aneurysms. J Med Screen, 1, 110–​16. McKeown T (1968). Validation of screening procedures. In: Screening in medical care. Reviewing the evidence. Nuffield Provincial Hospital Trust, Oxford University Press, Oxford. Mulhall BP, Veerappan GR, Jackson JL (2005). Meta-​analysis: com- puted tomographic colonography. Ann Intern Med, 142, 635–​50. National Lung Screening Trial Research Team (2011). Reduced lung-​ cancer mortality with low-​dose computed tomographic screening. N Engl J Med, 365, 395–​409. Palomaki GE, et  al. (2012). DNA sequencing of maternal plasma reliably identifies trisomy 18 and trisomy 13 as well as Down syndrome:  an international collaborative study. Genet Med, 14, 296–​305. Scoen RE, et  al. (2012). Colorectal-​cancer incidence and mortality with screening flexible sigmoidoscopy. N Engl J Med, 366, 2345–​57. Thorner RM, Remein QR (1961). Principles and procedures in the eval- uation of screening for disease. Public Health Monograph No 67. Public Health Service Publication No 846. US Department of Health Education and Welfare, Washington, DC. Wald DS, Bestwick JP, Morris JK, Whyte K, Jenkins L, Wald NJ (2016). Child-parent familial hypercholesterolemia screening in primary care. N Engl J Med, 375, 1628–37. Wald NJ (1994). Guidance on terminology. J Med Screen, 1, 76. Wald NJ (2004). The epidemiological approach: an introduction to epi- demiology in medicine, 4th edition. Wolfson Institute of Preventive Medicine/​Royal Society of Medicine Press, London. Wald NJ, Cuckle H (1989). Reporting the assessment of screening and diagnostic tests. Br J Obstet Gynecol, 96, 389–​96. Wald NJ, Hackshaw AK, Frost CD (1999). When can a risk factor be used as a worthwhile screening test? BMJ, 319, 1562–​5. Wald NJ, Leck I (eds) (2000). Antenatal and neonatal screening, 2nd edition. Oxford University Press, Oxford. Wald NJ, Law MR (2003). A strategy to reduce cardiovascular disease by more than 80%. BMJ, 326, 1419–​23. Wald NJ, et al. (2004). SURUSS in perspective. Br J Obstet Gynaecol, 111, 521–​31. Wald NJ, Morris JK, Rish S (2005). The efficacy of combining several risk factors as a screening test. J Med Screen, 12, 197–​201. Wald NJ, et al. (2018). Prenatal reflex DNA screening for trisomies 21, 18, and 13. Genet Med, 20, 825–30. Wilson JMS, Jungner G (1968). Principles and practice of screening for disease. WHO Public Health Paper No. 34, World Health Organization, Geneva. 2.13 Health promotion 152 2.13 Health promotion 152 ESSENTIALS The ‘Ottawa Charter for Health Promotion’ (1986) remains a bench- mark for the global health promotion community, but the context for health promotion has changed with increasing recognition of the significance of inequities in health. Health promotion is a key strategy to deal with the social determinants of health that create these inequities. Attention has shifted from the mere recognition that all public policies may impact on health to active strategies and actions to move health concerns into all policies. Clinicians are key actors in shaping social and cultural priorities and beliefs: they should be committed to the reduction of health inequity, with health promotion as a core commitment and responsibility. ‘Why treat people, and then send them back to the conditions that made them sick?’ (Sir Michael Marmot). Introduction Investing in better health and greater resilience of patients, families, and communities is a key responsibility of the health professional. This is a challenge that is quite different from providing services and interventions to return sick individuals to health, or engage in pri- mary prevention efforts (see Chapter 2.11) for those at immediate risk of developing disease. Building the health of individuals and populations has tradition- ally been the remit of public health services, the evolution of which has been described as having happened in waves, but rather than a new development superseding and replacing an earlier one, in public health the different ideas have built on, and complemented, each other. Fig. 2.13.1 shows the five public health developments since the early 1800s. Through the structural stage (e.g. installation of sanitation hard- ware) and biomedical (e.g. surveillance and infection control) as well as clinical (e.g. vaccination, screening) stages, public health—​by the end of the 20th century—​embraced health as a social value and a social and economic resource, rather than just an individual attribute. Some parts of the global community have not wholly advanced along all these five waves. In remote areas, even in wealthy countries, the full social and cultural dimensions of health often have not at- tained the same stature as they have in urban environments. Cities generally, even in middle-​ and low-​income countries, offer oppor- tunities for health that are better than elsewhere. With the growth of the health industry (in many OECD countries the most important contributor to local economies) it seems we have moved even beyond this ‘social model of health’ and are embracing a ‘health society’. In this society the pursuit of health is a lofty goal and one that yields significant commercial and political interest. Health has become culture. Yet, at the same time not everyone bene- fits equitably from this health society. In fact, the health inequity gap is growing, and remains persistent even in the most egalitarian societies. This chapter considers the components of modern health promo- tion that respond to and address the health society. Health promo- tion, as a concept, emerged from a more behaviourist approach to health development called health education. In many countries and cultures, the two concepts are still considered synonymous. The Ottawa Charter for Health Promotion The evolution of public health as described earlier, along with secular social change, created impetus for the World Health Organization, the Canadian Public Health Association, and Health Canada (the Canadian federal ministry of health) to convene an international conference in 1986. This conference on health promotion, with the subtitle ‘the move towards a new public health’ (Fig. 2.13.2) de- scribed priorities for health promotion in an ‘Ottawa Charter for Health Promotion’. This Charter has remained a benchmark for the global health promotion community. Follow-​up global conferences (https://www.who.int/healthpromotion/conferences/en/) fine-​tuned the components of the Charter and updated them to new contexts. Historical developments The development of the idea of health promotion was the result of changes in society, and our thinking about society, since the 1960s. Social movements emerged that claimed greater control over the plight and destinies of certain populations. The women’s health movement, for instance, put gender on social and clinical agendas 2.13 Health promotion Evelyne de Leeuw 2.13  Health promotion 153 and expected full involvement and control of women in their own health management. Since the early 1980s the scourge of HIV/​AIDS created new forms of health activism. Greater attention was paid to critical analyses of the workings of society and its core insti- tutions, including clinical medicine, for which terms like the ‘medical-​industrial complex’ were coined. This was reflected in philosophy and social science, and applying such gazes to the world of health led, for instance, to the ‘discovery’ of iatrogenesis (chal- lenges to health, or exacerbated disease conditions, created by the actions of the disease care system). In medical sociology a breakthrough development was Aaron Antonovsky’s work on ‘salutogenesis’, the phenomenon in which health is built (rather than disease prevented) by shaping condi- tions for greater ‘sense of coherence’, which has three components. First, comprehensibility, the sense of understanding events in life and feeling able to predict what is likely to happen in the future. Secondly, manageability, a belief that you will be able to take care of things that might happen. Thirdly, meaningfulness, a belief that things are worthwhile and that there is good reason to care about what happens. Antonovsky found, in his research with Holocaust survivors, that individuals with a greater sense of coherence suc- cumb less easily to disease and chronic health conditions. In the health promotion field, this is seen as ‘positive health’. It suggests that clinical intervention to ‘fix’ disease or infirmity and restore health is only the first stage on a continuum towards positive health. Investing in better social, economic, and natural capital would create individuals and communities with greater resilience. In some realms of the prevention community, there was also dis- satisfaction with the degree to which behavioural interventions yielded limited health gain. In fact, some saw that misguided be- havioural interventions in fact amplified adverse health conditions through victim-​blaming and social exclusion. It became increas- ingly recognized that different kinds of interventions (communica- tive, facilitative, and regulatory) should complement each other and in fact create synergy. For instance, mere knowledge provision on seatbelt use would only work to advance health if cars are equipped with such devices, and legislation mandates and sanctions their use. This logic applies throughout the health spectrum: the account of smallpox eradication shows that it was not just the clinical evidence that surveillance and control made eradication possible. This feat was only accomplished by a fortuitous alignment between clinical competence, technological advance (the bifurcated needle), strict rules, as well as social, political, and financial resourcing of the effort. The authority and sovereignty of governments and professionals were also challenged. Globalization started to be seen as a threat to the livelihood of local communities. Views on the role of government changed to what some have called ‘laissez-​faire’ governance (and others Thatcherism, Reaganomics, or New Public Management) since the late 1970s. Many scholars and activists felt that things like universal health coverage was eroded, sustainability and ecological balance suffered, and social justice was tested. It was time for a new vision on public health, and the Ottawa Charter provided it. Ottawa Charter action areas The Charter was developed over four days in the spirit that led to the need for the conference: ‘experts’ from all walks of life, including clinicians, scholars, and community representatives (albeit more Structural 1800 1850 1920 1970 2000 Biomedical Clinical Social Cultural Fig. 2.13.1  Five stages of modern public health evolution since the 1800s. 154 section 2  Background to medicine dominantly male and from richer countries than female and from lower income states) worked to continually draft a simple, two-​page message on priorities in action for the new public health. Enable, mediate, advocate The Charter recognizes that health is a resource for everyday life, and that its conditions are created in everyday life. ‘Settings’ such as school, work, sports clubs, the home, neighborhoods and cities, may all challenge and enhance health. There is a role for those who wish to promote health to enable individuals, groups, and communities to control the determinants of their health; to me- diate between groups and organizations for health; and to advocate for health at all levels of policy and practice. This is a broader social objective, not just a role for one discipline. Reorient health services The conference considered that most health services are in fact disease services, and do very little to invest in positive health. Maybe worse, in the 1980s most health services were not too proficient at patient education and counselling and some had primitive ideas about how Fig. 2.13.2  The cover of the Ottawa Charter for Health Promotion, including its logo describing action areas. Reprinted from The Ottawa Charter for Health Promotion, First International Conference on Health Promotion, Ottawa, 21 November 1986. 2.13  Health promotion 155 to secure best patient outcomes. The dominant model adopted the ‘Trust me, I’m a doctor’ position. The Ottawa Charter identified that health services, be they primary care posts, specialist hospitals, or ter- tiary care facilities, are pivotal social institutions in the communities they serve. Not just the health of their patients (or clients), but also of staff, visitors, and their ecological footprint could be considered important. Rather than disease cure institutions, they could become health-​promoting health services including better, health enhancing architectural design, community hubs, and healthy workplaces. Supportive environments The emerging discipline of social epidemiology had provided evidence since the 1970s that the social, physical, natural, and economic envir- onment impacts significantly on individual and community health. Like Sir Michael Marmot, many years later (2008) at the launch of the report of the WHO Commission on Social Determinants of Health said: ‘Why treat people, and then send them back to the conditions that made them sick?’, the Ottawa Charter urges individuals, groups, communities, practitioners, and politicians to see how these envir- onments contribute to making healthier choices the easier choices. For instance, in most countries fat meat is cheaper than lean meat and changing the commercial environment to facilitate the healthier choices (e.g. through different slaughter procedures, subsidies, but also advocacy) would yield easier access to healthy food. The suc- cessful Heartbeat Wales programme inspired various aspects of the health promotion vision, including the effective partnerships be- tween diverse partners in the food chain to accomplish exactly this. Heartbeat Wales significantly inspired the Charter from the early 1980s onward. Personal skills and community action There is no denying that health is mostly an individual attribute (al- though René Dubos already in 1959 explicitly included society in his definition: ‘Health is the expression of the extent to which the individual and the social body maintain in readiness the resources required to meet the exigencies of the future’—​a view that aligns well with positive health). But health behaviour is not necessarily individual, it is shaped through social norms and community op- portunity. The Charter identifies skills (such as a prescient view on health literacy) to deal with health challenges and opportunities, in- dividually and collectively, as key to the success of health promotion. Build healthy public policy Many of the options and opportunities outlined here depend on collective choice, or (as Nancy Milio analysed) on public policy out- side the health sector. This idea had started to take hold in Canada in the early 1970s with the Lalonde Report that advocated for ac- tion on elements of the ‘health field’ outside medicine. This idea was adopted by the Swedish government in the 1980s (Fig. 2.13.3) when they identified how action in one government sector could well have added effect across several health conditions: it has taken decades of deliberate health promotion action to allow the country to effectively implement its comprehensive health strategy. Milio systematically reviewed the policy literature and showed health impacts from all sectors under the influence of public policy (e.g. including agriculture, food, and nutrition; education and literacy; transportation and mobility; defence; economic development, and so on). The Ottawa Charter was straightforward in its call (Box 2.13.1) that making the healthier choice should also be the easier choice for all politicians. A health promotion for the 21st century The Ottawa Charter remains an important benchmark for global health promotion. It has influenced policies and practices around the world. On its 25th and 30th anniversaries the legacy of the state- ment was reviewed. Progress has been made in virtually all action areas, either in practice or in scholarship. There is, however, also evi- dence that health services are still facing problems to redefine them- selves as ‘health-​promoting environments’. Strong correlation Social upbringing environment Mental illness Tumours Injuries Respiratory diseases Cardiovascular diseases skeletomuscular disease Some correlation Social work environment and unemployment Diet Tobacco Alcohol and drugs Traffic Air/water pollutants Physical living environment Social living environment Physical work environment Fig. 2.13.3  The Swedish government’s map of public sector impacts on health and disease (HS90—​the Swedish Health Services for the 1990s, 1984). Box 2.13.1  Ottawa Charter for Health Promotion view of Healthy Public Policy Building healthy public policy Health promotion goes beyond healthcare. It puts health on the agenda of policymakers in all sectors and at all levels, directing them to be aware of the health consequences of their decisions and to accept their respon- sibilities for health. Health promotion policy combines diverse but complementary ap- proaches including legislation, fiscal measures, taxation, and organiza- tional change. It is coordinated action that leads to health, income, and social policies that foster greater equity. Joint action contributes to en- suring safer and healthier goods and services, healthier public services, and cleaner, more enjoyable environments. Health promotion policy requires the identification of obstacles to the adoption of healthy public policies in non​health sectors, and ways of ­removing them. The aim must be to make the healthier choice the easier choice for policymakers as well. Reprinted from The Ottawa Charter for Health Promotion, First International Conference on Health Promotion, Ottawa, 21 November 1986. 156 section 2  Background to medicine The context for health promotion also has changed. The debate around health equity has intensified, with many countries around the world (led by WHO) having inequalities in health on national and local agendas. Health promotion is still seen as a key strategy to deal with the social determinants of health that create these in- equities. Attention has shifted from the mere recognition that all public policies may impact on health to active strategies and actions to move health concerns into all policies. This is called ‘Health in All Policies’ and cunning leadership of the health sector and its profes- sionals is identified as critical to the future success of health promo- tion and global health. A clinical commitment and challenge For public health doctors it will be clear what their roles and commit- ments in health promotion are. They mediate, advocate, and enable diverse groups of stakeholders that shape environments for health to be explicit about the health impacts and choices available to the in- dividuals, groups, and communities they serve. In this way they will contribute to the development of Health in All Policies, and support people in individual and community action for health. For the clinician dealing with individual patients, the responsibility may be less clear. The concepts discussed in this chapter may be con- sidered abstract and divorced from the individual complaint voiced in the surgery, clinic, or ward. However, clinicians are key actors in shaping social and cultural priorities and beliefs, and their commit- ment to the reduction of health inequity should be in no way less than those of public health specialists. The British Medical Association responded to this challenge and presented its members, and the rest of the world, with a grid that sees levels of clinical action, both geographically (local and international) and in terms of performance (treating patients—​ community leadership—​advocacy—​research)—​see Table 2.13.1. To contribute to better health of individuals and communities, and to reduce inequities in health between groups, there are im- portant roles for doctors: • Practice holistic medicine from a patient-​centred perspective • Embrace and exploit the social and community stature of the medical profession in the community to bring together partners for health • Advocate for, do not just treat, marginalized and socially excluded individuals and their communities Health promotion is a core commitment and responsibility of the clinician. As the Declaration of Geneva states: I will not permit con- siderations of age, disease or disability, creed, ethnic origin, gender, na- tionality, political affiliation, race, sexual orientation, social standing, or any other factor to intervene between my duty and my patient. FURTHER READING Baum F, Lawless A, Williams C (2013). Health in All Policies from international ideas to local implementation: policies, systems and organizations. Health promotion and the policy process: practical and critical theories, 188–217. British Medical Association (2011). Social determinants of health—​ what doctors can do. British Medical Association, London. de Leeuw E (2011). The boulder in the stream. Editorial, ii157–​ii160. In:  de Leeuw E (ed) The Ottawa Charter 25 Years On. Health Promotion International, 26(suppl 2), ii157–​ii272. de Leeuw, E (2017). Engagement of sectors other than health in in- tegrated health governance, policy, and action. Annual Review of Public Health, 38, 329–49. de Leeuw E, Ståhl T, Tang KC (eds) (2014). The eighth global confer- ence on health promotion. Helsinki, June 2013. Health Promotion International, 29(suppl 1), i1–​i151. Dubos R (1959). Mirage of health:  utopia, progress, and biological change. Doubleday, Garden City, NY. Eriksson M, Lindström B (2006). Antonovsky’s sense of coherence scale and the relation with health: a systematic review. J Epidemiol Community Health, 60(5), 376–​81. Huber M, et al. (2011). How should we define health? BMJ, 343, d4163. Milio N (1981). Promoting health through public policy. FA Davis Co, Philadelphia, PA. Newby L, Denison N (2014). ‘If you could do one thing.’ Nine local actions to reduce health inequalities. The British Academy, London. Pelikan JM, et al. (2005). 18 core strategies for health promoting hos- pitals (HPH). In: Groene O, Garcia-​Barbero M (eds) Health promo- tion in hospitals: evidence and quality management. World Health Organization, Copenhagen. Ståhl T, et al. (2006). Health in all policies. Prospects and potentials. Finnish Ministry of Social Affairs and Health, Helsinki. Ziglio E, Simpson S, Tsouros A (2011). Health promotion and health systems: some unfinished business. Health Promot Int, 26(suppl 2), ii216–​ii25. Table 2.13.1  What doctors can do in health promotion to address social determinants of health and health equity (BMA, 2011) Treating patients Community leadership Advocacy Research Local A B C D Regional E F G H National I J K L International M N O P 2.14 Deprivation and health 157 2.14 Deprivation and health 157 ESSENTIALS The health of an individual is influenced by the circumstances in which he or she lives. Individuals who live in poverty are more likely to be unhealthy and die younger than individuals who are wealthy. The mechanism by which complex social circumstances cause health inequalities might be a failure to create capacity to manage life’s challenges. Most agree that well-​being is created where indi- viduals have an optimistic outlook, a sense that they are in control of their own lives, a sense of purpose and meaning in life, confidence in their ability to deal with problems, a supportive network of friends and a nurturing family. If society is serious about tackling health inequalities, action needs to happen across the life course. Children from families who have no expectation of success need encouragement and, when they fall into addiction or criminality, society needs to think about rehabilitation rather than punishment. Inclusion into society and building empathy is the basis for narrowing inequality. Early observations The industrial revolution was marked by increasing disparity in wealth across society. As the health of the rich improved, the health of the working classes was often damaged by the conditions in which they worked. In 1842, the social reformer Edwin Chadwick pub- lished a study in which he noted that labourers, on average, died around 20 years younger than members of the professional classes. Three years later, Friedrich Engels published a description of life in cities in Northern England. He reported that the annual death rate in mill towns such as Manchester and Liverpool was significantly higher than the national average (around 1 in 32 compared with an average of 1 in 45). In Glasgow, a city at the heart of the industrial revolution, city officials published in 1861 data linking infant mortality with levels of affluence. In the poorest areas of the city, infant mortality was 18 times higher than that seen in the wealthiest families. Over the last 150  years, it has become apparent that the rela- tionship between poverty and health is complex and involves many aspects of society. It has now become customary to talk of ‘socioeconomic determinants of health’ and, where these determin- ants are unequally distributed, to see the resultant health inequality as a matter requiring action. The World Health Organization defines the social determinants of health as: ‘the conditions in which people are born, grow, work, live, and age, and the wider set of forces and systems shaping the conditions of daily life. These forces and systems include economic policies and systems, development agendas, social norms, social policies and political systems.’ Health inequalities in the 21st century There are inequalities in life expectancy between countries, within countries, and within individual towns and cities. Internationally, the gap between industrialized countries and the low-​income countries narrowed between 1960 and 1990 from around 23  years to approximately 12  years. The gap between the two groups in infant mortality was 123 (per 1000 live births) in 1960 and this narrowed to 61 by 1990. The exception to this improving picture was in some African countries where mortality from AIDS caused a fall in life expectancy By 2010, the OECD countries had attained an average life expect- ancy at birth for the whole population of 79.7 years, a gain of almost 10 years since 1970. Japan had the highest life expectancy and most OECD countries had attained a life expectancy at birth of at least 80 years. Within the United Kingdom, life expectancy in England, Scotland, and Wales continues to improve, although there are differences between the three countries with Scotland having the lowest life expectancy for both men and women (Table 2.14.1). Despite increasing life expectancy across the United Kingdom, significant variations between districts are still apparent. In Kensington and Chelsea, the wealthiest part of London, a man can expect to live to 88 years, while a few kilometres away in one of London’s poorer areas, Tottenham Green, male life expectancy is 71. 2.14 Deprivation and health Harry Burns 158 section 2  Background to medicine The causes of health inequality Health in any society is determined by the complex interaction of many factors, hence attempts to attribute inequality to any one factor are likely to be an oversimplification. However, several explanations have received attention over the years. Access to healthcare One commonly held belief is that reduced life expectancy among the poor is due to inequality in access to healthcare. In 1980, the UK gov- ernment published a report by Sir Douglas Black. He had been asked to answer the question of why, after 30 years of a National Health Service, providing universal, free access to healthcare, there were persistent inequalities in health in the UK population. Black was able to discount this theory, saying healthcare was primarily to treat illness when it occurred, not to prevent its occurrence. However, it remains possible that, in countries without universal, free health services, this explanation could contribute to inequalities. Other, more plausible explanations for inequalities in the United Kingdom were considered in Black’s Report. These included the following. Natural or social selection Could it be that, rather than social position affecting health, health actually determines social position? People with poor health might be less effective in the workplace and gradually drift down the occu- pational hierarchy. However, it was generally accepted that this was not a credible explanation for the pattern of inequality observed in the United Kingdom at that time. Cultural or behavioural factors This explanation focuses on the choices made by individuals about their health. Use of alcohol, tobacco, lack of exercise, and consump- tion of unhealthy foods are more common at the lower end of the social scale. The resulting poor health is often seen as the consequence of a lack of education or simply a lack of interest in remaining healthy. However, a cohort study that followed British civil servants over a long period of time, collecting data on risk factors such as body weight, cholesterol, smoking, and blood pressure, found that risk factors could only explain one-​third of the observed variation in health. Material factors Poor people are more likely to live in overcrowded homes of poor quality. They may work in noisy, polluted environments with inad- equate safety regulations. They will be less financially secure and may be stressed by debt and difficulty paying bills. All of these fac- tors can adversely affect health. Psychosocial factors If we feel overwhelmed by events around us, we will be stressed, and stress has behavioural and biological consequences. This makes it difficult to separate behavioural, structural, and psychosocial factors as causes of inequality. One way of unravelling the complexity of the relationship is to understand the biological mechanisms through which social cir- cumstances influence health. The biology of deprivation Many studies report high levels of stress hormones in people at the lower end of the socioeconomic scale. A normal stress re- sponse allows us to respond to challenge. Sitting an exam, moving house, or an accident will produce a short-​lived elevation in stress hormones which will return to normal once the challenge is overcome. Many stressors are less easy to resolve. Unexpected bills, work conflicts, family crises are persistent and often difficult to manage. In poorer areas, problems may include noise, damp housing, discrimination, and living in a violent and dangerous neighbourhood. Greater resilience allows us to manage successfully everyday problems. This is particularly true for those who are more affluent and can afford to live in a safe environment. Chronically activated stress responses appear commoner among those in a poorer social position and this chronic stress burden may interfere with health in several ways which ultimately impair well-​being and long-​term survival (Fig. 2.14.1). Low socioeconomic status Adverse environments Poverty Lack of social support Chronic stress Biological damage Lack of control Poor health choices Poor health Fig. 2.14.1  Mechanisms by which low socioeconomic status may lead to poor health. Table 2.14.1  Life expectancy at birth by year of birth Year Men Women England Scotland Wales England Scotland Wales 07/​09 78.2 75.3 77.2 82.3 80.1 81.5 08/​10 78.5 75.8 77.6 82.5 80.3 81.8 09/​11 78.9 76.2 78.0 82.9 80.6 82.2 10/​12 79.2 76.5 78.2 83.0 80.8 82.2 11/​13 79.4 76.8 78.3 83.1 80.9 82.3 2.14  Deprivation and health 159 The mechanism by which complex social circumstances cause health inequalities might be a failure to create capacity to manage life’s challenges. Salutogenesis—​the creation of health The American sociologist, Aaron Antonovsky, studied the question ‘How do people manage stress and stay well?’. Some people main- tain health despite exposure to potentially overwhelming stressors. ‘Salutogenesis’ was the term he used to describe those factors that support human health and well-​being, rather than the factors that cause disease. Salutogenesis represents a counterbalance to patho- genesis with which medicine is more concerned. Antonovsky’s theory is only one of many concepts that might underpin the processes of health creation. Emotional intelligence, internal locus of control, empowerment are a few of many theories that have been advanced to explain why some individuals have an increased capacity for well-​being. Some attributes are common to several of these theories. Most agree that well-​being is created where individuals have: • An optimistic outlook • A sense that they are in control of their own lives • A sense of purpose and meaning in life • Confidence in their ability to deal with problems • A supportive network of friends • A nurturing family Antonovsky’s work is of special interest since he offers a link between the psychosocial drivers of inequality and their biological conse- quences. Central to Antonovsky’s theory is the concept that, early in life, we acquire a mental outlook which he described as ‘having a sense of coherence’. This he described as a set of psychological at- tributes which he identified as being associated with positive health outcomes. They include seeing the world as: • Comprehensible: having a belief that things happen in an orderly and predictable fashion, a sense that you can understand events in your life and predict what will happen in the future. • Manageable: confidence that you have the skills, ability, support, and resources necessary to manage events in your life. • Meaningful:  believing that things in life are interesting and a source of satisfaction, and that there is good reason or purpose to care about what happens. Failure to develop a sense of coherence in early childhood, Antonovsky suggested, would result in the individual being chronically stressed. His thinking hinted at an explanation for the physical consequences of psychosocial adversity. The early origins of well-​being The basis for a salutogenic outlook is laid down in early childhood and relates to the way the child learns to manage stress. A positive stress response might be caused by exposure to a new caregiver or an immunization but, so long as the child attaches to an adult as part of a safe, consistent, nurturing relationship, they learn that the world is a comprehensible and manageable place. However, when a child experiences frequent, or prolonged adversity—​such as physical or emotional abuse, or chronic neglect due to parental substance misuse or mental illness—​such prolonged activation of the stress response can be associated with altered brain development. Specifically, abnormalities of the prefrontal cortex, hippocampus, and amygdala make it harder to suppress inappro- priate behaviour, impair learning, and increase the risks of mental and physical health problems throughout the life course. Changes to the hippocampus also impair the individual’s capacity to manage stress. Across the life course, adverse childhood events increase the risk of poor health, failure in education, increased risk of offending, and a criminal record as well as poor health. Antonovsky did not claim that sense of coherence alone could produce health. He also argued that individuals needed access to ex- ternal supports that allowed them to be resilient in the face of severe challenge. These resources include assets such as money and social support. These are precisely the factors which those living in poverty and chaos are lacking. Fig. 2.14.2 outlines Antonovsky’s concept of health creation. Resources which enhance resilience Sense of coherence Events Stress Tension Resolution Well-being Seeing the world as: Structured Predictable Feeling that it is: Manageable Meaningful Wanting to engage Family Nurture Intelligence Work Material resource Identity Cultural stability Stable set of answers Optimism Fig. 2.14.2  Antonovsky’s concept of health creation. 160 section 2  Background to medicine How might health inequalities be improved? Healthy brain architecture depends on a foundation built by ap- propriate attachment formed between a child and stable, caring adults. If an adult’s responses to a child are unreliable, inappro- priate, or simply absent, the lifetime risk of failure is increased. Adverse childhood events can set in motion a lifetime of failure. Mental health problems in childhood can lead to school failure, poor educational record, and unemployment. One cohort study suggested that childhood abuse and neglect could have a signifi- cant impact on the likelihood of arrest for delinquency and vio- lence. By the age of 32, almost half of the victims of abuse and neglect were arrested for a non-​traffic offence. Criminal conviction enhances the likelihood of unemployment and poverty, and the individual becomes increasingly alienated from society. Domestic violence damages children and the cycle of alienation continues (Fig. 2.14.3). If society is serious about tackling health inequalities, action needs to happen across the life course. It needs to begin by ensuring that babies have the best possible start in life. Supporting parents who have not themselves experienced a nurturing childhood to nurture their children is a necessary first step. Children from families who have no expectation of success need encouragement and, when they fall into addiction or criminality, society needs to think about rehabilitation and repair rather than punishment. Inclusion into society and building empathy is the basis for nar- rowing inequality. FURTHER READING Center on the Developing Child. Key Concepts. http://​developingchild. harvard.edu/​ Kawachi I, Subramanian V, Almeida-​Filho N (2002). A glossary for health inequalities. J Epidemiol Community Health, 56, 647–​52. Lindstrom B, Eriksson M (2005). Salutogenesis. J Epidemiol Community Health, 59, 440–​2. Marmot M, et  al. (2010). Fair Society, Healthy Lives (The Marmot Review). http://​www.instituteofhealthequity.org/​resources-​reports/​ fair-​society-​healthy-​lives-​the-​marmot-​review Marmot MG (2015). The health gap: the challenge of an unequal world. Bloomsbury Press. ISBN 1632860783. McEwen BS, Morrison JH (2013). The brain on stress: vulnerability and plasticity of the prefrontal cortex over the life course. Neuron, 79, 16–​29. Initial event Unequal outcomes Alienation Consequences Loss of self efficacy, self esteem, sense of control Worklessness Poverty Failure in education offending health Mental health problems Chaotic early years Fig. 2.14.3  The cycle of alienation. 2.15 How much should rich countries’ governments s 2.15 How much should rich countries’ governments spend on healthcare? 161 ESSENTIALS The argument commonly made by politicians in richer countries is that a universal health system is unaffordable because of the enor- mous gains in life expectancy leading to an ageing population, and that market-​driven competition and choice is preferred because it in- creases efficiency. There is no evidence to support these claims, and evidence to the contrary on both counts. The ageing of the popula- tion contributes little to the overall growth in expenditure, which is largely attributable to increasing marketization of health systems and costs of medicines and technologies. Since 1948, the United Kingdom has had a universal integrated public health system free at the point of delivery and funded through central taxation. The UK NHS became the model for many countries’ health systems across the world, as having the lowest cost, most effi- cient and fairest system, and guaranteeing healthcare to all its citizens without fear of charges or denial of care. In 2012, after two decades of market incrementalism, the universal public model was abolished in England (although Scotland and Wales still retain it) in favour of a model which increasingly resembles the non​universal market form of the United States where risk selection and denial of care prevails. The United States is one of the richest countries in the world and has the most expensive healthcare, but in spite of that it denies more than one in five of its population access to healthcare. Overtreatment and denial of care, catastrophic costs, and spiralling health expend- iture, go hand in hand in the United States. Those countries that have adopted the US model of mixed funding and private provision have more marketization, the greatest inequalities in access, lack of coverage, and highest out-​of-​pocket payments because market models operate on the basis of risk selection and risk avoidance. We conclude that the decision to have a universal public healthcare system is political. Many countries have decided that universal healthcare is the hallmark of a civilized society and that it is both necessary and affordable for governments to legislate for its citizens to that end. The question of how much any country should spend is inextricably linked to the chosen model of funding and provision, the degree of marketization, and how much risk selec- tion and denial of care a government is prepared to tolerate in its health system. Introduction Among many high-​income countries, healthcare spending has been in relative decline since 2007, when the latest economic recession began. According to the Organisation for Economic Cooperation and Development (OECD), a grouping of the richest countries in the world,1 a third of developed world governments reported a ‘real term cut in overall health spending in 2013’. In Italy, Portugal, and Greece real reductions have been taking place for several years. At the same time, checks on future spending levels are being discussed throughout the European Union. A common claim is that advanced economies can no longer afford universal health systems, that is, sys- tems which guarantee a right of access to healthcare for everyone in need of it. What should higher income countries spend? The relative decline in health spending has been accompanied by a growing tendency to treat this question as a technical and non-​ political issue. According to the International Monetary Fund (IMF), reform is needed because public health spending is ‘an im- portant macro-​fiscal issue’: ‘Public spending on health care has been a key driver of aggregate increases in public spending over the past 40 years. [ . . . ] [S]‌pending is projected to continue rising as a share of gross domestic product (GDP) unless reforms are undertaken to help break these trends’. Both the European Commission (EC) and the IMF have advised or forced some of the countries worst af- fected by the recession and the aftershock of the banking crisis in 2007–​2008 to cut health spending, and Greece to abandon universal healthcare, in return for financial assistance. Throughout the developed world this technical rationale for what has become known as ‘austerity’ is bolstered by the hypothesis that rich countries are victims of their own success in improving the life expectancy of their citizens. A longer-​lived population will 2.15 How much should rich countries’ governments spend on healthcare? Allyson M. Pollock and David Price 1  The OECD consists of the following high-​income countries: Australia, Austria, Belgium, Canada, Chile, Czech Republic, Denmark, Estonia, Finland, France, Germany, Greece, Iceland, Ireland, Israel, Italy, Japan, Luxembourg, Netherlands, New Zealand, Norway, Poland, Portugal, Republic of Korea, Slovakia, Slovenia, Spain, Sweden, Switzerland, United Kingdom of Great Britain and Northern Ireland, United States of America. 162 section 2  Background to medicine contain a higher proportion of elderly people than a shorter-​lived one and this, it is argued, has created public budgetary pressures that threaten economic well-​being. The proffered solution to an ‘ageing society’ is to cap or reduce public health budgets so that they take up a declining (or no greater than at present) share of national wealth. However, decisions about future levels of spending on healthcare are fundamentally political, or normative, just as are decisions on public expenditure on defence or international development. Today, most governments of the wealthiest countries spend between 7 and 10% of their national wealth on healthcare: the United States is an out- lier at almost 18% (Fig. 2.15.1), and judging how much they should spend in the future is now a pressing political question. To address it we examine, first, the political basis of austerity policy. Secondly, since all normative claims have testable elements or can be reframed so that they are testable, we examine the concepts and empirical evidence that can be used in debates about future levels of spending. To simplify mat- ters we look at the role of politics and austerity through four research lenses, namely, public health, law, finance, and economics and trade. The role of politics and austerity in determining healthcare expenditure Austerity policy is based on claims that are no less political than those that inform alternative spending policies. This is because the econometric models that underpin the policy of reducing public expenditure and public services involve implicit and con- testable value judgements about the benefits of free markets and of a diminishing role for government or relatively low public ex- penditure. Standard economic theory, according to Paul Krugman, prescribes increased public spending during a recession in order to stimulate demand, even at the expense of growing public deficits (in which governments spend more than they collect). However, the dominant international political policy position known as the ‘Washington consensus’ has been to support the adoption of pri- vate markets with relatively low government involvement in place of tax funded public services. Following the banking crisis in 2008, many governments pursued the policy of reducing health expenditure. Those European Union (EU) countries worst affected by austerity—​Italy, Greece, Spain, Portugal, and Ireland—​all experienced severe reductions in total and public and private per capita spending on healthcare. The EC and the IMF have required some countries to reduce healthcare spending substantially in exchange for financial assistance. Since 2008, the UK government has opted for substantial cuts in public ex- penditure, provoking criticism from some leading economists that this was less conventional economics than ideological commitment to the private sector and markets. Since 2000, alternative hypotheses about economic develop- ment have been used to generate testable questions about or rebut Health expenditure 2014 for selected OECD countries plus Argentina, Brazil, Costa Rica, Cuba, Dominican Republic, Peru and Venezuela Total health expenditure as a percentage of gross domestic product (public and private) by country 18.0 16.0 14.0 12.0 10.0 8.0 6.0 4.0 2.0 0.0 USA Sweden Switzerland France Germany Austria Cuba New Zealand Netherlands Denmark Belgium Canada Japan Norway Finland Portugal Australia Costa Rica Italy United Kingdom Spain Brazil Greece Chile South Korea Peru Venezuela Argentina Dominican Rep Private Public Fig. 2.15.1  Health expenditure for OECD countries plus some Latin American countries. Data from World Health Organization Global Health Expenditure Database. 2.15  How much should rich countries’ governments spend on healthcare? 163 claims that public spending on healthcare is an economic burden that impairs future rates of economic development. There is now a large body of research that shows that growing inequalities in in- come and health status themselves create economic burdens that retard growth, and that investing in healthcare can stimulate the economy. Several studies have examined the costs of high levels of inequality for developed economies. Wilkinson and Pickett’s The Spirit Level (2009) and Thomas Piketty’s Capital in the Twenty-​first Century (2014) challenge the hypothesis that health spending is an eco- nomic burden. Adding weight to the analysis is the World Health Organization’s (WHO) Macroeconomics and Health Commission final report (2003) and the EU’s Investing in Health (2013), the au- thors of which argued that investing in health is a desirable eco- nomic goal because it ‘contributes to the Europe 2020 objective of smart, sustainable and inclusive growth’. Contrary to the IMF, the EU maintains that health spending is now ‘recognised as growth-​ friendly expenditure’. In a further sign of a changing policy arena, the United Nations Conference on Trade and Development suggested in 2014 that a more flexible approach was needed to public spending and govern- ment institutions than allowed under the Washington consensus (UNCTAD, 2014). A second argument for the political basis of austerity policy arises from findings that economic forecasts have proved to be badly wrong. In April 2014, IMF research staff published an acknowledge- ment that their econometric models had proved to be wide of the mark after it was found that those countries that had implemented the deepest public spending cuts subsequently experienced the lowest levels of economic growth. Thirdly, while the ageing society hypothesis has taken on a mythical status among politicians, it has been rebutted in a series of studies around the world. Ageing, it turns out, is too gradual to explain anything more than quite marginal changes in total health spending. This is because most people’s healthcare costs are incurred in the last six months of life at whatever age they die. Additionally, there is a body of research which shows that not only do older people contribute more financially and in kind than they take out of the system, but they are living longer and healthier and productive lives (the compression of morbidity). In any event, the ageing hypothesis lacks plausibility when used to suggest that a finite population has infinite healthcare needs. In the United Kingdom, for example, older people have accounted for a relatively small proportion of the increase in spending on healthcare, and while overall spending on health services between 1965 and 1999 grew by 3.8% a year in real terms, demographic change (ageing) accounted for a real increase of just 0.5% a year in sharp contrast to the cost of pharmaceuticals and technology, which rise well above inflation. Background—​trends in health spending in richer and some middle-​income countries (OECD area plus some Latin American countries) Fig. 2.15.1 and Table 2.15.1 show total spending on healthcare as a proportion of GDP for 2013/2014. Poor-​quality data on private spending It is important to note that the quality of data on private spending in both rich and poor countries is limited by the fact that they are based on household surveys, are not comprehensive, and often lack comparability. Moreover, most data reflect spending by or on those who use services, not those who are excluded, and data on the poor and workers in informal economies might not be collected. The re- liability of data about unmet need and inequality remains a critical issue in all countries. Allowing for these limitations, the spending data show: 1. That there are political differences among all countries in the total health expenditure as a proportion of GDP, and in the rela- tive spending on public and private healthcare. The United States is an outlier among all countries, with the highest public and private expenditure. 2. That national (non-​insurance-​based) health systems (United Kingdom, Norway, Denmark, Italy, New Zealand, Sweden) have Table 2.15.1  Health expenditure 2014 for selected OECD countries plus Argentina, Brazil, Costa Rica, Cuba, Dominican Republic, Peru, and Venezuela as a percentage of gross domestic product Country Public Private Total United States 8.3 8.9 17.1 Sweden 10.0 1.9 11.9 Switzerland 7.7 4.0 11.7 France 9.0 2.5 11.5 Germany 8.7 2.6 11.3 Austria 8.7 2.5 11.2 Cuba 10.6 0.5 11.1 New Zealand 9.1 1.9 11.0 Netherlands 9.5 1.4 10.9 Denmark 9.2 1.6 10.8 Belgium 8.2 2.3 10.6 Canada 7.4 3.0 10.4 Japan 8.6 1.7 10.2 Norway 8.3 1.4 9.7 Finland 7.3 2.4 9.7 Portugal 6.2 3.3 9.5 Australia 6.3 3.1 9.4 Costa Rica 6.8 2.5 9.3 Italy 7.0 2.3 9.2 United Kingdom 7.6 1.5 9.1 Spain 6.4 2.6 9.0 Brazil 3.8 4.5 8.3 Greece 5.0 3.1 8.1 Chile 3.9 3.9 7.8 South Korea 4.0 3.4 7.4 Peru 3.3 2.2 5.5 Venezuela 1.5 3.7 5.3 Argentina 2.7 2.1 4.8 Dominican Rep 2.9 1.4 4.4 164 section 2  Background to medicine similar levels of total spending to insurance-​based countries, but relatively higher levels of public spending, expressed as a propor- tion of GDP, while private spending on health is relatively low in these countries. 3. Countries with high levels of private spending may have mul- tiple public and private insurance systems. 4. Latin American countries also show major differences in public and private expenditure which are related to the design of their health systems. For example, Costa Rica and Cuba both have high levels of public expenditure and universal tax-​funded public health systems, whereas Brazil, the Dominican Republic, and Chile have mixed funding and private insurance systems which are not universal. Spending levels The perspective of public health Public health sciences seek to understand, among other things, the best ways in which to maximize population health status and meet public health needs through fairness of funding and services. Within the international public health community, universal access to healthcare has been adopted as the preferred goal of rational health systems for the last 45 years (World Health Organization, 1978). This is the case because it is now accepted that although healthcare is only one of several factors contributing to population health, uni- versal access to it makes a measurably greater contribution than more limited access. In 2015, the two major international health agencies, the WHO and the World Bank Group, jointly announced that universal healthcare ‘is a critical component of the new Sustainable Development Goals (SDGs)’, which have replaced the Millennium Development Goals as the standard against which to measure policy progress in popula- tion health. Included among SDGs is a target to ‘Achieve [ . . . ] access to quality essential healthcare services and access to safe, effective, quality and affordable essential medicines and vaccines for all’ (World Health Organization and World Bank, 2015). The agencies declare: ‘Supporting the right to health and ending extreme poverty can both be pursued through universal health coverage.’ Goals such as the SDGs therefore provide a framework based on public health science for normative debate about health spending. From them it follows that governments committed to the public health goal of maximizing population health status should spend sufficient to ensure universal access to healthcare. The perspective of laws and conventions National laws, constitutions, and international conventions on human rights provide a second framework for debate. Legal analysis can be used to determine to what extent domestic laws may influ- ence governments’ healthcare budgets. In most developed coun- tries, governments are required by primary legislation to fund public healthcare of some description, and within reason to vote through sufficient money to discharge that duty. For example, the parliamen- tary act of 1946 which brought in the NHS throughout the United Kingdom, committed the UK government to funding and providing key listed services for everyone throughout the United Kingdom, a duty that was abolished by the Health and Social Care Act 2012. Similar laws can be found in other countries with universal sys- tems. The Danish Health Act (amended June 2010) requires the gov- ernment to meet, among other things, ‘the need for [ . . . ] free and equal access to health care’. The objective of Sweden’s health act (the Health and Medical Services Act 1982:763) is ‘care on equal terms for the entire population’. Italy’s national health service law requires that health benefits must be guaranteed to all citizens. The EU, which shares with member states responsibility for healthcare, has adopted ‘principles of equality of access and solidarity in funding ar- rangements, whether that is primarily through taxation or through regulated social insurance’. These principles are applied through the European Court of Justice, the judgements of which are superior to national courts. National laws can be amended and revoked, however, and simply describing them may provide insufficient guidance about duties. Spain abolished its national health service by royal decree in 2010. England’s NHS legislation was changed in 2012 so as to absolve the government of the day from the duty to meet specific service re- quirements for everyone throughout England. Legal analysis was crucial to show the scale and nature of the proposed changes and the ways in which the government can now reduce public expend- iture and withdraw entitlements to healthcare and restrict access to public services without fear of legal challenge. However, the implications of the repeal are still not widely appreciated, and for political reasons the government claims still to be committed to universality. Apart from duties and responsibilities set out in national legis- lation, high-​income countries are also subject to provisions of international conventions on human rights to which they are signa- tories. These, though not always enforceable in domestic courts, are relevant to normative debate. Among the most important human rights instruments are the Convention on the Rights of the Child, the Convention on the Elimination of All Forms of Discrimination Against Women, and the Covenant on Economic, Social, and Cultural Rights. All, according to the WHO, provide ‘a legal and nor- mative framework for the respect, protection and fulfilment of the right to health and other related rights of women, children and ado- lescents’. It is generally accepted that the instruments place national governments under a duty (although not necessarily enforceable) to ‘ensure that health facilities, goods, and services are of good quality, are available in sufficient quantity, and are physically accessible and affordable on the basis of non-​discrimination’. These various laws and conventions constitute a range of legal or quasi-​legal provisions that can guide decisions about future spending. The perspective of financing Financial analysis derives from both public health and legal ana- lyses, for if there is an obligation to fund universal healthcare, how should money be raised and distributed so that the duty can be discharged most successfully and at least cost? In general, this question bears on the extent to which there is equity in resource distribution across society, and particularly on the role of publicly financed health spending in redistributing resources from the rich to the poor and from the well to the sick. It resolves itself into three testable questions. The first is the extent to which low incomes are barriers to access, that is, the extent to which patients are required to pay at the point of use (so-​called out-​of-​pocket payments or user 2.15  How much should rich countries’ governments spend on healthcare? 165 charges) for the services that they need instead of paying in advance through taxes or insurance. The second is the extent to which ad- vanced payments (instead of user charges) are equitably calibrated and collected, either through insurance premiums or taxes (i.e. are they in line with earnings or income?). The third question concerns the extent to which healthcare funds are allocated according to healthcare needs. Solutions to the first two questions depend largely on the way in which governments raise money to pay for healthcare, that is, on which type of financing maximizes universality. Evidence shows that universality is more likely to be achieved when out-​of-​pocket payments are minimized, and personal incomes cease to be a bar- rier to access. In many countries, user charges, or private spending, entail catastrophic costs for the household and the individual be- cause of the way they redirect resources from other necessities of life and basic household expenses (absorbing more than 20% of mean household income). In the United Kingdom, most people who re- quire medicines are exempt prescription charges, while in Scotland and Wales prescription charges have been abolished. However, in most countries the cost of medicines is borne by individuals. This is often very expensive for the patient and household expenditure. For example, in poorer countries a single course of medicines for common conditions such as malaria or diabetes may cost several days’ wages. Medical costs including medicine costs are a common cause of personal bankruptcy in all countries, and in the United States they account for three-​fifths of all bankruptcies. Since poverty and income inequality are the main causes of ill health it would be paradoxical to raise health funds by means that deepened or failed to address either. Hence, if money is raised in advance rather than at point of use, it needs to be raised by methods that impinge the least (or at least proportionately) on the incomes of the poor. The most equitable funding mechanism is progressive direct tax. The main alternatives, social or private insurance, both involve user charges, sometimes substantial, at point of use and have more limited capacity to address inequality by spreading costs across the whole of the population (a practice known as ‘risk pooling’). In a public healthcare system, the optimal form of financing is progres- sive taxation with a maximum separation between the user and the caregiver to reduce any conflicts of interest. How should money be distributed around the system? Selection versus inclusion Raising money equitably is only half the battle; resources must be al- located according to need if universality is to be achieved. Clearly, if healthcare money is not allocated according to need, or is siphoned off for non-​healthcare purposes, healthcare budgets are likely to be underestimated for some parts of the population and overestimated for others. Whether markets or market-​like bodies are capable of needs-​ based allocation is a testable question. One key measure concerns the extent to which the institutions within a health system that are responsible for securing healthcare and the institutions that provide it are inclusive, that is, are responsible for everyone in a geographical area and are not able to exclude more expensive population groups such as older people and the chronically sick. Were selection of this type to occur the universal goal would be more expensive to achieve or might not be achieved at all. Systems based on private insurance or privately run provision are inherently selective. In these systems corporations manage their financial risks by excluding high cost pa- tients or by limiting the range and nature of services available. Risk selection and out-​of-​pocket costs: Integrated universal systems and market models Universal public systems must be designed so that they cannot ex- clude patients and citizens from care and are built on redistribution and solidarity and universal risk pooling. In contrast, commercial health insurers and providers select risks (cherry picking). They seek to exclude unprofitable patients such as the chronically ill and elderly, either directly by refusing coverage, or indirectly by char- ging very high premiums, often restrict coverage for pre-​existing medical conditions, and frequently leave patients with unaffordable out-​of-​pocket costs. These insurance practices explain why, in the United States, mainly unprofitable patients’ groups are covered by public programmes namely—​the poor by Medicaid and the elders by Medicare. Solidarity mechanisms which are integral to the design of public integrated systems cannot operate in commercial health insurance or among private providers because their objective is profitability and solidarity among the wealthy. Private insurers and private pro- viders tailor coverage to purchasing power, not to need, often leaving patients with the most expensive conditions uncovered. Premiums generally rise with age, and high copayments often restrict the use of medical services. Bureaucratic administrative procedures are neces- sary to select out patient who require high-​cost care and those who are at high risk. This can lead to patient and system delays. These problems have been observed not only in the United States, but also in middle-​income countries that have adopted market-​oriented health policies. A second key measure of the wisdom of utilizing market bodies in the pursuit of a population’s health provision is the extent to which commercial pressures to consume more may inflate total (public plus private) healthcare costs so that more is spent than required from a public health perspective. This brings us to economic analysis. The perspective of economics and trade All governments want their nation’s economies to grow faster, and health spending is often conceived as a factor in economic growth. But both increases and reductions in spending can be supported by different economic analyses. In 2014, for example, a measurable drop of 1.2% in the United States’ economic growth forecasts was at- tributed to steps taken by the Obama government to increase access to healthcare and reduce costs and waste, reductions that were hotly contested by the healthcare industry. Similarly, attempts by the UK government in 1957 to control pharmaceutical profits were successfully opposed by the com- panies affected. The industry body, the Association of the British Pharmaceutical Industry, argued—​and the government ultimately accepted—​that generous profits from proprietary medicines sold to the NHS subsidized standard drugs, funded export marketing, and supported research and development. That principle survived until 2008. Health spending can thus be conceived, economically, as a bene- ficial component of growth or as aid to industry. Alternatively, as we show earlier, under the austerity project it can be seen as an eco- nomic burden, at least so far as public spending is concerned, but as we show next, public and private spending are linked. 166 section 2  Background to medicine The cost of markets Economists frequently consider the determinants of spending levels in the form of the structural arrangements that lie behind prices and how some structures lead to higher prices than others. The evidence shows that publicly run systems based on geographic areas of administration are cheaper to run. By contrast, markets are relatively expensive. For example, before market elements were introduced, the United Kingdom’s NHS had administration costs of around 6% of total expenditure. These costs now exceed 15% of total expenditure, while in the United States, since 1974 one of the most heavily marketized health systems in the developed world, they exceed 30%. Critiques of the role that markets play in inflating costs have been a staple of American economists for some years. In 2012 the USA’s Institute of Medicine published a report in which it reproduced find- ings showing that in the United States, wasted expenditure (approxi- mately $765 billion) made up a large part its uniquely high level of spending on healthcare: ‘a substantial proportion of health care expenditures is wasted, leading to little improvement in health or in the quality of care. [Table 2.15.2] contains estimates of excess costs in six domains: unnecessary services, services inefficiently delivered, prices that are too high, excess admin- istrative costs, missed prevention opportunities, and medical fraud.’ (Institute of Medicine, 2012) From a public health perspective these excess costs are inefficient, for not only do they represent a huge loss of productive health cap- acity, but to the extent that the wastage is publicly funded they also waste scarce tax resources. In the United States, high levels of pri- vate spending on healthcare have generated public spending levels broadly comparable to those of European countries with universal health systems. However, the United States does not achieve uni- versality, or anything like it, because much of the public spending is devoted to the sort of market-​driven costs and wastage listed in Table 2.15.2 and because commercial considerations dominate in resource allocation. The United States has the most expensive system in the world, with around 60 million people uninsured and almost 120 million people underinsured at any one time. Overtreatment and overdiagnosis go hand in hand with the denial of care. Catastrophic health expend- iture due to out-​of-​pocket health charges is very high. Case study of cost of markets: Medicines One of the biggest drivers of catastrophic costs and healthcare costs in most countries is pharmaceuticals. On average 25% of a country’s total health expenditure is spent on medicines, with a wide range from 7.7% to 67.6%, but the source of spending varies enormously. In many low-​income countries medicines can absorb as much as 50% of the public healthcare budget annually, compared with 12% in the UK NHS. However, most expenditure on medicines is due to out-​of-​pocket costs and charges, with pharmaceutical costs ac- counting for 64% of all out-​of-​pocket payments on healthcare in some countries. Since 1995, the private share of total pharmaceutical expenditure has increased across the world. According to the WHO, across all income groups, private spending by households in 2000 was the principal source of pharmaceutical expenditure, at 57.8% in high in- come, 70.9% in middle-​income, and 71.6% in low-​income countries. Most of the money that the pharmaceutical industry makes is not spent on producing medicines. It is estimated that 60% of all pharma- ceutical revenues are diverted from healthcare to profits, administra- tion, and marketing costs. According to one analysis of the Forbes 500 drug companies in 2003, of all revenues, 30.8% was spent on marketing and administration, 17% on profits and 14% on research and development (R&D). In the United Kingdom, the Association of the British Pharmaceutical Industry estimated that R&D was 33% of sales compared with 3.7% in the manufacturing industry, and com- pany profits as a percentage of sales ranged from 20 to 43% in 2014. Economists will argue that ‘waste’ such as the diversion of funds to market bureaucracy, including marketing and billing and invoicing, is nevertheless beneficial in a broader sense because, being eco- nomic activity, it contributes to employment and economic growth. Table 2.15.2  Estimated sources of excess costs in healthcare in the United States, 2009 Category Sources Estimate of excess costs Unnecessary services • Overuse—​beyond evidence-​established levels • Discretionary use beyond benchmarks • Unnecessary choice of higher-​cost services $210 billion Inefficiently delivered services • Mistakes—​errors, preventable complications • Care fragmentation • Unnecessary use of higher-​cost providers • Operational inefficiencies at care delivery sites $130 billion Excess administrative costs • Insurance paperwork costs beyond benchmarks • Insurers’ administrative inefficiencies • Inefficiencies due to care documentation requirements $190 billion Prices that are too high • Service prices beyond competitive benchmarks • Produce prices beyond competitive benchmarks $105 billion Missed prevention opportunities • Primary prevention • Secondary prevention • Tertiary prevention $55 billion Fraud • All sources—​payers, clinicians, patients $75 billion Adapted from Institute of Medicine (2010). The Healthcare Imperative: Lowering Costs and Improving Outcomes: Workshop Series Summary. Washington, DC: The National Academies Press. 2.15  How much should rich countries’ governments spend on healthcare? 167 The cost of free trade agreements Free trade is another economic perspective which has a bearing on spending. Economists frequently point to the importance of international trade for economic growth, and to the role of healthcare, health insurance, and related goods, principally pharmaceuticals, in generating export-​led growth. However, the promotion of free trade (the international equivalent of free mar- kets) can have major implications for spending levels when it is associated with extending healthcare markets and diluting gov- ernment controls. For example, free trade treaties can prevent governments applying price controls, the logical approach to containing inflationary pres- sures on public healthcare budgets. Today, more rigorous pharma- ceutical price controls continue to be opposed by the industry and are under assault by governments of pharmaceutical producing countries through trade treaty negotiations, such as those involving the Trans-​Pacific Partnership (TPP) and the Transatlantic Trade and Investment Partnership (TTIP). Both these draft treaties could outlaw or restrict the use of pharmaceutical price controls in cen- trally administered health systems. Such measures threaten the use of global budgets where governments set annual limits to health spending. Rates of expenditure growth are lowest in countries that cap health budgets under central oversight rules, such as England, Italy, Japan, and Sweden, and do not allow the market, consumer de- mand, and provider price-​setting to determine global expenditure, such as in the United States. The TPP and TTIP are only the most recent of a series of trade treaties with implications for health spending. In the 1990s, a World Trade Organization (WTO) treaty known as the General Agreement on Tariffs and Trade was exploited by the US insur- ance industry and the US government as a means of opening up new markets for the then struggling, multibillion-​dollar busi- ness of health-​maintenance organizations. Profits had fallen after 1997 because of market saturation, government, and employer strategies to contain healthcare costs and high-​profile scandals. To restore profitability, the industry sought to capture new mar- kets abroad by acquiring publicly run facilities, converting public funds into profits, and repatriating the profits to the United States. In this, it received influential backing for its foreign acquisitions policy in Latin America from the US government, the World Bank, and multilateral financial institutions such as the Inter-​American Development Bank. The universal health systems of Europe are the latest target of healthcare corporations with international trade strategies. As can be seen, the aforementioned analyses lead to contradictory findings for health spending using a market model. The branch of economics that deals with supply and demand, microeconomics, is not relevant to our question if universal healthcare is adopted as a goal. This is because microeconomics is built on a principle of effi- ciency in which maximizing individual utility—​not access—​is the desideratum, and in which healthcare is defined as a private com- modity not an essential public service. The United States offers prac- tical lessons into the effects of the market model which relies on commercial insurers and providers. The inhumanity of that system is a warning to other rich nations and yet it is rapidly becoming the alternative model to the integrated public health system that the United Kingdom once had. Conclusion We conclude that the decision to have a universal public healthcare system is political. There is strong evidence that universal healthcare is sustainable. In contrast, the United States shows that the more marketized and privatized the systems, the higher the costs, the waste, and the greater the risks of being denied care. The question of how much any country should spend is inextricably linked to the chosen model of funding and provision, the degree of marketiza- tion, and how much risk selection and denial of care a government is prepared to tolerate in its health system. FURTHER READING Anderson GF, et al. (2003). It’s the prices, stupid: why the United States is so different from other countries. Health Affairs, 22(3), 89–​105. Cameron A, et  al. (2011). The World Medicines Situation 2011—​ Medicines Prices, Availability and Affordability. World Health Organization, Geneva. http://​www.who.int/​medicines/​areas/​policy/​ world_​medicines_​situation/​WMS_​ch6_​wPricing_​v6.pdf Clements B, Coady D, Gupta S (eds) (2012). The economics of public health care reform in advanced and emerging economies. International Monetary Fund, Washington DC. Institute of Medicine (2012). Best care at lower cost: the path to continu- ously learning health care in America. The National Academies Press, Washington DC. Krugman P (2012). End this depression now. Norton, New York. Marmot M (2016). The health gap: the challenge of an unequal world. Bloomsbury, London. Mossialos EP, Govin BR, Hervey TK (2010). Health systems govern- ance in Europe: the role of EU law and policy. In: Mossialos Elias P, Govin BR, Hervey TK (eds), Health systems governance in Europe, pp. 1–​83. Cambridge University Press, Cambridge. Piketty T (2014). Capital in the twenty-​first century. Harvard University Press, Harvard. Pollock A (2005). NHS plc: the privatisation of our health care. Verso, London. Price, D, Pollock AM, Shaoul J (1999). How the World Trade Organisation is shaping domestic policies in health care. Lancet, 354(9193), 1889–​92. Rosenbaum SJ (2009). Insurance discrimination on the basis of health status: an overview of discrimination practices, federal law, and fed- eral reform options. J Law Med Ethics, 37(2), 101–​20. Starfield B (2008). The future of primary care: refocusing the system. N Engl J Med, 359 (20), 2087–​91. UNCTAD (2014). Trade and development report. United Nations, New York. Webster C (1988). The health services since the war. Vol 1: Problems of health care: the National Health Service before 1957. HMSO, London. Wilkinson R, Pickett K (2010). The spirit level: why more equal societies almost always do better. Penguin, Harmondsworth. World Health Organization (1978). Declaration of Alma-​Ata, International Conference on Primary Health Care, Alma-​Ata, USSR, 6–​12 September 1978. http://​www.who.int/​publications/​almaata_​declaration_​en.pdf World Health Organization and World Bank (2015). Tracking uni- versal health coverage. World Health Organization, Geneva. World Health Organisation (2018). World Health Statistics 2018: moni- toring health for the SDGs. https://www.who.int/gho/publications/ world_health_statistics/2018/en/ 2.16 Financing healthcare in low- income developin 2.16 Financing healthcare in low- income developing countries: A challenge for equity in health 168 ESSENTIALS Low-​income developing countries remain challenged by the increasing demand for essential healthcare combined with gov- ernments’ inability to mobilize and allocate adequate financing to health systems. According to their country specific contexts, they need to improve the equity and efficiency of their health systems, in particular their public sector management, and increase their gov- ernmental health spending through enhanced domestic resource mobilization and more efficiency in public spending. They need to sustain external assistance and implement social protection policies that decrease out-​of-​pocket payments, thereby preventing people with catastrophic illness from becoming impoverished. Economic growth and increased health spending alone are not enough to scale-​up healthcare coverage and achieve better health outcomes. These must be combined with accountability of results, transparent management of public funds, and multisectoral efforts with community involvement at implementation level. We should appreciate the progress made in all low-​income developing coun- tries in reducing both under-​five mortality rate (U5MR) and ma- ternal mortality rate (MMR), even if most did not achieve the relevant Millennium Development Goals (MDGs). The successful cases of Bangladesh, Bhutan, Mongolia, and Nepal, who did achieve MDGs 4 (U5MR) and 5 (MMR) over the last 10 years, illustrate good practices that deserve more careful study. Public spending on health in low-​income developing countries has not had a strong effect on reducing U5MR and MMR. Much more evident is the effect of female education. In designing public policy to reduce excess mortality in low-​income developing coun- tries, beyond increasing gross national income per capita, education of women seems to be a crucial issue. There is no magic blueprint for health financing: each country should design its own strategy based on evidence provided by national health accounts. The central issues to be addressed are: Where does the money come from? Where does the money go to? What kind of services are performed? And what types of goods are purchased? These accounts should be monitored and updated annually on the basis of epidemiological and other relevant evi- dence, and the outcome of this process should generate the support for updates of national health financing strategy. Introduction People have aspirations to fulfil their health needs. Health systems of some nature have existed as long as communities have tried to treat diseases and protect their health, but healthcare has a cost and people cannot access it without creating the ability to pay for it. This chapter provides an overview of the essential health financing issues. It includes policies, concepts, approaches, practices, and experiences that could enhance the understanding of comprehensive health sys- tems reforms that would both provide sufficient financial resources for equitable access to quality health interventions and protect people from catastrophic health expenditures. Health financing is analysed from the perspective of governments, but the roles of other important stakeholders such as the wider society and private sector are also taken into account to assist in the design, implementation, and evaluation of effective health financing reforms. The focus of this chapter is on the group of low-​income developing countries1 considered according to 2014 International Monetary Fund classification as ‘countries that were designated Poverty Reduction and Growth Trust (PRGT) eligible and had a level of per 2.16 Financing healthcare in low-​income developing countries: A challenge for equity in health Luis G. Sambo, Jorge Simões, and Maria do Rosario O. Martins 1  Afghanistan, Bangladesh, Benin, Bhutan, Bolivia, Burkina Faso, Burundi, Cambodia, Cameroon, Central Africa Republic, Chad, Comoros, Côte d’Ivoire, Congo, Congo Democratic Republic, Djibouti, Eritrea, Ethiopia, Ghana, Guinea, Guinea-​Bissau, Gambia, Haiti, Honduras, Kenya, Kiribati, Kyrgyzstan, Lao People’s Democratic Republic, Lesotho, Liberia, Madagascar, Malawi, Mali, Mauritania, Moldova, Mozambique, Nicaragua, Nigeria, Myanmar, Mongolia, Nepal, Niger, Papua New Guinea, Rwanda, São Tomé and Príncipe, Senegal, Sierra Leone, Sudan, Solomon Islands, Somalia, South Sudan, Togo, United republic of Tanzania, Tajikistan, Uganda, Uzbekistan Yemen, Vietnam, Zambia, Zimbabwe. 2.16  Financing healthcare in low-income developing countries 169 capita GNI less than (US) $2390’ (IMF, 2014). Qualitative and quan- titative analysis is made for both controllable and uncontrolled vari- ables, beginning with the health situation in low-​income developing countries when the Millennium Declaration was adopted in the year 2000 by the United Nations General Assembly. Particular attention is paid to the way incidence and prevalence of diseases has evolved since then, and the way this has impacted on mortality rate of chil- dren under five years old (U5MR), maternal mortality ratio (MMR), and life expectancy at birth. The cases of the low-​income developing countries recording the best progress in reducing the U5MR and MMR—​Bangladesh, Bhutan, Mongolia, and Nepal—​are analysed in terms of the relationship between their health financing patterns and the attainment of Millennium Development Goal (MDG) 4 (re- duction of child mortality) and Goal 5 (improved maternal health), with particular emphasis on cases in which health targets have been reached with limited financial resources. After discussion of health systems, the chapter then analyses health spending, the financing pattern, and the relationship between the latter and health outcomes in terms of achievement of the MDGs. We caution that there are variances that decision-​makers can con- trol, such as funding, financial access, social protection, quality of healthcare, and efficiency, and aspects that cannot be controlled, such as demographic trends, new norms, and standards resulting from sci- entific breakthroughs, epidemics/​pandemics, and other public health threats, which can all affect health outcomes. In light of current sci- entific knowledge and best practices on health financing, the chapter generates possible outcomes, depending on choices made, that are relevant to the context of low-​income developing countries. Subsequently, the chapter discusses the levels and sources of health financing in low-​income developing countries and the way these could have influenced the scaling-​up of cost-​effective public health interventions towards health MDGs. It makes an analysis of health financing patterns and the way health economic indicators evolved from 1995 to 2013, comparing with what happened in high-​ income countries during the same period. Our last analysis refers to the health production function, which is calculated on the basis of a multivariate regression model using a mathematical equation that relates health outcomes to enabling health system functions. We are not prescriptive and do not generate a blueprint model or template for health financing in low-​income developing countries. Available evidence shows that low-​income developing countries that made some strides in improving key health status indicators applied different models that combined various funding sources such as taxation, social insurance, and direct pay- ments (among other variables) in different macroeconomic envir- onments. We pursue scientific evidence for trustful health financing approaches in the context of low-​income developing countries. We provide ideas on macroeconomic policy and fiscal space measures that could create budget room, and thereby allow governments to provide additional resources to the health sector. Epidemiology of diseases in low-​income developing countries by the year 2000 The overall level of development influences the quality of life and health of a population. Political, economic, social, cultural, and en- vironmental determinants play a role in both the access to care and the health status. We realize that health improvements in the devel- oped world over the last century were mainly due to enhancement of living and working conditions, notwithstanding that scientific breakthroughs and technology innovation played an important role. In most low-​income developing countries health profiles are still characterized by high birth rates, high mortality rates, low life ex- pectancy, and high disease burden, with particularly adverse impact on vulnerable populations such as children and pregnant women. Poverty and hunger, which span over all low-​ and lower-​middle-​ income countries, are central determinants of heath and were considered by the United Nations as the number one Millennium Development Goal (MDG1—​eradicate extreme poverty and hunger). Poverty is the root of malnutrition, drives the spread of disease, increases vulnerability to outbreaks, epidemics, and pan- demics, and deters indigents from seeking healthcare. Iniquities in access to health determinants generate uneven distribution of health outcomes between continents, countries, and within countries (be- tween people according to their socioeconomic condition, age, sex, and geographic area). Human rights movements aim to influence governments to provide the political, economic, and social environ- ments that enable the fulfilment of health as a right of each citizen. The incidence and prevalence of diseases In 2000 the biggest seven killers of the poor in Africa and Southeast Asia were HIV/​AIDS, diarrheal diseases, acute respiratory infec- tions, maternal and perinatal conditions, malaria, measles, and tuberculosis. Among children and young adults (0–​44 years old), more than 36 million people lived with HIV/​AIDS worldwide, 95% in developing countries and especially in sub-​Saharan Africa. The same year, about 8.8 million people developed active tuberculosis and 1.7 million died of the disease, with 99% of all tuberculosis (TB) cases living in developing countries, and malaria killed more than 1 million people (3000 deaths a day), mostly children under 5 years old and pregnant women in sub-​Saharan Africa. Maternal death The main causes of maternal death are severe haemorrhages, infec- tion, hypertension, prolonged obstetric labour, and complications of unsafe abortions. In low-​income developing countries in 2000, the MMR was 628.1 per 100 000 live births compared with 18 per 100 000 in high-​income countries. The under-​five mortality rate The U5MR was 132.2 per 1000 live births in low-​income developing countries, compared with 9.7 in high-​income countries, yet with the existing knowledge and available technologies it would have been possible to prevent and treat most of the diseases responsible and reduce suffering of many millions of people over the world. Life expectancy at birth In 2000, life expectancy at birth in low-​income developing countries was 55 years, compared with 76 in high-​income countries. This was strongly correlated with very high infant and child mortality and premature death. There were, however, some examples of success in the midst of poverty such as: (1) Cambodia, with the highest HIV/​AIDS preva- lence in Asia (4%), succeeded in slowing the spread of HIV infec- tion among sex workers under 20 years old, from 40 to 23% between 170 section 2  Background to medicine 1998 and 2000; (2); the Vietnam malaria programme that reduced the malaria death toll by 97% between 1992 and 1997; and (3) the successful tuberculosis programme in Peru that increased the direct observed treatment short-​course coverage from 70% in 1990 to 100% in 1998, with a cure rate of 90%. Examples such as this pro- vided evidence of the feasibility of reducing disease burden in low, lower-​middle, and upper-​middle-​income countries. Best practices and success stories encouraged world leaders to strive towards a global response to fight these three killer diseases, giving direct help to the most vulnerable people, removing significant obstacles that keep them in poverty, preventing their families from falling into poverty, and strengthening health services’ capacity and improving health status in general. Striving towards health Millennium Development Goals in low-​income developing countries All United Nations member states and about 23 international or- ganizations committed to achieve the following Millennium Development Goals (MDGs), established following the Millennium Summit of the United Nations in 2000, by 2015: • to eradicate extreme poverty and hunger; • to achieve universal primary education; • to promote gender equality; • to reduce child mortality; • to improve maternal health; • to combat HIV/​AIDS, tuberculosis, malaria, and other diseases; • to ensure environmental sustainability; • to develop a global partnership for development. In 2005, the G8 finance ministers agreed to provide enough funds to the World Bank, International Monetary Fund, and the African Development Bank to cancel $40–​55 billion in debt owed by members of the heavily indebted poor countries to allow them to redirect resources to programmes for improving health and education and for alleviation of poverty. Each economically ad- vanced country was expected to progressively increase its offi- cial development assistance to reach a minimum net amount of 0.7% of its gross national product at market prices by the middle of the decade. In 2009, the High-​Level Taskforce on Innovative International Financing for Health Systems estimated that on average $44 per capita—​rising to $60 in 2015—​would be needed to ensure coverage with a set of essential health services in 49 low-​ income countries. The following indicators were selected to assess the progress of health status in low-​income developing countries from 1990 up to 2014: to reduce by two-​thirds, between 1990 and 2015, the under-​ five mortality rate (MDG target 4); to reduce by three-​quarters the maternal mortality ratio (MDG5 target 5); to reduce the burden of HIV/​AIDS, tuberculosis, malaria, and non​communicable diseases (MDG target 6); and to increase life expectancy at birth. The following paragraphs assess the progress made in reaching these important indicators in the 60 countries under review. For analysis we computed, for each country and based on the World Health Organization (WHO) estimates, the way each indicator evolved from 1990 to 2013. A critical analysis of MDGs 4 (U5MR) and 5 (MMR) reveals four categories of countries (Table 2.16.1 and Fig. 2.16.1): 1. Those who are clearly outliers: Zimbabwe and Lesotho are inside the red ellipse in Fig. 2.16.1 because their very high burden of HIV/​AIDS has negatively impacted on child and maternal health. Table 2.16.1  Progress towards MDG4 (U5MR) and MDG5 (MMR) in low-​income developing countries Country % reduction in MDG4 % reduction in MDG5 Bhutan –​72.9% –​86.7% Malawi –​72.3% –​53.6% Nepal –​72.1% –​75.9% Bangladesh –​71.4% –​69.1% Liberia –​71.3% –​46.7% Mongolia –​70.5% –​65.6% Tanzania, United Republic of –​69.0% –​54.9% Ethiopia –​68.6% –​70.0% Niger –​68.2% –​37.0% Bolivia –​68.1% –​60.8% Cambodia –​67.7% –​85.8% Eritrea –​66.9% –​77.6% Rwanda –​65.7% –​77.1% Madagascar –​65.2% –​40.5% Nicaragua –​64.8% –​41.2% Mozambique –​63.2% –​63.1% Kyrgyzstan –​63.2% –​11.8% Uganda –​63.0% –​53.8% Honduras –​62.4% –​58.6% Senegal –​60.8% –​39.6% South Sudan –​60.8% –​59.4% Yemen, Republic of –​58.9% –​41.3% Guinea –​57.6% –​40.9% Gambia, The –​56.5% –​39.4% Lao People’s Democratic Republic –​55.9% –​80.0% Tajikistan –​55.9% –​35.3% Zambia –​54.6% –​51.7% São Tomé and Príncipe –​53.8% –​48.8% Myanmar –​53.5% –​65.5% Vietnam –​53.0% –​65.0% Benin –​52.5% –​43.3% Moldova –​52.3% –​32.0% Burkina Faso –​51.7% –​48.1% Mali –​51.7% –​50.0% Burundi –​51.5% –​43.1% Haiti –​49.7% –​43.3% Congo, Republic of –​46.7% –​38.8% Afghanistan –​45.7% –​66.7% Nigeria –​44.9% –​53.3% (continued) 2.16  Financing healthcare in low-income developing countries 171 2. A second group of countries (Somalia, Central African Republic, Kenya, and Cameroon) that made very small improvements (in- side the orange ellipse in Fig. 2.16.1) are countries that faced frequent humanitarian crises or/​and present high inequalities. 3. A third group of 14 countries (inside the blue rectangle in Fig. 2.16.1) with an achievement above the median (Bolivia, Cambodia, Eritrea, Ethiopia, Honduras, Lao, Malawi, Mozambique, Rwanda, São Tomé and Príncipe, South Sudan, Tanzania, Uganda, Zambia). 4. Four countries with very good progress towards these two goals: Mongolia, Bangladesh, Nepal, and Bhutan (inside the green ellipse in Fig. 2.16.1). In Mongolia and Bhutan primary healthcare services are fully state funded, ensuring free access to everyone, and social health insurance population coverage is high. This, however, is not so in Bangladesh and Nepal. Bangladesh invested significantly in strengthening the health sector. In Nepal specific programmes provided free delivery of care and financial incentives to pregnant women to access a variety of maternity services, including antenatal care, de- livery in a facility and postnatal care. Programmes targeting child health have been intentionally community-​based. In this group of countries, wealth inequalities (as measured by Gini Index) are low. Regarding MDG6, the HIV prevalence for adults dropped on average from 3.6% to 2.5% (from 2000 to 2013), while the average number of deaths due to tuberculosis (per 100 000 population) de- creased from 50 to 30. For the same period, the reported number of Country % reduction in MDG4 % reduction in MDG5 Guinea-​Bissau –​44.9% –​39.8% Togo –​42.1% –​31.8% Djibouti –​41.3% –​42.5% Uzbekistan –​40.5% –​45.5% Sudan –​40.2% –​50.0% Sierra Leone –​40.0% –​52.2% Kiribati –​39.0% –​48.0% Ghana –​38.8% –​50.0% Comoros –​37.9% –​44.4% Côte d’Ivoire –​34.0% –​2.7% Congo, Democratic Republic of –​32.7% –​27.0% Chad –​31.3% –​42.4% Papua New Guinea –​31.1% –​53.2% Cameroon –​30.6% –​18.1% Kenya –​28.4% –​18.4% Mauritania –​23.5% –​49.2% Solomon Islands –​22.2% –​59.4% Central African Republic –​21.3% –​26.7% Somalia –​19.0% –​34.6% Lesotho 13.6% –​31.9% Zimbabwe 18.6% –​9.6% –0.8 –0.7 –0.6 –0.5 –0.4 –0.3 –0.2 –0.1 0 0 –0.1 –0.2 –0.3 –0.4 –0.5 –0.6 –0.7 –0.8 –0.9 –1 % Reduction in MDG4 Weak Good progress % reduction in MDG5 0.1 0.2 Fig. 2.16.1  Achievement of MDG4 (U5MR) and MDG5 (MMR) (% of reduction between 1990 and 2013). Source: WHO, Millennium Development Goals Indicators. 2014. 172 section 2  Background to medicine malaria deaths decreased on average 65%. However, the pattern of progress was different across countries: for example, in Malawi and Zimbabwe, both with very high HIV prevalence, considerable pro- gress has been made in the last 15 years, but this has not happened in other countries, such as Mozambique. TB rates remain high in countries like Nigeria, Somalia, and Cameroon. Mean life expectancy at birth in low-​income developing countries has improved from 55.08 years (in 2000) to 61.57 (in 2012), com- pared with 76 (in 2000) to 79 (in 2012) in high-​income countries. Age-​standardized mortality rate due to communicable diseases, non​communicable diseases and injuries has dropped 35%, 7%, and 20%, respectively, from 2000 to 2012. In summary, the progress towards health MDGs in low-​income developing countries is heterogeneous, with some champions, a large group of good performers, a group of countries with slow im- provements, and two countries with no progress. In 2018 the WHO published a progress report on the health-related MDGs, which re- corded impressive progress in some countries. Bangladesh, Bhutan, Mongolia and Nepal did achieve MDGs 4 (U5MR) and 5 (MMR) over the last ten years (Tables 2.16.1 and 2.16.2), but most low-​ income developing countries did not and will need to address them in their national health development policies and related health system designs. Apart from HIV/​AIDS, tuberculosis, and malaria with serious impact on MDGs 4 (U5MR) and 5 (MMR), many other infectious diseases—​neglected tropical diseases and epidemic prone diseases—​ mostly hit poor people. These diseases place a substantial economic burden on health systems and contribute to lifelong disadvantage of already deprived people. For example, malaria hits the subtropical areas around the world—​the poverty belt—​and in high endemicity areas of sub-​Saharan Africa it accounts for 40% of public health ex- penditure, 30–​50% of inpatients admissions, and up to 50% of out- patient visits. There is currently a shift in global focus from communicable dis- eases to non​communicable diseases as the major cause of morbidity and mortality in low-​income developing countries, but most of these countries are facing the burden of both. More research is required to understand the associations between communicable diseases and non​communicable diseases. The 2002 emergence of severe acute respiratory syndrome (SARS) in Asia; the 2007 bird flu outbreak caused by highly pathogenic H5N1 avian influenza in Asia, Europe, and Africa; the 2009 swine flu pan- demic produced by H1N1 influenza in South-​America; the 2014–16 and 2018–19 (ongoing) haemorrhagic fever epidemics caused by Ebola virus in West Africa; and the Zika virus epidemic of 2015–16, which began in the Americas; have all vibrantly demonstrated the vul- nerability of communities to emerging and re-​emerging diseases, and the need for more resilient health systems at local and global levels. By identifying ways to strengthen health systems and improve access to essential healthcare, it is possible to reach people in poor resource settings, even in remote areas, and rescue them from the trap of disease–​poverty–​disease. Tackling diseases requires invest- ments in knowledge, technology, manpower, and other required capacities to strengthen the overall health infrastructure. However, bearing in mind that interrelationships between health–​disease and its determinants are complex and dynamic, and often taking place in turbulent and unforeseen environments, leaders, and health man- agers must apply a critical approach to designing and developing health systems. Governments, partners, communities, and other im- portant stakeholders require reliable and appropriate evidence at the right time to develop health policies, strategic plans, and operational budgets that respond to epidemiological patterns in terms of pro- vision of care, and reflect equity principles in terms of access to it. The challenge of financing healthcare in  low-​income developing countries General approach of health systems financing A health system can be defined as all organizations, people, and actions whose primary intent is to promote, restore, or maintain health. According to WHO a national health system has six core components: health workforce, service delivery, health technologies (medical products, vaccines, and technologies), financing, informa- tion, and leadership and governance. Health systems play a key role in improving the health status of populations and contribute to the improvement of living standards and economic and social development. In health terms, ‘ideally Table 2.16.2  Likely timescale for achieving MDGs 4 and 5 in low-​income developing countries MDG 4 –​ Reduce under 5 mortality MDG 5 –​ Improve maternal health Achieved by 2019 Bangladesh; Bhutan; Mongolia; Nepal Bangladesh; Bhutan; Mongolia; Nepal Achieve 2020–​2029 Benin; Bolivia; Cambodia; Comoros; Eritrea; Ethiopia; Ghana; Haiti; Honduras; Kiribati; Lao People’s Democratic Republic; Liberia; Madagascar; Malawi; Myanmar; Nicaragua; Niger; Rwanda; São Tomé and Príncipe; Senegal; Sierra Leone; Somalia; Tajikistan; Tanzania; Uganda; Yemen; Vietnam Central African Republic; Kiribati; Rwanda; Tajikistan; Vietnam Achieve 2030–​2039 Burundi; Djibouti; Gambia; Guinea; Mali; Mozambique Myanmar; Sudan; Uganda Achieve after 2040 Afghanistan; Burkina Faso; Cameroon; Central African Republic; Chad; Congo (DR); Congo; Côte d’Ivoire; Guinea-​Bissau; Kenya; Kyrgyzstan; Lesotho; Mauritania; Nigeria; Papua New Guinea; Solomon Islands; Sudan; Togo; Uzbekistan; Zambia; Zimbabwe Afghanistan; Benin; Bolivia; Burkina Faso; Burundi; Cambodia; Cameroon; Chad; Comoros; Congo (DR); Congo; Côte d’Ivoire; Djibouti; Eritrea; Ethiopia; Gambia; Ghana; Guinea; Guinea-​Bissau; Haiti; Honduras; Kenya; Kyrgyzstan; Lao People’s Democratic Republic; Lesotho; Liberia; Madagascar; Malawi; Mali; Mauritania; Mozambique; Nicaragua; Niger; Nigeria; Papua New Guinea; São Tomé and Príncipe; Senegal; Sierra Leone; Solomon Islands; Somalia; Tanzania; Togo; Uzbekistan; Yemen; Zambia; Zimbabwe 2.16  Financing healthcare in low-income developing countries 173 everyone should have a fair opportunity to attain their full health potential and, more pragmatically, none should be disadvantaged from achieving this potential if it can be avoided’ (Whitehead, 1990). In healthcare the principle of equity should lead to universal access of everyone to available care for equal need, equal utilization, and equal quality. Around the world, health systems face serious chal- lenges in terms of their financial sustainability and providing equity in health and universal access to quality healthcare. Although public funding—​through taxes or social insurance—​is dominant, out-​of-​ pocket payments from citizens represent an important share of total spending. In general, the financial flows in a health system run between three types of agents:  population, financing organization(s), and healthcare providers, as described in Fig. 2.16.2. The payment for healthcare provided to the population can come from two major sources: payments made by the population at the time of use and/​or payments made by third-​party payers. The latter source of payment is based on previous fund-​raising among the population, which finances the health system. Within this general description fall the various existing systems, such as financing by taxes, social insurance, or private insurance premiums. Consequently, the financing organization may be the state, sickness funds, or insurance companies. Adjusting the financial model to the socioeconomic context is the central problem for health financing in any country in the world. The diversity and complexity of outcomes and determinants of health leads to a difficult question: how can we evaluate the perform- ance of a health system? In this chapter we consider three out of the five indicators proposed in the World Health Report of 2000: the overall health status, the distribution of health in the population, and the distribution of the financial contribution. With the epidemio- logical transition from infectious diseases to non​communicable ­diseases, the measure of non​fatal consequences of diseases, espe- cially chronic diseases and external causes, proves to be relevant in both low-​ and high-​income countries. The distribution of health and health determinants, in turn, allows for the assessment of equity in the distribution of health outcomes in terms of MMR and U5MR, life expectancy, and disease burden in general. The application of these indicators to WHO member countries and the resulting classification of countries has given rise to an intense debate, either directed to the results or to methodological and scientific aspects of the assessment. The context in low-​income developing countries In low-​income developing countries, the first aspect to consider is that the state plays a central role, not only in funding but also in provision of services, through a model of national health service supported by social insurance, usually associated to formal em- ployers. A second issue is the amount of out-​of-​pocket payments by families to support healthcare. In the African region, where about 60% of low-​income developing countries are situated, the general government expenditure on health constituted 48.7% of total ex- penditure on health, and by the end of 2011 only 6 of the 45 coun- tries reporting on this factor had met the Abuja target of allocating at least 15% of total government expenditure to the health sector. Moreover, out-​of-​pocket payments form over 50% of total health spending, which exposes families to the risk of catastrophic health expenditure. Advance payment schemes are a possible way to over- come this problem. These can be arranged through general taxation and/​or compulsory contributions to health insurance. A third aspect to consider is the need to promote efficiency in national health services to optimize resource allocation and maximize results. The 2010 WHO report on health systems financing estimated that 20–​ 40% of resources dedicated to health are wasted. In recent years there has been an increase in the number of African countries seeking universal health coverage based on man- datory prepayment schemes. The literature identifies a wide range of factors that would allow these countries to progress towards uni- versal health coverage: revenue growth, which increases the finan- cial capacity of companies and citizens; improving tax collection systems, which increases the volume of contributions; the devel- opment of communication infrastructures, improving geographic access to healthcare; increase in the health system workforce; pro- moting social solidarity, facilitating financial transfers from the rich to the poor and from the healthy to the sick; and strong leadership by governments in respect to all these issues. However, it is unlikely that universal health coverage can be achieved solely through tax funding in the present circumstances. The ability for low-​income countries to generate more tax revenue is limited by structural factors: per capita income is relatively low and gross domestic product (GDP) growth is modest; there are large informal economy sectors (including agri- culture) that make it difficult to collect taxes; there are low volumes of taxable imports; and due to institutional factors related to political instability there is weak institutional capacity, weak accountability, and inefficient tax administration systems. All these are exacerbated by the unpredictability and non​alignment of myriad partners with national health priorities. For example, an important share of health financing, especially in Africa, comes from international funding, with a particular increase in funding for specific diseases such as that provided by the Global Alliance for Vaccines and Immunization and the Global Fund to Fight AIDS, Tuberculosis and Malaria. Payments to providers Collection of funds for health care Out-of-pocket payments Financing organizations Population Healthcare providers Delivery of healthcare Fig. 2.16.2  Financial flows in health systems. 174 section 2  Background to medicine While universal health coverage is an ideal that no country can fully achieve, many countries are now getting closer to universal coverage and others are making good progress using different ap- proaches. Interpreting universal health coverage in the context of each country involves disaggregating the concept in terms of ob- jectives and emphasizing the progress in improving equity in access to health services, in improving service quality, and in providing fi- nancial protection, while considering the access to other key health determinants. What do countries need to do to make progress towards health MDGs? Research has shown, for example, that in countries with good governance, additional government health spending does re- duce child mortality. World Bank studies also concluded, however, that development assistance has a stronger effect in countries with solid policies and institutions than in countries with only average-​ quality policies and institutions, and an insignificant effect in coun- tries where policies and institutions are weak. The crucial question remains how to finance the extra spending that is required for most countries to attain the MDGs. Health spending can be divided into three categories:  private (out-​of-​ pocket expenditures and private insurance); public (financing from general revenues and social insurance contributions); and external sources (development assistance). There is evidence that where out-​ of-​pocket spending is less than 20% of total health expenditure, the risk of catastrophic health expenditure is negligible. Two other im- portant indicators in this context are the share of general revenues allocated to the health sector (the 2001 Abuja target, only for African countries, pledged to increase government funding for health to at least 15%) and the per capita total expenditure on health. The High-​ Level Taskforce on Innovative International Financing for Health Systems estimated that on average $60 per capita in 2015 would be needed to ensure coverage with a set of essential health services in low-​income countries. Health spending As illustrated in Table 2.16.3, the median expenditure on health as a percentage of GDP has grown from 4.5% in 1995 to 5.9% in 2013, largely attributable to increase in median government expenditure from 1.9% to 3% of GDP over this period. Government expenditure on health as a percentage of total government expenditure rose from a median value of 8.5% in 1995 to 10.5% in 2013. However, this pat- tern is very heterogeneous, and only 8 of the 57 countries reporting on this factor had met the Abuja target of 15%. In 2013, median per capita total expenditure on health was $137.6 compared with $55.2 in 1995 (at constant 2011 prices). It was less than $44 in only four countries, $44 to $60 in two countries and over $60 in 51 (90%) of the countries. Health financing pattern Concerning health-​spending categories, the median out-​of-​pocket payments as a share of total health expenditure decreased from 48% in 1995 to 40.5% in 2013. This indicator is still above 20% in 49 countries, meaning that many of the population in most of the countries (86%) remain exposed to the risk of financial catastrophe and impoverishment. Median government expenditure on health constituted 40.7% of total expenditure on health in 1995 and 49.2% in 2013. The external resources for health made up less than 20% of total health expenditure in half of the countries, 20–​50% in 21 countries, and over 50% in six countries. These figures contrast with those obtained by high-​level income countries, as can be seen in Table 2.16.3. The relationship between the pattern of health financing and the attainment of the MDG Key health spending and health financing indicators for coun- tries performing best in terms of both MDG4 (U5MR) and MDG5 (MMR) are shown in Figs. 2.16.3–​2.16.6. The pattern of the four countries is substantially different: whereas Bhutan and Mongolia achieved good performance with relatively high and considerable increasing per capita expenditures on health, Bangladesh attained MDGs achievement with relatively low per capita health expend- itures. The pattern of Nepal health economic indicators is similar to the median of the countries (Fig. 2.16.3). There is also a surprising amount of variation across countries in the share of GDP allocated to government health programmes and its trend over time. Countries achieving the two MDGs spend quite different amounts: Bangladesh with relatively low and unchanged share over time; Mongolia had relatively high and increasing trend (Fig. 2.16.4). Bangladesh, Bhutan, and Mongolia decreased gov- ernmental health expenditures as a percentage of total government expenditures between 1995 and 2013, while in Nepal this indicator increased over time, presenting the same pattern as the median of the 57 countries (Fig. 2.16.5). The share of the three health-​spending categories composition is also different across countries. In Bangladesh and Nepal, the share of out-​of-​pocket expenditure in total health spending is persistently Table 2.16.3  Health economic indicators for low-​income developing countries (LIDC) and high-​level income countries (HIC) LIDC HIC Indicator/​Year 1995 2013 2013 Per capita total expenditure on health (PPP, Int US$, and constant prices 2011) 55.2 137.6 4516 Total expenditure on health as a percentage of GDP 4.5 5.9 11.6 Government expenditure on health as a percentage of GDP 1.9 3.0 7.3 General government expenditure on health as a percentage of total government expenditure 8.5 10.5 16.8 Out-​of-​pocket expenditure as a percentage of total expenditure on health 47.9 40.5 15 Government expenditure on health as % of total health expenditure 40.7 49.2 60.6 Source: World Bank Statistics. 2.16  Financing healthcare in low-income developing countries 175 higher than in the other two countries (Fig. 2.16.6). This clearly shows that distinct health financing patterns can lead to the same result, namely substantial improvement in MDG4 (U5MR) and MDG5 (MMR). The health production function Health economists define the health production function as a math- ematical equation that relates health outcome to a set of factors, namely per capita income, share of government health spending in GDP, education attainment, and access to health services. Using women years of education as a proxy for educational attainment and the percentage of urban population as a proxy for access, we have estimated the impact of each variable on achievement of MDGs, adjusting for possible confounders. The multivariate regression for 2009 data shows that there are two relevant indicators that affect U5MR (MDG4): GNI per capita and female education. Adjusting for other factors, the under-​five mortality rate decreased by 0.327% when GNI per capita increased by 1%. Relating to female education, the U5MR was roughly 7% lower per additional year of female edu- cation. MDG5 (MMR), after adjusting for other factors, decreased by 0.58% when GNI per capita increased by 1%, and there was a 15% reduction in maternal mortality rate per additional year of female education. Conclusions Low-​income developing countries remain challenged by the increasing demand for essential healthcare combined with gov- ernments’ inability to mobilize and allocate adequate financing to health systems. In spite of people’s motivation to pay for better access to quality healthcare, the poor health infrastructure, low salaries in the health sector, and limited access to essential health techno­ logies all frustrate implementation in the field of equitable health policies and the aspiration towards universal health coverage. Economic growth and increased health spending alone are not enough to scale-​up healthcare coverage and achieve better health outcomes. These must be combined with accountability of results, transparent management of public funds, and multisectoral ef- forts with community involvement at implementation level. We should appreciate the progress made in all low-​income developing countries in reducing both U5MR and MMR, even if most did not achieve the relevant MDGs. The successful cases of Bangladesh, Bhutan, Mongolia, and Nepal, who did achieve MDGs 4 (U5MR) and 5 (MMR) over the last ten years, illustrate good practices that deserve more careful study. While we appreciate the rise in median government expend- iture for health in low-​income developing countries from 40.7% in 1995 to 49.2% in 2013, it is very important to recognize that the median out-​of-​pocket payments as a share of total health expend- itures is still above 20%, which means that most people in these countries are seriously exposed to catastrophic illness and im- poverishment, and this situation has not significantly improved. Poverty increases the need for healthcare while dwindling the capacity to pay for it. 0.00 100.00 200.00 300.00 400.00 500.00 600.00 Bangladesh Bhutan Mongolia Nepal Median 1995 2013 Fig. 2.16.3  Per capita total expenditure on health in 1995 and 2013 (PPP, Int$, constant prices 2011). 0.00 0.50 1.00 1.50 2.00 2.50 3.00 3.50 4.00 Bangladesh Bhutan Mongolia Nepal Median 1995 2013 Fig. 2.16.4  Government expenditure on health as a percentage of GDP in 1995 and in 2013. Source: World Bank statistics. 0.00 2.00 4.00 6.00 8.00 10.00 12.00 14.00 Bangladesh Bhutan Mongolia Nepal Median 1995 2013 Fig. 2.16.5  General government expenditure on health as a percentage of total government expenditure in 1995 and in 2013. Source: World Bank statistics. 176 section 2  Background to medicine Finally, it is important to highlight that health financing should be fit for its purpose as a key input to national health systems. It should be consistent with national health policies and budgeting models that enable the health system to perform its functions and ensure social protection. There is no magic formula, no blueprint model to be pro- scribed. Communities, governments, and international development partners should keep striving for improved local, national, and global health, which presupposes improved health financing at all levels. FURTHER READING IMF (2014). Poverty reduction and growth trust. International Monetary Fund, Washington DC. Sambo LG, Kirigia JM (2014). Investing in health systems for universal health coverage in Africa. BMC Int Health Hum Rights, 14, 28. Wagstaff A, et  al. (2006). Millennium development goals for health: what will it take to accelerate progress?. In: Jamison DT, et al. (eds) Diseases control priorities in developing countries, 2nd edition, Chapter 9. World Bank, Washington DC. Whitehead M (1990). Concepts and principles of equity and health. Copenhagen, World Health Organization Regional Office for Europe. WHO (2000). World health report. World Health Organization, Geneva. WHO (2012). Malaria in Africa. World Health Organization, Geneva. WHO, et al. (2002). Scaling up the response to infectious diseases. World Health Organization, Geneva. WHO (2018). Millenium Development Goals (MDGs). World Health Organization, Geneva. https://www.who.int/news-room/ fact-sheets/detail/millennium-development-goals-(mdgs) 61.30 30.79 11.60 69.56 47.90 36.24 69.21 82.50 26.48 40.70 2.45 5.90 3.96 11.40 0.00 10.00 20.00 30.00 40.00 50.00 60.00 70.00 80.00 90.00 100.00 60.23 25.41 37.01 46.15 40.50 35.26 73.84 60.25 43.29 49.20 4.51 2.74 10.56 10.30 0.00 10.00 20.00 30.00 40.00 50.00 60.00 70.00 80.00 90.00 100.00 Bangladesh Bhutan Mongolia Nepal Median Bangladesh Bhutan Mongolia Nepal Median Fig. 2.16.6  Share of out-​of-​pocket (in blue), government (red), and other health expenses (green) (as a % of the total health expenditure) in 1995 (left) and 2013 (right). Source: World Bank statistics. 2.17 Research in the developed world 177 2.17 Research in the developed world 177 ESSENTIALS The entire health system seeks to prevent disease, or diagnose and intervene effectively to limit its impact. The goal of most funders of medically related research in the developed world is to support re- search that is upstream of that which is relevant and attractive to health providers or pharmaceutical companies, and to work syner- gistically in the health research ecosystem with these and other com- plementary organizations when appropriate. It is often impossible to predict which advances in basic science will lead to breakthroughs, but if this upstream work is not done, then in the mid and long term, the flow of medical advances will dry up. The central problem for the funder of medical research is how to choose the best use for the next Dollar, Pound, or Euro. This is essentially a problem in prediction, but as Yogi Berra said: ‘It’s tough to make predictions, especially about the future.’ Introduction ‘And over them triumphant Death his dart Shook, but delay’d to strike, though oft invok’d.’ (Milton, J., Paradise Lost) A recent estimate is that current annual global spending on research and development is in the region of (US) $1.5 trillion (c.2012), of which roughly $270 billion (or about 18%) goes to medically re- lated research. Thus, in the 19 years since satirical online publication The Onion reported that the death rate remained at 100% despite huge efforts from health workers and doctors worldwide, roughly $5 trillion has been spent research work with the objective of health benefit. Yet the death rate has not been affected. Furthermore, over the next five years, the Wellcome Trust intends to spend another $7.5 billion with the same goal! So why do we do it? How do we attempt to do it well? Because, while Death cannot be denied, life can be improved through better health and Death itself can at least be delayed and premature Death prevented. Health economists might state this as an effort to maxi- mize the global number of Disability Adjusted Life Years. While the entire health system seeks to prevent, diagnose, and intervene effectively, the goal of health research is upstream of that, to deepen our understanding of the biological and social determinants that underlie health and the processes that undermine it to develop better interventions for prevention, cures, and palliative interventions. Time scales The goal at the Wellcome Trust, alongside most other funders of medically related research in the developed world, is to support research that might be upstream of that which is relevant and at- tractive to health providers (such as the National Health Service in the United Kingdom) or to pharmaceutical companies, and also to work synergistically in the health research ecosystem with these and other complementary organizations when appropriate. We seek to support the best work throughout the research-​to-​ implementation cycle mostly in academic institutions (universities and institutes), some of which will be too far from application or of too great a risk of failure to be justifiable in the budget of a pro- vider or a for-​profit company. However, we are confident that if this upstream work is not done, then in the mid and long term the flow of medical advances will dry up. The Wellcome Trust also offers extensive funding to encourage public engagement and involve- ment with science and to research work in the humanities, ethics, and public policy. We believe that this approach ensures that the best science remains integrated within the context of the societies in which it operates and has the greatest opportunity to impact on human health whether that impact is to be in the short, medium, or long term. At the Wellcome Trust, we choose to target our funds to a range of time scales. Some investments are made in work that we estimate to be very far upstream, such as our funding of parts of the human genome-​sequencing project (with the US National Institutes of Health (NIH) and others) in the 1990s. That work was clearly then of little immediate utility, but now accelerates much more targeted work in genetic testing, gene therapy, and the identification of gene products that may be useful targets for drug development. Genomics is now becoming widely useful, with larger scale genome projects in both humans and pathogens, and the beginnings of attempts to link sequence data to medical records. 2.17 Research in the developed world (a view from the Wellcome Trust) Jeremy Farrar 178 section 2  Background to medicine Some of our funds are targeted even further upstream. A signifi- cant example is our funding (with the Gatsby Foundation) of the Sainsbury-​Wellcome Centre for Neural Circuits and Behaviour. This Institute is focused on the study of basic neuroscience, by the study of the behaviour and activity of neurons and their circuits in the brains of living animals (rodents and fish) while they engage in solving mazes and similar tasks. A dozen years ago techniques were developed to target genes to neural cells that express proteins which can reveal the activity of those cells by emitting specific colours of light: almost literally putting light-​bulbs in the head of an animal as it thinks about its work. At roughly the same time light-​activated channels were adapted from bacteria for expression in animals, so that now specific neurons can be switched on, or off, by the appli- cation of light. Taken together these new technologies are broadly known as ‘optogenetics’. And at the same time people have devel- oped better methods for mapping the synaptic connections of neural cells using light and electron microscopy, to elucidate the precise wiring diagrams of neural circuits. So that now, scientists can map the neural networks, see the ac- tivity and control specific cells, all while the animal is behaving. In an analogy to the London Underground: suddenly we can see the tracks, the trains as they move along them and even through some switches to see what happens to the paths of the trains. The clock- work of the mouse or fly mind is being revealed. One hopes that in a few decades a similar understanding of the human brain could be developed (a much bigger task). But now the science is jumping ahead again, with the discovery and engineering of proteins to control nerve cells not by light, but by small molecules (designer receptors exclusively activated by de- signer drugs, DREADDs) or by ultrasound. Work that seemed to be decades away from the clinic is now possibly only a few years away and we stand at the brink of major progress in the understanding and treatment of diseases of the brain. Perhaps conditions like epi- lepsy will be the first that are amendable to this sort of intervention, but we are increasingly confident that these new technologies will be applicable to highly complex conditions, including mental illnesses and dementia in the medium to long term. It is often impossible to predict which advances in basic science will lead to such breakthroughs. For example, study of channel rhod- opsin in bacteria and fluorescent proteins in jellyfish leading to the understanding of and then control of the brain, or the work in fun- damental discovery science of over 50 years ago which had led to today’s opportunities for treatment with stem cells and monoclonal antibodies. Such cases are the justification for funding research all along the path from the very fundamental to the very applied. But which or- ganizations should work to support which stages along the path? Health services and pharmaceutical and medical engineering com- panies are motivated towards research that is close to clinical utility or medical products. But there are gaps at this end of the spectrum for diseases where there is ‘market failure’. Bluntly put; situations for which there are no financial incentives to develop interventions, dis- eases which predominantly affect the poor or diseases of poor coun- tries. Funders in wealthy countries often support work on neglected tropical diseases, for example, through genuine humanitarianism. This is sometimes augmented by enlightened self-​interest: most ob- viously in the case of emerging infections that become epidemic in poor countries (such as Ebola virus). Developing vaccines or cures for these illnesses is clearly in the interest of all, as such epidemics can have a devastating and destabilizing impact on the whole of a society, country, or region and many outbreaks are only the length of a jetliner’s flight from the rest of the world. Another reason why organizations in high-​income countries might fund research in low-​ and middle-​income ones is simply the power of studying a disease where it is common. One clear example of this is HIV disease and tuberculosis in Southern Africa. With the prevalence of HIV in parts of South Africa at 60%, there are imme- diate benefits to all for research to be conducted where the need is greatest. Governments fund research all along the path from basic science to applied work, but there is increasing pressure to spend tax-​payers’ money on research that will bring health or economic benefit in the short term. We can see this in the pressure for demonstrable ‘im- pact’ in the United Kingdom and in the United States, in the reduced budget for basic research in Australia, and the level-​funding of the NIH in the United States, while the National Science Foundation (NSF) is actually in decline. Types of funders Government funding mechanisms in developed countries are di- verse. The largest single funder in the world is the NIH’s USA extramural programme. In total the NIH funds over $30 billion a year, mostly in the United States and mostly through universities, although a significant fraction (about 17%) is spent at the intra- mural programmes (mostly at the various Institutes of the National Institutes of Health in Bethesda Maryland). The US National Science Foundation covers a broader range of science, but spends only about a quarter of the sum that the NIH does. The NSF runs no directly funded institutes. Other US federal agencies also contribute to the broad biomedical research agenda including the Department of Energy, Federal Drug Administration, and many others. As a result, the total government funding available in the United States for bio- medical research amounts to over $50 billion a year. The European Union’s European Research Council (ERC) funds roughly €13 billion per year. In addition, individual European coun- tries fund science themselves. Germany funds biomedical research through both the Deutsche Forshungsgemeinschaft (DFG, or German Research Foundation), which funds mostly through universities, and the Max Planck Society, which works by funding its own insti- tutes. In the United Kingdom the Medical Research Council (MRC) runs a mixed model, with funding both through grants to academic research leaders in universities as well as its own units and institutes. The largest MRC sole-​funded institute now is the Laboratory of Molecular Biology (the ‘LMB’) in Cambridge, although the MRC is also a founding partner in the Francis Crick Institute, which opened in 2016 and will host 1200 researchers. Some non​government funders have been set up as charities that specifically fund work on specific diseases or areas, such as the American Cancer Society, the British Heart Foundation, the Fragile-​X Association, and many others. These organizations gen- erally raise funds from donors in the community and with specific disease interests, and many do not accumulate large endowments. Traditionally it has not been attractive for disease-​motivated donors to fund fundamental research with a long-​term perspective because 2.17  Research in the developed world 179 they are mostly focused on short term progress in the disease areas of interest. These pressures often turn these charities away from basic research. However, this is changing and increasingly some of these philanthropic organizations including, for instance, Cancer Research UK and Paul Allen Foundation, are supporting extensive programmes in discovery science and taking a much longer-​term perspective. Other private charities are funded by one (or a very few) living donors. The largest of these now is the Bill and Melinda Gates Foundation, which is focused on the interests of the donors:  for impact on global health during their lifetimes. Another example is the Gatsby Foundation, in the United Kingdom, which is funded by Lord Sainsbury of Turville and is focused on research into food sus- tainability and neuroscience areas of interest to the donor. Some of these personal foundations are directed to spend out their funds during the lives of the donors, whereas others are organized on large endowments left by the original donors or corporations to be- come perpetual foundations or charities. These organizations invest the endowment and spend a defined fraction of the value towards the charitable activities, mostly research. The two largest of these are the Wellcome Trust, in the United Kingdom, with an endowment of over £18 billion at the last financial report, and the Howard Hughes Medical Institute (HHMI), with an endowment of roughly $18 bil- lion. Neither the Wellcome Trust nor the HHMI focus on specific diseases, but fund research broadly across the biomedical science and health area. What is the ‘unit of funding’ Most non​commercial biomedical research funding in developed countries goes towards work done by academics in universities. This is for historical reasons (that is where most of the basic and clinical scientists are), but also because universities have evolved to provide a comprehensive academic infrastructure, based on scholarship and teaching, that includes key resources (buildings, libraries, vivaria, and more). Most important is the support of the academics them- selves, through salaries and a career structure. In serious research intense universities, academic freedom is assured, at least at the level of the independent group leader (often but not always some variant of ‘professor’). Thus independent research group leaders are free to pursue their interests as far as they can find funding to do so. While they may (or usually these days do not) have ‘tenure’ (permanent employment), today they very rarely have long-​term institutional re- search funding and must therefore secure grants from government or private funders. These academics are overwhelmingly dedicated, imaginative, creative, hard-​working, and competitive. Because they must continuously run a permanent marathon cycle of research, publication, and grant-​winning, the benefits of this dedication come to us at far too cheap a price (academic salaries do not impress bankers). While there are very great advantages to funding into univer- sities, there are also drawbacks. The academic competitive structure is based (usually) on the ‘brand identity’ of the individual group leader/​professor. This is most often scored as senior-​authored papers. This culture can be a major inhibitor of collaboration between research groups. As science moves forwards it is often crucial for fields to meet, for new technologies to be developed and adopted and for ideas and theories to cross boundaries. The very reward-​culture system of the academic university thus incen- tivizes individual achievement and success, but at the same time dis-​incentivizes interdisciplinary collaboration. While there are some university structures that are also barriers (departments and schools), the real barriers are inherent to the system of credit-​by-​ authorship, the competitive cycle of short-​term funding and the increasing pressure for ‘impact’. In addition to the independent group leader/​professor, it is also critical to support the training and career-​progression of more junior researchers, through doctoral programmes and various forms of fellowships designed for basic scientists as well as clinicians. In most disruptive and creative industries, it is these individuals who have provided most of the breakthrough innovations that have led to major changes in the way we live and work. We, and our com- munity are too conservative in the freedom and support we give to this group. In our current system, at least in the United Kingdom and the United States, there is a very great excess of the number of junior people entering training over the number of professorial jobs that might await them. In the United Kingdom a study by the Royal Society in 2010 showed that only 0.45% of those entering PhD stu- dents are likely to end up as professors. While there are many other useful and productive paths for many people other than research independence, this ratio is clearly too great and is disheartening to young people. Many undergraduates sample research through pro- jects of vacation scholarships and are driven away to other fields when they see the odds for themselves. We are losing a great many talented people. The main alternative to funding research in universities is to sup- port science in institutes. These range from small ‘centres’ or ‘units’ within universities, sometimes in their own buildings, to larger or free-​standing ‘institutes’, usually on their own land, such as the NIH campus, HHMI’s Janelia Farm, or the Wellcome Trust’s Genome Campus at Hinxton. In such places it is possible to rewrite some or all of the normal ‘rules’ of academic research in an attempt to over- come some of the barriers. Commonly there is no tenure. Research focus can be required. Often, resources are controlled centrally by a director. Thus, the diffuse power of the university can be concen- trated to some specific goal or purpose. Some have even attempted to abolish publication as an outcome metric. The risk is that academic freedom maybe impinged upon and creative energy suppressed. At their best, core-​funded institutes bring together dedicated commu- nities that work synergistically around particular problems or tech- nologies, are liberated from the constraints of the short-​term grant cycle to take on big long-​term questions and take risks. Historical examples include, in their best years, the MRC’s LMB, the Institut Pasteur in Paris, or the Institute for Advanced Studies at Princeton. At their worst, core-​funded institutes can become ossified, bureau- cratic, and stultifying. How to choose Whether funding is to be devoted to training, to individual fellows or professors, or larger centres or institutes, a central problem for the funder is how to choose the best use for the next Dollar, Pound, or Euro. This is essentially a problem in prediction. As Yogi Berra said: ‘It’s tough to make predictions, especially about the future.’ 180 section 2  Background to medicine Another way to consider this is to distinguish orthogonal dimen- sions. One dimension is the fit to the strategic goals of the funder. For example, for the Wellcome Trust a grant proposal on astronomy is outside the remit and is a non​starter. On the other hand, spe- cific goals are often developed (such as the President’s BRAIN ini- tiative in the United States) and then proposals that align to these are favoured. The strategy may be politically driven, by the focus areas of the funder or by objective realities such as the burden of disease. Getting the balance between supporting the top-​down de- fined priority areas of the funder and the bottom-​up ideas from the community is a constant challenge and a point of discussion among all funders. In the end, no single funder can support everything it is asked to support and a mixed economy of top-​down and bottom-​up funding approaches are how most organizations work. A common problem is that just wishing to make progress in a disease area and defining a goal does not make the research tractable. One example would be Nixon’s National Cancer Act 1971—​the technologies re- quired were just not yet available when he made the statement and the Act was signed into law with a view to eradicate cancer Likewise, despite many years of effort, a vaccine for HIV has not yet been de- veloped despite being critically needed and despite the very best ef- forts of many people. The overriding dimension is clearly scientific quality. If one could somehow perfectly rank all available proposals across all the areas of interest at any one time from best to worst, then the funder could just support down the list until the money ran out. Of course, there are problems with this as well. Aside from Yogi Berra’s problem, it can be very difficult to quantify quality. As Ottoline Leyser recently pointed out ‘Quality is qualitative’. This does not stop many, many attempts to derive numerical metrics of research quality and im- pact. Some approach objective truth, such as citation analysis: at least it is apparent that some people are interested in a paper. But unfortunately it usually takes many years before the relative import- ance of a discovery becomes clear; much too late for any funding decision, and citation analysis can just reflect the inherent conser- vatism of the modern science environment. People work, are cited, get funded, and promoted by working on projects which others also work on. At present, almost all funders rely heavily on the opinions of sci- entific advisors. This can be by written peer-​review, discussion in committees, or by direct face-​to-​face interviews of applicants before committees. All of these are attempts to rank applications or appli- cants to allow for funding decisions to be made. All of these tech- niques contain possible artefacts and are rarely truly quantitative. It is also difficult to show that these processes are actually accurate in predicting success. The deepest problem for funders is to judge quality and oppor- tunity across different fields. Most avoid the problem by preallocating funds by area. Some (HHMI and the Wellcome Trust) do rank appli- cations and applicants across widely different fields. This is done by a series of screens: first of many applications for technical feasibility and broad promise. Thus field-​experts can remove proposals that are technically totally flawed, but these are increasingly rare. But the later stages converge on a smaller number of (mostly by this stage exceptional) candidates interviewed by a group of scientific advisors drawn from across many disciplines. Inevitably it can be difficult to ensure all disciplines are equally represented and all equal in how they judge their own discipline and other disciplines. There can also be the challenge of decision by committee which can lead to conser- vative judgements, or a single negative view ending the prospects of an application. In the end, judgements must be made. Lastly, an important principle has become acknowledged by most funders of research: that a grant is not a contract. In other words, if a scientist proposes a project, that project should be considered as an intellectual exercise: can the candidate state a clear and important question, can they think through the problems, can they work out how many people and years and dollars will be needed. But once they have succeeded and been awarded the funds, then they must be free to follow their path freely towards the best results. In addition, these may have very little to do with the original proposal. If the scientist discovers something important, or solves a big problem, or makes a breakthrough, no one will ask if it was in the proposal from five years before. If they are wise, neither will they care too much which journal it was published in. FURTHER READING Milton J (1667). Paradise lost. Book xi. The Onion (1997). World Death Rate Holding Steady at 100 Percent. https://​www.theonion.com/​world-​death-​rate-​holding-​steady- ​at-​100-​percent-​1819564171 2.18 Fostering medical and health research in reso 2.18 Fostering medical and health research in resource- constrained countries 181 ESSENTIALS Access to quality healthcare and education for all are essential elem- ents underpinning national development as well as prerequisites for personal well-​being. This makes it important to grasp the inequalities that still prevail in human and institutional capacity, including access to and use of research resources. The goals of fostering medical re- search in resource-​constrained countries are: to bring outstanding science to bear on their medical and health problems; invest in building the requisite medical and health research systems; develop an evidence base to inform policies and programmes, resource al- location, and health practice; enhance scientific processes, findings, explanation, and ‘discovery’ by drawing on local know-​how; and gain sustained engagement with policymakers, senior managers, and community leaders. Context is all-​important: every country, no matter how small or resource-​constrained, should have the capacity to conduct essential national health research to identify, prioritize, and respond to its own health problems. Critical to fostering medical research1 in resource-​ constrained countries is the need for balanced, equitable collabor- ations that emphasize longer-​term building of human and institutional capacity. This speaks to the true opportunity offered by the promise and practice of research in resource-​constrained countries: to con- tribute widely to both national and global health development and, in so doing, to major and equitable progress in the human condition. Introduction Medical and health research today take place within a new era of ‘global health’, south-​north-​south collaboration, and 21st century interdisciplinary knowledge convergence. These provide promise, challenges, and opportunities, particularly for fostering research in resource-​constrained countries. Globalization today is premised on speed of communication and interconnectedness which serve to ‘level’ peoples’ aspirations and expectations across all settings; similarly, the questions, focus, and methods of science are converging more than ever before despite the variation in socioeconomic contexts. Anticipating this, a multipolar Global HIV Vaccine Enterprise was proposed in 2003. This cata- lysed an upsurge in research and institutional development towards a still-​elusive vaccine against HIV/​AIDS. Most visible of 21st century innovations is the internet, which is re- shaping all forms of communication from the social to the scientific. Less visible, though in time as pervasive, is the biological revolution which has already ensured a new scientific ‘universe’ reflected in un- ravelling of the human genome along with the tools to more deeply understand and manipulate our genomic complement. The full ram- ifications and impact of the ‘biological revolution’ on science, society, and well-​being are yet to be fully understood and appreciated. Despite the sometimes intolerable hardship experienced by mi- grant or refugee populations—​ever-​present in today’s globalized ‘in- stant media’—​the moral standards the world uses to judge its actions are today more apparent and widely held. This is reflected in global debates on climate and the environment, sustainable energy use and acceptable standards of political governance that have given rise to equity-​oriented research initiatives such as Future Earth and Urban Health and Wellbeing, which seek to harness science across discip- lines, sectors, and geographies. Contemporary UN Declarations as well as national constitutions—​that of South Africa, our own country, for example—​enshrine human dignity, the achievement of equality, and advancement of human rights and freedoms as supreme qualities. Access to quality healthcare and education (for all) are recognized as essential elements underpinning national development as well as prerequisites for personal well-​being. This makes it important to grasp the inequalities that still prevail in human and institutional capacity, including access to and use of research resources. For example, national research and development (R&D) expenditure among OECD countries was estimated at 2.4% of gross domestic product (GDP) in 2012, with the R&D expenditure in the United States comprising 2.79% of GDP, and Korea 4.36%. While China’s R&D expenditure is growing (cur- rently estimated at 1.98% of GDP), R&D expenditure in South Africa, India, and the Russian Federation fall below this average (most recent estimates amounting to 0.76%, 0.81%, and 1.12% of GDP, respectively). 2.18 Fostering medical and health research in resource-​constrained countries Malegapuru W. Makgoba and Stephen M. Tollman 1  The term ‘health research’ encompasses a breadth of disciplines to investigate the determinants of health and illness and the interventions, system changes, and policies needed to address these. 182 section 2  Background to medicine A further indicator of this disparity is the level of researchers per thousand employed people in these different regions. The OECD average is 7.7 people per thousand employed in 2011, with Finland reporting the highest ratio of 16.06. South Africa and China report relatively low estimates of 1.43 and 1.83 people per thousand em- ployed in research. In the health sector, OECD countries average 3.16 physicians to 1000 inhabitants with the United Kingdom re- porting a ratio of 2.8 and the United States a ratio of 2.5 in 2011. In contrast, India and South Africa most recently estimate 0.7 and 0.8 physicians to 1000 people, respectively. It follows that across low-​ and middle-​income countries (LMICs) there are different constraints with little uniformity in the infrastructure available for research, reflecting variable investment and the uneven status of research in national development. Pockets of excellence exist in middle-​income settings that range from Brazil to China to India and South Africa—​but widely inadequate research and infrastructure devel- opment characterize most lower-​income countries thereby denying the fruits of research and scientific advance to many of the world’s poorest and marginalized communities and, indeed, to future generations. Critical, unresolved research questions confront the scientific, policy and citizen leadership of LMICs—​on the rapidly evolving burden of disease and risk and their social and biological deter- minants in contrasting environments, coupled with understanding and improving the health and social systems that can address these effectively. Such issues pose a profound challenge to scientists and practitioners within and across disciplines. They shine a light on the sociopolitical context in which such issues are being addressed. Whereas scientific and social progress hold the promise of major advances in personal and population health, persisting disparities can exacerbate inequalities and render ‘catch-​up’ all but out of reach. Fostering medical research in resource-​constrained countries is not only essential but also requires institutional and individual sci- entific development—​with attention to career paths, research lead- ership, sound ethics, the sustaining of versatile research platforms (molecular, clinical, service and population-​based), and institu- tional support. Broadly put, the goal is to: •​ Bring outstanding science to bear on the medical and health prob- lems of resource-​constrained countries and communities. •​ Invest in building the requisite medical and health research sys- tems based on sound governance and long-​term institutional cap- acity strengthening. •​ Build an evidence base to inform policies and programmes, re- source allocation, and health practice from primary care to ad- vanced clinical settings; these should focus on national priorities and tackle inequities. •​ Enhance scientific processes, findings, explanation, and ‘dis- covery’ by drawing on local know-​how and indigenous know- ledge and expertise. •​ Gain sustained engagement with policymakers and senior man- agers as well as community leadership, to foster the uses of evi- dence and encourage enhanced investment in health research and healthcare. The work of the Commission on Health Research for Develop­ment was a landmark which, in 1990, published ‘Health Research: Essential Link to Equity in Development’. This far-​reaching report envisaged that every country, no matter how small or resource-​constrained, should have the capacity to conduct essential national health re- search in order to identify, prioritize, and respond to its own health problems. In addition, that all countries, no matter how limited their resource base, should contribute to the global health research enter- prise. These aspirations—​widely shared and as timely today—​clearly align with aspirations for the new ‘global health’. They highlight the national and scientific competencies needed to ensure deep and sus- tained progress towards the Millennium Development Goals and their recent successor, the Sustainable Development Goals. Changing profiles of health, disease, and well-​being: Setting priorities for health research Health problems facing resource-​constrained countries—​where poverty is widespread amid income and social inequalities—​can occur with un- expected intensity, evidenced by the recent Ebola epidemic in the West African countries of Guinea, Liberia, and Sierra Leone. While the infec- tion itself carried an overwhelming mortality rate, the knock-​on effects were similarly devastating: primary care services were hit hard, affecting immunizations and maternal and child care, while household liveli- hoods were severely undermined contributing to vulnerability of the do- mestic economies. The emergency reaction of Médecins Sans Frontières was critical, but a generally sluggish global response prompted soul-​ searching on the future role of high-​income countries like the United Kingdom, agencies such as the World Health Organization (WHO), and international strategies for vaccine and drugs development. A resurgent epidemic in the Democratic Republic of Congo (DRC), aggravated by community mistrust of measures introduced to limit the outbreak, high- lights the complexity of epidemic containment. Reflected in recent assessments of the global burden of disease, complex health transitions are underway in LMICs. Rapid changes in national health profiles are being fuelled by the socioeconomic forces driving (for example) labour migration within and across national boundaries, and new forms of transport and communication that link people and places at unprecedented speed and scale. Together, these act to juxtapose infectious conditions (malaria, TB, HIV/​AIDS, neglected tropical diseases) with unfolding non-​communicable dis- eases—​notably cardiovascular disease, stroke, and diabetes, respira- tory conditions, cancers, and mental illness. The natural history of these diseases and their attendant risks pose challenges to personal development along the life course—from conception, birth and child- hood to puberty, and adolescence to adulthood—​in a world where populations are ageing while national incomes remain uncertain. This ‘dual burden’ of disease—​graphically described as a colli- sion of epidemics—​amounts to a ‘quadruple burden’ when the un- finished agenda of maternal, neonatal and child health is included along with persistently high levels of injury (violent and uninten- tional) and rising accident rates (from motor vehicles especially). Such a rapidly changing disease burden must profoundly impact on the framing of national and global research priorities—​and the resources and capabilities required to enable effective research. Context is all-​important Our long-​standing African experience brings home the overriding importance of social and political context in fostering, conducting, 2.18  Fostering medical and health research in resource-constrained countries 183 interpreting, and then ensuring the applications of research. Indeed, given many countries’ history and experience with the deliberate marginalizing of poorer communities (now in the voting majority post-​independence), research in LMICs cannot be separated from broader public concerns with national and social development. This manifests in several ways. Contestation between science and politics This translates to a pervasive culture of resorting to political au- thority to resolve issues that elsewhere would be addressed by public sector executives. Research findings, whatever the evidence, and particularly if results conflict with prevailing social norms or pol- itical ideologies, may be actively contested with appeals to popular leadership for guidance and direction. Leading politicians in LMICs face high, at times unrealistic, pressure from their constituencies to resolve deeply entrenched social and development concerns—​hence politicians’ and policymakers’ preoccupation with answers in con- trast to researchers’ emphasis on understanding. Examples from our recent history illustrate vividly the interplay between politics and bio-​sciences: Adolf Hitler was a dominant pol- itical figure who, on rising to power in Germany, used science to justify the political ideology on which the Nazi state was based, and which justified barbarous treatment of ‘non-​Aryans’. Trofim Lysenko was an influential Soviet agricultural scientist, with a powerful ally in Joseph Stalin, who for over 40 years undermined, then destroyed, biological science (notably genetics) teaching and research in the Soviet Union. Thabo Mbeki and Manto Tshabalala-​Msimang, for- merly president and minister of health in South Africa, confused a nation and led it into denial about HIV/​AIDS causation for a decade, with devastating consequences for human lives (see next). Deep respect for human and community rights (reflected in the UN charter as well as national constitutions) Disparities in wealth and education—​as much within as between countries—​raise concerns about the rights and protections of vulner- able persons (such as children, the disabled, or unemployed) and com- munities in all spheres, and notably in research. Concern for the dignity of research participants, confidentiality regarding personal informa- tion, and equitable access to findings and resultant benefits (e.g. new diagnostic tests, therapeutics, equipment) weigh heavily on public rep- resentatives who increasingly are held to account by an activist citizenry. Sensitivity to purpose and relevance Keen interest in the focus of research—​what research is conducted, how, and for what purpose—​lies behind the strenuous debates on research priorities in LMICs and presents great opportunity to work at the frontiers of socially responsive science. Such research environments support interdisciplinary efforts that engage with community perspectives; and tend to a more holistic approach to both the conduct and translation of research including health systems development and all aspects of professional practice. It follows that while concern for the impacts of research is plainly evident in industrialized countries, in LMICs it is becoming an im- perative. While working relationships with public leadership bear on the applications of research to policy development, as important are scientists’ links with public representatives for drawing attention to the social importance of the research enterprise. This will help counter the tendency for findings to play but little part in public sector investment decisions—​whereas findings and understanding should figure prominently in development priorities and resource allocation. An unlikely alliance: Science and politics At first glance, the apparently neutral terrain of research, scien- tists, and the public good seems a world away from the contested, at times cut-​throat world of national politics. Yet history is replete with examples—​from Stalinist Russia (and the Lysenko era of eu- genics, discussed earlier) to Hitler’s Germany (that posited Aryan races as the apex of human development and condoned human experimentation on those judged ‘lesser’ persons)—​where science served the goals of a political master plan. In response, ethical codes and principles governing the conduct of research—​most re- cently, the ninth revision of the Helsinki Declaration—​are upheld across the globe. Nevertheless, in all settings there are examples where political goals have undermined ethical practice. In South Africa at the close of the 20th century, the emergence of HIV/​AIDS confronted politi- cians with an intolerable dilemma: face up to a profound health and social challenge evolving in concert with national liberation, and sup- port a talented scientific community determined to address this; or buy the argument of the AIDS ‘denialists’ who rejected the existence of a human immunodeficiency virus and attributed AIDS solely to widespread deprivation and related nutritional and immunological compromise. President Thabo Mbeki’s ‘flirtation’ with HIV/​AIDS denialists, notably Peter Duesberg, ushered in a decade of national confu- sion and denialism reflected in a crisis in the governance and independence of medical research; a health policy denial of anti- retroviral therapy to HIV-​positive pregnant mothers and patients, with consequences estimated at 330 000 deaths; and society’s pro- foundly compromised ability to impact on interpersonal and peer or community behaviour and halt the spread of infection. Thus, though difficult to accept, the consequence of the delayed rollout of antiretroviral medication was the loss of hundreds of thousands of lives. Today, some 20 years later, South Africa and its Department of Health can take credit for the widespread and sustained take-​up of the largest antiretroviral therapy programme worldwide. This is saving countless lives, rapidly improving life expectancy, and preserving the livelihoods of many of the country’s poorest citi- zens. Indeed, this recent but hardly isolated example illustrates how powerful political figures influencing health research can determine the future direction and success (or not) of nations. Fig. 2.18.1 provides a generic framework for the dynamic inter- actions involving science and research in national contexts. The political environment, prevailing culture and social norms, and the pursuit of human rights all act on the conduct of research and use of the findings/​evidence that result. Acting together, these influences can ensure that the applications of research serve as a tremendous power for social good and human development—​but equally, they hold the potential for political ma- nipulation of scientists who may be used (even coerced) to legitimate state actions in support of ends that fly in the face of obligations to advance population health and well-​being. 184 section 2  Background to medicine Conclusion While much in our past experience emphasizes ‘divides’—​between rich and poor, north and south or east and west, stark disciplinary divisions, and a divide between the social and biological determin- ants of health—​we have entered a world of ‘convergences’ that is fostered and reinforced by a global health movement that empha- sizes our common future. Work that is ‘convergent’ lies at the heart of global improvements in child health and the great reductions in mortality from HIV/​AIDS and malaria. Critical to fostering medical research in resource-​constrained coun- tries is the need for balanced, equitable collaborations that emphasize longer-​term building of human and institutional capacity. This vision needs to be tempered by the reality that the gap in resources—​including statistical capability, computational and technical expertise—​remains profound and is paralleled by a limited clinical-​physiological-​molecular evidence base. Addressing this as a common, collaborative enterprise would bring returns to research and practice in all settings and fields. Beyond this, positive recognition is justified for the sophisticated, lo- gistically complex field-​based research that increasingly extends from molecular to population levels, and similarly for generating the ‘public good’ data being made widely accessible. In summary, we are at the cusp of a new paradigm of research collaboration that fulfils the value-​based aspirations of ‘global health’, bringing the best science to bear on the complex challenges facing vulnerable communities in resource-​constrained countries, while recognizing the pressures to ‘deliver’ that are faced daily by scientists, policymakers, and politicians. This speaks to the true opportunity offered by the promise and practice of research in resource-​constrained countries:  to contribute widely to both na- tional and global health development and, in so doing, to major and equitable progress in the human condition. Acknowledgements We thank Carren Ginsburg for sourcing comparative data to reflect national differences. FURTHER READING Abubakar I, et al. (2018). The UCL-Lancet Commission on Migration and Health: the health of a world on the move. Lancet, 392, 2606–54. Chigwedere P, et al. (2008). Estimating the lost benefits of antiretro- viral drug use in South Africa. J Acquired Immune Deficiency Syndrome, 49(4), 410–​15. Commission for Africa (2005). Leaving no-​one out:  investing in people: our common interest. Penguin, London. Commission on Health Research for Development (1990). Health research: essential link to equity in development. Oxford University Press, New York. Future Earth (2015). Urban Health and Wellbeing: research for global sustainability. http://​www.futurearth.org/​ Jamison DT, et al. (eds) (2006). Disease and mortality in sub-​Saharan Africa. The World Bank, Washington DC, pp. 1–​387. Jamison D, et al. (2017). Universal Health Coverage and Intersectoral Action for Health: Key messages from Disease Control Priorities, 3rd edition. Lancet. Klausner RD, et al. (2003). The need for a global HIV vaccine enter- prise. Science, 300, 2036–​9. Koplan JP, et al. (2009). Towards a common definition of global health. Lancet, 373, 1993–​5. Makgoba MW (1999). The South African Medical Research Council: Africanizing health research. Nat Med, 5(4), 367–​70. Makgoba MW (2000). HIV/​AIDS: the perils of pseudoscience. Science, 288, 1171. Makgoba MW (2002). Politics, the media and science in HIV/​AIDS: the peril of pseudoscience. Vaccine, 20, 1899–​904. Nattrass N (2008). AIDS and the scientific governance of medicine in post-​apartheid South Africa. African Affairs, 107, 157–​76. OECD (2014). Factbook 2014: Economic, Environmental and Social Statistics. OECD Publishing 2014. http://​dx.doi.org/​10.1787/​factbook-​2014-​en Robertson LJ, Makgoba MW (2019). Mortality analysis of people with severe mental illness transferred from long-stay hospital to alter- native care in the life Esidimeni tragedy. SAMJ, 108, 813–17. Streatfield PK, et al. (2014). Cause-​specific mortality in Africa and Asia: evidence from INDEPTH health and demographic surveil- lance system sites. Glob Health Action, 7, 25362. The Lancet (2015). 1 year on—​lessons from the Ebola outbreak for WHO. Lancet, 385, 1152. The Lancet (2015). GBD 2013 mortality and causes of death collaborators. Global, regional, and national age-​specific all-​cause and cause-​specific mortality for 240 causes of death, 1990–​2013: a systematic analysis for the Global Burden of Disease Study 2013. Lancet, 385, 117–​1. Tollman SM, et al. (2008). Implications of mortality transition for pri- mary health care in rural South Africa: a population-​based surveil- lance study. Lancet, 372, 893–​901. World Bank (2015). World Bank Data:  Research and Development Expenditure. World Bank, Washington DC. http://​data.worldbank. org/​indicator/​GB.XPD.RSDV.GD.ZS World Bank (2015). World Bank Data: Physicians (per 1,000 people). World Bank, Washington DC. http://​data.worldbank.org/​indicator/​ SH.MED.PHYS.ZS World Medical Association (2013). World Medical Association Declaration of Helsinki:  ethical principles for medical research involving human subjects. JAMA, 310 (20) 2191–​4. Human rights Politics and the political environment Culture and social norms Research Knowledge/Evidence on health and well-being Fig. 2.18.1  Framework highlighting social drivers and interactions that influence the conduct of research and use of evidence to advance human health and well-​being. 2.19 Regulation versus innovation in medicine 185 2.19 Regulation versus innovation in medicine 185 ESSENTIALS Two tragedies created drug regulatory authorities in the form they exist today. The marketing in 1937 of an elixir of sulphanilamide using diethyl alcohol as the solvent led to over 100 children dying from acute kidney injury. The use from 1957 of thalidomide as a treatment for morning sickness led to the development of phocomelia (and other abnormalities) in the fetuses of about 10 000 mothers who had been given it during their pregnancies. Drug regulatory authorities initially tended to be risk averse, but now recognize concern that the process of drug regulation might inhibit the development and licensing of novel products. They now pursue a more risk-​based approach to regulation and seek to support safe innovation, with arrangements including scientific advice pro- grammes, orphan drug approval processes, and expedited approval processes, although these are not without their critics. Regulation of medicines The regulation of medicines started, in a modest way, during the 18th and 19th centuries with the publication of pharmacopoeias intended to ensure the quality of medicinal products sold to the public. These did not specify the purposes for which products should be used, and they were advisory rather than compulsory, although they probably had some impact. The 20th century, how- ever, saw substantial changes arising from a combination of scandal and tragedy. In 1906 the British Medical Association published a slim volume entitled Secret Remedies: What they Cost and What they Contain, which was an exposé of the patent medicines industry. Beechams Pills—​containing only aloes, powdered ginger, and powdered soap—​were promoted for numerous purposes that ranged from ‘constipation’ to ‘maladies of indiscretion’ and even ‘menstrual de- rangements’, which was interpreted as a covert claim as an abortifa- cient. The public in both the United States and Britain were horrified. In America, where the US Congress was discussing a ‘Pure Foods Act’, the legislation was extended to medicines, thus creating the Food and Drugs Administration (FDA). In the United Kingdom things moved rather more slowly. After the publication, in 1909, of a sequel entitled More Secret Remedies, Parliament established a Select Committee to advise on what measures should be put in place to protect the public. Its report recommended that legislation be en- acted to regulate the supply of medicines to the public. Sadly, the Committee published its findings the day before the start of the First World War and it was largely forgotten. Tragedies that shaped drug regulation Two tragedies created drug regulatory authorities in the form they exist today. In 1937 an American pharmaceutical company, Massengill, began marketing an elixir of sulphanilamide using di- ethyl alcohol as the solvent. Over 100 children died from acute kidney injury a result of the solvent’s nephrotoxicity. As a conse- quence, the FDA’s powers were enhanced, but virtually nothing was done in Europe. In 1957 the German drug company Grünenthal placed thalidomide on the market as a treatment for morning sickness without under- taking any reproductive toxicology tests. It became apparent, after a delay of almost three years, that thalidomide caused phocomelia ­(malformations of the arms and/or legs), as well as other abnormal- ities, in the fetuses of mothers who had been given it during their preg- nancies. It is estimated that, worldwide, 10 000 babies were afflicted, although the thalidomide disaster was largely avoided in America as a result of the legislative response to the sulphanilamide tragedy. In response to these disasters—​particularly the thalidomide tragedy—​national drug regulatory authorities were established in almost all countries. They all have legal powers—​within their jurisdictions—​to ensure that medicines placed on their markets are of good quality, are effective in the uses promoted by their manu- facturers, and that they are safe in relation to their efficacy. Within the European Union almost all new drugs are now authorized by the so-​called ‘centralized procedure’ which means that they become licenced, simultaneously, across all 28 member states. The European Medicines Agency (EMA) ​undertakes a detailed scientific scrutiny of the evidence through its Scientific Committees and makes a ­recommendation whether or not new products should be granted so-​called ‘marketing authorization’. Because of their historical origins, drug regulatory authorities initially tended to be risk averse. Although they were not opposed 2.19 Regulation versus innovation in medicine Michael Rawlins 186 section 2  Background to medicine to pharmaceutical innovation, there had been some concern that the process of drug regulation might inhibit the development and licensing of novel products. However, regulatory authorities have increasingly recognized that they have an important role in not only protecting public health, but supporting innovation. They have therefore taken considerable steps over recent years to ensure a more risk-​based approach to regulation, reducing regulatory bur- dens where possible, and putting in place measures to support safe innovation. Support for innovation Provision of advice Regulatory agencies have, for many years, offered the sponsors of new products the opportunity to seek scientific advice on the de- sign of their development programmes. At face-​to-​face meetings, sponsor companies meet with the scientific staff of drug regu- latory agencies to discuss their proposed development plans and protocols, including the outcome measures they plan to adopt in demonstrating efficacy. More recently the FDA and EMA have in- stituted arrangements for the two agencies to provide joint scien- tific advice. The scientific advice programme is entirely voluntary, with agen- cies covering their costs by charging fees for the services they offer. The scientific advice that is provided is ‘non​binding’ on either party, but nevertheless these scientific advice services are widely used by the sponsors of new products, and few companies would now complete a drug development programme without having sought regulatory advice from the FDA and the EMA or National Authorities in Europe. In the United Kingdom the Medicines and Healthcare Products Regulatory Agency (MHRA) has, in addition to its scientific advice provisions, recently established an Innovation Office. This offers help to all those developing particularly innovative products or tech- niques/​processes, including small and medium-​sized enterprises as well as academics who have developed a novel medicine or device, or a novel approach to the development or manufacture of a product. Examples of such innovative products include Advanced Therapy Medicinal Products (cell-​based therapies), nanotechnologies, stratified medicines, novel drug/​device combinations, and advanced manufacturing techniques. Orphan drugs In the early 1980s it was recognized, initially in the United States, that manufacturers were being deterred from developing drugs for rare diseases. The development and ongoing costs of such products, to meet the demands of regulation to ensure appropriate standards of quality safety and efficacy, could never be recouped from their sales. The US Congress, in 1983, passed legislation that allowed the US FDA to offer various incentives to sponsors seeking to develop prod- ucts for rare (so-​called ‘orphan’) diseases. Comparable measures were subsequently introduced in other jurisdictions including the EU, Japan, Singapore, and Australia. In the United States an orphan disease is defined as a condition with less than 200 000 affected per- sons: in the EU it is defined as a condition with a prevalence of not more than 5 per 10 000 population. To further reduce the burden on manufacturers applying for orphan drug status, the FDA and EMA now use a common application process for both agencies, although each maintains separate approval processes. The orphan drug programme has been an undoubted success. In the EU, for example, more than 100 orphan drugs have been granted marketing authorization since the programme began in 2000. Expedited approval programmes In the mid-​2000s, the FDA began to introduce arrangements that would permit promising new products to become more rapidly available to patients. To meet the FDA’s requirements for expedited approval, the product must fulfil three criteria. First, the product should be intended to treat a ‘serious’ condition that has substantial impact on patients’ day-​to-​day functioning. Secondly, the product should expect to be an improvement any ‘available therapies’ that are approved for the same indication as the new drug. Thirdly, the product should meet ‘unmet clinical need’, defined as a condi- tion whose treatment is not adequately met by currently available treatments. In the United Kingdom, the MHRA have recently introduced analogous arrangements. A promising innovative medicine desig- nation gives sponsors an indication that a product may be eligible for the Early Access to Medicines Scheme (EAMS) based on early clinical data. Under the scheme, the MHRA provides an opinion on the benefit/​risk balance of the medicine based on the data available when the EAMS submission is made. If successful, the MHRA gives a positive opinion that supports prescribers in deciding whether to use the product in patients with life threatening or seriously debilitating conditions before marketing authorization is granted. Discussions are currently taking place with a view to extending the principle of early designation of a promising medicine in Europe. The EMA is currently undertaking a pilot study of the use of ‘adap- tive pathways’ to accelerate the availability of novel products for unmet clinical need. In this scheme a sponsor first targets a narrow indication in an area of high unmet need and subsequently develops the product for broader indications. During the development pro- cess input is obtained from other stakeholders such as health tech- nology assessment (HTA) bodies. The scheme is within the existing legislative framework and will likely use provisions such as con- ditional approval. Once the first approval is granted, the sponsor will then be required to collect additional information about the product’s performance in clinical use so that—​provided the early promise is fulfilled—​full marketing authorization is granted or the indication broadened. Are these mechanisms to support innovation useful? National drug regulatory authorities are clearly making serious ef- forts to promote the development of safe and effective innovative medicines. In doing so they must continue to balance a product’s benefits against its risks, but the approaches that authorities are taking is not without controversy. Some commentators have criticized the arrangements for providing scientific and regulatory advice to companies, whether they be small or large. They believe that doing so will (or has) lead to ‘regulatory 2.19  Regulation versus innovation in medicine 187 capture’, whereby the agendas of regulatory authorities are manipu- lated to the commercial advantage of pharmaceutical companies. There is no evidence that this has occurred in either Europe or North America, but the fact that such accusations have been made can leave regulatory authorities feeling—​at best—​uneasy. The converse is that, without appropriate advice, those developing novel products run the risk of failing to understand what is required of them, undertaking inappropriate or unnecessary studies, and wasting substantial sums of money. Other commentators of a utilitarian persuasion have suggested that (at least in the EU) the orphan drugs legislation should be re- pealed. They argue that their development costs are still very high, even with incentives, and as a consequence the costs of providing pa- tients with orphan products is likely to deprive those with common conditions of cost-​effective care. This attitude is countered by egali- tarians who claim that society has a duty—​within reason—​to help those with any condition achieve the quality of life to which they are entitled. There is no substantial appetite at present in the United Kingdom, the European Union, or North America to repeal the orphan drugs arrangement. The arrangements for expedited approval in the United States have also been criticized on two grounds. First, it has been claimed that the number of safety warnings issued after the launch of prod- ucts undergoing expedited approval have been greater than those approved under conventional approaches. This may well be correct because the data on which expedited approvals are made are likely to be less. Nevertheless, if patients with unmet clinical need are to have access to products that may have a profoundly beneficial effect on their lives, a reduced safety database would seem to be a price worth paying. The second criticism is that the benefits have been limited, at least for some products that have gone through an exped- ited approval. This has been particularly alleged in respect of some anticancer agents, when approvals for products used in later stages of the disease have been based on intermediate endpoints (such as reductions in tumour size), with only a modest increase in longevity. Despite these criticisms regulatory authorities are continuing to develop arrangements that would allow patients earlier access to promising new products. It is in the best interest of patients and the public that they succeed. FURTHER READING Carpenter D (2010). Reputation and power: organizational image and pharmaceutical regulation at the FDA. Princeton University Press, Princeton. Emanuel M, et  al. (2012). Thalidomide and its sequelae. Lancet, 380(9844), 781–​3. Stephens MDB (2010). The dawn of drug safety. George Mann Publications, Easton, Winchester. 2.2 Evolution Medicine’s most basic science 39 2.2 Evolution: Medicine’s most basic science 39 ESSENTIALS The role of evolutionary biology as a basic science for medicine is expanding rapidly. Some evolutionary methods are already widely applied in medicine, such as population genetics and methods for analysing phylogenetic trees. Newer applications come from seeking evolutionary as well as proximate explanations for disease. Traditional medical research is restricted to proximate studies of the body’s mechanism, but separate evolutionary explanations are needed for why natural selection has left many aspects of the body vulnerable to disease. There are six main possibilities: mismatch, infection, constraints, trade-​offs, reproduction at the cost of health, and adaptive defences. Like other basic sciences, evolutionary biology has limited direct clinical implications, but it provides essential re- search methods, encourages asking new questions that foster a deeper understanding of disease, and provides a framework that or- ganizes the facts of medicine. Physicians who understand evolution recognize that bodies are not designed machines but jury-​rigged products of millions of years of natural selection that work remarkably well, given that no trait can be perfect, and that selection maximizes reproduction, not health. Introduction This medical textbook is, as far as we know, the first to offer a chapter on evolutionary biology. The occasion of the 160th anniversary of the publication of The Origin of Species makes it fitting, albeit some- what delayed. Medical students are taught how the human body is (anatomy), and how it works (physiology), but seldom are they taught why it works (natural selection) or whence it comes (evo- lution). It is as though car mechanics were taught how a car works, and how to fix breakdowns, but never where it came from (factories and designers’ drawing boards) nor the purpose for which it was de- signed (transport along roads). Things are beginning to improve. The past 15 years have seen a series of books, articles, and meetings that report new applications of evolutionary biology to medicine. Evolution is as fundamental to medicine as physics or chemistry. This chapter cannot review its whole scope. We can only illustrate a few core principles in hopes of encouraging further reading. Core evolutionary principles for medicine Natural selection and adaptation When individuals in a population vary in ways that influence their genetic contribution to future populations, the average character- istics of the population will change. This is not a theory; it is neces- sarily true. Natural selection involves no design, no planning, and no goal. The word ‘evolution’ refers more generally to any changes over time in a population, whether from selection, mutation, genetic drift, or migration. Notwithstanding his most famous title, Darwin’s greatest contri- bution was not his explanation of speciation, but his explanation of adaptation. Recent research on the Galapagos finches known as ‘Darwin’s finches’ illustrates the point. During drought, only larger seeds are available, so individuals with larger beaks get more food and have more offspring. In just a few generations, the average beak in the population became significantly larger after a drought. When the rains came, and small seeds again became plentiful, selection switched to favouring smaller beaks. No trait is adaptive except in relation to a specific environment. Levels of selection Non​specialists often assume that natural selection should shape traits to benefit groups. After all, if a species goes extinct, all the individuals and their genes are lost. This ‘group selection’ fallacy was unmasked over 40 years ago, but it continues to cause confusion in medicine. For instance, one might expect pathogens to evolve low viru- lence: killing off the host is surely not good for the group! However, even long association of a host and pathogen does not necessarily decrease virulence. People who are out of bed transmit a rhinovirus 2.2 Evolution: Medicine’s most basic science Randolph M. Nesse and Richard Dawkins Acknowledgement: Thanks to the Berlin Institute for Advanced Study for pro- viding a fellowship to RMN that made preparation of this chapter possible. 40 SECTION 2  Background to medicine faster; this selects for low virulence. The story is very different for insect-​borne diseases. Plasmodium is transmitted faster from pa- tients who are too sick to slap mosquitoes, so virulence is high for malaria in humans (infected mosquitoes feel just fine). Ageing can be similarly misunderstood. One might think that senescence could speed the evolution of the species by making room for new individuals. The species, however, is not the level at which selection acts. Consider a lethal or deleterious gene that is expressed only late in life. Many carriers will have passed on the gene before it kills them. The same gene would be quickly selected out if it killed individuals before they reproduced. We are all descended from indi- viduals who died after having children. Not one of our direct ances- tors ever died in childhood! Moreover, a pleiotropic gene that gives a benefit early in life may be favoured, even if it causes deleterious effects later, when selection is weaker. This evolutionary explan- ation for senescence is now confronting remarkable new evidence that single-​gene effects in the insulin signalling pathways can have huge effects. The reasons why selection has not incorporated such changes will prove most interesting. Established applications Some methods from evolutionary biology have long been applied to medicine. Population genetics describes how natural selection, mutation, migration, and drift account for shifting gene frequencies. This body of knowledge has been a foundation for medicine since the middle of the twentieth century, so we will only note a few new applications. It is now clear that the ability to digest lactose as an adult is the exception, rather than the rule. In our ancestors, milk was a food for babies only. New analyses show that the ability to digest lac- tose as an adult has emerged on at least three separate occasions in human prehistory, always in dairying cultures. Remarkably, the selective advantage in these cultures has been huge, of the order of 5 to 15%. The exact benefits remain to be fully understood but calcium and vitamin D may be important, as well as getting more calories. Another example is the prevalence of mutations influencing the acetaldehyde dehydrogenase genes in some populations (especially in southeast Asia). Carriers get sick when they drink alcohol. Is the prevalence of this mutation a result of random genetic drift, or does it give some advantage, perhaps by decreasing the risk of alco- holism? New data show that it does protect against alcoholism and that strong selection has acted at this locus; it is at the centre of one of the largest haplotypes in some populations. This supports the role of alcohol, but the geographical distribution suggests that diet or other cultural variations may be responsible. Genetic methods for tracing phylogenies of pathogens have long been available. Influenza strains are tracked so assiduously that it is possible now to predict some characteristics of likely future epidemic strains—​invaluable information for vaccine design. Epidemics of pathogen-​contaminated food are now routinely traced back to the source using genetic data. It has even been possible to trace specific cases of HIV back to a specific source, because rapid mutations leave a clear trail. Evolutionary methods also can also be applied to somatic cell lines within a body, for instance, to determine if the cells in a tumour are all identical or if subclones are competing in the tumour. The im- plications for customizing chemotherapy are substantial. Evolutionary aetiology Most medical research provides proximate explanations based on the anatomical and chemical details of the body’s mechanisms. However, even knowing every detail about a trait offers only one half of a complete biological explanation. The other half is provided by an evolutionary explanation of how that trait came to exist in the first place. There are two kinds of evolutionary explanations: the first is a phylogenetic explanation based on the sequence of prior traits across evolutionary history; the other is an explanation of what evo- lutionary forces account for the changes across time. Most often, this requires an understanding of how the trait gives a selective advantage. Explain vulnerabilities, not diseases Evolution can explain why aspects of the body have been left vul- nerable to disease. Why do we have wisdom teeth, and a small birth canal? Why do we so often develop lower back pain and hip problems? Why hasn’t selection shaped our immune systems to better eliminate pathogens and cancer cells? Answering such questions in an evolu- tionary way is often challenging. A framework can help to organize the effort. There are six main reasons why bodies have vulnerabilities to disease despite the actions of natural selection (Box 2.2.1). Mismatch Chronic ‘diseases of civilization’ such as obesity, hypertension, and diabetes are now pandemic. The motivations that make us eat too much and exercise too little were shaped for an environment where sweet, fatty, or salty foods were good for us, and excess exercise could be fatal. Recognizing the origins of our unhealthy preferences does not change them, but it illuminates the source of the problem and possible solutions. Similarly, allergies and autoimmune disorders are more common in developed societies. Our immune systems evolved when people were routinely exposed to intestinal parasites and pathogens. In their absence, inhibitory immune cells are not stimulated, leaving the system overactive and responsive to self. An attempt to recreate the original intestinal environment by administering whipworm ova has proved remarkably effective as a treatment for Crohn’s disease. Coevolution We remain vulnerable to infections because pathogens evolve faster than us. Just how fast is demonstrated by the rapid rise of resistance Box 2.2.1  Six kinds of evolutionary explanations for vulnerability • Mismatch between aspects of our bodies and novel environments • Pathogens that evolve faster than we do, and resulting costly defences that cause harm themselves • Constraints on what natural selection can do • Trade-​offs that keep any trait from being truly perfect • Traits that increase reproduction at the cost of health • Protective defences such as pain and fever 2.2  Evolution: Medicine’s most basic science 41 to every antibiotic. Evolutionary analysis of the phenomenon shows that initial intuitions may not be right. For instance, rotating the first-​choice antibiotic in a hospital every few months does little to decrease multidrug resistance, and taking all of an antibiotic pre- scription may not prevent resistance. Most of our antibiotics are products of natural selection sifting through a vast range of mol- ecules during a billion years of competition between microbes. Pathogens also have strong selection effects on hosts, particularly in shaping defences such as fever, vomiting, diarrhoea, cough, and the many manifestations of inflammation. These adaptive responses often have harmful effects because they are products of an evolu- tionary arms race. Every defence creates selection for ways to es- cape it, and this shapes yet more expensive and dangerous defences. At equilibrium, we would expect the defences to become nearly as dangerous as the pathogens (natural selection would be expected to amplify them until they approach the danger level), a principle that should inform studies of anti-​inflammatory agents in infection. Constraints Many of the body’s limitations reflect the limits on what natural selection can do. It cannot maintain an information code without errors, nor can it start afresh to correct a poor ‘design’. For instance, the eye’s nerves and vessels are between the light and the retina, and their exit causes a blind spot. Such constraints can never be fixed, because intermediate stages do not work. Human engineers can, literally, go back to the drawing board, evolution cannot (imagine if the jet engine had had to ‘evolve’ from the propeller engine, step by step). Trade-​offs Not only does selection result in many suboptimal ‘designs’, but it cannot make any trait perfect. All traits involve trade-​offs. Thicker wrist bones would break less easily, but they would inhibit free wrist rotation. Muscles fatigue, but careless use of a new drug that blocks fatigue may reveal just what damage fatigue prevents. Bilirubin is, according to some medical teaching, a waste product from haem metabolism. However, an intermediate molecule, bili- verdin, is relatively water soluble. Why not excrete biliverdin? Because bilirubin is an effective antioxidant. If there are no such specific trade-​offs to be seen, economics al- ways furnishes an ultimate trade-​off. Individuals could be built with thickened bones that never break, but they would spend extra en- ergy moving those big bones while individuals with thinner bones would have more offspring because they divert the economic goods saved (e.g. calcium and energy) elsewhere in the economy of the body (e.g. milk) where they can do more good. Engineers know this as the principle of ‘overdesign’, in which risks of failure are min- imized within available budgets. But whereas engineering budgets are arbitrary—​civilian aviation standards are more risk averse than military, for example—​evolutionary budgets are set by the compe- tition. Individuals whose bones are ‘too good’ will end up having fewer children than rivals whose ‘spending policy’ accepts the in- creased risk of breakage. Reproduction at the expense of health A related point explains the differences in mortality between the sexes. A trait that increases reproduction will tend to spread, even if it harms health. Investments in competitive ability give greater reproductive pay-​offs for males than for females, so men have been shaped to take more risks and to invest less in bodily repair. Data from developed societies shows that mortality rates for men at the age of sexual maturity are about three times higher than that for women. Defences The final explanation is not really a reason for vulnerability, but it is on the list because defences against disease are so often inad- equately distinguished from direct manifestations of disease. Pain, fever, nausea, and vomiting are adaptations useful in certain situ- ations. Unfortunately, they are often expressed as ‘false alarms’ when they are not essential. From a physician’s point of view, it seems that selection has done a poor job. After all, much of general medicine involves of blocking normal defence reactions such as pain, fever, vomiting, and anxiety, and few patients expire as a result. However, selection has not made a mistake. The costs of not ex- pressing a response when it is needed are so huge relative to the costs of false alarms that the optimal threshold allows for many false alarms. This ‘smoke detector principle’ explains why blocking a de- fence is usually safe: the doctor can judge if the response is necessary. Nonetheless, we should expect that defences have been shaped to be expressed when they were needed on the average, in the long run. Utility In the clinic Upon hearing about new evolutionary approaches to medicine, most journalists and many doctors ask how it can improve treat- ment in the clinic today. This is the wrong question. There are some direct clinical applications, such as hesitating before blocking a defensive response such as a raised temperature or vomiting. However, theory should not change practice directly. Instead, evolution offers established methods such as population genetics, new questions about why the body is vulnerable, strategies for an- swering them, and a scientific foundation for an integrative under- standing of the body. Research implications Revisions and extensions of evolutionary methods will make them even more valuable. As extensions of the Human Genome Project move us towards individualized genetic medicine, an evolutionary view of genetic variations can get us beyond simply labelling some ‘defective’ and others ‘normal’. There is, after all, no normal genome. There are just genes that construct phenotypes that result in more or fewer offspring in a given environment. As outlined above, an evolutionary approach also suggests a new class of questions about the aetiology of disease. Research to answer these questions should eventually allow a book like this to provide an additional evolutionary section for each disease. The chapter on gout will describe comparative data which tests the hypothesis that uric acid’s benefits as an antioxidant in a long-​lived species justify its raised levels, despite the pain to some individuals. The chapter on jaundice will mention the costs, benefits, and evolution of bilirubin. The chapter on infectious disease will describe the arms races that shape pathogens and defences, and the costs and benefits of blocking defensive responses. The chapter on anxiety and depression will not 42 SECTION 2  Background to medicine treat them simply as pathological states, but as potentially useful responses, prone to dysregulation. So far, however, the benefits of seeking the evolutionary aetiology for every disease is only begin- ning to be recognized. Teaching implications There is more to teach than can be taught, so medical educators try to provide students with core facts, general understanding, and crit- ical skills that allow them to learn more. Evolutionary knowledge is invaluable not only for itself, but because it offers a framework that can organize and relate the thousands of facts. It helps students realize why bodies fail, and therefore what disease really is. Evolution also offers opportunities for designing courses that provide deeper understanding. For example, a biochemistry course could emphasize the origins of certain pathways, and how adaptation is constrained by the limits of natural selection. Students in physiology would learn the evolutionary reason why the respiratory system relies on carbon dioxide, not oxygen, to regulate respiration. A deeper understanding of the body Physicians are increasingly being educated as if they are technicians, identifying problems and applying officially approved solutions. This makes very poor use of medicine’s most valuable resource. We select medical students carefully because we want—​or should want—​doctors who think. Providing them with a deep evolutionary understanding of the body will foster clear thinking. Instead of viewing the body as a designed machine, they will see it as a product of natural selection with traits more exquisite than in any machine, some of which nonetheless leave us vulnerable to diseases. Doctors who understand the body in evolutionary terms will make better de- cisions for their patients because they will have a better sense of what it is that they are actually doing. Opportunities Interested physicians will want to be in touch with The International Society for Evolution, Medicine & Public Health (https://​isemph. org/​). It sponsors an annual meeting, an Oxford University Press Journal (Evolution, Medicine, & Public Health), The Evolution and Medicine Review, and an email list and network to connect clinicians and researchers interested in evolutionary approaches to medicine. FURTHER READING Evolution, Medicine, and Public Health. https://academic.oup.com/emph Gluckman D, et al. (2011). How evolutionary principles improve the understanding of human health and disease. Evol Appl, 4, 249–​63. International Society for Evolution, Medicine, and Public Health. http://isemph.org Nesse RM (2011). Ten questions for evolutionary studies of disease vulnerability. Evol Appl, 4, 264–​77. Nesse RM, Stearns SC (2008). The great opportunity: evolutionary applications to medicine and public health. Evol Appl, 1, 28–​48. Nesse RM, Williams GC (1994). Why we get sick: the new science of Darwinian medicine. Vintage Books, New York. Perlman, R (2013). Evolution and medicine. Oxford University Press, Oxford. Stearns SC, Medzhitov R (2016). Evolutionary medicine. Sinauer Associates, Sunderland, MA. 2.20 Human disasters 188 2.20 Human disasters 188 ESSENTIALS Human disasters, as massive misfortunes long recorded over history, have great importance for medicine, rightly prompting the call for prevention, relief, and practical intervention by medical personnel. But why do human disasters happen? A sharp distinction is some- times drawn between natural disasters (e.g. earthquakes), and social disasters (e.g. wars), but detailed knowledge often shows that this contrast is not always clear: many disasters have mixed causes. Taking the example of famines. These are popularly understood in terms of food output decline, yet many famines have occurred without any decline in food production. Such misunderstanding has been responsible for the loss of millions of lives, mainly by under- mining the role of social intervention. Starvation is a characteristic of some people not having enough food to eat, not of there being not enough food in the economy. Even when nature plays a part in human disasters, society can make a huge difference. The death toll from so-​called natural disasters is much less in higher than lower-​income countries, even when they do not have any fewer disastrous natural events. The type of political rule can also be a factor of importance: authoritarian governments and military dictatorships are generally insensitive to vulnerable people. Interventions that can be effective include envir- onmental policies, those that mitigate the effects of disasters on the economy, and healthcare actions. Introduction ‘A sudden or great misfortune’ is the way the New Shorter Oxford English Dictionary defines a disaster. That is not a bad starting point in trying to understand the idea of ‘human disasters’. Disaster is a common enough term in public discussion. The addition of the ad- jective ‘human’ to the word points specifically to the misfortunes of human beings, rather than those of animals or of the perils of our natural surroundings (except to the extent that these non​human ­adversities yield human mishaps). It also guides us towards focusing particularly on the predicaments of groups of people with shared and interlinked predicament, rather than on the personal tragedy or sadness of particular individuals, seen on their own. The term could be used differently of course. There is nothing in- herently odd in talking, for example, about the human disaster that befell King Lear, and indeed Lear’s woeful elderly life can, of course, be of interest for a textbook of medicine. It would belong, however, to some other section of this book linked, for example, with psycho- logical adversity, the trauma of dejection and betrayal, or perhaps geriatric deprivation, with their own clinical challenges. In contrast, what are standardly called ‘human disasters’ have come to be seen as sudden developments of huge misfortunes, in the form of death or debilitation or displacement or impoverishment, of substantial groups of people simultaneously affected by such events as earthquakes, floods, epidemics, wars, or famines, which call for imperative social action for prevention and relief. That is the sense in which the idea of human disasters will be addressed in this chapter. It is, however, important to mention that the disastrous effects of terrorism and targeted killing are increasingly becoming another source of what can be called human disaster. Such specialized vio- lence in the contemporary world demands attention as well, and they take us in the direction of examining the influence of exploitation—​ and sometimes deliberate ‘planting’—​of a sharp and confrontational human identity in a highly belligerent form, used for exciting violent feelings towards other groups with different identities—​other reli- gions, other nationalities, other races, or other ethnicities. Studies are needed not only about the way identity politics can be made to serve the cause of human violence, but also about how organized in- flammation can become a source of danger for human security and an ingredient of a form of human disaster. Problems of a natural-​social dichotomy In the vast literature on the subject of human disasters, a sharp distinction is sometimes made between natural disasters, like earthquakes, floods or droughts, and social disasters, like wars or genocide. There is some rudimentary logic in that distinction: polit- ical sagacity can obviously have more success in averting genocide or wars than in, say, preventing earthquakes. Yet the contrast is not so 2.20 Human disasters Amartya Sen 2.20  Human disasters 189 total, since in the transformation of a natural calamity into a human disaster, social factors play a big role. For example, though floods are natural phenomena in an obvious sense, their effects—​and indeed even their incidence—​are also in- fluenced by social and economic events, such as the making of canals or drainage systems, not to mention the climatic effects of global warming that may be significantly influenced by preventable human activities. Even though in understanding the immediate parameters of such events as floods or droughts or storms, we can begin well enough by ‘rounding up the usual suspects’ in the natural world (such as rainfall, temperatures, gales), it is important that our ana- lysis should not end there. Some other examples of human disaster are quintessentially mixed bags, such as epidemics, which depend on the working of nature (e.g. on the properties of viruses, bacteria, and other conta- gions), and yet are massively influenced by human behaviour (like contacts and exposures) and social arrangements (like immuniza- tion and medical care). In such cases it would be very difficult even to begin the analysis of disasters of this mixed type—​epidemics and others—​only as phenomena of nature, since the social-​natural en- tanglement is rampant in every aspect of this type of human disaster. In some cases, the inescapably mixed causation of disaster has tended to escape attention because of the popularity of apparently ‘obvious’ causal explanations that fail to look far enough and are easily satisfied with finding a natural correlate. A good example of this kind of neglect can be seen in the popular understanding of the causation of famines in terms of food output decline, with much-​ repeated explanations that do not go beyond the proximate fea- tures of droughts, floods, or storms. This causal confusion, which is contradicted by the great many cases of famines that have occurred without any decline in food production, has been responsible for the loss of millions of lives, mainly through undermining the necessity and urgency of social intervention. The subject demands some discussion here, and to that I turn next. Indeed, by concentrating on famines, which have been much more extensively studied than other kinds of disasters, it is possible to illustrate some general points about the relationship between social and natural aspects of human disasters. Famines: Causes and prevention Famines are gigantic events of carnage in which millions of people die from starvation and from diseases linked with human debili- tation, social disruption, and movements of destitutes in search of something to eat, and the spread of communicable ailments asso- ciated with these phenomena. Even though a great many people die from a famine, the misfortune that a population experiences in a famine can go well beyond the mortality of those who succumb to the famine, since famines can leave an inheritance of huge disrup- tion for many years to come. The crude logic that if people are dying of starvation, there must be a shortage of food, has had much influence on traditional thinking across the world. That theory has had a particularly damaging effect in persuading governments to do nothing even when many people lose their livelihood (for one reason or another), which makes them unable to buy food. This hypothesis was relevant to the British Raj’s strange inactivity as the Bengal famine of 1943 slowly gathered momentum. It would eventually kill close to 3 million people. The of- ficial statistics of food output in Bengal indicated that there had been no significant change in the availability of food there, and sticking to their theory, the government did little to relieve the relentless emer- gence of a large famine. Instead, it tried to deny the existence of the unfolding famine, sustained by the censorship of the ‘native’ press, and the complicit silence of the British-​owned English papers. That gigantic cover-​up ended when the leading English paper of India, The Statesman (also British owned), broke rank with the decision of the highly agonized editor (Ian Stephens) that he could no longer be a party to the concealment of a huge human disaster. Stephens supplemented his factual reports of starvation by stinging editorials on official policy. Once the news went into the public do- main it got large attention in the British press and in Parliament in London, making the government admit the existence of a famine, which also led to the starting of much-​delayed relief work. However, ever loyal to the ‘theory’ that famines are caused only by food avail- ability decline, the same imperial government would later revise the statistics downward to bring the revised ‘facts’ into line with their deluded theory, even though the previous food statistics had been broadly right. What went wrong, of course, was the inability to understand that starvation is a characteristic of some people not having enough food to eat: it is not a characteristic of there being not enough food in the economy. The critical connection of the command over food—​what is sometimes called the ‘entitlement’ to food—​is with the purchasing power to buy food (in a market economy), linked to jobs and in- comes and relative prices of what people have to sell (their labour and the commodities they can make, such as services and crafts) to buy staple food. A famine is, in this sense, an economic phenom- enon, not just one of agricultural production. However, going further, famines are not only manifestations of an economic phenomenon, they also have political aspects and medical correlates as well. Famines are easy to prevent since only a relatively tiny proportion of the population is affected, and it is easy to regenerate the purchasing power of those who are destitute through emergency employment. This can provide incomes in a re- gion in which many people have lost their purchasing power be- cause of job loss or price rise. Indeed, the fact that independent India, despite considerable regular undernutrition at a chronic level, has not had a famine since the end of the Raj in 1947, is only partly linked with the progress of food production and agriculture in post-​independence years; it is also critically connected with the state’s immediate willingness to start relief programmes, mainly in the form of emergency projects, giving jobs and incomes to the af- flicted population. The urgency of preventive intervention is a political imperative for the government in office in a functioning democracy. If a famine is allowed to develop then no ruling party (or coalition) has much chance of winning a democratic election (or even to survive blis- tering criticism in the media and in parliaments or assemblies). Indeed, famines in the recent years, as in the past, have continued to occur only in countries without a functioning democracy (e.g. military dictatorships, one-​party states, or countries that do have elections but do not have other necessary features of a democracy such as a free media and room for public discussion and social criti- cism). Famines are thus manifestations of political as well as eco- nomic phenomena. 190 section 2  Background to medicine Famines also belong to the domain of medicine and public healthcare. This is because most people who die from famines die from diseases of the region, some of which are locally endemic. While starvation is the prime mover of famine mortality, actual deaths very often come from people getting ill and succumbing to their illness. Debilitation due to severe undernourishment plays a part in this in making people vulnerable to fall ill and sometimes to die of it. But no less importantly, hunger precipitates illness and death because of (1) the propensity of hungry people to eat what- ever scraps they can pick up from anywhere; (2) vast population movements induced by search for jobs and food that spread con- tagions with great speed; and (3) breakdown of essential services including healthcare and medical attention. Famines typically kill not just directly as a result of starvation but through intensifying and aggravating the forces of illness-​based mortality common to the region. Table 2.20.1 gives a breakdown of the causes of death in the ex- cess mortality in the Bengal famine of 1943, as presented in my 1981 book Poverty and Famines, for the period 1943–​1946, taking the prefamine 1941–​1942 average as the standard for comparison. More than four-​fifths of the death toll resulting from the Bengal famine, in this estimate, was directly connected with diseases common to the region, with pure starvation death accounting for no more than only a fifth of the total. A similar picture emerges from most other famines. This aspect of famines is the centre of attention in Alex de Waal’s important study of famines in Darfur in Sudan. This is certainly a rich and policy relevant perspective on famines, particularly be- cause even with the failure of food entitlements, the magnitude of deaths can be very substantially reduced through health inter- ventions. If the emergence and survival of a famine is largely an economic and political issue, the mortality it generates is also very significantly a matter of healthcare and medical attention (including timely prevention, particularly through immuniza- tion). The allegedly ‘natural’ phenomenon of famines tends to be intensely ‘social’ at many different levels—​economic, political, and medical. Before I move on from the subject of famines to other types of human disasters, let me briefly note the variety of circumstances in which famines have occurred and have killed with abandon across the world. It would be good to associate firm mortality es- timates with each famine, but this would be a difficult thing to do in many cases. Even though death tolls have been thoroughly studied for some famines, using reasonably good statistics, there is always an element of uncertainty in any such estimate, no matter how carefully it may have been prepared. Part of the difficulty lies in the underlying conceptual issues. The death toll of a famine has to be calculated by contrasting the actual number of deaths in that period with the number of deaths that would have taken place in the absence of that famine. That is, famine mortality is the difference between actual mortality in the famine period and the estimated number of people who would have died anyway, even if the famine had not occurred at all. Thus the uncertainties involved relate only partly to problems in getting good statistics of actual deaths, which is not an easy task as it is, especially at a time when many of the normal functioning of social institutions, including registration of deaths, are disrupted (sometimes because of death or migration of the staff involved in the relevant social institutions). They also relate to the uncertainties inherent in the ways and means of estimating—​ as a ‘counterfactual’—​what would have happened had the famine not occurred at all. One can nevertheless get some general impression of the size and reach of a famine by looking at the estimates of death tolls that we have, along with reports of other events and predicaments con- nected with the respective famines. Table 2.20.2 relates to many ‘recent’ famines that have occurred over the last couple of hundred years, beginning with the Irish famines of 1845–​1851 and drawing on a large number of sources. The association of famines with political authoritarianism and civil disruption is well illustrated by the incidence of such events in countries with military governments (Ethiopia, Sahel countries), or one-​party rules (Soviet Union, China, Cambodia, North Korea), or alien governance (preindependence India and politically dom- inated Ireland), or civil disruption (Nigeria, and to some extent Bangladesh, even though the death toll was kept in check, particu- larly in the latter, through organized famine relief). Disasters and policy intervention Human disasters can be of many types, varying from floods, droughts, wind storms, extreme temperatures, and earthquakes, where natural factors have some clear role, to wars and indus- trial accidents (like the Union Carbide disaster in Bhopal in 1984) where the story is almost entirely social in the broad sense. But the important point in the context of policy intervention is to recognize that, even when nature plays a part, society can make a huge difference. Consider natural calamities such as floods or droughts, which have been associated with processes that have led to great many disaster deaths. What can intervention achieve? There are at least three different ways of prevention or amelioration. • First, environmental policies can make floods, droughts, and other natural adversities less common. This is a subject that is being energetically discussed today—​at long last—​moved largely by the evidence of massive global warming and the related long-​ run environmental hazards. The recent ‘COP21 summit’ in Paris in December 2015 is a good example of attempting disaster pre- vention through concerted international efforts. There was, Table 2.20.1  Proportionate breakdown of excess mortality in the Bengal famine: 1943–​1946 over 1941–​1942 Ailment types Percentage contribution Malaria 36.7 Cholera 7.1 Smallpox 5.0 Dysentery, diarrhoea, and enteric group of fevers 5.0 Other fevers (often undiagnosed) 27.4 Respiratory diseases 0.4 Percentage share of ailment-​related deaths 81.6 Source: Sen A (1981). Poverty and Famines: An Essay on Entitlement and Deprivation. Oxford University Press, Table D.3, p. 204. 2.20  Human disasters 191 contrary to earlier fears, some limited success, and yet much was left unresolved in Paris—​what George Monbiot described in The Guardian as ‘by comparison to what it could have been, it’s a mir- acle’, but ‘by comparison to what it should have been, it’s a disaster’ (12 December 2015). There will no doubt be many efforts in future years to avert the climatic catastrophe-​in-​making. • Second, any human disaster that kills directly will also tend to disrupt the economy through its effects on jobs and incomes and prices, thereby affecting the entitlement to food and other es- sentials that people need. The discussion on famines above has already illustrated this general connection. That analysis can be extended. For example, an earthquake may not lead to a famine, but can leave an economy unsettled, making lives precarious for those who are not killed in the physical calamity itself. These ef- fects can be prevented or at least reduced in their impact through careful social intervention. The type of housing and the arrange- ments of city planning can also reduce the effects of earthquakes, floods, storms, and other hazards. Third, the morbidity and mortality associated with a human disaster can be large and even extend far beyond the immediately affected population who are directly hit by the physical events. However, taking note of the likely dangers from local conta- gions, the scale of both illness and death can be radically altered through a variety of preventive measures, such as immunization, ensuring safe water, influencing the routes of contact and the spread of infection, and also through health education and med- ical preparedness. The divisiveness of disasters The death toll from so-​called natural disasters reduces sharply as we move to higher and higher income countries, even when they do not have any fewer natural events of that type. The contrast can be very sharp indeed, as Kofi Annan, the former Secretary General of the United Nations, has noted (International Herald Tribune, 19 September 1999): ‘Ninety per cent of the disaster victims worldwide live in developing countries where poverty and population pressures force growing numbers of poor people to live in harm’s way on flood plains, in earthquake prone zones and on unstable hillsides. Unsafe buildings compound the risks. The vulnerability of those living in risk prone areas is perhaps the single most important cause of disaster casualty and damage.’ A similar remark can be made about the role of medical services and epidemiology, which are often rudimentary in the poorer countries. If low income is a predisposing condition for the penalty of disasters, the type of political rule can also be a factor of importance. The impact of the insensitivity of ruling governments in authori- tarian countries on the interests of vulnerable people was illustrated earlier in the specific context of famine prevention, but the problem of authoritarian insensitivity is a widespread phenomenon and goes well beyond the terrible record of authoritarian rules with famines. The neglect of medical services and healthcare that can be seen in many military dictatorships, for example in sub-​Saharan Africa, is Table 2.20.2  Some recent famines 1845–​1951, Ireland A series of famines linked with a blight in potato farming, with loss of jobs, incomes, and staple food; famine mortality around one million deaths (the largest proportionate mortality among all ‘recent’ famines). 1928–​1929, China Famines, particularly affecting farming population, especially in Shensi, Honan, and Kansu provinces, with mortality estimates that are in excess of 5 million deaths. 1932–​1934, Soviet Union Connected with agricultural turmoil, at least partly linked with collectivization, especially severe in Ukraine, with mortality estimates in excess of 5 million deaths. 1943, India In the province of Bengal in British India, with a fairly normal food availability; mortality numbers are now placed between 2 and 3 million. 1958–​1961, China Linked with the disastrous failure of the so-​called Great Leap Forward, there was a sharp decline in food production, disruption of normal economic processes, and huge chaos; the famine was well hidden from those not directly exposed to it. The death toll has been later estimated to be at least around 30 million (making it the largest absolute size of famine mortality in the recorded history of famines in the world). 1967–​1968, Nigeria Connected with civil war and the blockade of Biafra; no reliable estimate of deaths has been made. 1972–​1974, Sahel region Affecting several African countries in the Sahel belt (Burkina Faso, Chad, Mali, Mauritania, Niger, Senegal), with some reduction in agricultural production, but also affecting pastoralists both through animal death and through a fall in the relative prices of animal products against staple food from agriculture; no reliable mortality estimates can be made. 1973, Ethiopia The drought in the Wollo province reduced the purchasing power of the poor, and even though there was no significant reduction in food output at all for Ethiopia as a whole (the reduction was confined mainly to Wollo), food did not move into Wollo, and some moved out of it, because of the relatively larger purchasing power of the rest of Ethiopia; no reliable mortality estimates can be made. 1974, Bangladesh This famine, which followed shortly after the disruption of civil strife that led to the break-​up of Pakistan, occurred in a year of peak food availability. Floods that would reduce food output the following year immediately affected employment and incomes of rural labourers, and also an exaggerated anticipation of a coming food crisis made food prices rises very steeply (followed by a price decline later on, after the famine). Mortality estimates vary but it was kept very much in check by a huge food relief programme arranged by the government through which 4.35 million people (about 6 per cent of the total population) were fed. 1975–​1979, Cambodia Linked with the Khmer Rouge ravages of the rural economy, deportation of parts of the urban population, and systematic slaughter of unfavoured people; the death toll is estimated to be around one million. Since early 1990s, North Korea Intermittent famines with economic disruption by an authoritarian regime with a strong ideological agenda; death estimates are often placed around one million—​so far. 192 section 2  Background to medicine itself a problem for disaster management. There are political param- eters related to the medical, social, and economic interventions that can make disasters less likely, and less devastating when they do occur. The effects of similar natural phenomena can be widely different, depending on income levels, political reactions, medical arrangements, and other variable conditions. Human disasters are very divisive events, not only between coun- tries but also within them. Even in the same country, and sometimes even within the same region of a country, a disaster can ruin some groups of people while leaving others almost completely untouched. Famines, for example, tend to affect those who are already vulner- able because of poverty and the absence of safety nets on which they can rely, private or social. Policy intervention has to take note of the divisions along the lines of class and occupation groups. There are also differences related to age and gender. Children may be particularly vulnerable in some disasters, including in epidemics of some diseases. The need for immunization of children and arran- ging medical attention for children are important requirements in disaster management, and so is the institution of economic safety nets through state provision and other social arrangements for child support and income supplement at moments of dire need. In some disasters (for example in the present phase of AIDS epi- demic), women are increasingly more affected. It is usually the case that the opposite is true in wars, which typically kill more men than women or children, although women may be particularly vulner- able when there is a civil war targeted against a specific community where women are systematically raped and injured, as has happened for example in Darfur in Sudan. Even though the mortality rate may be higher among men, leaving a lower ratio of men in the surviving population, the huge incidence of rapes amounts to a gigantic vul- nerability for women in particular. There are other kinds of special vulnerabilities in disasters, like breakdown of families in famines, from which women tend especially to suffer. There is also a more subtle point that tends to be neglected in journalistic reports. The life expectancy of women is typically longer than that of men, unless this is reversed through gender bias in social care. There is some evidence that the life expectancy of women is comparatively more reduced by many types of so-​called natural disasters than that of men. This greater vulnerability of women is sometimes hidden by the fact that even after the larger reduction, women may still have a higher life expectancy than men, but—​as was discussed in the context of famine analysis—​the right comparison is between what actually happens with a disaster and what could have been expected to have happened in its absence. That distinction is as relevant for medical practitioners and public health experts as it is for social and economic investigators, as is the recognition that human disasters are typically highly divisive events. FURTHER READING Bose S (2009). Pondering poverty, fighting famines: towards a new history of economic ideas. In: Basu K, Kanbur R (eds) Arguments for a better world, Vol. II. Oxford University Press, Oxford. Depoortere E, et al. (2004). Violence and mortality in West Darfur, Sudan (2003–​2004): epidemiological evidence from four surveys. Lancet, 364, 1315–​20. Dreze J, Sen A (1989). Hunger and public action. Oxford University Press, Oxford. Guha-​Sapir D, Hargitt D, Hoyois P (2004). Thirty years of natural disasters 1974–​2003:  the numbers. Centre for Research on the Epidemiology of Disasters, Presses Universitaires de Louvain, Louvain-​la-​Neuve, Belgium. Kahn ME (2005). The death toll from natural disasters: the role of income, geography, and institutions. Review of Economics and Statistics, 87, 271–​84. Malik K (2014). Multiculturalism and its discontents: rethinking diver- sity after 9/​11. Seagull Books, Kolkata, India. Neumayer E, Plumper T (2007). The gendered nature of natural disasters: the impact of catastrophe events on the gender gap in life expectancy, 1981–​2002. Ann Assoc Am Geogr, 87, 551–​66. Ravallion M (1987). Markets and famines. Oxford University Press, Oxford. Sen A (1981). Poverty and famine: an essay on entitlement and depriva- tion. Oxford University Press, Oxford. Sen A (2006). Identity and violence:  the illusion of destiny. Norton, New York and Penguin, London. Stephens I (1966). Monsoon morning. Ernest Benn, London. Stern N (2007). The economics of climate change. Cambridge University Press, Cambridge. Vaughan M (1987, 2007). The story of an African famine. Cambridge University Press, Cambridge. de Waal A (2005). Famine that kills: Darfur, Sudan. Oxford University Press, Oxford. World Disaster Report 2005 (2006). International Federation of Red Cross and Red Crescent Societies, Geneva, Switzerland. 2.21 Humanitarian medicine 193 2.21 Humanitarian medicine 193 ESSENTIALS Humanitarian medicine addresses the human consequence of crises such as conflict, disaster, or displacement, and serves to assist those whose lives and health are impacted by such events. It is practised in challenging settings where resources are limited and environments unstable, and requires a clinical skillset which is both near limitlessly broad and context specific. The humanitarian sector has expanded significantly during the last two decades as a result of climate-​related crises and increasing complex humanitarian emergencies due to protracted and multi- faceted conflicts. The picture today is evolving rapidly to ad- dress expanding humanitarian demands and a changing global sociopolitical reality. Introduction Humanitarian medicine is a discipline in which the overarching aim is to provide relief to those whose lives and health are harmed by crises. This work strives to uphold the core humanitarian prin- ciples of humanity, neutrality, impartiality, and independence, even when the reality on the ground is dangerous and chaotic. Unlike most clinical disciplines, humanitarian medicine is not defined by technique but rather by the setting in which it oc- curs:  complicated environments, such as conflict, disaster, or displacement, that are inherently resource-​limited and insecure. Within this context, however, its scope is broader than clinical medicine and requires a coordinated effort between multiple dis- ciplines. Moreover, the practice of humanitarian medicine in- volves adherence to principles which go beyond the precepts of the Hippocratic Oath. Humanitarian Medicine has been proposed as a practice that ‘goes beyond the usual therapeutic act and promotes, provides, teaches, supports, and delivers people’s health as a human right, in conformity with the ethics of Hippocratic teaching, the principles of the World Health Organization, the Charter of the United Nations, the Universal Declaration of Human Rights, the Red Cross Conventions and other covenants and practices that ensure the most humane and best possible level of care, without any discrimination or considera- tion of material gain’.1 Humanitarian providers practise medicine in one of its truest forms. Humanitarian medicine occurs in settings with low re- sources, where providers need to rely on clinical skills rather than extensive ancillary tests and imaging. It would be hard to find a practice of clinical medicine that more closely demonstrates Hippocrates’ assessment that ‘the art (of medicine) consists of three elements: the disease, the patient and the doctor’. At a period when Western-​trained physicians are becoming increasingly specialized, humanitarian providers often need to diagnose and treat patients that cross all specialties, including those with rare or atypical pres- entations, and they often conduct their practice without the safety net of referral. Although not a new discipline, the sphere of humanitarian medi- cine has expanded over the last 20 years, primarily in response to an increase in the scale and frequency of humanitarian emergencies. Climate change and natural disasters have become more frequent, violent conflicts are less defined and more protracted, and forced displacement has reached critical levels. With this reality comes the human consequence and the humanitarian needs of those affected. The United Nations Office for the Coordination of Humanitarian Affairs (OCHA) estimates that at least 141  million people in 37 countries were in need humanitarian assistance in 2017—​the highest level in history (Fig. 2.21.1). The totality of need is staggering, and humanitarian medicine is a practice needed now more than ever. Origins and evolution of the humanitarian sector History of humanitarian response Humanitarian medicine is increasing in prominence (as demon- strated by its new inclusion in this established textbook!). This is related to many factors including increasing global humanitarian 2.21 Humanitarian medicine Amy S. Kravitz 1  Masellis M, Gunn S (2017). International Association for Humanitarian Medicine (IAHM) Brock Chisholm. Iahm.org. http://​www.iahm.org/​eng/​home_​ medicinaumanitaria.htm 194 section 2  Background to medicine Democratic Republic of the Congo (DRC) Central African Republic (CAR) Chad Libya Cameroon Nigeria Burkina Faso Senegal Mauritania Haiti Mali Niger Nigeria regional refugee response plan Burundi regional refugee response plan DRC Uganda Chad Cameroon Niger Rwanda Burundi Nigeria Tanzania Sudan Myanmar Afghanistan Occupied Palestinian territory (oPt) Syrian Arab Republic Iraq South Sudan Burundi Somalia Djibouti Yemen Ethiopia Ukraine Syria regional refugee and resilience plan South Sudan regional refugee response plan Sudan Ethiopia Egypt Turkey Iraq Lebanon Jordan Kenya Uganda South Sudan Syrian Arab Republic Countries with humanitarian response plans (HRPS) or other appeals Countries included in regional refugee response plans Fig. 2.21.1  Response plans around the world; Global Humanitarian Review 2017. Reproduced from Global Humanitarian Review 2017, United Nations Office for the Coordination of Humanitarian Affairs (OCHA). 2.21  Humanitarian medicine 195 needs, the expanding size of the overall humanitarian sector, and the role that the mass media have played in raising awareness of global crises and increasing the profile of those who are engaged in the emergent responses. However, the humanitarian concept is far from a new phenomenon: as the Greek historian, Thucydides, wrote in his classical account of the war between Sparta and Athens in the 5th century BCE: ‘If it had not been for the pernicious power of envy, men would not so have exalted vengeance above innocence and profit above justice . . . in these acts of revenge on others, men take it upon themselves to begin the process of repealing those general laws of humanity which are there to give a hope of salvation to all who are in distress.’ (Thucydides, History of the Peloponnesian War) This understanding of human nature and behaviour in crises nearly two and a half millennia ago is one of the earliest discourses on hu- manitarian action. Humanitarian principles have been enshrined in many religious teachings, and limits to warfare, including the acceptable conduct of wars, have been articulated in texts from the time of Thucydides: examples include The Art of War by Sun Tzu. But despite its underlying presence, for millennia the conduct of humanitarian medicine lacked codification and the practical frameworks that it now enjoys. The origins of contemporary humanitarianism are traditionally considered to be rooted in the actions of Henri Dunant, a Geneva-​ born businessman, who in June 1859 travelled to northern Italy to see Emperor Napoleon III, arriving in time to witness the imme- diate aftermath of one of the bloodiest battles of the century, the Battle of Solferino in the Second Italian War of Independence. With over 5000 soldiers killed and estimates of over 23 000 left wounded on the battlefield, Dunant’s experiences led to the publication of a small book entitled Un Souvenir de Solférino (A Memory of Solferino) in 1862. This book focused attention on the value of having a pre- pared cadre of responders to provide support in emergencies and proposed a plan for the formation of government sponsored inter- national relief societies to care for those wounded in wartime. He put forth the idea that aid workers should be permitted to assist the wounded on the battlefield, unencumbered and unharmed, pro- vided they remained neutral to the conflict. In 1863, Dunant helped to establish The International Committee for Relief to the Wounded, providing the framework for the even- tual establishment of the Red Cross. The following year, on 22 August 1864, 12 nations agreed to the principles of protection for the wounded and the neutrality and independence of medical per- sonnel on the battlefield. By signing the Geneva Convention for the Amelioration of the Condition of the Wounded in Armies in the Field, now commonly known as the Geneva Conventions, the concepts of rules in war and protection for humanitarian assistance were en- shrined in law. For his efforts, Dunant was awarded the first ever Nobel Peace prize in 1901. From this origin, formal international humanitarian law has been designed to specifically cover the duties of those engaged in armed conflicts as well as the rights of humanitarian organizations involved in relief operations. The laws serve as a careful equipoise between respect for state sovereignty with the recognized rights of inde- pendent organizations to provide aid to those in need. In total, the four Geneva Conventions of 1949, the two 1977 Additional Protocols, and a final additional Protocol in 2005, form the funda- mental basis for international humanitarian law. Derived directly from these laws are the fundamental principles of humanitarian action: humanity, neutrality, impartiality, and inde- pendence. The OCHA summarizes these key principles as follows: • Humanity:  Human suffering must be addressed wherever it is found. The purpose of humanitarian action is to protect life and health and ensure respect for human beings. • Neutrality: Humanitarian actors must not take sides in hostilities or engage in controversies of a political, racial, religious, or ideo- logical nature. • Impartiality:  Humanitarian action must be carried out on the basis of need alone, giving priority to the most urgent cases of dis- tress and making no distinctions on the basis of nationality, race, gender, religious belief, class, or political opinions. • Independence:  Humanitarian action must be autonomous from the political, economic, military, or other objectives that any actor may hold with regard to areas where humanitarian action is being implemented. International humanitarian law, combined with medical ethics and other legal frameworks for healthcare providers, was intended to de- fend the autonomy of medical personnel, to protect them against interference by security or military personnel, and to ensure that healthcare is provided on the basis of need, and neither refused nor used against groups. Aid architecture The formation of the Red Cross, and the principles upon which it was based, set the standard for the development of subsequent non​governmental organizations (NGOs). The League of Nations High Commission for Refugees, under the newly formed League of Nations, was founded in 1921 and served as a forerunner of the United Nations High Commissioner for Refugees (UNHCR). This was the first international agency that focused upon legal and polit- ical protections and assistance to refugees. Other humanitarian or- ganizations followed, and by the 1980s the key emergency offices within the United Nations had been formed, followed by the prolif- eration of NGOs. Today, humanitarian medicine is practised within a framework of a larger aid system, with an UN-​led coordination mechanism established in 1991 through General Assembly Resolution. Key pillars of the UN-​led humanitarian system included the forma- tion of the Inter-​Agency Standing Committee (IASC) as the main policy and decision-​making body guiding international humani- tarian response, and the establishment of the OCHA, intended to increase coordination among the multitude of actors now en- gaged in humanitarian relief. No individual organization has suf- ficient capacity to address the comprehensive needs within even a single humanitarian crises, let alone the multitude simultaneously ongoing, and efforts to improve coordination between agencies, as well as with the host authorities, and the affected communities remains an ongoing priority. As part of that effort, the UN intro- duced the cluster system in 2005 to increase accountability, or- ganization, and professionalism within the global humanitarian 196 section 2  Background to medicine community. With health serving as one of the 11 established clus- ters, the purpose of this system was to ensure ‘that international responses to humanitarian emergencies are predictable and account- able and have clear leadership by making clearer the division of la- bour between organizations, and their roles and responsibilities in different areas’. (Fig. 2.21.2). The practice of humanitarian medicine Precipitants and preparations All crises, including violent conflict and natural disaster, can af- fect the health of a population, ranging from injuries and trauma to the exacerbation of underlying medical conditions. These events can weaken already poor infrastructure and affect access to basic services such as healthcare, clean water, and sanitation. Population displacement can result, often precipitating the spread of infectious diseases that are exacerbated by overcrowding. The diseases with the highest epidemic potential in such settings—​cholera, dysen- tery, measles, and meningitis—​are uncommonly seen in most clin- ical practices today, but where there is displacement they can have devastating individual and population level effects. Humanitarian assistance is essential in any crisis situations where the host government is unable to meet the demands within its own country or is unwilling to provide for the needs of its popu- lation. Before the establishment of a humanitarian space, how- ever, even in emergent situations, assessments, and planning must occur. Most experienced humanitarian organizations have teams of medical, logistics, security, and epidemiological experts on standby to deploy to any affected area to immediately carry out rapid assessments. The inclusion of local partners who can trans- late and provide an understanding of local context is essential, and mapping platforms such as geographic information systems can serve as tools which allow a better understanding of the locations, relationships, and patterns. Logistical considerations can be complex, particularly in un- stable environments, and often require the development of a supply chain with more agility than that required for typical commercial endeavours. Most agencies initially operate by a ‘push’ system, using prepacked and stockpiled kits ready for rapid deployment to an emergency, and switch to a ‘pull’ system, when supplies are requested as per need, within about 2–​3 weeks of operations. Humanitarian doctors who join these relief efforts, either in the ini- tial stages or once the programme has become established, should Humanitarian & Emergency Relief Coordinator Early recovery UNDP Education UNICEF & Save the Children Emergency telecom- munications WFP Health WHO Food security WFP & FAO Logistics WFP Nutrition UNICEF Camp coordination and camp management IOM/UNHCR Shelter IFRC/ UNHCR Protection UNHCR Water, sanitation, and hygiene UNICEF Pre ve nt io n Mit ig at io n Pre pa re d n e s s D is as ter Res po ns e Reco ve ry Re co ns tr uc tio n Fig. 2.21.2  UN cluster system. Reproduced from Reference Module for Cluster Coordination at Country Level (revised July 2015), Inter-​Agency Standing Committee. 2.21  Humanitarian medicine 197 gather information about the context and strive to form a basic framework of the background clinical and epidemiological picture. The setting they will enter may be chaotic and tense, and prepar- ations will serve to align expectations and streamline focus onto the clinical challenges which lie ahead. Complex realities Simply put, providing clinical care in humanitarian settings is a challenge; resources are limited, advanced diagnostics are not readily available, and the environment is unfamiliar and chaotic. Colleagues may have different backgrounds, cultural norms, and professional practices. Language barriers often require the use of translators. Despite these realities, the breadth of possible caseloads is nearly limitless. A humanitarian provider may need to manage acute emergencies, respond to victims of sexual violence, treat non-​ communicable diseases, or support war surgery. A caseload could include ophthalmological, renal, and orthopaedic presentations, while concurrently working to ensure the provision of clean water and support for hospital management. Without extensive adminis- trative or axillary staff, humanitarian providers may have a broad range of duties that stretch their capacity and test their skills to the limit. Humanitarian medicine must be practised with a nuanced under- standing of the specific context in which it occurs; mastery in one setting does not imply universal expertise in another. The skills re- quired for effective working (for example) in the Central African Republic, where there is context of protracted conflict, include an understanding of malnutrition, epidemic control, and of sexual vio- lence. In contrast, providing assistance to Iraqi refugees involves supporting emergency health needs and a shifting focus from war injuries to long-​term rehabilitation. In any setting these clinical scenarios would pose a challenge, but working in unstable environ- ments adds further complexity, since before even starting it may be necessary to negotiate humanitarian space with armed combatants, coordinate across agencies, and gain the trust of a broad community. Rather than a ‘simple’ mastery of clinical medicine, effective practice in humanitarian settings demands of the provider that they under- stand disciplines such as security, law, epidemiology, and logistics, and can operate practically in highly politicized domains Despite the innumerable complexities that present themselves, there is as yet no overarching licensing body or specific clinical review board to instruct and regulate this emerging discipline. There have been coordinated efforts to standardize best practices in humanitarian aid, such as with the Sphere Standards, initiated in 1997, and the Core Humanitarian Standards on Quality and Accountability (CHS), launched in 2014 (Box 2.21.1), but there are clear limitations to the use of universal standards in diversely varied settings. The lack of a definitive training programme to pre- pare the physician for the enormous variability of contexts and scenarios means that humanitarian providers must depend upon their medical education, clinical experiences, and preparations, and be willing to listen and learn from others. National or ‘local’ col- leagues are often the source of indispensable information, and the best humanitarian doctors are those who have the capacity to ac- knowledge their own shortcomings and learn from others. Having ‘gold standards’ might remain the goal, but the reality of circum- stances ensures that these remain at best highly challenging or, in effect, impossible. The humanitarian system Size and economics In response to a changing global landscape there has been an unpre- cedented expansion in the humanitarian sector over the last decade. Between 2007 and 2010, the fieldworker population rose by 4–​6% annually, with financial contributions to the international emer- gency response efforts rising nearly threefold. By 2010, there were an estimated 274 000 humanitarian providers who collectively re- sponded to 103 natural disasters and 43 complex emergencies. By 2014, this number had grown to 450 000 humanitarian providers, working across more than 4480 operational aid organizations. Box 2.21.1  Humanitarian response review and sector strengthening Throughout the history of humanitarian action, nearly every major international intervention had raised questions about how to measure the impact and effectiveness of the response and how to make the sector more coordinated, efficient, and effective. When the contextual variability of humanitarian interventions can include sociopolitical con- straints and security limitations, developing a uniform approach to improvement can be difficult to achieve. Despite the challenges, over the past two decades, great strides have been made to further the ef- fectiveness of humanitarian action, and the evaluation of humanitarian responses, such as that to the 2010 earthquake in Haiti, remain an im- portant component driving this effort. After the devastating 2010 earthquake in Haiti, which killed more than 220 000 people and affected nearly a third of the Haitian population, almost $9 billion in aid was donated in what amounted to one of the largest emergency relief responses ever mounted. Countless lives were saved in the immediate search-​and-​rescue efforts and medical response, and in subsequent relief efforts which reduced food insecurity and con- trolled an ensuing cholera epidemic. Despite such a large financial com- mitment and global response, by 2013, acute vulnerabilities persisted with over 10% of the population remaining displaced in camps and 30% continuing to have chronic and acute health needs. A review of the international response to the earthquake showed that, significantly, the Government of Haiti, was estimated to have received only 1% of the immediate humanitarian aid, and between 15 and 21% of the longer term recovery funding for reconstruction and develop- ment. NGOs and private contractors were the primary recipients of the aid in Haiti, and despite building extensive infrastructure with the use of public funds, had limited accountability to the national health authorities and lacked transparency as to quality and services offered. Further, with limited infusion of funds into the existing public health system, Haitian clinics and hospitals continued to lack the necessary resources to func- tion. With this in context, the 2014 Haiti Humanitarian Action Plan, developed through OCHA, sought to address the ongoing critical hu- manitarian needs but also focused at strengthening national capacities. The review of the response also demonstrated inconsistent capaci- ties and performance between agencies, and, (as concluded by Sir John Holmes, former Under-Secretary General for Humanitarian Affairs and Emergency Relief Coordinator) ‘the influx of many hundreds of hu- manitarian organizations, many of whom, while well-​meaning, were not necessarily professional and well-​informed in their approach, posed a huge challenge to coherence’. In addition to reforms aimed at coord- ination, the review highlighted the need ‘to develop principles, cri- teria and standards for medical teams that respond to emergencies and disasters’. This led to the development of the Classification and Minimum Standards for Emergency Medical Teams in sudden-​onset disasters, and fed into the formation, in 2014, of the Core Humanitarian Standards on Quality and Accountability. 198 section 2  Background to medicine Although international actors receive the most media attention, local and national agencies and authorities actually deliver the greater assistance in conflict and disaster situations, particularly in the critical initial phase. Over 80% of operational aid organizations are local NGOs that work only within their home country. These na- tional providers are the ‘first in’ and ‘last out’ and form the backbone of the aid delivery system, but they are often underrecognized and excluded from much of the available funding. Despite a consensus on the essential role that national and local NGOs play within the humanitarian sector, recent data suggest that these sectors receive only a meagre share of global funding—​0.3% in 2017. Funding of the sector has risen in absolute amount, but the growth has not kept pace with need, indeed the increase in need has stretched the capacity of the sector to its apparent limit. In 1992, the first interagency humanitarian appeals by OCHA requested (US) $2.7 billion, a figure which, by 2007, had only risen to $5.5 billion. The International Red Cross and Red Crescent Movement (RCRC) set out its requirements separately, maintaining independ- ence from the UN-coordinated appeals. By 2016, the UN joint ap- peal was for $20.5 billion yet yielded only $12.4 billion in committed funding, a 40% shortfall, and—irrespective of any shortfalls—needs have continued to increase (Fig. 2.21.3). As of June 2017, the UN joint appeal estimated a total requirement of $23.5 billion, to target 101.2 million people across 37 countries. Changing humanitarian picture The dizzying increase in global humanitarian needs show no signs of abating. Today, the primary cause of human suffering and the re- sultant humanitarian response is intricately linked to conflict, par- ticularly those which are increasingly complex and protracted. With comparison to the historic precedence, today’s crises are increas- ingly referred to ‘complex humanitarian emergencies’, defined by the World Health Organization as ‘situations of disrupted livelihoods and threats to life produced by warfare, civil disturbance and large-​ scale movements of people, in which any emergency response has to be conducted in a difficult political and security environment’. Modern conflicts increasingly occur in urban rather than rural ­settings, and now affect middle income countries as often as poorer ones. The impact of such conflicts includes a massive surge in displace- ment, a regression on development, and a general increase of human suffering. The humanitarian consequences are massive and persistent. The International Committee of the Red Cross (ICRC) spends most of its budget on protracted conflicts, and in 2013, two thirds of the hu- manitarian aid from the Development Assistance Committee (DAC) donors was spent in countries which had been recipients of substantial humanitarian support for at least the previous 8 years. Protracted conflicts and other man-​made crises have already con- tributed to unparalleled levels of displacement and 2016 marked an unprecedented challenge for the humanitarian community. UNHCR reported 65.6 million people forcibly displaced due to persecution, conflict, violence, or human rights violations, exceeding even the levels seen after the Second World War (Fig. 2.21.4) This equates to nearly 1% of the global population and the rate of new displace- ments is calculated to be 20 people every minute. Most refugees ori- ginate from the three zones of continued violence, Syria (5.5 million), In billion US$ 20 15 10 5 0 2004 2016 2014 Funding Gap Requested 2012 2010 2008 2006 Fig. 2.21.3  Funding trends 2004-​2016. 97 0 10 20 30 40 50 60 70 Displaced population (millions) Proportion displaced (number displaced per 1000 world population) 98 99 00 01 02 03 04 05 06 07 08 09 10 11 12 13 14 15 16 0 1 2 3 4 5 6 7 8 9 10 Refugees and asylum-seekers Internally displaced persons Proportion displaced Fig. 2.21.4  Trends in global displacement and proportion displaced (1997–​2016). From Global Trends: Forced Displacement in 2016, © 2017 United Nations High Commissioner for Refugees. 2.21  Humanitarian medicine 199 Afghanistan (2.5 million) and South Sudan (1.4 million). Despite the predominance of news coverage on the European refugee influxes, the reality is that the least wealthy countries are actually hosting the vast majority of those displaced. In 2016, 84% of the refugees were hosted in developing countries, with Turkey assuming the lar- gest number worldwide for the third consecutive year (2.9 million). Lebanon, meanwhile, hosted the largest number of refugees relative to population size, and current estimates are that one out of every six people in that country is a refugee. Pakistan, Iran, Uganda, and Ethiopia were the other primary countries of asylum for refugees. The challenges of displacement for the humanitarian community relate not simply to the sheer magnitude of the issue but also to the changing contextual picture. Displacement today no longer adheres to traditional precedents that were set in rural areas or in camp set- tings. Today’s response requires evolving practices, approaches, and interventions, in particular those that recognize and are sensitive to often dire historical contexts. Current crises in settings such as Yemen and Syria, where prewar health systems were functional, means that humanitarian providers must adapt traditional responses and con- sider the voids left by previously-​existing tertiary level or specialist care. The expectations placed on humanitarian providers in such set- tings can be untenably high. Requests to support dialysis units or ter- tiary specialist care are totally different propositions from traditional refugee responses, factors which have substantially increased the complexity of humanitarian programming in recent years. Further, working in dense urban areas poses different challenges with regard to shelter and housing, water and sanitation, engagement with es- tablished municipal and governmental systems, and even locating marginalized or displaced populations. Despite years of operational experience, humanitarian practices are currently, and rapidly, being adapted to today’s picture. But this takes time, and the humanitarian community may not have the desired resources for this investment as the needs of millions are generally urgent and pressing. Instability and insecurity in humanitarian space Addressing the humanitarian consequences of today means engaging in a sector that is more complex and hazardous than ever before. With increased involvement of non​state actors, conflicts can often no longer be considered to be between two recognizable sides. Fragmented structures can mean it is often difficult to know who is in charge, and the laws of war and the associated humanitarian pro- tections are often forgotten. Perhaps the most fundamental issues facing humanitarian pro- viders today is a decrease in respect for and enforcement of humani- tarian principles. Violations against humanitarian law, including a lack of respect for the protections of civilians, humanitarian, and medical workers, has risen to unprecedented levels and become a threat to the foundations of the sector. Analysis of data by the World Health Organization (WHO) showed that within the 2 year period of 2014 and 2015, there were 594 reported attacks on health care ­facilities or personnel across 19 different countries with emer- gencies, resulting in 959 deaths and 1561 injuries. Médecins Sans Frontière (MSF), meanwhile, reported attacks on 63 of their facil- ities in Syria in 2015, resulting in the total destruction of 12 facilities and the injury or death of more than 80 MSF-​supported medical staff in that year. There has been an appalling lack of accountability against the perpetrators of such crimes, allowing violations of inter- national law to occur with seeming impunity. The campaign #NotATarget was launched by MSF in 2015 to stand in solidarity with those targeted and to demand the ‘right of humani- tarian workers to provide humanitarian care to people in need, par- ticularly in conflict zones’. The laws to provide these protections are already in place, but without enforcement they hold little power. When nations and warring parties disregard the international ob- ligations and the protections in place for the impartial provision of healthcare, humanitarian action itself is threatened. Now more than ever, efforts must be made to strengthen and protect this system, and those who work within in it. In May 2016, UN Security Council Resolution 2286 passed, which condemned the attacks on medical facilities and demanded compliance with international law. At pre- sent, however, global leaders have not provided much substantive action to support the rhetoric. Political change and forward uncertainty Humanitarian Medicine has grown in prominence because it is im- portant, indeed essential. Those who practise humanitarian medi- cine have the fundamental ability to reaffirm humanity and help relieve human suffering. Yet the humanitarian sector now appears to be at a crossroads, with capacity stretched to a limit and insufficient funding, and today’s geopolitical context portends uncertainty. Throughout modern history, humanitarian principles have played a critical role in the geopolitical sphere, from Britain’s commitment to end the slave trade in early 1800s to the international response for those affected by the Northern Chinese Famine of 1876-​1879. In 1904, United States President Theodore Roosevelt used the op- portunity of his Annual Message to Congress to reaffirm a place in politics for humanity, stating that: ‘ . . . there are occasional crimes committed on so vast a scale and of such peculiar horror as to make us doubt whether it is not our manifest duty to endeavor at least to show our disapproval of the deed and our sympathy with those who have suffered by it . . . (and) . . . in extreme cases action may be justifiable and proper’. After the Second World War, these commitments were repeated and appeared firmly rooted in the foundations of liberal govern- ance, but these sentiments appear disconnected with the realities of the present day, and today’s global world order no longer conveys any semblance of solidarity. The predominant views from leaders is often decidedly more inward-​looking and self-​centred. Decisions made in response to the refugee influx in Western countries include the tightening of entry of refugees to the point of near stranglehold. The EU-​Turkey deal, signed by 28 European Union Heads of State in March 2016, agreed to the return to Turkey of ‘all new irregular migrants’ arriving on Greek shores, despite concerns about poten- tial violations of both EU and International Humanitarian Law. The traditions of humanitarian principles, which once ran deep in world politics, now appear reduced to strategic interest. The moral respon- sibilities of today’s world leaders can no longer be assumed. At this profoundly transformative time, those practising hu- manitarian medicine continue to uphold moral imperatives. They are demonstrating a commitment to medical excellence, with a broadening and strengthening of their capacities to meet the de- mands of those in need. But it is crucial that these and other active groups realize that the practice of a humanitarian provider should 200 section 2  Background to medicine go well beyond the therapeutic acts of relieving suffering and saving lives, as it serves to exert the general principles of humanity. The mere presence of humanitarian providers indicates a solidarity with those in need and helps to restore dignity for those impacted by crises. By bearing witness and demanding accountability, they re- ject the placid acceptance of suffering. Through their actions the capacity of compassion and the potential of humanity displayed, which at this particular time in history, perhaps more than ever be- fore, is imperative. FURTHER READING Bradol J-​H, Vidal C (eds) (2011). Medical interventions in humani- tarian situations: The work of Médecins Sans Frontières. Translated by Christopher Brasher, Nina Friedman, Philippa Bowe Smith, Karen Stokes, and Karen Tucker. Médecins Sans Frontières (MSF), Geneva, Switzerland. Kravitz A (2019). Oxford handbook of humanitarian medicine. Oxford University Press, Oxford. 2.22 Complementary and alternative medicine 201 2.22 Complementary and alternative medicine 201 ESSENTIALS Complementary and alternative medicine can be defined as diagnosis, treatment, and/​or prevention which complements mainstream medi- cine by contributing to a common whole, by satisfying a demand not met by orthodoxy, or by diversifying the conceptual frameworks of medicine. It is popular; hence doctors should know about it. Why is complementary and alternative medicine popular? The following motivations may be important: (1) to leave no thera- peutic option untried; (2) to take control over one’s own health; (3) to accord one’s healthcare with one’s global outlook; (4) to benefit from natural and, by implication, safe treatments; (5) to be given time, understanding, and empathy by a practitioner; (6) disenchantment with conventional medicine/​science. Types of complementary and alternative medicine The term covers a vast array of treatments and diagnostic techniques which have little in common except that they are not part of main- stream medicine. The most important modalities are (1) acupuncture—​ probably effective for some painful conditions and for nausea/​ vomiting; rarely causes severe adverse events. (2)  Phytotherapy—​ treatment with herbal extracts; can be evaluated by assessing each of the many remedies separately; some phytomedicines are supported by sound evidence. (3) Homeopathy—​based on irrational concepts of ‘like cures like’ and ‘potentizing’ (shaking and stepwise dilution of drugs); trial data fail to show efficacy for any condition. (4)  Spinal manipulation—​may be mildly effective for back pain as practised by chiropractors, osteopaths, physiotherapists, and other healthcare pro- fessionals; claims that it also works for many other conditions are not supported by sound evidence; can cause significant side effects (e.g. manipulation of the cervical spine causes transient adverse events in about half of all patients and has been associated with serious compli- cations such as dissection of the vertebral artery). Definition Most healthcare professionals feel they know intuitively what is meant by complementary and alternative medicine (CAM), yet an adequate definition is hard to find. Often CAM is described by char- acteristics that exclude it from mainstream medicine, for example: • not taught in medical school • not scientifically proven • not based on a scientific rationale • not used in routine healthcare CAM can be positively defined as ‘diagnosis, treatment, and/​or pre- vention which complements mainstream medicine by contributing to a common whole, by satisfying a demand not met by orthodoxy or by diversifying the conceptual frameworks of medicine’. CAM encompasses a large variety of techniques which have little in common except that they are not part of mainstream medicine, claim to offer help for most conditions, and pride themselves on a holistic approach to patient care (Table 2.22.1). Some relate to thera- peutic modalities (e.g. herbalism), some to diagnostic techniques (e.g. iridology), and some include both diagnostic and therapeutic modalities (e.g. acupuncture). There are considerable local differences in what are regarded as CAM or mainstream medicine. In Germany, for instance, massage therapy and herbalism are orthodox, whereas in English-​speaking countries they are usually regarded as CAM. Acupuncture is CAM in the West, while in China it is a widespread, traditional, and gener- ally accepted form of treatment. Since most CAM therapies are used as adjuncts to conventional treatments, ‘complementary’ is a more appropriate term than ‘al- ternative’. When used as a true alternative to mainstream medicine, CAM can become a hazard to patients even if the treatment itself is without risks. In many countries, including the United Kingdom, CAM is practised mostly by healthcare professionals who are not medically trained, often in the absence of stringent regulation, leading many to be concerned that vulnerable patients may be exploited. Prevalence The 1-​year prevalence of CAM usage by the general population ranges from 10% in the United Kingdom to 62% in Germany. In patient populations, these figures can be considerably higher. 2.22 Complementary and alternative medicine Edzard Ernst 202 section 2  Background to medicine For instance, most cancer patients try one form of CAM or another. The annual expenditure for CAM exceeds (US) $20 billion in the United States of America and £1.6 billion in the United Kingdom. In industrialized countries, typical users of CAM are middle-​ aged, female, well-​educated members of a high socioeconomic class. Indications for CAM range from chronic benign conditions where mainstream medicine does not offer a cure (e.g. back pain) to life-​ threatening diseases like cancer and AIDS. Most patients try CAM in parallel with conventional treatment yet 30 to 50% do not tell their conventional healthcare providers that they do so. A medical history should include specific questions about CAM. Reasons for CAM’s popularity There is not one but a complex mix of reasons; the following motiv- ations may be important: • to leave no therapeutic option untried • to take control over one’s own health • to align one’s healthcare with one’s global outlook • to benefit from natural and, by implication, safe treatments • to be given time, understanding, and empathy by a practitioner • disenchantment with conventional medicine/​science Table 2.22.1  Various other forms of therapeutic and diagnostic methods Name Principle Main indications/​ reasons for use Efficacy Risks Alexander technique Training process of ideal body posture and movement; developed by F M Alexander Musculoskeletal problems Very few clinical trials No serious adverse effects on record Applied kinesiology Diagnostic technique using muscle strength as an indicator; developed by G. Goodheart n.a. Repeatedly shown to be not valid Can delay reliable diagnoses Aromatherapy Application of essential oils usually through gentle massage techniques; developed by R.M. Gattefossé Relaxation improvement of well-​being Systematic reviews are mostly inconclusive Allergic reactions to oils Autogenic training Form of self-​hypnosis for relaxation and stress reduction; developed by J. Schultz Stress management Some evidence for effectiveness No serious adverse effects on record Chelation therapy Intravenous infusion of EDTA used in CAM for ‘deblocking’ arteries from arteriosclerotic lesions Circulatory disorders Shown to be ineffective Serious adverse effects, even deaths, reported Chiropractic Popular manual therapy based on the assumption that most health problems are due to misalignment of the spine and treatable through spinal manipulation; developed by D Palmer; seen as mainstream by many proponents Back pain, neck pain, and many others Cochrane reviews of chiropractic for back pain show it is not superior to standard therapies, no good evidence for other indications Serious adverse effects have been reported, their exact incidence is not known Colonic irrigation (or colon therapy) Cleansing of the colon through enemas with water or coffee for ‘detoxication’ Various No sound evidence for effectiveness Serious adverse effects reported Hypnotherapy Induction of trance-​like state to influence the unconscious mind Various Some evidence for effectiveness Serious adverse effects probably infrequent Iridology Diagnostic technique using signs and impurities on the iris n.a. Repeatedly shown to be not valid Can delay reliable diagnoses Macrobiotic diet Diet based on the yin/​yang principle using whole grains and vegetables Disease prevention Positive effects on cardiovascular risk factors Serious adverse effects reported Massage Various techniques of manual stimulation of cutaneous, subcutaneous, or muscular structures (deemed mainstream on the European continent) Musculoskeletal problems, anxiety, and many others Some evidence for effectiveness in musculoskeletal and psychological problems No serious adverse effects on record Osteopathy Health problems are thought to be due to misalignment of the spine and corrected through spinal mobilization; developed by T Still; seen as mainstream by many proponents Back pain, neck pain, and many others Systematic reviews of osteopathy for back pain are inconclusive Adverse effects less than with chiropractic Reflexology Internal organs correspond to areas on the sole of the feet and can be influenced through massaging these Relaxation Systematic review was inconclusive No serious adverse effects on record Spiritual healing Channelling of ‘healing energy’ through a healer into a patient Re-​establishing a wholesome balance Best evidence fails to show effectiveness No serious adverse effects on record Yoga Meditative, postural, and breathing techniques from ancient India Various Some evidence for effectiveness in asthma, or cardiovascalur risk factors, for instance No serious adverse effects on record n.a., not applicable. 2.22  Complementary and alternative medicine 203 Examples of CAM methods Acupuncture Description Traditionally, the Chinese believed that the life energy (Qi or Chi) flowing in particular channels (meridians) governs human health. The energy is a balance of opposite characteristics: yin and yang. Illness is understood as an expression of an imbalance between yin and yang. One way of re-​establishing the proper equilibrium would be to insert needles in acupuncture points located along the merid- ians. Instead of or in addition to needles, acupuncturists also use pressure (acupressure), laser light (laser acupuncture), electrical currents (electroacupuncture), heat (moxibustion), or other stimuli to stimulate acupuncture points. Neither the meridians nor the acu- puncture points have a morphological basis, and the theory of yin and yang is not supported by facts. Mode of action Neurophysiological research has created a (hypothetical) rationale for acupuncture: activation of brainstem nuclei, and the release of neural transmitters and endorphins in the brain and descending in- hibitory control systems. There are considerable differences between traditional Chinese and Western acupuncture. In traditional Chinese medicine, con- ventional diagnoses are not normally sought, treatment is highly individualized according to each patient’s particular yin/​yang im- balance, and acupuncture is employed as a ‘cure all’. In contrast, Western acupuncturists tailor their treatment to the conventional diagnosis established beforehand and use acupuncture for selected indications for which it might be efficacious. Efficacy Many trials of acupuncture exist but are fraught with methodological problems, such as placebo effect and blinding of patients or therap- ists. Over 200 systematic reviews and meta-​analyses of acupuncture trials for various conditions have been published. As they are often based on biased primary data, their conclusions are not always reli- able. According to such reviews, the most convincing evidence exists for the following conditions: • chronic back pain • dental pain • gastrointestinal endoscopy • idiopathic headache • osteoarthritis of knee • postoperative nausea and vomiting For other indications, the data remain inconclusive because of highly contradictory findings, poor quality research, insufficient quantity of primary data, or a total absence of reliable studies. Recently, new non​penetrating acupuncture devices have become available which best control for placebo effects in clinical trials. The results of clinical trials employing such devices tend to sug- gest that most of the therapeutic response to acupuncture relies on ­placebo effects. Hardly any acupuncture studies are conducted with masking the therapist according to group allocations of patients. Safety Serious complications of acupuncture include: • trauma (e.g. cardiac tamponade, pneumothorax) • infections (e.g. viral hepatitis) With well-​trained therapists, such complications are rare. However, mild adverse effects (e.g. pain or bleeding at the site of needling) occur in about 10% of all patients. In addition, there are indirect risks. For instance, some acupuncturists advise their patients about prescription drugs without having the medical competence to do so. Phytotherapy Description Medical herbalism (phytotherapy) is treatment with whole plants, parts of plants, or plant extracts. The term does not cover treatment with single active constituents such as acetylsalicylic acid, origin- ally derived from willow bark. Since all plants contain a multitude of chemicals, phytotherapy involves treatment with a mixture of poten- tially active compounds. In many cases there is uncertainty about the most important active ingredients and their pharmacological ac- tions. The claim of herbalists that the whole plant (extract) will yield more beneficial effects than any single isolated ingredient (synergy) is largely unproven. Most medical cultures have their version of traditional herb- alism. Traditional Chinese medicine has a long history of employ­ ing mixtures of herbs to prevent and treat disease. This tradition was modified by the Japanese and resulted in Kampo medicine. The Indian tradition has generated Ayurvedic medicine which relies heavily on plant-​based remedies. Likewise, European herb- alism has a tradition which is as old as European medicine itself. The scientific investigation of medicinal herbs is, however, a rela- tively recent innovation. Mode of action There are few differences in principle between pharmacotherapy and phytotherapy except that herbal remedies are multicomponent systems which render them pharmacologically more complex. There is no reason why the rules of pharmacokinetics and pharma- codynamics should not apply. Discernible modes of action exist for every plant-​based medicine. In some cases, these have been eluci- dated; in many other cases, they remain hypothetical. Efficacy Based on authoritative systematic reviews and meta-​analysis, good or at least encouraging evidence exists for the efficacy of the fol- lowing herbal remedies: • Andographis paniculata for upper respiratory tract infections • black cohosh (Actaea racemosa) for alleviating menopausal symptoms • cranberry (Vaccinium macrocarpon) for prevention of urinary tract infections • devil’s claw (Harpagophytum procumbens) for treating musculo- skeletal pain • garlic (Allium sativum) for hypercholesterolaemia • Ginkgo biloba for intermittent claudication 204 section 2  Background to medicine • green tea (Camellia sinensis) for prevention of cancer and cardio- vascular disease • hawthorn (Crataegus spp.) for treatment of chronic heart failure • horse chestnut (Aesculus hippocastanum) seed extract for primary venous insufficiency • kava (Piper methysticum) as an anxiolytic drug • nettle (Urtica dioica) for benign prostate hyperplasia • red clover (Trifolium pratense) for hot flushes during menopause • saw palmetto (Serenoa repens) for benign prostatic hyperplasia • St John’s wort (Hypericum perforatum) for mild to moderate depression • valerian (Valeriana officinalis) for insomnia • willow (Salix spp.) bark for pain For many other popular medicinal herbs, too few clinical trials have been carried out, the studies are methodologically flawed, or their results are contradictory. No good evidence exists for the efficacy of traditional approaches to herbal medicine where plant mixtures with a multitude of ingredients are used depending, not on a conventional diagnosis, but on the individual patient’s set of symptoms, consti­ tution, or other circumstances. Yet these traditional approaches are the ones likely to be applied if a patient consults a medical herbalist. Safety Many medicinal herbs have been associated with serious adverse ef- fects (see Chapter 10.4.3), for example: • aconite (Aconitum) cardiotoxic • Aristolochia nephrotoxic • black cohosh (Actaea racemosa) hepatotoxic • broom (Cytisus scoparius) cardiotoxic • chaparrall (Larrea tridentate) nephrotoxic • comfrey (Symphytum officinale) hepatotoxic • kava (Piper methysticum) hepatotoxic • liquorice root (Glycyrrhiza glaba) induces hypokalaemia • pennyroyal (Mentha pulegium) hepatotoxic • skullcap (Scutellaria lateriflora) hepatotoxic Herbal remedies can interact powerfully with synthetic drugs (Table 2.22.2), and Asian herbal medicines have been shown re- peatedly to be adulterated with synthetic drugs or contaminated with heavy metals. In many countries (e.g. United Kingdom and the United States of America) herbal medicines are marketed as food (or dietary) supplements in the absence of stringent quality control. Homeopathy Description Samuel Hahnemann, a German physician, believed in two major principles which formed the basis of an entirely new school of medi- cine: homeopathy. The ‘like cures like’ principle postulates that, if a given drug induces symptoms (e.g. a headache) in healthy individ- uals, it can be employed to treat headaches in patients who suffer from it. The second principle holds that ‘potentizing’ (i.e. shaking and stepwise diluting) drugs makes them more potent for the treatment of illness. Homeopathic dilutions prepared in this way are believed to be clinically effective even if not a single molecule of the original medicine is contained in the potentized remedy. For 200 years, scientists have pointed out that these principles fly in the face of science and that therefore homeopathy cannot possibly work beyond a placebo effect. Homeopaths, however, insist that their remedies act via ‘energy’ transfer from the original substance to the diluent (the theory of a ‘memory of water’). Homeopaths do not treat diseases but claim to treat the whole in- dividual. They take a detailed history with the aim to match the to- tality of the symptoms and characteristics of that patient with a ‘drug picture’ (the ‘like cures like’ principle). This homeopathic remedy, given in the correct potency, should then be the optimal treatment for that patient. Clinical improvement may, however, take weeks or months, and, in about 20% of all cases, symptoms may deteriorate before they become better, a phenomenon termed ‘homeopathic aggravation’. At the time of Hahnemann there were very few effective treat- ments and many that were overtly harmful. Homeopathic remedies had virtually no adverse effects. Hahnemann can therefore be cred- ited with clinically exploiting the placebo effect to the best benefit of his patients. It is hardly surprising then that homeopathy con- quered many countries (e.g. France, the United States, India, South America) by storm. The advent of effective synthetic drugs and other efficacious treatments lead to the sharp decline of homeopathy; the recent boom of CAM, however, has brought about a revival. Mode of action Several hypotheses have been developed to explain the transfer of ‘energy’ from the mother tincture to the diluent. However, none has so far withstood the scrutiny of independent assessment. Neither has the ‘energy’ ever been defined in physical terms, nor are there rational explanations as to how this ‘energy’ (if it exists) might af- fect human health. Therefore, homeopathy remains among the least plausible forms of CAM. Efficacy A meta-​analysis of all 89 randomized and/​or placebo-​controlled clinical trials published by 1995 calculated an overall odds ratio of 2.45 in favour of homeopathy. When only the 26 most rigorous studies were meta-​analysed, the odds ratio fell to 1.66 but remained statistically significant. This publication was criticized (e.g. for pooling data for all medical conditions and all homeopathic rem- edies, and for including trials that were not randomized or placebo-​ controlled and studies of material (low dilution) remedies for which efficacy is not disputed). The results of about a dozen subsequent systematic reviews generally fail to demonstrate effects beyond pla- cebo. Therefore, the best evidence available to date fails to suggest efficacy. Safety Highly diluted homeopathic remedies cannot cause pharmacological adverse effects. Homeopaths claim that ‘homeopathic aggravations’ (an exacerbation of presenting symptoms after administration of the optimal remedy) occur in about 20% of cases; if that were true, they might represent a safety issue. ‘Indirect’ safety problems include the substitution of effective interventions by homeopathy. For instance, non-​medically qualified homeopaths tend to advise their clients against immunization and advocate homeopathic remedies instead. If this happens on a large scale, it jeopardizes herd immunity against serious infectious diseases. 2.22  Complementary and alternative medicine 205 Table 2.22.2  Possible interactions between some popular herbal remedies and synthetic drugs Herbal remedya Usage or pharmacological effectb Possible interaction Aloe (Aloe spp.) Various With chronic use, potentiation of cardiac glycosides or antiarrhythmic drugs due to loss of potassium Black cohosh (Actaea racemosa) Oestrogenic Increased effects of antihypertensives Borage (Borago officinalis) Anti-​inflammatory Interaction with antiepileptics, may increase risk of seizure Broom (Cytisus scoparius) Antiarrhythmic, diuretic Increased effects of antidepressants, β-​blockers, and cardiac glycosides Cascara (Rhamus purshiana) Laxative, cathartic Loss of potassium with chronic use, potentiation of cardiac glycosides, or antiarrhythmic drugs Chamomile (Matricaria recutita) Spasmolytic, anti-​inflammatory May potentiate effects of anticoagulants through its coumarin content Chasteberry (Vitex agnus castus) Hormonal effects Increased effects of other hormonal drugs Cranberry (Vaccinium macrocarpon) Urinary tract infections May enhance elimination of drugs normally excreted in urine Ephedra (Ephedra sinica) Central nervous system (CNS) stimulant, sympathomimetic Cardiac glycosides/​halothane: arrhythmias, guanethidine: enhanced sympathomimetic effect, monoamine oxidase enzyme (MAO) inhibitors: enhanced sympathomimetic effect, secale alkaloids/​oxytocin: hypertension Garlic (Allium sativum) Hypocholesterolaemic Increased effects of anticoagulants and antiplatelet drugs Ginger (Zingiber officinale) Antiemetic Increased effects of anticoagulants Ginseng (Panax ginseng) Various Interaction with MAO inhibitors, interaction with stimulants and phenelzine, increased effect of hypoglycaemics Hawthorne (Crataegus spp.) Digitalis-​like Can increase hypotensive effects of nitrates, antihypotensives, cardiac glycosides, and CNS depressants Hops (Humulus lupulus) Hypnotic Antagonism with antidepressants, can increase effects of CNS depressants and hypnotics, interference with hormonal drugs Horse chestnut (Aesculus hippocastanum) Anti-​inflammatory Increased effects of anticoagulants Kava (Piper methysticum) Anxiolytic Potentiation with other anxiolytics, can increase parkinsonian symptoms with levodopa Lavender (Lavandula angustifolia) Sedative Increased effects of CNS depressants Liquorice (Glycyrrhiza glaba) Corticosteroid activity for gastric irritation Potassium loss (e.g. with thiazide diuretics, water, and sodium retention with corticosteroids), increased effects of digoxin, decreased effects of antihypertensives Lily of the valley (Convallaria majalis) Congestive heart failure Increased (side) effects of quinidine, calcium, saluretics, laxatives, glucosteroids, β-​blockers, calcium channel blockers, and digitalis Mistletoe (Viscum album) Anticancer drug Increased effects of CNS depressants, antihypertensives, and cardiac drugs Nettle (Urtica dioica) Diuretic May potentiate effects of other diuretics Pumpkin seed (Curcubita pepo) Anthelmintic, diuretic Can increase effect of diuretics Sage (Salvia officinalis) Antispasmodic Interaction with antiepileptics, may increase risk of seizure, decreased effect with antiglycaemics St John’s wort (Hypericum perforatum) Antidepressant Increased effects of digoxin MAO inhibitors or serotonin uptake inhibitors, decreased effect of drugs metabolized by the cytochrome P450 enzyme system Valerian (Valeriana officinalis) Hypnotic Increased effects of CNS depressants and hypnotics Yew (Taxus spp.) Antirheumatic, anticancer Chemotherapeutic agents may potentiate its effects a Plant source in brackets. b Not comprehensive. 206 section 2  Background to medicine Spinal manipulation Description In most cultures, spinal manipulation has been practised by bone­ setters for centuries. Today such therapeutic techniques are practised by chiropractors, osteopaths, physiotherapists, doctors, and other health professionals. Spinal manipulation is the hallmark therapy for chiropractors who use it to adjust ‘subluxations’, malalignments of the spine claimed to be at the root of all health problems. During spinal manipulation vertebral joints are often moved beyond their physiological range of motion but not far enough to destroy joint structures. A  typical technique is the short-​lever, high-​velocity thrust which is used by chiropractors on most patients. Mode of action Many chiropractors believe that vertebral ‘subluxations’ adversely affect human health and that consequent spinal manipulation will improve it. The mechanism of action is, however, unclear. Some theories hold that it breaks fibrous adhesions within joints, that it affects mechanoreceptors of the joint, or that it modulates central nervous system excitability. Efficacy Most of the trial data pertain to back pain. A Cochrane review of spinal manipulation found no evidence that it is superior to standard treatments for acute or chronic back pain, but some evidence that it is better than harmful interventions or sham treatments. For all other indications (e.g. neck pain, headache, dysmenorrhoea, colic, asthma), the current best evidence fails to indicate effectiveness. Safety Several prospective studies have shown that spinal manipulation leads to transient, mild adverse effects such as local pain in about 50% of all patients. In addition, serious adverse effects such as arterial dissection, stroke, and death have been reported in about 700 cases. Chiropractors claim that the incidence of such complications is exceedingly low. Due to significant underreporting, this may not be so. At present, the true incidence of complications is not known and no adequate reporting systems for adverse effects are in place. Other forms of CAM CAM is a highly diverse field comprising more than 150 different forms of therapeutic and diagnostic methods (see Table 2.22.1). FURTHER READING Assendelft WJJ, et al. (2003). Spinal manipulative therapy for low back pain. A  meta-​analysis of effectiveness relative to other therapies. Ann Intern Med, 138, 871–​81. Capasso F, et al. (2003). Phytotherapy: a quick reference to herbal medi- cine. Springer Verlag, Berlin. Cochrane reviews related to complementary medicine. https://​cam. cochrane.org/​cochrane-​reviews-​related-​complementary-​medicine Derry CJ, et al. (2006). Systematic review of systematic reviews of acu- puncture published 1996–​2005. Clin Med, 6, 381–​6. Ernst E (2002). A systematic review of systematic reviews of homeop- athy. Br J Clin Pharmacol, 54, 577–​82. Ernst E (2006). Acupuncture—​a critical analysis. J Intern Med, 259, 125–​37. Ernst E (2008). Chiropractic:  a critical evaluation. J Pain Symptom Manage, 35, 544–​62. Ernst E, Smith K (2018). More Harm than Good?: The Moral Maze of Complementary and Alternative Medicine. Springer. Ernst E, et al. (2008). Oxford handbook of complementary medicine. Oxford University Press, Oxford. Mathie R, Fok Y, Viksveen P, To A, Davidson J (2019). Systematic Review and Meta-Analysis of Randomised, Other-than-Placebo Controlled, Trials of Non-Individualised Homeopathic Treatment. Homeopathy, 108(2), 88–101. Newman M (2018). Is cancer crowdfunding fuelling quackery? BMJ, 362, k3829. Reisman S, Balboul M, Jones T (2019). P-curve accurately rejects evidence for homeopathic ultramolecular dilutions. PeerJ, 23(7), e6318. Rubinstein S, et al. (2019). Benefits and harms of spinal manipula- tive therapy for the treatment of chronic low back pain: systematic review and meta-analysis of randomised controlled trials. BMJ, 364, l689. Singh S, Ernst E (2008). Trick or treatment? Alternative medicine on trial. Bantam, London. SECTION 3 Cell biology Section editors: John D. Firth, Christopher P. Conlon, and Timothy M. Cox 3.1 The cell  209 George Banting and Jean Paul Luzio 3.2 The genomic basis of medicine  218 Paweł Stankiewicz and James R. Lupski 3.3 Cytokines  236 Iain B. McInnes 3.4 Ion channels and disease  246 Frances Ashcroft and Paolo Tammaro 3.5 Intracellular signalling  256 R. Andres Floto 3.6 Apoptosis in health and disease  266 Mark J. Arends and Christopher D. Gregory 3.7 Stem cells and regenerative medicine  281 Alexis J. Joannides, Bhuvaneish T. Selvaraj, and Siddharthan Chandran 3.8 The evolution of therapeutic antibodies  296 Herman Waldmann and Greg Winter 3.9 Circulating DNA for molecular diagnostics  299 Y.M. Dennis Lo and Rossa W.K. Chiu 2.3 The Global Burden of Disease Measuring the hea 2.3 The Global Burden of Disease: Measuring the health of populations 43 ESSENTIALS To make the best decisions to improve health, policymakers need reliable, up-​to-​date information on the major challenges facing their country. The Global Burden of Disease study facilitates this by providing comprehensive and scientifically rigorous estimates of the causes of death and illness across the globe. It examines a total of 84 risk factors and the amount of health loss attribut- able to each or combinations of them. Analysis over time reveals a shift from disease burden dominated by communicable, maternal, neonatal, and nutritional causes to a burden increasingly made up of non​communicable diseases and injuries. By making compari- sons between countries or subnational units like states or coun- ties, the Global Burden of Disease can highlight areas of particular success or challenge, providing opportunities to examine what is working, or what is not. Introduction To improve health by making the best decisions, policymakers need reliable, up-​to-​date information on the major challenges facing their country. The Global Burden of Disease (GBD) is a large-​scale en- terprise dedicated to expanding the quantitative evidence base for health by producing estimates of deaths, prevalence, and incidence by disease for all countries and by identifying the major risks that impinge on population health. The GBD study was created to provide comparable, compre- hensive, and scientifically rigorous estimates of the causes of death and illness across the globe and aims to answer the following ques- tions:  What are the world’s major health problems? How well is society addressing these problems? Additionally, how do we best dedicate resources to get the maximum impact in improving popu- lation health in the future? In the early 1990s, when the World Bank commissioned the ori- ginal GBD study, researchers were surprised to discover that the an- swers to these questions were elusive. For some parts of the world, data were sparse or non​existent. In summing disease-​specific mor- tality estimates by organizations whose mission was to combat a given disease, GBD researchers found a total that was considerably greater than the number of people, globally, who had actually died in a particular year. The GBD approach not only makes sure that deaths attributed to different causes sum to the total number of deaths—​it goes beyond mortality estimates and looks at causes of disability, or non​fatal con- ditions. Increasingly, around the world, people are living longer and are exposed to disease risks that are more common with increasing age. Examining disabling conditions, such as back pain, dementia, or diabetes is therefore more important than ever. With the diversity of health conditions facing different populations, having compar- able measurements is essential. The GBD approach allows policy- makers to directly compare the public health impact of diseases that often are fatal, such as cancers, and conditions that primarily disable, such as depression or anaemia. Underlying some of these conditions are risk factors including al- cohol and tobacco use, dietary factors, occupational exposures, and air pollution. The GBD 2017 study examined 84 risk factors and the amount of health loss attributable to each or combinations of them. This information can help decision-​makers to identify opportunities to promote population health by preventive measures. After the original GBD study, disease burden estimates by world regions were produced on a regular basis in the 2000s by the World Health Organization. In 2007 the Institute for Health Metrics and Evaluation (IHME) was established at the University of Washington in Seattle with funding from the Bill & Melinda Gates Foundation to make these estimates by country and to regularly update esti- mates. IHME published a comprehensive update, the Global Burden of Diseases, Injuries, and Risk Factors Study 2010, in December 2012 in a dedicated issue of The Lancet. Since then, IHME has com- mitted to producing annual updates of these results for the whole time period from 1990s onward and for an increasing number of countries and subnational units such as the provinces of China and the states of India, Mexico, Brazil, and the United States. New risks, diseases, and their disabling outcomes are added based on policy interest. For instance, the GBD 2017 includes new estimates for type 1 and type 2 diabetes mellitus, and bullying victimization as a risk factor for depression and anxiety. Governments and funding partners want to maximize the im- pact of their healthcare spending. By pinpointing the leading causes 2.3 The Global Burden of Disease: Measuring the health of populations Theo Vos, Alan Lopez, and Christopher Murray 44 SECTION 2  Background to medicine of health loss, including risk factors for diseases, GBD can guide how health services are planned and how resources are directed to diseases and risks. By making comparisons between countries or subnational units like states or provinces, GBD can highlight areas of particular success or challenge, providing opportunities to examine what is working, or what is not. Methods GBD quantifies health loss from hundreds of diseases, injuries, and risk factors. In order to achieve this, a large effort is made to col- lect all available data on deaths and the epidemiology of more than 300 diseases and injuries in 195 countries and territories, by age and sex, from 1990 to the present. This large-​scale effort is coordinated by IHME. Health data for many countries are sparse, and they are often messy. Researchers at IHME with guidance from a global con- sortium of researchers—​more than 3300 in over 140 countries—​ identify every possible source of data for a country or condition and work to gain access to them. Once in hand, the data are often ad- justed to account for known differences in measurements between data sources. The next step is synthesizing the data—​taking what is known and making sense of it. Sophisticated statistical model- ling approaches are used to estimate health loss for every disease and country. Every estimate in GBD is estimated 1000 times over to account for the impact of uncertainty from sampling errors, meas- urement errors, and the choice of models. Causes of health loss are classified at a first level into three broad categories in the GBD:  communicable, maternal, neonatal, and nutritional diseases; non​communicable diseases; and injuries. The first of these encompasses diseases like HIV, tuberculosis, malaria, diarrhoea, lower respiratory infections, iron-​deficiency anaemia, and preterm birth complications. Non​communicable diseases include heart disease, stroke, diabetes, cancer, depression, and asthma. Injuries range from self-​harm (suicide) to road in- juries to animal bites. In order to account for death and disease, GBD uses the disability-​ adjusted life year (DALY). DALYs equal the sum of years of life lost to premature mortality and years lived with disability. To calculate years of life lost, a choice was made to set a norm for long life. This norm was set by looking at the lowest observed mortality rates at any given age in any population greater than five million. Years of life lost are computed as the remaining life expectancy if an indi- vidual had not died but continued to live at low risk of dying. This works out as almost 88 years of remaining ‘ideal’ life expectancy for an infant that has died, 39 years for someone who died at age 50, and still another 2.2 years for someone dying at age 100. It reflects an assumption that everyone, all over the world, deserves to live a long life in full health. Years lived with disability take into account the prevalence of disability in a population as well as its severity. To make more than 2500 disabling outcomes of diseases and injuries included in the GBD comparable, researchers mapped these into 234 homogenous ‘health states’ for which they wrote short lay descriptions of the main symptoms and functional limitations such as pain, inability to move, see, or hear, or changes in mood. These lay descriptions were put to over 60 000 respondents in country and internet surveys, each time in a randomly chosen pair, followed by the question ‘who is the healthier’. From over half a million of these judgements, dis- ability weights were derived that indicate the relative severity of a health state as a number between 0 and 1. The disability weights in GBD vary from 0.003 for mild vision loss to 0.778 for a person with schizophrenia with fulminant psychosis. As we combine years of life lost (YLLs) and years lived with disability (YLDs) into the DALY measure by simple addition, it means that we equate the health loss experienced by 333 cases of mild vision loss and a little more than one person with schizophrenia in a year to a year of life lost due to a death. Another factor that is considered in creating GBD estimates is that the world’s population is growing. An increase in the number of deaths from a particular disease, then, might simply be explained by the fact that there are more people alive who may then contract the disease. One way to account for these changes in population is to calculate rates of each condition, usually expressed as deaths or DALYs per 100 000 people. In some cases, an apparent increase is revealed as a decrease when considered this way. For example, in the Democratic Republic of Congo between 1990 and 2017, the number of deaths from tuberculosis increased by 45% but the rate of tuber- culosis deaths per 100 000 actually decreased by 32% over the same period. While the absolute burden in terms of number of people suf- fering from TB increased, the proportion of people in the population who are suffering actually decreased. The increase in the number of deaths from TB indicates to health service planners that they need to expand tuberculosis treatment resources even though the decline in rates indicates that the existing TB control measures may be having success. Age-​standardizing also allows for more accurate comparison across countries and time frames. This technique applies observed age-​specific rates of a condition to a standard age distribution and avoids distortion when one population is older or younger than another. Results GBD 2017 generated 38 billion data points. In order to make these results more accessible to researchers, policymakers, students, health workers, journalists, and others, IHME created a group of interactive data visualization tools that allow people to explore health trends for different countries and regions. One of these, GBD Compare, shows numbers and rates of deaths, YLLs, YLDs, and DALYs by dis- eases and risk factors. The main view is a ‘tree map’, which is basic- ally a square pie chart showing the proportional distribution of the overall burden by underlying causes. Additional views can be added showing a map of the world, time plots, age plots, and ranking lists. Each can interactively be explored by country, age, sex, year, cause, or risk factor. They can be found at https://vizhub.healthdata.org/ gbd-compare/. Looking at global patterns in causes of death and disability over time reveals a shift from disease burden dominated by communic- able, maternal, neonatal, and nutritional causes (shown in red) to a burden increasingly made up of non​communicable diseases (blue) and injuries (green, Fig. 2.3.1). This pattern has been partly driven by improvements in healthcare and greater access to vaccines and life-​saving medical interventions that contributed to healthcare 2.3  The Global Burden of Disease: Measuring the health of populations 45 successes, such as the nearly 90% decrease in health loss from mea- sles and tetanus globally between 1990 and 2017. Meanwhile, heart disease, stroke (intracerebral haemorrhage and ischaemic stroke), type 2 diabetes, low back pain, and chronic obstructive pulmonary disease (COPD) climbed in ranking over this period to occupy 6 of the top-​10 slots for causes of DALYs (per 100 000). The rise in non​communicable diseases can be explained by sev- eral factors. Economic development has propelled some countries from low-​ to middle-​income status, often bringing improvements in healthcare, water quality and sanitation, and living conditions. At the same time, as countries gain higher economic status, air pollu- tion may increase, and changes in diet and levels of physical activity typically occur. Factors like these have led to an epidemiologic tran- sition away from communicable, maternal, neonatal, and nutritional disorders and towards a much greater burden of non​communicable diseases. The exception to this trend has been the HIV/​AIDS epi- demic: in 1990, HIV/​AIDS caused the 39th-​highest rate of deaths (per 100 000) globally. In 2005, at the peak of the epidemic, this rank had jumped to fifth place, and by 2017 it had dropped to 13th due to increased access to antiretroviral treatment. Greater economic development has also led to a demographic transition, meaning that birth and death rates, typically high in low-​ income countries, decreased as industrialization occurred. This re- sults in population growth, as death rates decrease faster than birth rates, and later, population ageing. Policymakers must plan for the results of these changes by expanding health services and antici- pating the need to care for the changing spectrum of prevalent dis- eases and comorbidities that are more common at older ages. At the regional and country levels, more variation can be seen. High-​income countries, including the United States, Australia, Western Europe, and Japan, exhibit a disease burden dominated by non​communicable diseases. Life expectancy in these countries is typically high: between 78 and 84 years for both sexes combined in 2017 (Fig. 2.3.2), and the leading causes of health loss are all non-​ communicable diseases:  ischaemic heart disease, low back pain, lung cancer, COPD, and Alzheimer’s disease and other dementias are the top five. By contrast, life expectancy in sub-​Saharan Africa ranged from 52 to 71 years; this region also has the greatest burden of communicable diseases including malaria, diarrheal diseases, HIV/​AIDS, and tuberculosis among the top ten causes of DALYs Leading causes 1990 Leading causes 2017 1 Lower respiratory infections 1 Ischaemic heart disease 2 Diarrhoeal diseases 2 Lower respiratory infections 3 Preterm birth complications 3 COPD 4 Ischaemic heart disease 4 Diarrhoeal diseases 5 Neonatal encephalopathy 5 Preterm birth complications 6 COPD 6 Low back pain 7 Drug-susceptible tuberculosis 7 Intracerebral haemorrhage 8 Measles 8 Diabetes type 2 9 Malaria 9 Neonatal encephalopathy 10 Intracerebral haemorrhage 10 Ischaemic stroke 11 Other neonatal disorders 11 Migraine 12 Low back pain 12 Malaria 13 Protein-energy malnutrition 13 HIV/AIDS other 14 Drowning 14 Lung cancer 15 Ischaemic stroke 15 Drug-susceptible tuberculosis 16 Congenital heart anomalies 16 Falls 17 Self-harm by other means 17 Age-related hearing loss 18 Dietary iron deficiency 18 Major depression 19 Migraine 19 Other neonatal disorders 20 Pedestrian road injuries 20 Self-harm by other means 23 Dietary iron deficiency 22 Falls 27 Pedestrian road injuries 23 Lung cancer 29 Congenital heart anomalies 26 Diabetes type 2 36 Drowning 28 Major depression 38 Protein-energy malnutrition 30 Age-related hearing loss 73 Measles 38 HIV/AIDS other Cause type: Communicable, maternal, neonatal, and nutritional Noncommunicable Injuries Fig. 2.3.1  Top 20 causes of DALYs per 100 000 globally, both sexes, all ages, 1990 and 2017. 46 SECTION 2  Background to medicine <55 55-59 60-64 65-69 70-74 75-79 ≥80 Caribbean LCA DMA ATG TTO GRD VCT TLS MDV BRB SYC MUS COM Persian Gulf W Afr E Med MLT SGP Balkan Peninsula TON WSM FSM KIR FJI VUT SLB MHL Fig. 2.3.2  Life expectancy at birth, both sexes, 2017. (ATG, Antigua and Barbuda; VCT, St Vincent and the Grenadines; BRB, Barbados; COM, the Comoros; DMA, Dominica; E Med, Eastern Mediterranean; FJI, Fiji; FSM, Federated States of Micronesia; GRD, Grenada; KIR, Kirbati; MDV, Maldives; MHL, Marshall Islands; MLT, Malta; MUS, Mauritius; LCA, St Lucia; SGP, Singapore; SLB, Solomon Islands; SYC, Seychelles; TON, Tonga; TTO, Trinidad and Tobago; TLS, Timor Leste; VUT, Vanuatu; W Afr, West Africa; WSM; Western Samoa.) 2.3  The Global Burden of Disease: Measuring the health of populations 47 lost in 2017. Preterm birth complications, protein-​energy malnutri- tion, neonatal encephalopathy from birth trauma and asphyxia, and neonatal sepsis were also among the top 10 causes of health loss in sub-​Saharan Africa in 2017. Between 1990 and 2017, the absolute number of deaths from most causes increased (Fig. 2.3.3). For example, cardiovascular disease caused 11.9 million deaths in 1990 and 17.8 million in 2017, a 50% increase (Fig. 2.3.3). Looking at the numbers of DALYs, mental dis- orders were responsible for 82 million DALYs in 1990 and 123 mil- lion in 2017, a 50% increase (Fig. 2.3.4). In terms of age-​standardized rates, however, deaths from cardiovascular disease dropped by 30% over this period, and DALY rates from mental and substance abuse disorders changed marginally by 1.7%. The explanation for this dif- ference is that cardiovascular deaths mainly occur in older people. Despite a drop in the age-​by-​age rates of disease, population ageing is causing the number of deaths to increase. In the case of mental disorders, the rates of disease have not changed much over time and the increase in DALY numbers is largely due to the increase in the world population size. China, in the period 1990 to 2017, provides a clear example of the epidemiological transition. In 1990, the top 10 causes of DALYs per 100 000 included two neonatal conditions along with lower re- spiratory infections and diarrheal diseases (Fig. 2.3.5). By 2017, however, all of the top-​10 causes were non​communicable diseases, including ischaemic heart disease, stroke, diabetes, lung cancer, stomach cancer, low back and neck pain, and age-​related hearing loss. Injuries follow a less predictable pattern as countries move through the epidemiological transition. In this example of China, the decrease in DALY rates from drowning, by 79%, and suicide by means other than firearms, by 67%, were remarkable. Risk factors have undergone a similar transition globally. In 1990, the top five risks, in terms of rates of DALYs, were child wasting, short gestation, low birth weight, smoking, and high blood pres- sure. Twenty-​seven years later, short gestation had dropped to number five, while high blood pressure had taken over the number-​ one ranking. Smoking, high fasting plasma glucose, and high body mass index came in at numbers two, three, and four, respectively (Fig. 2.3.6). This trend in risk factors reflects the epidemiological transition, where more health loss now results from poor diets and overweight and obesity than inadequate nutrition. Deaths and DALYs related to smoking have increased in countries like Bangladesh and Indonesia but have decreased in most high-​income countries. A notable exception to these trends is sub-​Saharan Africa, where many countries continue to experience a large burden of com- municable, neonatal, and nutritional conditions. In 1990, these made up all of the ten highest-​ranked diseases in terms of DALYs per 100 000. By 2017, a few prominent changes had occurred: HIV/​AIDS shot to number four, and measles dropped from 4th place to 18th, but the leading causes remained the same: lower respiratory infections, malaria, diarrheal diseases, HIV/​AIDS, neo- natal encephalopathy, preterm birth complications, tuberculosis, protein-​energy malnutrition, and neonatal sepsis. Trends in risk factors diverge from most other regions as well, with childhood wasting at the top of the list, followed by short gestation, low birth weight, unsafe sex, unsafe water, and unsafe sanitation. These find- ings underscore the continuing need for health services focused on communicable diseases and maternal and child health, as well as infrastructure for clean water and sanitation in sub-​Saharan Africa (Fig. 2.3.7). 0 10M 20M 30M 40M 50M 60M 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 Year Deaths in millions Cause of death HIV/AIDS and sexually transmitted infections Respiratory infections and tuberculosis Enteric infections Neglected tropical diseases and malaria Other infectious diseases Maternal and neonatal disorders Nutritional deficiencies Neoplasms Cardiovascular diseases Chronic respiratory diseases Digestive diseases Neurological disorders Mental disorders Substance use disorders Diabetes and kidney diseases Skin and subcutaneous diseases Musculoskeletal disorders Other noncommunicable diseases Transport injuries Unintentional injuries Self−harm and interpersonal violence Fig. 2.3.3  Global number of deaths by cause, both sexes, all ages, 1990–​2017. 48 SECTION 2  Background to medicine Epidemiological trends in disease burden are best identified by examining rates of deaths and DALYs; however, the absolute number of people dying and suffering from various conditions also informs healthcare needs. The distribution of medical specialties might need to shift to accommodate changes in population and in the leading causes of death and illness. A decreasing burden of childhood illnesses would mean less need for paediatricians and children’s hospitals over time. Larger numbers of elderly people Leading causes 1990 Leading causes 2017 1 Lower respiratory infections 1 Ischaemic heart disease 2 COPD 2 Intracerebral haemorrhage 3 Intracerebral haemorrhage 3 COPD 4 Preterm birth complications 4 Ischaemic stroke 5 Ischaemic heart disease 5 Lung cancer 6 Drowning 6 Diabetes type 2 7 Neonatal encephalopathy 7 Neck pain 8 Self-harm by other means 8 Age-related hearing loss 9 Diarrhoeal diseases 9 Stomach cancer 10 Ischaemic stroke 10 Low back pain 18 Preterm birth complications 13 Stomach cancer 21 Neonatal encephalopathy 15 Lung cancer 23 Lower respiratory infections 16 Low back pain 26 Self-harm by other means 19 Neck pain 28 Drowning 21 Age-related hearing loss 78 Diarrhoeal diseases 29 Diabetes type 2 Cause type: Communicable, maternal, neonatal, and nutritional Noncommunicable Injuries Fig. 2.3.5  Leading causes of DALYs per 100 000, both sexes, all ages, China, 1990 and 2017. 0 500M 1B 1.5B 2B 2.5B 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 Year DALYs in billions Cause of DALY HIV/AIDS and sexually transmitted infections Respiratory infections and tuberculosis Enteric infections Neglected tropical diseases and malaria Other infectious diseases Maternal and neonatal disorders Nutritional deficiencies Neoplasms Cardiovascular diseases Chronic respiratory diseases Digestive diseases Neurological disorders Mental disorders Substance use disorders Diabetes and kidney diseases Skin and subcutaneous diseases Sense organ diseases Musculoskeletal disorders Other noncommunicable diseases Transport injuries Unintentional injuries Self−harm and interpersonal violence Fig. 2.3.4  Global number of DALYs by cause, both sexes, all ages, 1990–​2017. 2.3  The Global Burden of Disease: Measuring the health of populations 49 Occupational injury Low physical activity Lead Vitamin A deficiency Low omega−3 Child underweight Iron deficiency Low vegetables Secondhand smoke Handwashing Unsafe sanitation Drug use Low nuts and seeds Unsafe sex Household air pollution Impaired kidney function Unsafe water Low fruit High sodium Low whole grains Ambient particulate matter Child wasting High LDL Alcohol use Low birth weight Short gestation High body mass index High fasting plasma glucose Smoking High blood pressure 0 500 1k 1.5k 2k 2.5k DALYs per 100 000 Risk factor Cause of DALY HIV/AIDS and sexually transmitted infections Respiratory infections and tuberculosis Enteric infections Other infectious diseases Maternal and neonatal disorders Nutritional deficiencies Neoplasms Cardiovascular diseases Chronic respiratory diseases Digestive diseases Neurological disorders Mental disorders Substance use disorders Diabetes and kidney diseases Sense organ diseases Musculoskeletal disorders Other noncommunicable diseases Transport injuries Unintentional injuries Self−harm and interpersonal violence Fig. 2.3.6  Leading risk factors in terms of DALYs per 100 000, both sexes, all ages, 2017. with non​communicable diseases will require expansion of specialty care for conditions affecting this population, including neurologists, geriatricians, and elder care facilities. Subnational studies, which are done at the state or province level, provide an even more granular level of detail, revealing greater disparities within countries than between neighbouring countries and allowing decision-​makers to implement more targeted inter- ventions. These have been conducted so far in China, Mexico, the United Kingdom, Brazil, India, Russia, Iran, Indonesia, Japan, the United States, Kenya, South Africa, Ethiopia, Sweden, Norway, and New Zealand. China’s subnational study uncovered some striking differences between provinces. In Shanghai, the age-​standardized death rate for communicable, maternal, neonatal, and nutritional diseases was 15.1 per 100 000 in 2017, similar to rates in high-​income countries like Switzerland and Italy, but lower than rates in the United States and Canada. In China’s western provinces like Xinjiang, however, the death rate is several times higher than Shanghai’s and comparable to death rates in Morocco, Vietnam, and El Salvador.   Leading risk factors 1990 Leading risk factors 2017 1 Child wasting 1 Child wasting 2 Unsafe water 2 Short gestation 3 Short gestation 3 Low birth weight 4 Child underweight 4 Unsafe sex 5 Vitamin A deficiency 5 Unsafe water 6 Unsafe sanitation 6 Unsafe sanitation 7 Low birth weight 7 No access to handwashing 8 No access to handwashing 8 Household air pollution 9 Child stunting 9 Child underweight 10 Household air pollution 10 Vitamin A deficiency 14 Child stunting 12 Unsafe sex Risk factor: Behavioural Environmental Fig. 2.3.7  Leading risk factors in terms of DALYs per 100 000, both sexes, all ages, sub-​Saharan Africa, 1990 and 2017. 50 SECTION 2  Background to medicine Conclusion The Global Burden of Disease enterprise provides powerful tools for clinicians and policymakers alike, to identify both large-​scale trends and local disparities. Being aware of these factors can help decision-​ makers plan for future healthcare needs and make changes that will reduce exposure to harmful risk factors. Healthcare providers, staff at health ministries and international health agencies, researchers, and others can also contribute to the ongoing GBD effort as collab- orators. For more information on joining the GBD network, please visit http://​www.healthdata.org/​gbd/​call-​for-​collaborators. Collaborating with a worldwide network of researchers strengthens both the data-​gathering effort and the quantitative analysis by bringing together experts from a variety of disciplines. IHME and its collaborators are expanding the list of diseases, injuries, and risk factors included in GBD and routinely updating the GBD estimates. Continual updates will ensure that the international community has access to high-​quality estimates in the timeliest fashion. Through sound measurement, it will provide the foundational evidence that will lead to improved population health. FURTHER READING Foreman KJ, et  al. (2018). Forecasting life expectancy, years of life lost, and all-​cause and cause-​specific mortality for 250 causes of death: reference and alternative scenarios for 2016–​40 for 195 coun- tries and territories. Lancet, 392(10159), 2052–​90. GBD 2016 Healthcare Access and Quality Collaborators (2018). Measuring performance on the Healthcare Access and Quality Index for 195 countries and territories and selected subnational locations: a systematic analysis from the Global Burden of Disease Study 2016. Lancet, 391, 2236–​71. GBD 2017 DALYs and HALE Collaborators (2018). Global, regional, and national disability-adjusted life-years (DALYs) for 359 diseases and injuries and healthy life expectancy (HALE) for 195 countries and territories, 1990–2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet, 392, 1859–1922. IHME (2018). GBD Compare. https://vizhub.healthdata.org/gbd- compare/ Murray CJL, Lopez AD (2017). Measuring global health: motivation and evolution of the Global Burden of Disease Study. Lancet, 390, 1460–​4. 2.4 Large- scale randomized evidence Trials and me 2.4 Large- scale randomized evidence: Trials and meta- analyses of trials 51 ESSENTIALS Reliable detection or refutation of realistically moderate effects on major outcomes often requires large-​scale randomized evidence As long as doctors start with a healthy scepticism about the many ap- parently striking claims and counterclaims that appear in the medical literature, trial results do make sense. The main enemy of common sense is overoptimism:  there are a few striking exceptions where treatments for serious disease really do turn out to work extremely well, but in general most claims of vast improvements from new ther- apies turn out to be evanescent. Clinical trials generally need to be able to detect or to refute real- istically moderate (but still worthwhile) differences between treat- ments in long-​term disease outcome. Large-​scale randomized evidence should be able to detect such effects, but medium-​sized trials or medium-​sized meta-​analyses can, and often do, yield false-​ negative or exaggeratedly positive results. If the results from such studies seem too good to be true then they probably are: conversely, unpromising evidence can be misleading if it is from a study of inad- equate size, or from one particular subgroup of a large study with a clearly favourable overall result. Realistically moderate expectations of what a treatment might achieve (or, if one treatment is to be compared with another, of how large any difference between the main effects of these two treatments is likely to be) should foster studies that can discriminate reliably be- tween (1) a difference in outcome that is realistically moderate but still worthwhile, and (2) a difference in outcome that is too small to be of any material importance. To assess moderate effects reliably, avoid both moderate biases and moderate random errors To demonstrate or refute realistically moderate differences in out- come, studies must guarantee (1)  strict control of bias—​which, in general, requires proper randomization and appropriate statistical analysis, with no unduly ‘data-​dependent’ emphasis on specific parts of the overall evidence; and (2) strict control of the play of chance—​ which, in general, requires large numbers of patients with the out- come of interest, rather than a lot of detail on each patient. The conclusion is obvious: moderate biases and moderate random errors must both be avoided if moderate benefits are to be assessed reliably. This leads to the need for large numbers of properly randomized pa- tients with properly analysed data, which in turn should lead to some large but simple randomized trials (or ‘mega-​trials’) and to large sys- tematic overviews (or ‘meta-​analyses’) of all related randomized trials. Other forms of evidence may be untrustworthy Non​randomized evidence, unduly small randomized trials, unduly small meta-​analyses of trials, and undue emphasis on particular sub- groups (or on particular trials) are all inferior sources of evidence about current patient management or as foundations for future research strategies because they often cannot discriminate reliably between moderate (but worthwhile) differences and negligible dif- ferences in outcome, and the mistaken clinical conclusions that they engender could well result in the undertreatment, overtreatment, or other mismanagement of millions of future patients worldwide. Benefits of large-​scale randomized evidence Hundreds of thousands of premature deaths each year could be avoided by seeking appropriately large-​scale randomized evidence about various widely practicable treatments for the common causes of death, and by disseminating this evidence appropriately. The value of large-​scale randomized evidence is illustrated by the trials of fibrinolytic therapy for acute myocardial infarction; of antiplatelet therapy for a variety of vascular conditions; of endocrine therapy for early breast cancer; and of drug therapy for lowering blood pressure. In these examples, proof of benefit that could not have been achieved by either small-​scale randomized evidence or non-​ randomized evidence has led to widespread changes in practice that are now preventing hundreds of thousands of premature deaths each year, and appropriately large-​scale randomized evidence could substantially improve the management of many important, but non-​ fatal, medical conditions. Moderate (but worthwhile) effects on major outcomes are generally more plausible than large effects Some treatments have large, and hence obvious, effects on survival: for example, it was clear without the need for any randomized trials that prompt treatment of diabetic coma or cardiac arrest can save 2.4 Large-​scale randomized evidence: Trials and meta-​analyses of trials Colin Baigent, Richard Peto, Richard Gray, Natalie Staplin, Sarah Parish, and Rory Collins 52 SECTION 2  Background to medicine lives, and more recently the introduction of protease inhibitors for the treatment of HIV infection led to a reduction in AIDS-​related morbidity and mortality that was large enough to be obvious even without randomized evidence; indeed, the remarkable effectiveness of antiretroviral drugs can be seen from the sudden reversal, after the mid-​1990s, of the upward trend in mortality among US men aged 30–​34 (see Fig. 2.4.1), the chief cause of which was HIV/​AIDS. However, perhaps in part because of these striking successes, for the past few decades the hopes of large treatment effects on mortality and major morbidity in many serious diseases have been unrealis- tically high. Of course, treatments do quite commonly have large effects on various less fundamental measures: certain drugs clearly reduce blood pressure, blood cholesterol, or blood glucose; many tu- mours or leukaemias can be controlled temporarily by radiotherapy or chemotherapy; and, in acute myocardial infarction, lidocaine (lignocaine) can prevent many arrhythmias and fibrinolytic therapy can dissolve many thrombi. However, although such effects on inter- mediate outcomes may be large, the net effects on mortality may be much more modest. In general, if substantial uncertainty remains about the efficacy of a practicable treatment, its effects on major endpoints are probably either negligibly small, or only moderate, rather than large. Indirect support for this possibly pessimistic conclusion comes from many sources, including: the previous few decades of disappointingly slow progress in the curative treatment of common chronic diseases of middle age; the heterogeneity of each single disease, as evidenced by the unpredictability of survival duration even when apparently similar patients are compared with each other; the variety of dif- ferent mechanisms in certain diseases that can lead to death, only one of which may be appreciably influenced by any one particular therapy; the modest effects often suggested by meta-​analyses (see later) of various therapies, and, in certain special cases, observa- tional epidemiological studies of the strength of the relationship between a particular disease and the factor that the treatment will modify (e.g. blood pressure, blood cholesterol, or blood glucose; see later on in this chapter). Having accepted that only moderate reductions in mortality are likely with many currently unevaluated interventions, how worth- while might such effects be if they could be detected reliably? To some clinicians, reducing the risk of early death in patients with myocar- dial infarction from 10 per 100 patients down to 9 or 8 per 100 pa- tients treated may not seem particularly worthwhile, and if such a reduction was only transient, or involved an extremely expensive or toxic treatment, this might well be an appropriate view. Worldwide, however, several million patients a year suffer an acute myocardial infarction, and if just one million were to be given a simple, non​toxic, and widely practicable treatment that reduced the risk of early death from 10% down to 9 or 8% (that is, a proportional reduction of 10 or 20%), this would avoid 10 or 20 thousand deaths. (For example, at least a million patients a year now receive fibrinolytic therapy for acute myocardial infarction, which is avoiding about 20 000 early deaths a year.) Such absolute gains are substantial, and might consid- erably exceed the number of lives that could be saved by a much more effective treatment of a much less common disease. Reliable detection or refutation of moderate differences requires avoidance of both moderate biases and moderate random errors If realistically moderate differences in outcome are to be reliably detected or reliably refuted, then errors in comparative assessments of the effects of treatment need to be much smaller than the differ- ence between a moderate, but worthwhile, effect and an effect that is too small to be of any material importance. This in turn implies that moderate biases and moderate random errors cannot be tol- erated. The only way to guarantee very small random errors is to study really large numbers, and this can be achieved in two main ways: by making individual studies large, and by combining infor- mation from as many relevant studies as possible in a systematic meta-​analysis (Table 2.4.1). However, it is not much use having very small random errors if there are moderate biases, so even the large sizes of some non​randomized analyses of computerized hos- pital records, where the complex factors involved in the decision to treat a person with a particular drug may not be recorded in suffi- cient detail, cannot guarantee medically reliable comparisons be- tween the effects of different treatments (see later). For example, the choice of treatment may be strongly affected by subtle patient characteristics that are correlated with the prognosis. (A crude 1950 1960 1970 1980 1990 2000 2010 0% 0.1% 0.2% 0.3% 0.4% 0.5% 0.6% Male Female 0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 5-year risk Death rate/1000 UNITED STATES 1950−2015: Males & Females All medical mortality at ages 30−34 Source: WHO mortality & UN population estimates Fig. 2.4.1  Mortality trends in the United States of America among men and women aged 30–​34 during the period 1950–​2015. Antibacterial drugs caused a big decrease in mortality around the middle of the century in both sexes. The increase in AIDS-​related mortality since the early 1980s caused a sharp increase in all-​cause mortality, particularly in men, which continued until it was spectacularly reversed by effective antiretroviral drug combinations in the mid-​1990s. 2.4  Large-scale randomized evidence: Trials and meta-analyses of trials 53 illustration of such problems is provided by the old joke ‘What’s the most dangerous place in the world?’ ‘Bed—​look at the number of people who die in bed!’.) Avoiding moderate biases Proper randomization avoids systematic differences between the types of patient in different treatment groups. The fundamental reason for randomization is to avoid moderate bias, by ensuring that each type of patient can be expected (but for the play of chance) to have been allocated in similar proportions to the dif- ferent treatment strategies that are to be compared. This means that only random differences should affect the final comparisons of outcome. Non​randomized methods, in contrast, cannot generally guarantee that the types of patient given the new study treatment do not differ systematically in any important ways from the types of patient given any other treatment(s) with which the new study treatment is to be compared. For example, moderate biases might arise if the study treatment was novel and doctors were afraid to use it for the most seriously ill patients, or, conversely, if they were more ready to use it for those who were desperately ill. There may also be other ways in which the severity of the condition differentially affects the likelihood of being assigned to different treatments by the doctor’s choice (or by the patient’s choice, or by any other non-​ random procedure). It might appear at first sight that by collecting enough infor- mation about various prognostic features it would be possible to make some mathematical adjustments to correct for any such dif- ferences between the types of patients who, in a non​randomized study, receive the different treatments that are to be compared. The ill-​conceived hope is that such methods, which are often car- ried out on routinely collected healthcare data, might achieve comparability between those entering the different treatment groups, but they cannot be guaranteed to do so, and often fail ser- iously. The difficulty is that some important prognostic factors may be unrecorded, while others may not have been discovered yet or be difficult to assess exactly and hence difficult (or even impossible) to adjust for reliably. Although there are examples of non​randomized studies in which the estimated effects of treat- ment appear quantitatively close to those observed in analogous randomized trials, there are many examples where they are not, being either quantitatively incorrect—​so that drugs appear ei- ther misleadingly promising or of misleadingly low efficacy—​or even qualitatively incorrect, when a harmful drug might appear effective (or vice-​versa). The machinery of a properly randomized trial No foreknowledge of what the next treatment will be In a properly randomized trial, the decision to enter a patient is made in ignorance of which of the trial treatments that patient will, once entered, be allocated. The treatment allocation is then made known after trial entry has been decided upon. (The purpose of this sequence is to ensure that foreknowledge of what the next treatment is going to be cannot affect the decision as to whether to enter the patient; if it did, those to be allocated one treatment might differ systematically from those to be allocated another.) Ideally, any major prognostic features should also be irreversibly recorded before the treatment is revealed, particularly if these are to be used in any treatment analyses. For, if the recorded value of some prognostic factor might be affected by knowledge of the trial treatment allocation, then treatment comparisons within subgroups defined by that factor might be moderately biased. In particular, treatment comparisons just among ‘responders’ or just among ‘non​responders’ can be extremely misleading un- less the response is assessed before treatment allocation (which it can sometimes be, if all patients have a ‘run-​in’ period on ac- tive treatment before randomization, partly to assess the response to treatment and partly to exclude those who seem during this prerandomization run-​in unlikely to participate wholeheartedly in the main post-​randomization study). No bias in patient management or in outcome assessment An additional difficulty, in both randomized and non​randomized comparisons of various treatments, is that there might be system- atic differences in the use of other treatments (including general supportive care) or in the assessment of major outcomes. A non-​ randomized comparison may well suffer from moderate biases due to such systematic differences in ancillary care or assessment, par- ticularly if it merely involves the retrospective review of medical records. In the context of a randomized comparison, however, it is generally possible to devise ways to keep any such biases small. For example, placebo tablets may be given to control-​allocated patients and certain subjective assessments may be ‘blinded’ (although this is less important in studies assessing mortality). ‘Intention-​to-​treat’ analyses with no post-​randomization exclusions Even in a properly randomized trial, unnecessary biases may be introduced by inappropriate statistical analysis. One of the most important sources of bias in the analysis is undue concentration on just part of the evidence; that is to say, on ‘data-​derived subgroup Table 2.4.1  Requirements for reliable assessment of moderate effects: negligible biases and small random errors NEGLIGIBLE BIASES (i.e. guaranteed avoidance of moderate biases) Proper randomization (non​randomized methods might suffer moderate biases) Analysis by allocated treatment (including all randomized patients: ‘intention-​to-​treat’ analysis) Chief emphasis on overall results (no unduly data-​dependent emphasis on particular subgroups) Systematic overview of all relevant randomized trials (no unduly data-​dependent emphasis on particular studies) SMALL RANDOM ERRORS (i.e. guaranteed avoidance of moderate chance fluctuations) Large numbers in any new trials (to be really large, trials should be ‘streamlined’) Systematic overview of all relevant randomized trials (which yields the largest possible total numbers) 54 SECTION 2  Background to medicine analyses’ (see next). Another easily avoided bias is caused by the post-​randomization exclusion of patients, particularly if the type (and hence prognosis) of those excluded from one treatment group differs from those excluded from another. Therefore, one of the fundamental statistical analyses of a trial that should be made available is an analysis that compares all those originally allocated one treatment (even though some of them may not have actually received it) with all those allocated the other treat- ment. This is sometimes referred to as an ‘intention-​to-​treat’ ana- lysis. Additional analyses can also be reported; for example, in describing the frequency of some very specific side effect it may well be preferable to record its incidence only among those who actually received the treatment. (This is because strictly random- ized comparisons may not be needed to assess extreme relative risks.) However, in assessing moderate effects on the main out- come of interest such ‘on-​treatment’ analyses can be misleading, and ‘intention-​to-​treat’ analyses are generally a more trustworthy guide as to whether there is any real difference between the trial treatments in their effects on long-​term outcome. No unduly data-​dependent emphasis on results in particular subgroups Treatment that is appropriate for one patient may be inappropriate for another. Ideally, therefore, what is wanted is not only an answer to the question ‘Is this treatment helpful on average for a wide range of patients?’, but also an answer to the question ‘For which recog- nizable categories of patient is this treatment particularly helpful?’ However, this ideal is difficult to attain directly because the direct use of clinical trial results to assess proportional risk reductions in particular subgroups of patients is surprisingly unreliable. Of course, patients who already have a very good prognosis anyway and are at low absolute risk cannot have a large absolute benefit (for even if a small risk is halved the absolute benefit is small). Classification of patients as being at low (or high) risk of an adverse disease out- come is often a reliable guide as to which patients can expect little absolute gain even if the trial treatment works as expected (and as to which patients might expect a worthwhile gain). This low-​risk/​ high-​risk split may not require support from formal subgroup analyses—​indeed, it could even be damaged by such analyses. For, even if the proportional effects of treatment in specific subgroups are importantly different, standard subgroup analyses are so insensitive that they may well fail to demonstrate these differences. Moreover, even if there are highly significant differences between the propor- tional risk reductions produced by the trial treatment in different subgroups, and the results seem to suggest that the treatment works in some subgroups but not in others (thereby giving the appearance of a ‘qualitative interaction’), this may still not be good evidence for subgroup-​specific treatment preferences. The play of chance often produces qualitatively wrong answers in particular subgroups in trials (or in meta-​analyses of trials) that could, if interpreted incau- tiously, lead to millions of people being treated inappropriately, or untreated inappropriately. Questions about such ‘interactions’ between patient character- istics and the effects of treatment are easy to ask, but are surpris- ingly difficult to answer reliably. Apparent interactions can often be produced by the play of chance and, in particular subgroups, can mimic or obscure some of the moderate treatment effects that might realistically be expected. To demonstrate this, a subgroup analysis was performed based on the astrological birth signs of patients ran- domized in the very large Second International Study of Infarct Survival (ISIS-​2) trial of the treatment of acute myocardial infarc- tion. Overall in this trial, the 1-​month survival advantage produced by aspirin was conclusively demonstrated (804 vascular deaths among 8587 patients allocated aspirin versus 1016 among 8600 al- located no aspirin; 23% proportional reduction, two-​sided p value <0.000001). However, when these analyses were subdivided into 12 subgroups by the patients’ astrological birth signs (in mediaeval as- trology, the ‘birth sign’ is determined by the month of birth; for ex- ample, ‘Libra’ means born 24 September to 23 October, and ‘Gemini’ means born 22 May to 21 June) to illustrate the unreliability of sub- group analyses, aspirin appeared totally ineffective for those born under Libra or Gemini (Table 2.4.2). It would obviously be unwise to conclude from such a result that patients born under the astro- logical birth sign of Libra or Gemini should not be given aspirin if they have a heart attack. However, similar conclusions based on ‘exploratory’ data-​derived subgroup analyses, which from a purely statistical viewpoint are no more reliable than these, are often re- ported and believed, with inappropriate effects on worldwide clin- ical practice. There are three main remedies for this unavoidable conflict be- tween the reliable subgroup-​specific conclusions that doctors and patients want and need, and the statistically unreliable findings that direct subgroup analyses can usually offer. However, the extent to which these remedies are helpful in particular instances is one on which informed judgements differ. First, where there are good a priori reasons for anticipating that the proportional effects of treatment might be very different in different circumstances then a limited number of subgroup analyses may be prespecified in the study protocol, along with a prediction of the direction of such proposed interactions. (For example, it was expected that the benefits of fibrinolytic therapy for acute myocardial infarction would be greater the earlier such patients were treated and so some studies prespecified that the analyses would be subdivided by the number of hours from the onset of symptoms to treatment:  see later.) These prespecified Table 2.4.2  False-​negative mortality effect in a subgroup defined only by the mediaeval astrological birth sign: the ISIS-​2 (1988) trial of aspirin among over 17 000 patients with acute myocardial infarction Astrological birth sign No. of 1-​month deaths (aspirin versus placebo) Statistical significance Libra or Gemini 150 vs. 147 NS All other signs 654 vs. 869 2p <0.000001 Any birth signa 804 vs. 1016 (9.4%) (11.8%) 2p <0.000001 a Appropriate overall analysis for assessing the true effect in all subgroups. Mediaeval astrology divides birth dates into 12 ‘birth signs’ (which depend only on the day and month of birth, not the year of birth). To demonstrate the potential unreliability of other subgroup analyses, the ISIS-​2 patients were divided into 12 subgroups according to their astrological birth sign, and the apparent effects of aspirin were calculated separately in each of these 12 subgroups. Because of the play of chance, the apparent effects differed from one subgroup to another (ranging from no apparent effect of aspirin in two subgroups (Libra and Gemini: see text for definition) to aspirin apparently halving mortality in another (Capricorn)). 2.4  Large-scale randomized evidence: Trials and meta-analyses of trials 55 subgroup-​specific analyses can then be taken somewhat more ser- iously than other subgroup analyses, but they can still yield im- portantly wrong answers. The second approach is to emphasize chiefly the overall results of a trial (or, better still, of all such trials) for particular outcomes, as a guide to—​or at least a context for speculation about—​the qualita- tive results in various specific subgroups of patients, and to give less weight to the actual results in each separate subgroup. This is clearly the right way to interpret the astrological ‘findings’ in Table 2.4.2, but, if used sensibly, it is also likely in many other circumstances to provide the best assessment of whether one treatment is better than another in particular subgroups. The proportional effect of treat- ment as estimated by the overall results may well provide a useful approximation to the proportional effects of treatment in particular subgroups. Of course, any such extrapolation of overall results to particular subgroups needs to be performed in a sensible way, keeping in touch with medical realities. In particular, if a treatment has substantial side effects, it may be obviously inappropriate for low-​risk patients. (In this case, the side effects in a particular subgroup and the pro- portional benefit in that subgroup should be estimated separately, but the estimation for both might be more reliable if based on an appropriate extrapolation from the overall results rather than on the results in that one subgroup alone.) The third approach is to be influenced, in discussing the likely ef- fects on mortality in specific subgroups, not only by the mortality analyses in these subgroups but also by the analyses of recurrence-​ free survival or some other major ‘surrogate’ outcome. For, if the overall results are similar but much more highly significant for recurrence-​free survival than for mortality, subgroup analyses with respect to the former may be more stable and may provide a better guide as to whether there are any major differences between sub- groups in the effects of treatment. The appropriate interpretation of apparently different results in different subgroups of the randomized evidence is still one of the most difficult matters of judgement in the interpretation of random- ized evidence; at present, many clinicians and regulatory agencies pay too much attention to irregularities in apparent effects that are consistent with chance. Avoiding moderate random errors The need for large-​scale randomization To distinguish reliably between the two alternatives that (a) there is no worthwhile difference in survival or that (b) treatment confers a moderate, but worthwhile, benefit (e.g. 10 or 20% fewer deaths), not only must systematic errors be guaranteed to be small (see earlier) compared with such a moderate risk reduction, but so too must any of the purely random errors that are produced just by chance. Random errors can be reliably avoided only by studying very large numbers of patients and hence large enough numbers of ‘endpoints’. However, it is not sufficiently widely appreciated just how large clin- ical trials need to be in order to detect moderate differences reli- ably. This can be illustrated by a hypothetical trial that is actually quite inadequate—​even though by some standards it is moderately large—​in which a 20% reduction in mortality (from 10 to 8%) is sup- posed to be detected among 2000 heart attack patients (1000 treated and 1000 controls). In this case, one might predict about 100 deaths (10%) in the control group and 80 deaths (8%) in the treated group. However, if this difference were to be observed it would not be con- ventionally significant (p = 0.1); indicating that even if there is no real difference between the effects of the trial treatments, it would still be relatively easy for a result at least as extreme as this to arise by chance alone. Although the play of chance might well increase the difference enough to make it conventionally significant (e.g. 110 deaths vs. 70 deaths, 2p <0.001), it might equally well dilute, oblit- erate (e.g. 90 deaths vs. 90 deaths), or even reverse it. The situation in real life is often even worse, as the average trial size may include only a few dozen events rather than the several hundred (or few thou- sand) that would ideally be needed to guide the future treatment of millions. Mega-​trials: How to randomize large numbers One of the chief techniques for obtaining appropriately large-​scale randomized evidence is to make trials extremely simple, and then to invite hundreds of hospitals to collaborate. The first of these large streamlined trials (or mega-​trials) were the ISIS and GISSI studies of heart attack treatment in the 1980s, and many other mega-​trials have now been successfully undertaken, not only in the field of cardiology—​where numerous large trials have now been performed—​but also in other specialties where treatment might be expected to have only moderate effects on morbidity and mortality from a common disease or injury. Many such mega-​trials have pro- duced medically important results that would not otherwise have been reliably obtained. However, in terms of medically significant findings, what has been achieved so far is only a fraction of what would be possible if this research strategy could be more widely adopted. Any obstacle to simplicity is an obstacle to large size, and so it is worth making enormous efforts at the design stage to sim- plify and streamline the process of entering, treating, and assessing patients. Many trials would be of much greater scientific value if they collected 10 times less information, both at entry and during follow-​up, on 10 times more patients. Since those responsible for entering patients into trials are generally busy people, it is particu- larly important to simplify the entry of patients, otherwise rapid re- cruitment may prove difficult (see later). Likewise, when allocating resources within large-​scale trials, it is important to direct them to where it chiefly matters, namely the recruitment of large num- bers of patients and counting how many suffer the main outcomes of interest, whereas large sums of money may often be wasted on inappropriate audits or unnecessary or excessively frequent meas- urements, the analysis of which will not contribute to answering the main study question. Simplification of entry procedures for trials: The ‘uncertainty principle’ For ethical reasons, patients cannot have a commonly available treatment chosen at random if either they or their doctor are (for any reasons) already reasonably certain that another treatment is preferable. Hence, randomization can be offered only if both doctor and patient feel substantially uncertain as to which of the trial op- tions is best. The question then arises: ‘Which categories of patients 56 SECTION 2  Background to medicine about whose treatment there is such uncertainty should be offered randomization?’ The obvious answer is all of them, welcoming the heterogeneity that this will produce. (For example, either the treat- ment of choice will turn out to be the same for men and women, in which case the trial might as well include both, or it will be different, in which case it is particularly important to study both.) In appropri- ately large trials, patient homogeneity is generally a defect while het- erogeneity is generally a strength. Consider, for example, the trials of immediate fibrinolytic therapy for acute myocardial infarction. Some had restrictive entry criteria that allowed inclusion of only those patients who presented between 0 and 6 h after the onset of pain, and those trials contributed almost nothing to the key question of how late such treatment can still be useful. In contrast, the trials with wider and more heterogeneous entry criteria that included some patients with somewhat longer delays between pain onset and hospitalization answered this question prospectively, and were able to show that fibrinolytic therapy can have definite protective effects not only when given 0 to 6 but also when given 7 to 12 hours after the onset of pain (see later). This approach of randomizing the full range of patients in whom there is substantial uncertainty as to which treatment option is best was used in the first Medical Research Council Asymptomatic Carotid Surgery Trial (ACST-​1). Narrowing of the carotid artery (which is rapidly detectable by ultrasound) can eventually cause a stroke, or even a succession of strokes. It can be dealt with surgi- cally by carotid endarterectomy, but in the 1990s there was much uncertainty as to whether such surgery, with its inherent periopera- tive risks, was appropriate for individuals with severe carotid artery narrowing who were currently asymptomatic (that is, had not had a stroke in the past few months). The ACST was therefore designed to compare a policy of immediate carotid endarterectomy with a policy of ‘watchful waiting’ in asymptomatic patients with substantial ca- rotid artery narrowing. If a patient was prepared at least to consider surgery seriously, then the neurologist and surgeon responsible for that individual’s care considered in their own undefined way what- ever medical, personal, or other factors seemed to them to be rele- vant, including, of course, the patient’s own preferences and values. Eligibility for randomization was defined by the ‘uncertainty prin- ciple’ (Fig. 2.4.2): 1. If they or the patient were reasonably certain, for any reason, that they did wish to recommend immediate surgery for that particular patient, the patient was not eligible for entry into the ACST. 2. Conversely, if they or the patient were reasonably certain, for any reason, that they did not wish to recommend immediate surgery, the patient was likewise not eligible for entry into the trial. 3. If, but only if, the doctor(s) and patient were substantially uncertain what to recommend, the patient was automatically eligible for randomization between immediate versus no im- mediate surgery (with all patients receiving whatever their doctors judged to be the best available medical care, which generally included advice to stop smoking, low-​dose aspirin, and treatment of hypertension; nowadays, it would probably also include a statin). In ACST-​1, there were substantial differences between individual doctors in the types of patients about whom they were uncertain (in terms of the severity of carotid stenosis—​which was generally recorded on ultrasound as 70%, 80%, or 90% blockage—​age, gen- eral health, and various other characteristics). This guaranteed that no category of patient about which there was widespread uncer- tainty would be wholly excluded, and hence guaranteed that the trial would yield at least some direct evidence in a variety of typical pa- tients. As a result of the wide and simple entry criteria adopted by ACST-​1, 3120 patients were randomized (which was more than in any previous vascular surgery trial), so the study was able to provide some clear answers about who needed carotid endarterectomy. In asymptomatic patients younger than 75 years of age, with carotid diameter about 70% or more on ultrasound, immediate carotid end- arterectomy halved the net 5-​year stroke risk from about 12% to 6% (even though this 6% included the 3% perioperative hazard). For patients with only moderate carotid artery stenosis on ultrasound, the 5-​year risks of carotid stroke (excluding perioperative hazard) were 2% versus 9%, while among those with tighter stenosis the risks were 3% versus 10%, suggesting about as much benefit in moderate as in tight stenosis. The ‘uncertainty principle’ simultaneously meets the require- ments of ethicality, heterogeneity, simplicity, and maximal trial size, and should be widely used. It states that the fundamental eligibility criterion is that both doctor and patient should be substantially uncertain about the appropriateness of each of the trial treatments for that particular patient. With such uncertainty as the fundamental criterion of eligibility, informed consent can often be simplified. For, the degree of ‘informed consent’ that is appropriate in a randomized comparison of two established treat- ments governed by the ‘uncertainty principle’ should probably not differ greatly from that which is applied in routine practice outside trials when treatment is being chosen haphazardly—​or, to put it another way, ‘double standards’ between trial and non​trial situations are inappropriate. The haphazard nature of many non-​ randomized treatment choices is reflected in the wide variations in practice between and within countries. Even when a practice is similar it may be similarly wrong: for example, before the ISIS-​2 results became available (see later), few doctors routinely used fi- brinolytic therapy for acute myocardial infarction. Provided that trials are governed by the ‘uncertainty principle’, there is an ap- proximate parallel between good science and good ethics. Indeed, in such circumstances, excessively detailed consent procedures (which can be distressing and inhumane, and so would not be con- sidered appropriate in routine non​trial clinical practice) would not be humane or ethically appropriate in trials. Excessively de- tailed consent procedures are, unfortunately, quite common, but their chief purpose is to protect doctors against lawyers rather than to protect patients against anything. This ‘uncertainty principle’ is just one of many ways of simplifying trials and thereby helping them to avoid becoming enmeshed in a mass of wholly unnecessary traditional complexity. If randomized trials can be substantially simplified (which, it must be admitted, requires a reversal of the current trend towards unnecessary complexity), as has already been achieved for a few major diseases, and hence made very much larger, then they will continue to play an appropriately central role in the development of rational criteria for planning treatment strategies and reducing death and disability. 2.4  Large-scale randomized evidence: Trials and meta-analyses of trials 57 Minimizing both bias and random error: Meta-​analyses of randomized trials Archie Cochrane was one of the first people to emphasize the need to organize, by specialty, the results from all relevant randomized trials, and the Cochrane Library brings together in a single place a large number of systematic reviews (many of which include meta-​ analyses of randomized trials) summarizing the available evidence about a wide range of therapeutic questions. When several trials have all addressed much the same question, the traditional pro- cedure of only a few of them becoming widely known may be a source of serious bias, since chance fluctuations for or against treat- ment may affect which trials become well known and widely cited. To avoid this problem, it is appropriate to base inference chiefly on a meta-​analysis of all the results from all of the trials that have ad- dressed a particular type of question (or on an unbiased subset of such trials), and not on some potentially biased subset of these trials. Carotid artery stenosis detected by ultrasound, but, as yet, no clinical evidence of stroke from it. Should patient be offered immediate carotid surgery? Doctor(s) or patient reasonably certain (no matter why) that immediate surgery is not appropriate: Ineligible Doctor(s) or patient reasonably certain (no matter why) that immediate surgery is appropriate: Ineligible Doctor(s) and patient substantially uncertain whether to risk immediate surgery: Uncertainty implies eligibility Telephone to randomize Group 1: Allocated No immediate surgery (unless or until a clear indication is thought to have arisen) Group 2: Allocated Immediate carotid surgery (unless definite contraindication is thought to have been discovered, or patient [or doctor] changes their mind) Over the next few years only a small number get carotid surgery 90% get carotid surgery (median delay: 1 month) Statistical comparisons of various outcomes over 5 to 10 years: 100% of Group 1 vs. 100% of Group 2, i.e. ‘intention to treat analysis’ Fig. 2.4.2  Example of the ‘uncertainty principle’ to define eligibility for trial entry: the chief eligibility criterion for the Asymptomatic Carotid Surgery Trial (ACST) was that doctors and patients should be substantially uncertain whether to risk immediate carotid surgery. Partly because this criterion was appropriately flexible, ACST-​1 became the largest-​ever trial of vascular surgery, showing that the long-​term benefits of carotid artery surgery could eventually outweigh the immediate hazards. ACST-​2 (http://​www.nds.ox.ac.uk/​acst) is now randomizing surgery versus carotid stenting where the doctor(s) and patient are substantially uncertain which to prefer. 58 SECTION 2  Background to medicine Such meta-​analyses will also minimize random errors in the assess- ment of treatment since, in general, far more patients are involved in a meta-​analysis than in any contributory individual trial. The separate trials may well be heterogeneous in their entry cri- teria, their treatment schedules, their follow-​up procedures, their methods of treating relapse, and so on. In view of this heterogeneity, at one extreme each trial might be considered in virtual isolation from all others, while at the opposite extreme the results from all trials could be combined, largely ignoring any heterogeneity. Both these extreme views have some merit, and the pursuit of each by different people may prove more illuminating than too definite an insistence on any one particular approach. However, the heterogen- eity of the different trials merely argues for careful interpretation of any meta-​analyses of different trial results, rather than arguing against meta-​analyses. Whatever the difficulties in interpreting meta-​analyses may be, without them it is difficult to avoid moderate selective biases and substantial random errors, both of which could obscure any moderate treatment effects, or, conversely, imply an ef- fect where none exists. Which meta-​analyses are trustworthy? Since the 1970s, a rapidly increasing number of meta-​analyses of the results of randomized trials have been reported, not all of which are trustworthy. When considering how reliable a given one might be there are two fundamental questions: What is the potential for bias, and what is the potential size of purely random errors? To answer the first question, consideration must be given to whether biases might exist within individual trials (e.g. because of an unreliable method of randomization or because of post-​randomization exclusions from the main analyses), and whether the subset of trials under consid- eration might be a biased subset of all relevant trials that have been performed (as might arise, for example, if certain trials were aban- doned because of unpromising findings, or remained unpublished for this or any other reason). The simplest approach to meta-​analysis is merely to have collected and tabulated the published data from whatever randomized trial reports are easily accessible in the literature, and sometimes this may suffice. At the opposite extreme, extensive efforts may have been made by those organizing the meta-​analysis to locate every poten- tially relevant randomized trial, including those never published, to collaborate closely with the trialists to seek individual data on each patient ever randomized into those trials, and then (after extensive checks and corrections of such data) to produce, in collaborative re- analyses with those trialists, agreed analyses and publications. The results of some of the largest such collaborative re-​analyses will be described later: the Anti Thrombotic Trialists’ (ATT) Collaborative Group, the Fibrinolytic Therapy Trialists’ (FTT) Collaborative Group, and the Early Breast Cancer Trialists’ Collaborative Group (EBCTCG). Collaboration of the original trialists in the meta-​ analysis process, with collection of detailed data from each indi- vidual trial participant, can help to avoid or minimize the biases that could be produced by missing trials (e.g. owing to the greater likeli- hood of extremely good, or extremely bad, results being particularly widely known and published), by inappropriate post-​randomization withdrawals, or by the failure to allocate treatment properly at random. If randomization was performed properly in the first place, the post-​randomization withdrawals can often be followed-​up and restored to the study for an appropriate ‘intention-​to-​treat’ analysis. Knowledge of the exact methods of treatment allocation (backed up by checks on whether the main prognostic factors recorded are non-​ randomly distributed between the treatment groups in a particular trial) may help to identify trials that were so improperly randomized that they should be excluded from a meta-​analysis of the properly randomized trials. Meta-​analyses based on individual patient data may also provide more information about treatment effects than the more usual overviews of grouped data, for they allow more detailed analyses—​indeed, if they are really large then they may actually yield statistically reliable subgroup analyses of the effects of treatment in particular types of patient. Conversely, even a perfectly conducted meta-​analysis of an inter- vention with moderate effects on a major clinical outcome may not be reliable if the trials were all small. There are two reasons for this. First, when the true effect of an intervention is only moderate, most small trials will fail to reach statistical significance, and may be less likely to be published (or otherwise available) than the few with results that are misleadingly extreme. Hence, a meta-​analysis consisting exclusively of small trials is particularly prone to bias. Secondly, the random errors may be too large to allow reliable in- terpretation. A meta-​analysis that includes a total of only 100 deaths will have random errors about as great as a single trial with only 100 deaths. For these reasons small-​scale evidence, whether from a meta-​analysis or from one trial, is often unreliable and may well be found in retrospect to have yielded wrong answers. What is needed is large-​scale randomized evidence; it does not matter much whether the totality of the evidence comes from a properly conducted meta-​ analysis of several trials or one properly conducted trial with such clear results that no further trials were done. The practical medical value of large-​scale randomized evidence will be illustrated by a few examples. Examples of information about the efficacy and safety of widely used drugs that could have been reliably established only by large-​scale randomized evidence Evidence of benefit from a single very large trial In the ISIS-​2 trial, half of 17 000 patients with suspected acute myocardial infarction were allocated aspirin tablets (162 mg/​day for 1 month, which virtually completely inhibits cyclo-​oxygenase-​ dependent platelet activation) and half were allocated placebo tab- lets. Before 1988, when the ISIS-​2 results were published, aspirin was not routinely used in the treatment of acute myocardial infarction, and no other major trial had (or has subsequently) compared aspirin with an untreated control group in cases of suspected acute myocar- dial infarction. However, the effects of 1 month of aspirin were so definite in ISIS-​2 (804/​8587 vascular deaths among those who were allocated aspirin versus 1016/​8600 among those who were not) that even the lower 99% confidence limit would have represented a worth- while benefit from this simple and inexpensive treatment (Fig. 2.4.3). As a result, worldwide treatment patterns changed sharply when the ISIS-​2 results emerged in 1988, and aspirin is now routinely used for the majority of emergency hospital admissions with suspected acute myocardial infarction not only in Europe and America but throughout Asia. In the United Kingdom, for example, two British 2.4  Large-scale randomized evidence: Trials and meta-analyses of trials 59 Heart Foundation surveys found cardiologists reporting that rou- tine aspirin use in acute coronary care had increased from under 10% in 1987 to over 90% in 1989. Worldwide, the annual number of patients with suspected myocardial infarction who would nowadays be given such treatment must be several million a year, suggesting that aspirin is already preventing several tens of thousands of prema- ture deaths each year in this clinical context alone. However, if the ISIS-​2 trial had been a factor of 10 smaller (i.e. 1700 instead of 17 000 patients), then exactly the same proportional reduction in mortality as shown in Fig. 2.4.3 would not have been conventionally signifi- cant and, therefore, would have been much less likely to influence medical practice—​indeed, the result might by chance have appeared exactly flat, greatly damaging future research on aspirin in this con- text. (In fact, during the early interim monitoring of the ISIS-​2 trial results by the independent Data Monitoring Committee there was no apparent difference in mortality based on a few hundred deaths.) Likewise, if the ISIS-​2 trial had been non​randomized, then it might well have produced the wrong answer since, in a nonrandomized study, doctors might tend to give active treatment to patients who are particularly ill, or who are rather different in various other ways from those not given active treatment. In addition, even if a non-​ randomized study did happen to produce an unbiased answer, it would have been impossible to be sure that it had actually done so, so again a non​randomized study might have had much less influence on medical practice than ISIS-​2. In the ISIS-​2 trial, aspirin significantly reduced the 1-​month mortality, but it also significantly reduced the number of non​fatal strokes and non​fatal reinfarctions that were recorded in hos- pital. Combining all these three outcomes into ‘vascular events’ (i.e. stroke, death or reinfarction), 10% of those who were allo- cated aspirin versus 14% of the controls suffered a vascular event in the month after randomization (Table 2.4.3)—​an absolute dif- ference of 40 events per 1000 treated (or, perhaps more relevantly, 40 000 per million). The randomized trials of aspirin, or of other antiplatelet regimens, in other types of high-​risk patients (e.g. a few years of aspirin for those who have survived a myocardial infarc- tion or stroke) were not as large as ISIS-​2, and so, taken separately, most yielded false-​negative results. However, when the results from many such trials are combined, statistically definite reductions in ‘vascular events’ are seen (Table 2.4.3). Since such treatments do not appear to increase non​vascular mortality, all-​cause mortality Cumulative number of vascular deaths 1000 800 600 400 200 0 Days from randomization 0 7 14 21 28 35 Placebo tablets: 1016 vascular deaths (11.8%) Aspirin: 804 vascular deaths (9.4%) Fig. 2.4.3  Effect of administration of aspirin for 1 month on 35-​day mortality in the 1988 ISIS-​2 trial among 17 000 patients with acute myocardial infarction. (Absolute survival advantage: 24 (SE 5) lives saved per 1000 patients allocated aspirin, 2p <0.00001. The COMMIT trial in 46 000 such patients has since shown aspirin plus clopidogrel to be slightly more effective than aspirin alone.) Table 2.4.3  Summary results of (a) trials of aspirin (or other antiplatelet drugs), and (b) trials of adding a P2Y12-​receptor antagonist to aspirin Type of patient (and study) Mean duration (total randomized) Stroke, heart attack, or vascular death (a) Antiplatelet vs. control Antiplatelet Control Difference Acute heart attack (ISIS-​2) 1 month (20 000) 10% 14% 40 per 1000 (2p <0.00001) Acute stroke (CAST) 1 month (40 000) 9% 10% 10 per 1000 (2p = 0.001) Previous heart attack (ATT) 2 years (20 000) 13% 17% 40 per 1000 (2p <0.00001) Previous stroke/​TIA (ATT) 2.5 years (23 000) 18% 22% 40 per 1000 (2p <0.00001) Other high-​risk (e.g. angina, peripheral vascular disease) 1 year (20 000) 8% 10% 20 per 1000 (2p <0.00001) (b) Aspirin plus P2Y12-​receptor antagonist vs. aspirin alone Aspirin + P2Y12-​receptor antagonist Aspirin alone Acute coronary syndrome (CURE) 9 months (13 000) 9% 11% 20 per 1000 (2p <0.001) Acute heart attack (CCS2-​COMMIT) 1 month (46 000) 9% 10% 10 per 1000 (2p = 0.002) Previous heart attack (PEGASUS-​TIMI 54) 33 months (21 000) 8% 9% 10 per 1000 (2p <0.001) 60 SECTION 2  Background to medicine is also significantly reduced. Subsequently, a combination of aspirin and a P2Y12-​receptor antagonist such as clopidogrel (which inhibits platelet activation through different pathways) has been shown to be more effective than aspirin alone in acute myocardial infarc- tion or acute coronary syndrome (Table 2.4.3), and this has paved the way for ‘dual antiplatelet therapy’ to become standard for such conditions. The large-​scale randomized evidence on antiplatelet drugs that is summarized in Table 2.4.3 has changed clinical practice world- wide, and may already have affected the treatment of hundreds of millions of patients in ways that, at low cost, have prevented millions of strokes, heart attacks, or vascular deaths. Small randomized trials and small meta-​analyses of trials, or non​randomized studies (how- ever large), could not possibly have provided appropriately reliable evidence about such moderate risk reductions. Evidence of hazard from a single large trial It is not sufficiently appreciated that identical arguments about the need for large-​scale randomized evidence apply equally when there is the possibility that a treatment is the cause of a moderate increase in adverse outcomes. The drug niacin decreases blood low-​density lipoprotein (LDL) cholesterol and triglycerides, while increasing high-​density lipoprotein (HDL) cholesterol. It was shown to be ef- fective for the prevention of coronary events in the Coronary Drug Project conducted in the 1970s, but a more recent trial (AIM-​HIGH) had been stopped early because of a lack of efficacy. Many patients are unable to take niacin because it causes uncomfortable flushing and other adverse effects. The second Heart Protection Study—​Treatment of HDL to Reduce the Incidence of Vascular Events (THRIVE) was designed to assess the benefits and hazards of extended-​release niacin in ­combination with the drug laropiprant, which had been shown to reduce flushing in up to two-​thirds of patients and hence, it was hoped, would improve adherence. The trial randomized over 25 000 patients to niacin-​laropiprant versus placebo and followed them for about 4 years. There was no significant effect on major vas- cular events, defined in this trial as a major coronary event (non​fatal ­myocardial infarction or coronary death), stroke of any type, or coronary or non​coronary revascularization (13.2% vs. 13.7%, rate ratio 0.96, 95% confidence interval 0.90–​1.03, p = 0.29), but the trial also provided very important information about several adverse ef- fects of niacin. Table 2.4.4 summarizes the results from the trial for selected serious adverse events (which, essentially, means that the event caused hospitalization or, rarely, death). While some of these were known adverse effects of niacin (e.g. diabetes-​related, gastrointestinal, musculoskeletal, and skin-​related disorders), other adverse events that were in excess in the THRIVE trial had not been suspected. Allocation to niacin-​laropiprant re- sulted in moderately raised risks of serious infections (22% rela- tive increase, corresponding to an absolute excess of 14 cases per 1000 participants treated with niacin-​laropiprant for 4 years) and of bleeding (38% relative increase, corresponding to an absolute excess of 7 cases per 1000 participants treated with niacin-​laropiprant for 4 years). A significant excess of serious infections was also seen with niacin alone in the smaller AIM-​HIGH trial of 3500 patients (8.1% vs. 5.8, p = 0.008), but there were relatively few serious bleeding events and the excess observed with niacin alone was not significant (3.4% vs. 2.9%, p = 0.36), although it is consistent with the THRIVE result. The discovery of these new adverse effects in THRIVE, to- gether with the new information about the magnitude of known adverse effects, has led to a reappraisal of the role of niacin in the management of blood lipids. For not only is niacin now known to be ineffective at preventing vascular events (at least in circumstances similar to those tested in THRIVE), but it is also a much more haz- ardous drug than had previously been appreciated. This illustrates the value of large-​scale randomized evidence for studying drug safety when, as is typical, the hazards requiring detection are of only moderate magnitude. Definite result from a very large meta-​analysis of trials: Benefit from ‘adjuvant’ therapy with tamoxifen for patients with hormone-​sensitive (ER+) ‘early’ breast cancer By definition, in ‘early’ breast cancer all detectable deposits of dis- ease are limited to the breast and the local or regional lymph nodes, and can be removed surgically. However, experience shows that undetectably small deposits of breast cancer cells may remain else- where that eventually cause clinical recurrence at a distant site, per- haps after a delay of several years, which is then usually followed by death from disease. If the original tumour was ‘ER positive’ (i.e. if the tumour cells were still expressing the oestrogen receptor pro- tein) then the distant deposits of cancer cells that spread from it be- fore it was removed may also be ER positive, and may be continually stimulated by circulating hormones. Therefore, among women who Table 2.4.4  Summary results of the HPS2-​THRIVE trial: effects of niacin-​laropiprant on selected serious adverse events and diabetes Event type Niacin-​laropiprant (n = 12 800) Placebo (n = 12 800) Rate ratio (95% confidence interval) Absolute excess (%) ± SE Serious adverse event (%) Gastrointestinal event 4.8% 3.8% 1.28 (1.13–​1.44) 1.0 ± 0.3 Musculoskeletal event 3.7% 3.0% 1.26 (1.10–​1.44) 0.7 ± 0.2 Skin-​related event 0.7% 0.4% 1.67 (1.20–​2.34) 0.3 ± 0.1 Infection event 8.0% 6.6% 1.22 (1.12–​1.34) 1.4 ± 0.3 Bleeding event 2.5% 1.9% 1.38 (1.17–​1.62) 0.7 ± 0.2 New-​onset diabetes 5.7% 4.3% 1.32 (1.16–​1.51) 1.3 ± 0.3 Adapted from The New England Journal of Medicine, The HPS2-​THRIVE Collaborative Group, Effects of Extended-​Release Niacin with Laropiprant in High-​Risk Patients, 371, 210. Copyright © 2014 Massachusetts Medical Society. Reprinted with permission. 2.4  Large-scale randomized evidence: Trials and meta-analyses of trials 61 have had breast cancer removed by surgery (or by surgery and radio- therapy), there have been many trials of ‘adjuvant’ daily treatment with tamoxifen, a drug that blocks the oestrogen receptor. Some in- volved only 1 to 2 years of treatment, some involved about 5 years, some compared 5 versus 1 to 2 years and more recent trials have compared 10 versus 5 years of tamoxifen. In total, more than 100 000 women have been randomized in several dozen such trials, some of which are still being followed-​up for long-​term outcomes. Taken separately, most of these tamoxifen trials have been too small to provide reliable evidence about long-​term survival. However, if the results of all of them are combined in various ways, some very definite differences emerge: 1 to 2 years of tamoxifen is better than nothing, 5 years is better than 1 to 2 years, and 10 years is better still for delaying or avoiding the recurrence of ER positive breast cancer. Meta-​analysis has also established that newer endo- crine therapies (such as aromatase inhibitors) reduce recurrence rates even further, although, unlike tamoxifen, they can only be used in postmenopausal women. Fig. 2.4.4a shows the results from the trials of about 5 years of tamoxifen. Allocation to active treatment produces a 13% absolute difference in the 15-​year risk of recurrence (34 vs. 47%), and a 9% absolute difference in survival (25 vs. 34%; both 2p <0.00001). Most of the effect on recurrence is seen during the first 5 years, while tamoxifen was still continuing to be given, but most of the effect on breast cancer mortality comes after this period (Fig. 2.4.4a). Indeed, the difference in the 15-​year probability of death from breast cancer is about three times as great as that seen after 5 years. Reliable assessment of the moderate improvements in long-​term survival in early breast cancer that are produced by tam- oxifen (and by radiotherapy and chemotherapy) would have been impossible without such a meta-​analysis of all trials, with updated follow-​up data provided periodically, because each of the trials was too small on its own to answer these questions convincingly. Collaborative meta-​analyses that involve hundreds of trialists from all around the world can also foster international acceptance of the totality of the randomized evidence. In the case of breast cancer this has, since the mid-​1980s, helped lead to widespread adoption in many countries of a succession of improvements in treatment (earlier detection, better local control, progressively better chemo- therapy and, in ER-​positive disease, progressively better endocrine therapy). These have, in aggregate, resulted in a sustained fall in na- tional mortality rates (Fig. 2.4.4b). Promising meta-​analysis of small trials confirmed by large trials: benefit from fibrinolytic therapy in acute myocardial infarction If a recent thrombus has just blocked a coronary artery, thereby causing acute myocardial ischaemia or infarction, fibrinolytic drugs (such as streptokinase or tissue plasminogen activator) can some- times rapidly dissolve the thrombus, restoring the flow of blood and reperfusing the heart muscle. These drugs were first introduced into clinical research in the late 1950s, but the trials of fibrinolytic therapy for suspected acute myocardial infarction in the 1960s and 1970s were too small to be statistically reliable (none involved even 1000 patients). So, by the early 1980s the haemorrhagic side effects were obvious, the benefits had not been convincingly dem- onstrated, and such treatments were generally considered to be definitely dangerous, probably fairly ineffective, and hence inappro- priate for routine coronary care. Although meta-​analyses published in the mid-​1980s of the previous small trials (which had involved a total of only c.6000 patients in 24 trials) indicated a statistically def- inite benefit, they were not really believed by cardiologists and so such treatments were still not widely used. The situation was saved by two large randomized trials, GISSI-​1 and ISIS-​2, which together involved about 30 000 patients (and by their aggregation with the seven other randomized trials that each involved more than 1000 patients, yielding a total of 60 000; see next). In ISIS-​2, not only were patients randomly allocated to receive aspirin or placebo tablets as described earlier (Fig. 2.4.3), but they were also separately allocated to receive intravenous streptokinase or a placebo infusion. In this ‘factorial’ design (which allows the separate assessment of more than one treatment without any material loss in the statistical reliability of each comparison), one-​quarter of the patients were allocated aspirin alone, one-​quarter were allocated streptokinase alone, one-​quarter were allocated both streptokinase and aspirin, and one-​quarter were allocated neither (that is, they were given placebo tablets and a pla- cebo infusion). Streptokinase, like aspirin, produced a highly signifi- cant reduction in mortality, and the combination of streptokinase and aspirin was highly significantly better than either aspirin alone or streptokinase alone (Fig. 2.4.5). The results shown in Fig. 2.4.5 might suggest that there was no need to collect any more randomized evidence about fibrinolytic therapy, but this ignores the potential hazards of such treatment and the heterogeneity of patients. Taken separately, even ISIS-​2 (the largest of these trials) was not large enough for statistically re- liable subgroup analyses, but when the nine largest trials were all taken together, they included a total of about 60 000 patients, half of whom had been randomly allocated fibrinolytic therapy. Those entering a coronary care unit with a diagnosis of suspected or def- inite acute myocardial infarction range from patients who are al- ready in cardiogenic shock with low blood pressure and a fast pulse (half of whom will die rapidly) to those who have merely had a history of chest pain and no very definite changes on their electro- cardiography (ECG) (of whom ‘only’ a small percentage will die be- fore discharge). Fibrinolytic therapy often causes blood pressure to fall: should it be used in patients who are already dangerously hypo- tensive? It occasionally causes serious strokes: Should it be used in patients who are elderly or hypertensive, and therefore already have an above-​average risk of cerebral haemorrhage (or who have only slight changes on their ECG, and therefore have only a low risk of cardiac death)? Finally, if a coronary artery has been totally occluded for long enough, the heart muscle that it supplies will have been ir- reversibly destroyed: How many hours after the heart attack starts is fibrinolytic treatment still worth risking—​3? 6? 12? 24? These questions needed to be answered reliably before appropriate and generally accepted indications for and against such an immedi- ately hazardous, but potentially effective, therapy could be devised. To address them, the main fibrinolytic therapy trialists collaborated in a systematic meta-​analysis of the randomized evidence, based on individual patient data. On review of the 60 000 patients random- ized between fibrinolytic therapy and control in trials of more than 1000 patients, some of the therapeutic questions were relatively easy to answer satisfactorily. For example, it appears that most of those whose ECG is still normal (or shows a pattern that indicates only a small immediate risk of death) can be left untreated, leaving open the option of starting fibrinolytic treatment urgently if their ECG changes suddenly for the worse over the next few hours or days. 62 SECTION 2  Background to medicine ≈ 5 years tamoxifen vs. Not RECURRENCE ER+ (a) 50 Control 47·3% 33·9% ≈ 5 years tamoxifen 25·0% ≈ 5 years tamoxifen Control 34·3% 15–y gain 13·4% (SE 1·1) Logrank 2p < 0·00001 15–y gain 9·3% (SE 1·1) Logrank 2p < 0·00001 40 30 20 10 0 5 16·6 26·5 8·9 12·2 26·2 18·6 41·0 29·1 10 15 years 0 5 10 15 years % ± SE 0 Recurrence 50 40 30 20 10 % ± SE 0 Breast cancer mortality ≈ 5 years tamoxifen vs. Not BREAST CANCER MORTALITY ER+ UK and USA, 1950–2014: Breast cancer mortality at ages 45–54 (b) Source: WHO mortality & UN population estimates Death rate/10 000 women, age standardized* Further MODERATE effects are still worthwhile and achievable LARGE effect on UK/USA breast cancer mortality by combining several MODERATE effects. 1950 1970 1990 2010 0 2 4 6 UK USA 0.6% 0.4% 0.2% 0% 10-year risk Mean of annual rates in the two component 5–year age groups smoothed in 5-year calendar periods Fig. 2.4.4  (a) Effects of about 5 years of tamoxifen versus not in ER-​positive disease: 15-​year probabilities of recurrence and of breast cancer mortality (10 000 women in the 2005 worldwide EBCTCG meta-​analysis). (b) Female breast cancer mortality in the United Kingdom and United States of America at ages 45 to 54 during the period 1950 to 2014. (The UK breast screening programme has little effect on mortality at these ages.) 2.4  Large-scale randomized evidence: Trials and meta-analyses of trials 63 Conversely, among those who already had ‘high-​risk’ ECG changes when they were randomized, the absolute benefit of immediate fi- brinolytic therapy was, if anything, slightly greater than is indicated by Fig. 2.4.5. Age, sex, blood pressure, heart rate, diabetes, and a pre- vious history of myocardial infarction could not identify reliably any subgroup that would not, on average, have their chances of survival appreciably increased by treatment. By contrast, the longer that fibrinolytic treatment for such pa- tients was delayed, the less benefit it seemed to produce. Among the 45 000 whose ECG showed definite ST-​segment elevation or bundle-​branch block, the benefit was greatest (about 30 lives saved per 1000) among those randomized between 0 and 6 h after the onset of pain (Fig. 2.4.6). However, the mortality reduction was still substantial and signifi- cant (about 20 per 1000; 2p <0.003) for the patients whose hospital admission had been delayed for some hours and who were therefore randomized 7–​12 h after the onset of pain. Indeed, even if patients were randomized 13–​18 h after the onset of pain, there still appeared to be some net reduction in mortality (about 10 per 1000, but not statistically definite). The regression line in Fig. 2.4.6 reinforces these separate subgroup analyses in a more reliable way. Yet, before these large trials it was forcefully, but mistakenly, argued that such treatments could not possibly be of any worthwhile benefit if given more than about 3 or 4 h after the onset of symptoms. Such detailed inferences are difficult enough with large-​scale properly randomized evidence and would be impossible without it. Because of their unknowable biases (see earlier), non​randomized database analyses are simply not a viable alternative to large-​scale randomized evidence. Nor would randomization of ‘only’ several thousand patients have been sufficient. The availability of large-​scale randomized evidence, in this case a meta-​analysis involving about 6000 deaths among 60 000 patients, has been essential in determining which particular types of patient derive net benefit from fibrinolytic therapy (or from surgical opening of acutely occluded arteries). Promising meta-​analysis of small trials refuted by large trials: Lack of significant benefit from magnesium infusion in acute myocardial infarction In animal studies, infusion of a magnesium salt can limit the myo- cardial damage arising from sudden experimental blockage of a cor- onary artery. By the early 1990s, there was considerable optimism that a simple, inexpensive magnesium infusion might prove benefi- cial after acute myocardial infarction. Twelve small trials, involving between them a total of only about 2000 patients, had addressed this question, and their aggregated results indicated a highly statistically significant—​but implausibly large—​halving of risk (72/​1199 deaths among those allocated magnesium versus 151/​1191 among the controls, 2p <0.00001). At this time some argued that such results constituted proof beyond reasonable doubt that magnesium was of sufficient value to justify its widespread usage without seeking fur- ther randomized evidence, but others remained sceptical, arguing that the apparent results were far too good to be true. Two trials, one (LIMIT-​2) involving 2000 patients and one (ISIS-​4) involving 58 000, were then set up to test the possible effects of mag- nesium more reliably. The first yielded a moderately promising re- sult (Fig. 2.4.7), indicating avoidance of about one-​quarter of the early deaths, but with the 99% confidence interval including the pos- sibility that magnesium had no beneficial effect on early mortality. The second (which had continued in spite of intense lobbying of its Data Monitoring Committee to stop the trial), however, yielded a completely unpromising result, so that the overall evidence, by that time based on over 60 000 randomized patients, indicated no net effect on mortality. Nevertheless, some cardiologists remained hopeful that mag- nesium might prove to be effective among specific subgroups. Accordingly, the MAGnesium In Coronaries (MAGIC) trial Percentage dead 10 5 0 0 1 2 3 4 5 Weeks from starting treatment Aspirin only Streptokinase only Routine hospital care alone 13% dead (568/4300) Routine care + combination of both streptokinase and aspirin 8% dead (343/4292) Fig. 2.4.5  Effects of a 1-​h streptokinase infusion, and of aspirin for about one month, on 35-​day mortality in the 1988 ISIS-​2 trial among 17 000 patients with acute myocardial infarction who would not normally have received either treatment, divided at random into four similar groups to receive aspirin only, streptokinase only, both or neither. 40 30 20 10 0 6 12 Hours from symptom onset to randomization 18 Loss of benefit per hour of delay to randomization: 1.6 SD 0.6 per 1000 patients 3000 14 000 12 000 9000 7000 50 24 0 Lives saved per 1000 allocated fibrinolytic therapy (± SD) Fig. 2.4.6  Benefit versus delay (0–​1, 2–​3, 4–​6, 7–​12, or 13–​24 h) in the nine largest randomized trials of fibrinolytic therapy versus control in patients with acute myocardial infarction. One-​month mortality results for 45 000 patients with ST elevation or bundle-​branch block when randomized, showing the definite net benefit even for the 9000 randomized 7–​12 hours after the onset of pain. 64 SECTION 2  Background to medicine subsequently randomized 6000 patients, all of whom had received reperfusion therapy within the past few hours, to magnesium versus placebo but this also found no evidence of any net benefit. It is interesting to consider what this sequence of magnesium trial results (Fig. 2.4.7) might mean for those wishing to interpret other randomized evidence. Our interpretation is that if something seems too good to be true then it probably is—​or, more formally, that big benefits are much less plausible than moderate benefits. None of the 12 small trials had sufficient power to detect a moderate effect on mortality, and although their aggregated results indicated that mortality could be reduced by more than half, such an effect is too extreme to be plausible, and could be misleading even though it is highly significant. The LIMIT-​2 trial then suggested that magnesium might reduce mortality by about a quarter, a result that is somewhat more plausible but not clearly significant. The success of the ISIS-​4 and MAGIC trials in refuting the implausibly large benefit suggested by the 13 smaller trials reinforces our point that often, when trying to distinguish between the two medically realistic possibilities of a moderate effect or no effect, only large-​scale evidence suffices. Even the LIMIT-​2 trial, which recruited 2000 patients, was in retrospect too small. (Another important methodological point is that ‘random effects’ methods for meta-​analysis can produce importantly wrong answers: applied to the 15 separate trials in Fig. 2.4.7, a standard ‘random effects’ meta-​analysis yields a summary odds ratio of 0.67 [95% CI 0.52–​0.85; 2p <0.001], suggesting—​clearly incorrectly—​ that magnesium reduces mortality by about one-​third!) Trials in their epidemiological context: Blood pressure, stroke, and heart disease Quantitative epidemiological evidence about the effects of long-​term differences in risk factors such as blood pressure or blood choles- terol level can help in interpreting the results from trials of the ef- fects of reducing these risk factors for only a few years. For example, appropriate meta-​analyses of prospective observational epidemio- logical studies indicate that, throughout the range of usual systolic blood pressure in the populations studied (about 115–​180 mm Hg systolic blood pressure, SBP), a lower value was associated with a lower risk of ischaemic heart disease, with no apparent ‘threshold’ below which the relationship reversed (Fig. 2.4.8). This analysis suggests that, in later middle age (60–​69 years), 10 mm Hg lower Trial name 12 small trials LIMIT– 2 ISIS–4 (1995) MAGIC (2002) Total 99% or 95% confidence intervals 2853/ (8·3%) 34482 2844/ (8·2%) 34487 72/1199 90/1159 2216/29011 475/3113 472/3100 2103/29039 11 8/1157 151/1191 0·44 (SE 0·10) 0·74 (SE 0·13) 1·06 (SE 0·03) 1 ·00 (SE 0·07) 1.00 (SE 0.03) 2p>0·1; NS Magnesium better 0 0.5 1.0 1.5 2.0 Magnesium worse Treatment effect 2p > 0·1; NS, adverse Deaths/Patients Allocated magnesium Allocated control Ratio of death rates Magnesium : Control Fig. 2.4.7  Effect of a magnesium infusion on 1-​month mortality among patients with acute myocardial infarction. Ratio of the death rate in the treatment group to that in the control group is plotted for each trial (as a black square with area proportional to the amount of statistical information) along with its 99% confidence interval (horizontal line). A stratified overview of the results of all these trials (and its 95% confidence interval) is represented by an open diamond. 40–49 50–59 60–69 70–79 80–89 Age at risk Effect of 10 mm Hg ↓ SBP 18% ↓ risk 27% ↓ risk 31% ↓ risk 120 140 160 180 Usual systolic BP (mm Hg) Hazard ratio (95% CI) 1 2 4 8 16 32 64 128 Fig. 2.4.8  Stroke and ischaemic heart disease mortality rate in each decade of age versus usual systolic blood pressure (SBP, mm Hg) at the start of that decade, in a systematic overview of 61 prospective studies involving 1 million adults. Hazard ratios are plotted with a group-specific CI derived only from the variance of the log hazard in that one category (including the reference category), and each square has area inversely proportional to the effective variance of the log mortality rate. 2.4  Large-scale randomized evidence: Trials and meta-analyses of trials 65 SBP is associated with about 27% less ischaemic heart disease (IHD) mortality (and about 35% less stroke mortality: Prospective Studies Collaboration, data not shown). By the mid-​1990s, several trials had been conducted to deter- mine whether a few years of blood pressure reduction in middle age would reduce the risk of stroke and of coronary heart disease. Partly because of imperfect compliance, the mean difference in SBP between the treatment and control groups in these trials was only about 10 mm Hg. Even if such trial treatments would eventually produce about 27% less coronary heart disease after many years (as seen in observational studies), the effects seen within the 2 or 3 years that are available on average between randomization and death in a 5-​year trial might well be somewhat smaller (perhaps only about 15%, for example). But, considered separately, none of the trials recorded enough coronary heart disease events (or enough vascular deaths) for statistically reliable assessment of a 15% risk reduction. For stroke, the trials provide direct and highly significant evi- dence that most, or all, of the risk reduction associated with 10 mm Hg lower usual SBP appears soon after the blood pressure is lowered (Fig. 2.4.9). In contrast, the significant reduction in coronary heart disease seen in the trials (16% SD 4, 95% CI 8–​23%; 2p = 0.00001) seems to fall somewhat short of the difference of about 27% suggested by the observational evidence. However, the coronary heart disease reduction in the trials is still substantial and real (2p = 0.00001). Taken together, Figs. 2.4.8 and 2.4.9 suggest that antihypertensive regimens that produce differences of much more than 10 mm Hg SBP will reduce stroke by more than half and heart disease by more than a quarter. They also suggest that the proportional risk reduction produced by a given absolute reduction in SBP will be approximately independent of the initial SBP. Results from large anonymous trials are relevant to real clinical practice A clinician is used to dealing with individual patients, and may feel that the results of large trials somehow deny their individuality. This is almost the opposite of the truth, for one of the main reasons why trials have to be large is just because patients are so different from one another. Two apparently similar patients may run entirely different clinical courses, one remaining stable and the other progressing rap- idly to severe disability or early death. Consequently, it is only when really large groups of patients are compared that the proportion of patients with truly good and bad prognosis in each can be relied on to be reasonably similar. One commonly hears statements such as: ‘If the effect of a treatment isn’t obvious in a couple of hundred patients then it isn’t worth knowing about’. But, the accumulation since 1980 of large-​scale randomized evidence of such moderate effects with treatments for heart disease, stroke, breast cancer, intestinal cancer and various other conditions has transformed medical practice, and may already have avoided millions of catastrophic disabilities, recur- rences of cancer, and premature deaths. It is also said that what is really wanted is not a blanket recommen- dation for everybody, but rather some means of identifying those few individuals who really stand to benefit from therapy. If any cri- teria (e.g. a short-​term response to a non​placebo-​controlled course of some disease-​modifying agent) can be proposed that are likely to discriminate between people who will and will not benefit, then these can be recorded prospectively at entry and the eventual trial result subdivided with respect to them. However, there is a danger in too detailed an analysis of the apparent responses of small sub- groups chosen for separate emphasis because of the apparently re- markable effects of treatment in these subgroups. Even if an agent brought no benefit, it would have to be acutely poisonous for it not to appear disproportionately beneficial in one or two such subgroups! Conversely, if an intervention really avoids an approximately similar proportion of the risk in each category of patient, it will, by chance alone, appear not to work in some category or categories of patient. The surprising extent to which this happens is evident from the ex- ample in Table 2.4.2. A large anonymous trial will at least still help to answer the practical questions of whether, on average, a policy of widespread treatment (except where clearly contraindicated) is pref- erable to a general policy of no immediate use of the treatment (ex- cept where clearly indicated). Moreover, without really large trials it is difficult to see how else many such questions relating to the effects of treatments on death or disability (or other major outcomes) are to be resolved reliably. Trials are at least a practical way of making some solid progress, and it would be unfortunate if desire for the perfect (that is, knowledge of exactly who will benefit from treat- ment) were to become the enemy of the possible (that is, knowledge of the average direction and approximate size of the effects of treat- ment in many large categories of patient). Trial (or group of trials) Numbers of events treat : control Odds ratios & 95% confidence limits (treat : control) TREATMENT ←BETTER TREATMENT WORSE→ (i) Strokes (ii) Coronary heart disease (CHD) events 38% SD 4 reduction 2p < 0.00001 16% SD 4 reduction 2p = 0.00001 STROKE 35–40% LOWER CHD 20–25% LOWER Reductions in risk associated epidemiologically with a LONG-TERM difference of 5–6 mm Hg DBP: 0.5 1.0 HDFP trial MRC 35–64 trial SHEP trial MRC 65–74 trial 13 other trials ALL TRIALS 102:158 60:109 105:162 101:134 157:272 525:835 275:343 222:234 104:142 128:159 205:226 934:1104 HDFP trial MRC 35–64 trial SHEP trial MRC 65–74 trial 13 other trials ALL TRIALS (Heterogeneity χ4 2 = 4.2, NS) (Heterogeneity χ4 2 = 4.2, NS) Fig. 2.4.9  Reduction in the odds of stroke and coronary heart disease in all unconfounded randomized trials of antihypertensive drug treatment (mean systolic blood pressure differences of about 10 mm Hg for 5 years). Conventions are as for Fig. 2.4.7. 66 SECTION 2  Background to medicine FURTHER READING Adjuvant Tamoxifen: Longer Against Shorter (ATLAS) Collaborative Group (2013). Long-​term effects of continuing adjuvant tamoxifen to 10 years versus stopping at 5 years after diagnosis of oestrogen receptor-​positive breast cancer: ATLAS, a randomised trial. Lancet, 381, 805–​16. Antithrombotic Trialists’ Collaboration (2002). Collaborative meta-​ analysis of randomised trials of antiplatelet therapy for prevention of death, myocardial infarction, and stroke in high-​risk patients. BMJ, 324, 71–​86. Asymptomatic Carotid Surgery Trial (ACST) (2010). 10-​year stroke prevention after successful carotid endarterectomy for asymptom- atic stenosis (ACST-​1): a multicentre randomised trial. Lancet, 376, 1074–​84. Chalmers I (1994). The Cochrane Collaboration: preparing, maintain­ ing and disseminating systematic reviews of the effects of health care. Ann N Y Acad Sci, 703, 156–​63. Chen ZM et al. for the COMMIT (ClOpidogrel and Metoprolol in Myocardial Infarction Trial) collaborative group (2005) Addition of clopidogrel to aspirin in 45,852 patients with acute myocar- dial infarction: randomised placebo-​controlled trial. Lancet, 366, 1607–​21. Cochrane AL (1979). 1931–​1971: a critical review, with particular ref- erence to the medical profession. In: Teeling-​Smith G, Wells N (eds) Medicines for the year 2000, pp. 1–​11. Office of Health Economics, London. Collins R, MacMahon S (2001). Reliable assessment of the effects of treatment on mortality and major morbidity I: clinical trials. Lancet, 357, 373–​80. Collins R, Peto R (1994). Antihypertensive drug therapy: effects on stroke and coronary heart disease. In: Swales JD (ed) Textbook of hypertension, pp. 1156–​64. Blackwell Science, Oxford. Collins R, et al. (1987). Avoidance of large biases and large random errors in the assessment of moderate treatment effects: the need for systematic overviews. Stat Med, 6, 245–​50. Collins R, Doll R, Peto R (1992). Ethics of clinical trials. In: Williams CJ (ed) Introducing new treatments for cancer: practical, ethical and legal problems, pp. 49–​65. John Wiley & Sons Ltd, Chichester. Dowsett M, et  al. (2010). Meta-​analysis of breast cancer outcomes in adjuvant trials of aromatase inhibitors versus tamoxifen. J Clin Oncol, 28, 509–​18. Early Breast Cancer Trialists’ Collaborative Group (EBCTCG) (2005). Effects of chemotherapy and hormonal therapy for early breast cancer on recurrence and 15-​year survival: an overview of the ran- domised trials. Lancet, 365, 1687–​717. Early Breast Cancer Trialists’ Collaborative Group (EBCTCG) (2015). Aromatase inhibitors versus tamoxifen in early breast cancer:  patient-​level meta-​analysis of the randomised trials. Lancet, 386, 1341–​52. Fibrinolytic Therapy Trialists’ Collaborative Group (1994). Indications for fibrinolytic therapy in suspected acute myocardial infarction: collaborative overview of early mortality and major morbidity re- sults from all randomised trials of more than 1000 patients. Lancet, 343, 311–​22. HPS2-​THRIVE Collaborative Group (2014). Effects of extended re- lease niacin with laropiprant in high-​risk patients. N Engl J Med, 371, 203–​12. ISIS-​2 (Second International Study of Infarct Survival) Collaborative Group (1988). Randomised trial of intravenous streptokinase, oral aspirin, both, or neither among 17,187 cases of suspected acute myo- cardial infarction: ISIS-​2. Lancet, 332, 349–​60. ISIS-​4 (Fourth International Study of Infarct Survival) Collaborative Group (1995). ISIS-​4: a randomised factorial trial assessing early oral captopril, oral mononitrate, and intravenous magnesium sul- phate in 58050 patients with suspected acute myocardial infarction. Lancet, 345, 669–​85. Law M, Morris J, Wald N (2009). Use of blood pressure lowering drugs in the prevention of cardiovascular disease: meta-​analysis of 147 randomised trials in the context of expectations from prospective epidemiological studies. BMJ, 338, b1665. MacMahon S, Collins R (2001). Reliable assessment of the effects of treatment on mortality and major morbidity II:  observational studies. Lancet, 357, 455–​62. Magnesium in Coronaries (MAGIC) Trial Investigators (2002). Early administration of intravenous magnesium to high-​risk patients with acute myocardial infarction in the Magnesium in Coronaries (MAGIC) Trial: a randomised controlled trial. Lancet, 360, 1189–​96. MRC Asymptomatic Carotid Surgery Trial (ACST) Collaborative Group (2004). Prevention of disabling and fatal strokes by successful carotid endarterectomy in patients without recent neurological symptoms: randomised controlled trial. Lancet, 363, 1491–​502. Prospective Studies Collaboration (2002). Age-​specific relevance of usual blood pressure to vascular mortality: a meta-​analysis of indi- vidual data for one million adults in 61 prospective studies. Lancet, 360, 1903–​13. Prospective Studies Collaboration (2007). Blood cholesterol and vas- cular mortality by age, sex and blood pressure: meta-​analysis of indi- vidual data from 61 prospective studies with 55,000 vascular deaths. Lancet, 370, 1829–​39. Woods KL, et  al. (1992). Intravenous magnesium sulphate in sus- pected acute myocardial infarction: results of the second Leicester Intravenous Magnesium Intervention Trial (LIMIT-​2). Lancet, 339, 1553–​58. Yusuf S, Collins R, Peto R (1984). Why do we need some large, simple randomized trials? Statistics in Medicine, 3, 409–​20. 2.5 Bioinformatics 67 2.5 Bioinformatics 67 ESSENTIALS Bioinformatics may be defined as ‘conceptualizing biology in terms of molecules and applying “informatics techniques” (e.g. applied mathematics, computer science and statistics) to understand and organize the information associated with these molecules, on a large scale’. Clinical bioinformatics may be defined as ‘the clinical application of bioinformatics-​associated sciences and technologies to under- stand molecular mechanisms and potential therapies for human diseases’. To achieve these aims: (1) data must be curated to facilitate stand- ardized access to existing information and to allow the submission of new entries to data sets; (2) analysis tools should be developed drawing upon both computational and biological/​clinical expertise; (3) all analyses must be interpreted in a biologically/​clinically mean- ingful manner. Introduction If clinical bioinformatics is to deliver the integration of molecular and clinical data and thereby translate research knowledge into ef- fective ‘personalized’ medicine, then two broad constituencies need to be supported. Clinicians at the point of care need to understand and integrate, perhaps via decision support mechanisms, entities such as genotype/​phenotype correlations, biomarker discovery, and pharmacogenomics; while researchers require accurate, struc- tured and (ideally) coded clinical data, as well as biological reference data sets. Ever accelerating technological advances and precipitous falls in the cost of high-​throughput technologies (e.g. whole genome sequencing, expression profiling, high resolution image processing and others) means that there is a veritable deluge of available data. Accompanying falls in the cost of the substantial computational power and associated data storage needed also mean that there is opportunity for meaningful analysis. Ever faster turnaround times (currently measured in hours) mean that it is now feasible to intro- duce next-​generation sequencing (NGS) into workflows directly contributing to patient care. NGS technologies allow the identification of single nucleo- tide polymorphisms, point mutations, and insertions or deletions (indels) as well as larger structural changes such as translocations, rearrangements, inversions, duplications, and copy number vari- ations. When investigating somatic mutations e.g. in tumours, comparison with germline samples facilitates variant detection. In rare diseases the comparison within a given trio, proband and both parents, serves a similar process. The need for defined metrics to inform strict criteria-​based quality assurance is crucial if a clinical bioinformatics pipeline is to be setup. NGS has additional capabilities to investigate cellular proper- ties over and above the determination of genomic sequence alone. Epigenomics deals with the chemical modifications of nucleic acids (e.g. 5′ methylation, and the consequent effect on gene expres- sion). NGS offers the potential to identify changes across the entire genome by capturing epigenetic information from multiple genes simultaneously. Given that for some tumours epigenetic status re- flects the overall prognosis, such analyses may provide substantially enhanced prognostic information. However, simply aggregating patient-​specific clinical data with genetic, expression, or other data will not automatically lead to better clinical outcomes. Clinical data is often unstructured and incomplete, being spread across multiple paper and electronic sys- tems, hence clinically meaningful semantic vocabulary standards are needed (see next). At a cellular level it is clear that biology is not solely dependent on the genome sequence alone, and in 2012 it was estimated that the biological function of approximately half of all human genes remained unknown. Projects such as the Encyclopaedia of DNA Elements (ENCODE), currently building a comprehensive list of the functional elements in the human genome, and the Kyoto Encyclopaedia of Genes and Genomes (KEGG), which supports machine executed models of system-​level biological pathways, are important. Without these the ability to in- terpret analyses and to draw relevant biomedical patient-​specific conclusions will be severely impeded. 2.5 Bioinformatics Afzal Chaudhry 68 SECTION 2  Background to medicine Components needed for clinical bioinformatics Data storage Flexible, extensible data warehousing is essential to accommodate the volume of clinical and biomedical information. The ability to support multiple data sources containing heterogeneous data sets is crucial, and data structures must also be able to accommodate sparse data sets as it is unlikely that any individual clinical record will contain information on all possible concepts. Typical warehouse designs are built upon a ‘dimensional fact’ model. Here, a fact is a concept relevant to decision-​making (e.g. an observation made at a specific point in time such as a blood pressure measurement), while a dimension describes some attri- bute of that fact (e.g. the blood pressure was measured with the patient supine). Use of a common set of semantic terms to support data aggregation/​interoperability Control of the metadata dictionary describing all of the facts in the warehouse is essential. For example, for laboratory tests, normal ranges, and assay types may change over time and/​or may vary from one laboratory to another. It is impossible then to meaning- fully aggregate data over time or from multiple laboratories un- less the results (facts) are interpreted using the relevant metadata (dimensions). Examples of such metadata include the struc- tured vocabulary/​ontologies listed in Table 2.5.1. Hierarchical ontological terminologies reflecting clinical meaning such as SNOMED CT are preferred over more epidemiological orien- tated classification systems such as ICD-​10. Dimensions should ideally be described using elements from a defined archetype rep- resented in a definition set such as the openEHR reference model. When using natural language processing technology to extract structured standardized data from unstructured information, the extraction should be ‘supervised’ by the metadata dictionaries to allow data from text-​based records to be amalgamated with that from a coded record. Meaningful analytical tools There are multiple data sets and tools to support the analysis of bio- informatics data (Tables 2.5.2 and 2.5.3). Typically, these focus on assessing similarities between molecular sequences based on align- ment. Sequence-​based data sets significantly outnumber structural-​ based data sets because of the relative ease by which sequence data can be obtained. Additional analyses, often using specialized soft- ware, are needed over and above simple alignment analyses to detect clinically relevant structural genomic alterations. Protein orientated databases are often categorized as either pri- mary, detailing the linear amino-​acid sequence, or secondary, containing derived information. Secondary-​based analyses may consider motifs or electrostatic interactions that are contiguous in three-​dimensional space but not in the linear sequence. Some macromolecular three-​dimensional structure databases contain a hierarchical taxonomy to help identify evolutionary relationships. Ultimately the most value is seen by combining data from multiple sources—​clinical, sequence, structure, expression, and function (or as many of these as exist). This may not always be straightforward due to variations in nomenclature and data formats, although web-​ based gateways supporting the traversal of multiple databases are becoming more effective. Examples of clinical/​research areas benefitting from clinical bioinformatics strategies See Table 2.5.4. Oncology research Oncology research has tended to focus on single gene and single pathway analysis. However, NGS offers both multiple simultaneous analyses and extremely high sequence coverage thus substantially increasing sensitivity. International consortia such as the Cancer Genome Atlas are sequencing thousands of cancers to generate data sets across different cancer subtypes. Computational theories Table 2.5.1  Clinical classifications/​terminologies/​structured vocabularies available in the United Kingdom Name (acronym) Full name Clinical related entities described URL (accessed November 2018) ICD-​10 International Classification of Disease 10 Diagnoses https://www.who.int/classifications/icd/ icdonlineversions/en/ SNOMED CT Systematized Nomenclature of Medicine—​ Clinical Terms Symptoms, signs, diagnoses http://​www.ihtsdo.org/​snomed-​ct dm+d NHS Dictionary of Medicines and Devices Medication http://​dmd.medicines.org.uk/​ NLMC National Laboratory Medicine Catalogue Laboratory investigations https://​nlmc.x-​labsystems.co.uk/​ LOINC Logical Observation Identifiers Names and Codes Laboratory investigations https://​loinc.org/​ NICIP National Interim Clinical Imaging Procedure code set Radiological investigations https://digital.nhs.uk/services/terminology-and- classifications/national-interim-clinical-imaging- procedure-nicip-code-set OPCS 4.7 Office of Population Censuses and Surveys Classification of Interventions and Procedures version 4.7 Procedures https://isd.digital.nhs.uk/trud3/user/guest/group/0/ pack/10​ HPO Human Phenotype Ontology Phenotypic abnormalities encountered in human disease https://​hpo.jax.org/​app/​ 2.5  Bioinformatics 69 including pathway network analysis and graph theory can be used to model tumour-​related regulatory networks and interactions, allowing complex interactions to be understood. The predictive power of multigene biomarker panels, now potentially scaled into the many thousands of genes analysed simultaneously as opposed to just tens of genes, can be profoundly enhanced (e.g. in one study a panel of 2300 genes could discriminate adenocarcinoma of the lung from healthy tissue with 100% accuracy). Pharmacogenomics As the genetic/​molecular basis of the metabolism and mechanism of action of drugs becomes increasingly understood, so therapy can be individually tailored to some degree. Recent examples include trastuzumab for HER2-​positive breast cancer and imatinib for chronic myeloid leukaemia and conditions associated with tyrosine kinase-​ based mutations. NGS can identify somatic variants which help to direct therapy (e.g. as resistant tumour clones emerge—​melanoma with the BRAF mutation V600E is susceptible to vemurafenib while the p61 BRAF V600E variant is not). In haematology the potential to stratify an individual’s sensitivity to warfarin (VKORC1 and CYP2C9 gene polymorphisms) will help to guide appropriate dosing and avoid potentially life-​threatening events. Infectious diseases The far smaller size of viral and bacterial genomes makes it possible to sequence the genome of infecting pathogens. In the case of the 2009 H1N1 influenza pandemic, bioinformatics tools were able to describe within a few hours of the first identification of a novel muta- tion a possible mechanistic explanation by which it was able to mani- fest such a severe phenotype. Computational analysis of genome sequence and protein structures can help in identifying likely drug susceptibility (e.g. the enterohaemorrhagic O104:H4 E. coli outbreak in Germany in 2011), while individual infecting strains can be typed and traced over both time and geographical distribution so sup- porting more appropriate and economical public health strategies. Digital imaging Even among the most experienced histopathologists there can be considerable interobserver variation in certain conditions. Objective algorithms to identify tumour grading and to search for other tissue-​ based measures of disease activity using level sets, fractal analysis, and machine learning can improve diagnosis. The adaptation of astronomical algorithms coupled with their application to large an- notated study cohorts is likely to provide a powerful set of analytical tools. In dermatology, texture analysis, neural network framework-​ based analyses, data mining of skin images and computer-​based reconstruction of the skin surface have all been used to support re- search into reliable diagnostic strategies. Table 2.5.2  Publicly available databases of biological knowledge Domain Database URL (accessed November 2018) Nucleotide DDBJ https://​www.ddbj.nig.ac.jp GenBank https://www.ncbi.nlm.nih.gov/genbank/ Genome COGs https://www.ncbi.nlm.nih.gov/COG/ Entrez genome https://www.ncbi.nlm.nih.gov/genome GeneCensus http://bioinfo.mbb.yale.edu/genecensus/ Protein—​primary NRDB https://www.ncbi.nlm.nih.gov/protein OWL http://130.88.97.239/OWL/index.php SWISS-​PROT https://www.uniprot.org/uniprot/?query=reviewed:yes Protein—​secondary Pfam http://pfam.xfam.org/ PRINTS http://130.88.97.239/PRINTS/index.php PROSITE https://prosite.expasy.org/ Protein—​macromolecular CATH http://​www.cathdb.info/​ PDBeFold http://www.ebi.ac.uk/msd-srv/ssm/​ Protein Data Bank https://www.rcsb.org/ SCOP http://scop.mrc-lmb.cam.ac.uk/scop/index.html Functional/​systems biology ENCODE https://genome.ucsc.edu/ENCODE/​ KEGG https://www.genome.jp/kegg/​ Vocabulary Gene Ontology http://​geneontology.org/​ Integrated systems/​web gateways InterPro https://www.ebi.ac.uk/interpro/ Uniprot http://​www.uniprot.org/​ Table 2.5.3  Major research institutions providing access to a wide range of bioinformatics databases and analysis tools EMBL-​European Bioinformatics Institute https://www.ebi.ac.uk/ NCBI (National Centre for Biotechnology Information) http://​www.ncbi.nlm.nih.gov/​ Wellcome Trust Sanger Institute https://www.sanger.ac.uk/science/tools​ 70 SECTION 2  Background to medicine Conclusions The need to deliver safe, timely, sustainable, and patient-​centric care along with the need for evidence-​based strategies means that any new technology first has to demonstrate clear translational research benefits before it can be adopted into routine practice. We now have the means to generate and analyse data sets that can lead to such breakthroughs, and this continues to develop at a breakneck pace. If the most is to be made of these new data sets and analyses then not only must appropriate clinical data be available for cor- relation, but there will also be a need for more structured training programmes and curricula to train clinicians in the analysis, interpretation, and use of such data. The UK Health Education England Genomics Education Programme Clinical Bioinformatics group reported in early 2015 and has recommended a series of steps and programmes for a range of healthcare staff to address the long-​term goal of establishing a workforce fit for genomic medicine. The implementation of such recommendations is awaited. FURTHER READING Raza S (2014). Defining the role of a bioinformatician. http://www. phgfoundation.org/briefing/defining-the-role-of-a-bioinformatician​ Slade I, Burton H (2016). Preparing clinicians for genomic medicine. Postgraduate Medical Journal, 92, 369–​71. Table 2.5.4  Examples of clinical/​research areas benefitting from clinical bioinformatics strategies Clinical/​research area Example DNA/​RNA sequencing and expression profiling Comprehension of biomolecular pathways underlying malignant transformation Biomarker identification Improved classification, early diagnosis, prognostication, and tailoring of therapy Pharmacogenomics/​proteomics Tailoring of therapy—​likelihood of therapeutic benefit as well as likely propensity to side effects Pathogen genome/​protein sequence/​structure and function Description of putative mechanisms underpinning phenotypic manifestations Susceptibility to antimicrobial therapy Epidemiological analysis to identify environmental factors and support the relevant control mechanisms Digital imaging Machine learning based improvements in cellular/​tissue analysis and diagnosis Image analysis of three-​dimensional geometric/​structural anatomy using both visible (e.g. glaucoma), and non​visible spectra (e.g. infrared analysis of meibomian gland morphology in dry eye syndromes) 2.6 Principles of clinical pharmacology and drug t 2.6 Principles of clinical pharmacology and drug therapy 71 ESSENTIALS In its widest sense a drug is any chemical entity that can perturb a biological system. For the purposes of drug therapy the biological system is the human body and the perturbation is exploited to aid the diagnosis, treatment, or even cure of a disease process. Historically, chemical entities have been plant derived or synthetic small (organic) molecules (<1000 of Dalton in size). However, increasingly new drugs are large biomolecules (up to 100 000 Dalton; e.g. monoclonal antibodies or recombinant proteins). These biologics are expected to exceed small molecules in the drug pipelines of the pharmaceutical industry in the coming decades. The role of clinical pharmacology is to provide the scientific basis for rational prescribing: ‘patients may recover in spite of drugs . . . or because of them’ (Gaddum). This sums up the dilemma facing any doctor who prescribes a drug to a patient: it may be the explicit in- tention to do the patient some good, but the drug may actually harm and on rare occasions even kill them. Principles of clinical pharmacology Drug therapy can be considered under four headings:  (1) pharmaceutical—​is the drug getting from the formulation into the patient?; (2) pharmacokinetic—​how does the drug dose, formulation, frequency, and route of administration affect the drug concentration in the body, and the way that this concentration changes with time?; (3)  pharmacodynamic—​how does a drug produce its pharmaco- logical effects?; (4) therapeutic—​is the pharmacological effect trans- lated into a therapeutic effect? Adverse drug reactions Definition and causes—​an adverse drug reaction is any unwanted or harmful reaction experienced following administration of a drug, or combination of drugs, under normal conditions of use that is sus- pected of being related to the drug (or combination). If a causal connection with a drug cannot be established, then the injury is referred to as an adverse drug event. An adverse drug reaction may be (1)  dose-​related—​usually due to an exaggeration of a known pharmacological effect of the drug and to an extent predictable; or (2) non-​dose-​related—​often caused by immunological or pharma- cogenetic mechanisms and hence largely unpredictable. Clinical importance—​adverse drug reactions are responsible for 1 to 4% of acute hospital admissions, affect 5 to 20% of inpatients at some time during their admission, and are responsible for up to 3% of inpatient deaths. Pharmacovigilance is the subspecialty of clinical pharmacology devoted to the detection and evaluation of adverse drug reactions. Drug interactions A drug interaction occurs when the effects of one drug are altered by the effects of another drug, usually resulting in an adverse drug re- action. Drugs likely to precipitate interactions typically (1) are highly protein bound (e.g. aspirin and sulphonamides); or (2) induce drug metabolism (e.g. phenytoin, carbamazepine, and rifampicin); or (3) inhibit drug metabolism (e.g. cimetidine, metronidazole, anti-​HIV protease inhibitors, and triazole antifungals). The drugs most likely to be affected by drug–​drug interactions are those with a steep dose–​ response curve and a low therapeutic index. A rational basis for prescribing To maximize a drug’s therapeutic potential and minimize its side ef- fects, a series of checks and balances is needed. The following should be considered before any drug is prescribed: (1) does the patient need a drug at all?—​do the risks outweigh its benefits?; (2) are the benefits of the drug well established, preferably by randomized con- trolled trials?; (3) what drug action is being sought, and what class of drug can best provide it?; (4) what is the most appropriate drug in that class, and in what formulation?; (5) what is the appropriate dose and how long should the drug be prescribed for—​ a single course or for indefinite use?; (6) will this drug interact with other drugs the patient is taking?; (7) can this drug replace other drugs the patient is taking?; (8) what does the patient need to understand about this drug—​and who will communicate this?; (9) will it be necessary to review the prescription of this drug—​and if so, when, how, and by whom?; (10) does the patient need anything else to derive the most benefit from this drug? Practical prescribing Guidelines and formularies—​in practice, the question checklist here is not completed before every drug is prescribed, as many of the questions are addressed by using appropriate therapeutic guidelines 2.6 Principles of clinical pharmacology and drug therapy Kevin O’Shaughnessy 72 section 2  Background to medicine and formularies. Guidelines provide prerehearsed decision paths for many of the issues raised in the checklist, and formularies tackle the specific question of which drug to prescribe (from within a thera- peutic class). The patient’s drug history—​it is essential to obtain a thorough drug history from the patient before prescribing. Key information that should be obtained includes details of: (1) all the medicines currently being prescribed, including their doses; (2)  any previous medical treatments; (3) any ‘alternative’ treatments (e.g. herbal and homeo- pathic remedies); (4) any self-​prescribed medicines; (5) any history of allergy or adverse reactions to drugs. Prescribing for the individual patient—​guidelines, formularies, and other prescribing aids are not a substitute for an intelligent clin- ical approach. The prescriber needs to establish what the patient’s experience and expectations of drug therapy are, and the patient needs to know the likely consequences—​both good and bad—​of taking any drug that is prescribed. This dialogue is important, since it will often de- cide whether the patient actually takes the drug as prescribed. Patient compliance (or, more correctly, adherence) is a key variable in the pre- scribing process, and one over which the doctor often has least control. Balancing a drug’s therapeutic benefit with its side effects The prescriber checklist implies that balancing the expected bene- fits of a drug against the expected harm is a straightforward process. It is not. By the time a drug reaches the clinic we know a lot about the size of its therapeutic effect, its relation to drug dose, and the proportion of patients likely to show this effect. Measuring harm is a less precise and much slower process. Terms such as ‘risk:benefit ratio’ are widely used but can be very misleading. Benefit is often measured in terms of amount and not its probability or frequency. It can also be measured accurately by a single therapeutic endpoint; harm from a drug encompasses a spectrum of side effects that differ in both their frequency and severity. Some side effects are predictable from the pharmacology of the drug (so-​called type A) and are relatively common and dose-​ dependent. Others are rare and unpredictable (so-​called type B or idiosyncratic). Type B side effects are usually more severe with a higher burden of mortality and morbidity and, because of their low frequency, will only come to light after a drug is licensed for clinical use. It is important to remember that only a few thousand patients are exposed to a drug before a license is granted (and for drugs with orphan status, the patient exposure may be very much smaller). The duration of dosing is also inevitably short when the intention is to give the drug to patients indefinitely. Hence, crucial important information about side effects has to be gathered after the drug is licensed, and it may take many millions of patient-​years of dosing before some drug side effects emerge. Perhaps the best way of comparing a drug’s benefit with its harm is to define them in terms of the number of patients needed to be treated to observe a certain benefit and cause specific harm. These are referred to as the number needed to treat (NNT) and number needed to harm (NNH), respectively. Take, for example, the drug clopidogrel. It is currently given to patients for 12 months after suf- fering an acute coronary syndrome (non-​ST elevation myocardial infarction, NSTEMI) to prevent myocardial infarction. Because of its antiplatelet action it also causes gastrointestinal bleeding. Used in this way to prevent non​fatal myocardial infarction, the NNT for clopidogrel is about 67 and the NNH to cause a major bleeding epi- sode is 100. So, if 100 patients are treated for 1 year, 1.5 non​fatal ­myocardial infarctions will be prevented at the expense of causing one major bleeding episode. This is a much more transparent pres- entation than the relative risk reduction that is often used to high- light drug benefit in clinical trials. These percentage measures inevitably boost the psychological impact of a drug’s effect—​the NNT for clopidogrel equates to a 22% reduction in the frequency of non​fatal myocardial infarction. But even with reliable measures of harm and benefit, how do we decide where the final balance lies? It will depend on other factors such as the severity of the disease and whether there are safer al- ternative drugs. Hence, if we want to treat a disease with a high fa- tality rate using a drug which is highly effective and carries little risk of harm, the balance is clearly in favour of the drug. But if the dis- ease itself carries no mortality or morbidity and we propose using a drug whose effectiveness is low and carries a high risk of harm, the balance is clearly against using the drug. Most clinical decisions to use a drug or not will be in the grey area between these two extremes. Efficacy, effectiveness, and efficiency of drugs These terms are not synonymous and can be easily confused. Efficacy is a pharmacological term. It refers to the ability of a drug to bring about a certain size of an effect at a given concentration or dose. The effect may not be applicable to clinical practice or is only ac- curately measured in a clinical trial. Hence the term is best reserved for the performance of a drug in this setting. Effectiveness refers to the performance of a drug in everyday clinical use, and is defined as the likelihood and extent of the therapeutic effect in a given patient. Efficiency weights drug performance against cost: it is the ratio of effectiveness to cost. Clearly, it is more efficient to use the cheaper of two drugs that are equally effective and safe. The therapeutic index of a drug The therapeutic index is a term taken from animal pharmacology. It is the dose needed to harm over the dose needed to produce the therapeutic response. However, as harm is usually measured by a drug’s lethality, it is not a useful clinical measure. Instead, the index is employed clinically in a very loose and entirely qualitative way. Drugs that produce side effects at doses well outside the clinical dose range are said to have a high therapeutic index and those where the ranges are much closer or even overlap are said to have a low therapeutic index. Hence penicillins have a high therapeutic index; large doses can be given without the worry of adverse effects unless the patient is allergic to penicillins. In contrast, digoxin has a low therapeutic index; the doses causing toxicity overlap with those pro- ducing therapeutic benefit. Drugs with a low therapeutic index in- clude: aminoglycoside antibiotics, anticoagulants, anticonvulsants, antihypertensives, some antiparasitic and antiviral drugs, cardiac glycosides, and cytotoxic and immunosuppressant drugs. To increase the margin of safety for drugs with a low index, dosing can be guided by measuring drug levels in plasma or serum. Such therapeutic monitoring is mandatory for drugs such as lithium and aminoglycoside antibiotics because of additional pharmacokin- etic problems with their use. Anticoagulants are unusual in being 2.6  Principles of clinical pharmacology and drug therapy 73 monitored by their effect on clotting rather than as a plasma drug concentration. Formularies Formularies are lists of medicines for prescribers and pharmacists, intended to guide the choice and facilitate the dispensing of medi- cines. Many give details of the formulation and doses of drugs. Each formulary is produced primarily for a particular group, usually the prescribers in one country or region or institution, or even one prac- tice. Most formularies are restrictive (i.e. they make a narrow choice of medicines from all those available). This is typical for the formu- lary of a hospital, or of a health maintenance organization. A hospital formulary lists only the preparations that are stocked in the hospital pharmacy; a health maintenance organization formulary only those that the organization will pay for. The British National Formulary (BNF) is probably the best known and most widely used formulary of all, but is unusual in including all medicines available for prescrip- tion in the United Kingdom, whether they are good choices or not. However, every section of the BNF has concise and critical ‘notes to facilitate the selection of suitable treatment’ that precede the list of available agents. This invaluable resource is revised biannually and available online (https://​www.medicinescomplete.com/​about/​). The WHO ‘Model List of Essential Drugs’ In many developing countries, having limited health budgets means that large sections of the population have no access to drugs or healthcare, and governments cannot afford to provide necessary drugs. To help them to use their limited funds in the best ways, the World Health Organization (WHO) has since 1977 published a regularly updated Model List of Essential Drugs. It is updated every two years and currently in its 18th edition (http://​www.who.int/​ medicines/​publications/​essentialmedicines/​en/​index.html). Essential drugs on the list are intended to ‘provide safe, effective treatment for the majority of communicable and non​communicable diseases’. The WHO list is a ‘model’ list that can be adapted to meet the needs of the local health economy. Hence, over 150 countries have an es- sential list based on the WHO model. It is a salient fact that the first WHO list contained 208 drugs and, in the intervening 30 years, it has not doubled in size. There may be some clear lessons here for the drug lists of developed countries. Medicines management Rapidly escalating costs of providing drugs within healthcare sys- tems and the need to maximize drug safety mean that the concept of medicines management is now widespread. By bringing together clinical, pharmacy, and financial skills, drugs that are considered es- sential can be prescribed in the most cost-​effective and safest way. In the United Kingdom, drugs are assessed at a national level by the National Centre for Health and Clinical Excellence (NICE) before deployment within the National Health Service (NHS). NICE con- siders in detail the evidence for a drug’s alleged benefit and weights this against known side effects and economic modelling of its total cost versus benefit within the NHS. The principles of clinical pharmacology Drug therapy can be considered under four headings—​pharma­ceutical, pharmacokinetic, pharmacodynamic, and therapeutic—​each of which addresses a pertinent question about drug therapy (Fig. 2.6.1). The pharmaceutical process The pharmaceutical step is concerned with the question, ‘Is the drug getting from the formulation into the patient?’ The route of drug administration is usually a more crucial choice than how it is formulated. Nevertheless, formulation can greatly af- fect the rate and extent of drug absorption. Repackaging short-​acting drugs into ‘sustained’ or ‘modified’ release formulations to slow re- lease of the drug into the gut allows them to be taken once daily. Morphine, calcium channel blockers (e.g. diltiazem and nifedipine), theophylline, and l-​dopa have been widely reformulated in this way. Other drugs are formulated for specific routes of administration: glyceryl trinitrate is available for sublingual (as a spray and tablets), buccal, and transdermal (as a paste or patch) use. To understand the differences between these various routes, it is necessary to under- stand the concept of systemic availability. Systemic availability Systemic availability, commonly called bioavailability, is the pro- portion of administered drug that reaches the systemic circulation and is available for distribution to the site of drug action. If a drug is given intravenously, it will enter the systemic circulation directly (i.e. it is said to be 100% bioavailable). The same drug given orally, or by any other route, must be absorbed first (which may be an in- complete process), and possibly metabolized, before entering the systemic circulation. Metabolism can occur in the gut wall and liver following oral ad- ministration, although it can occur at any site of drug administra- tion. The cells lining the gut also express drug efflux pumps (e.g. P-​glycoprotein) and transporters that can actively reduce the extent of absorption of a drug. These processes, together with incomplete absorption, ensure that most drugs have a bioavailability of less than 100% when given orally. In some cases, the reduction is so large that a drug has zero bioavailability and is clinically ineffective. Oral in- sulin or benzylpenicillin are good examples of this problem, with their instability in the stomach preventing significant absorption. Complete presystemic metabolism also explains why glyceryl tri- nitrate and buprenorphine are orally effective only if given sublin- gually or as a buccal patch. Absorption from the mouth allows the drug to bypass gut wall and liver metabolism. Special drug formulations Most drugs are given orally; oral formulations include syrups, or- dinary (instant release) tablets, capsules, and modified release for- mulations. However, drugs can be given by other routes, including sublingually, buccally, rectally, transdermally, by inhalation, and by injection intravenously, subcutaneously, intramuscularly, or locally. Modified release formulations Most conventional instant release tablets are designed to disintegrate in the stomach or proximal small bowel, so that absorption is com- plete within a few hours on ingestion. Modified or sustained release 74 section 2  Background to medicine formulations are oral formulations that allow a drug to be released over long periods (12–​24 h typically) relative to the half-​life of the drug. The intention of the prolonged and slowed release is to smooth out the concentration profile of the drug in the blood and extend its duration of action. They include formulations of theophylline, ni- fedipine, diltiazem, morphine, and lithium. Prescriptions of these drugs should specify the exact formulation, as formulations differ in systemic availability and may not be interchangeable. Sublingual, buccal, rectal formulations Drugs that are absorbed through the oral or rectal mucosa avoid first-​pass metabolism in the liver by uptake into veins that drain directly into the systemic circulation. For example, sublingual gly- ceryl trinitrate is rapidly effective as a sublingual tablet, but if the tablet is swallowed, the remainder is not bioavailable because of high presystemic metabolism. Rectal administration achieves a direct ef- fect on the local bowel wall (e.g. corticosteroids in ulcerative colitis), but is also useful for achieving high blood levels rapidly when intra- venous access is difficult (e.g. diazepam for seizure control). Transdermal formulations Some lipid-​soluble drugs are well absorbed through the skin, and their transdermal delivery via ‘patches’ allows controlled re- lease over many hours or days. Examples are glyceryl trinitrate in the long-​term treatment of angina pectoris, transdermal hyoscine for travel sickness, oestradiol as hormone replacement therapy, buprenorphine for pain control, and nicotine for smoking cessation. Inhaled formulations The lung provides a huge surface area (c. 100 m2) for drug absorp- tion, but to reach the distal airways, a drug for delivery by inhal- ation must be associated with particles in the 2 to 5 µm range. These can either be solid particles (dry powder devices) or dissolved in small droplets (aerosol devices). For hand-​held aerosol inhalers the drug is dissolved in a volatile hydrocarbon, but in nebulizers the drug is in an aqueous solution that is aerosolized by a jet of air or oxygen. Both aerosols and dry powders are widely used to deliver inhaled corticosteroids and bronchodilators used in the manage- ment of asthma and chronic obstructive pulmonary disease. Even peptides can be delivered by this route, as demonstrated by the li- censing of inhaled insulin. The efficiency of inhalation as a route of administration also explains the ‘success’ of some drugs of abuse (e.g. nicotine and crack cocaine). Subcutaneous, intramuscular, and local injections The rate of absorption of insulin from the site of subcutaneous in- jection is controlled by its physical state (e.g. monomer, crystalline, or non​crystalline), pH, and the presence of zinc ions or protamine PHARMACEUTICAL PROCESS ‘Is the drug getting into the patient?’ Formulations Routes of administration (compliance) PHARMACOKINETIC PROCESS ‘Is the drug getting to its site of action?’ PHARMACODYNAMIC PROCESS ‘Is the drug producing the required pharmacological effect?’ THERAPEUTIC PROCESS ‘Is the pharmacological effect being translated into a therapeutic effect?’ Biliary excretion Rectal or sublingual administration Parenteral administration Disintegration etc. Dissolution ‘First pass’ Elimination Therapeutic/toxic effects Absorption Distribution Plasma proteins Tissues Elimination Hepatic metabolism Renal excretion Other Molecular pharmacology Cell and tissue pharmacology Cell and tissue physiology Organ physiology Clinical effects Extracellular fluids Tissues (sites of action) d n u o b n U Drug in solid dosage form Drug in particulate form Drug in solution Metabolism in gut lumen and gut wall Hepatic metabolism Pharmacological effects Protein- bound Fig. 2.6.1  The four processes of clinical pharmacology in relation to drug therapy. 2.6  Principles of clinical pharmacology and drug therapy 75 (isophane) in the buffer in which it is suspended. It is also affected by altering the amino acid sequence of insulin; allowing recom- binant insulins, which are rapidly (lispro and aspart) or slowly re- leased (Glargine or Detemir) after injection. Soluble insulins have a rapid onset (15–​30 min) and short duration of action (4–​6 h), so they are usually given together with intermediate or long-​acting insulin. Long-​acting insulins act for more than 24 h and can provide the in- sulin background as a once daily injection. The isophane insulins have an intermediate duration of action and are usually given twice daily mixed with soluble insulin. Absorption of a drug after subcutaneous injection is affected by blood flow. Hence, the duration of action of local anaesthetics can be prolonged by vasoconstriction. Adrenaline or felypressin is added to the subcutaneous formulations for this purpose. Smoking also re- duces subcutaneous insulin absorption by causing cutaneous vaso- constriction. Reduced absorption also explains why subcutaneous adrenaline is not advised for the treatment of anaphylaxis. The intramuscular route is a popular parenteral route but can be erratic. Hence, phenytoin and diazepam should not be given by this route for emergency use. Absorption following intramuscular in- jection may be retarded by esterifying a drug to a large lipid mol- ecule. This gives oily formulations that provide long-​lived drug depots in muscle, which are used in the treatment of male hypo- gonadism (testosterone enanthate or undecanoate) and schizo- phrenia (fluphenazine or flupentixol decanoates). Combination formulations in oral therapy Combination products are attractive and may aid compliance (see following paragraphs), but should be used only when at least two criteria are met: • The frequency of administration of the two drugs is the same. • The fixed doses in the combination product are therapeutically and optimally effective in most cases (i.e. it is not necessary to alter the dose of one drug independently of the other). Acceptable combination products include: • Aspirin plus codeine (co-​codaprin) or paracetamol plus dihydro­ codeine (co-​dydramol), pairs of drugs that have different analgesic ac- tions (which synergize) and different adverse effects (which do not). • l-​Dopa plus a peripherally-​acting dopa decarboxylase inhibitor (benserazide or carbidopa); the peripheral action of the decarb- oxylase inhibitor blocks peripheral metabolism of l-​dopa, which is free to enter the central nervous system, where it is converted to dopamine, producing the therapeutic effect in Parkinson’s disease. • Combined oral contraceptives, which contain an oestrogen and a progestogen. • Ferrous sulphate plus folic acid, used to prevent anaemia in pregnancy. • Co-​amoxiclav (amoxicillin plus clavulanic acid); the β-​lactamase inhibitor, clavulanic acid, prevents the breakdown of amoxicillin by bacterial penicillinase, so broadening its spectrum. The patient’s use of a drug: Compliance and concordance Compliance is the extent to which a patient follows a prescribed drug regimen. Some prefer the term ‘concordance’ or ‘adherence’, to make it clear that therapeutic decisions are best arrived at jointly between prescriber and patient and avoiding a patronizing relationship. Rates of adherence are difficult to measure and can be surprisingly low, even in conditions where the drug has a very obvious benefit, such as epilepsy and asthma. Possibly as few as 1 in 6 patients take a drug exactly as prescribed and 1 in 6 take none at all; the remainder take a different dose and/​or a different frequency from that prescribed. Poor adherence is still not well recognized by doctors as a cause of therapeutic failure. The effect of the prescribing regimen Apart from the financial cost to the patient (which in some countries is substantial), the two factors that determine adherence to drug therapy are the complexity of the regimen and the likelihood of side effects. The complexity of the prescribed regimen reflects frequency of administration and the number of drugs prescribed. Generally speaking, the more frequently a drug is prescribed per day, and the more drugs in total are prescribed, the lower is the rate of compli- ance. For most chronic diseases, drugs that can be taken once daily are preferred. Obviously, using several drugs (polypharmacy), where one may be adequate, should be avoided wherever possible. Side effects that a patient attributes to a drug may lead them to stop taking the drug completely. It may be necessary then to persuade them to persevere or switch to an alternative drug. However, giving another drug to simply reduce the side effects of the first drug has to be thought through carefully. For example, giving furosemide to a patient who develops ankle oedema on nifedipine, causing in turn gout or hypokalaemia, is poor practice. But giving omeprazole to a rheumatoid patient who develops acid reflux on their non​steroidal anti-​inflammatory (NSAID) is appropriate. The effect of the illness People with severe mental health problems (e.g. patients with schizophrenia or manic-​depressive psychosis), often take medicines unreliably. Physical disability may cause difficulty even in patients who want to take their medicine. For example, patients with rheumatoid arth- ritis who cannot reach the tablets, or cannot remove the top of a childproof container, cannot take them without help. Sometimes, a good response to treatment leads patients to stop. For example, patients with tuberculosis need long courses of several drugs to eradicate the infection; once the symptoms have resolved, motivation to continue treatment may decline, risking reactivation and the emergence of resistant tuberculosis. Some diseases may promote compliance. Patients with insulin-​ dependent diabetes easily become very ill quite quickly if they forget to take their insulin, and that is likely to make them comply, al- though they may not use it precisely as advised. Patients in whom a β-​blocker or vasodilator has prevented anginal attacks will also be more likely to have good compliance. The patient’s behaviour People tend to forget to take drugs, or cannot be bothered; they may feel no need for treatment (e.g. in asymptomatic hypertension); they may be unclear about the prescribing instructions; they may not want to feel dependent or be thought to be dependent on ‘drugs’. There may be social or physical reasons why they cannot reach a pharmacist, financial difficulties, or everyday inconveniences in carrying and taking the medication. 76 section 2  Background to medicine The doctor’s behaviour The enthusiasm and confidence with which a drug is prescribed, and the extent to which these attitudes are transmitted to the patient, may influence not only compliance but also the response to therapy. This is related to the placebo effect of drug taking. Methods of assessing adherence It is important to assess adherence both in everyday practice and in clinical trials. The most obvious and usually the easiest approach is to ask the patient whether he or she has been taking the drugs, and whether there have been any problems. If the doctor is non-​ judgemental and indicates that difficulties are common, the patient is encouraged to be open. Less directly, one can ask to see the patient’s tablets: this at least confirms that the prescription has been filled. Counting the tablets left in the bottle is a guide to how many have been taken, but some may have been thrown away. Recording devices fitted in the caps of medication containers can record the frequency and exact timing of the opening of the container, and are useful in research. If a patient is vague or untrustworthy, measurement of the drug level in plasma or urine may give some reassurance—​at least on the day they visit the doctor. The list of drugs for which assays are rou- tinely available is small, but a surrogate marker can be used (e.g. the level of thyroid stimulating hormone (TSH) to ensure compli- ance with thyroxine or the international normalized ratio (INR) for patients on warfarin). Alternatively, a pharmacological effect such as heart rate can be measured directly (to assess β-​blocker compliance). Methods of improving adherence Adherence can be improved by supervised administration of the drug, by removing barriers, by simplifying the therapeutic regimen, and by educating the patient on the need to take the medicine, with reinforcement whenever possible. Supervised administration Administration of a drug by the doctor or nurse ensures compli- ance. This is possible in hospital or when occasional administration is required (e.g. intramuscular injections of vitamin B12, long-​acting intramuscular depot injections of neuroleptics, and supervised twice-​weekly antituberculosis therapy). A relative or other carer can give the drug at home and this may be aided by using dosimeter boxes prefilled by a pharmacist. Removing barriers to adherence Adherence may be encouraged by prescribing pleasant-​tasting syrups rather than tablets for children and older people, and by using a drug or formulation that minimizes side effects. Simplification of the therapeutic regimen  The therapeutic regimen can be simplified by reducing both the number of drugs a patient has to take and the frequency of administration. Modified release or com- bination formulations may be useful here. Education and reminders  Educating the patient about why treat- ment is necessary (e.g. treating hypertension or diabetes reduces the risk of serious complications) is time-​consuming but improves com- pliance. In the treatment of certain infections (e.g. tuberculosis) and in typhoid carriers, the wider importance to the community should also be explained. Even in the well-​motivated, reminders to take the treatment may improve compliance. Many drugs for long-​term use are also now dispensed in a ‘calendar pack’ to help compliance. The pharmacokinetic process Pharmacokinetics concerns the complicated question ‘How does the drug dose, formulation, frequency and route of administration affect the drug concentration in the body, and the way that this concentra- tion changes with time?’ The answer will depend on the absorption, distribution, metabolism, and excretion of that drug (otherwise known as ADME). This naturally lends itself to complex mathemat- ical modelling, but the key points can be understood without being overwhelmed by the mathematics. The size and chemistry of biologics poses special problems for both the measurement and understanding of their ADME. Generally they are metabolized to small peptide fragments or amino acids that can be renally cleared or recycled through cellular protein syn- thesis. This process can be very rapid for small peptides and proteins (c.hours), but very slow for large proteins such as IgG that can have a half-​life of 7–​28 days. The uptake of biologics from the subcutaneous injection sites frequently used for biologics is also slow and affected by lymphatic drainage and binding of any Fc-​domains to FcRn re- ceptors on endothelial cells. Small proteins may also be deliberately modified by, for example, PEGylation, glycosylation, or coupling to larger proteins such as albumin to reduce their degradation. Basic pharmacokinetic terms and concepts Before considering the various components of ADME, a few pharma- cokinetic terms need to be explained. Consider a time after admin- istering a drug when its absorption and distribution throughout the various compartments of the body are complete. At this time, there are no concentration gradients across compartments and any fur- ther decline in drug concentration reflects drug elimination, that is, excretion and metabolism of the drug. Several parameters charac- terize this elimination phase of a drug: Volume of distribution (Vd) This can be a confusing term, since it is really a mathematical device and not a true physical volume. Hence, it rarely relates to any par- ticular water compartment (e.g. plasma, 3 litres; extracellular water, 16 litres; total body water, 42 litres). It simply allows the amount of drug remaining in the body to be inferred from the drug concen- tration (concentration = amount/​volume). Drugs have a wide range of values for their Vd and sometimes they are much greater than the total water space. This is typical for drugs that bind to tissues or partition into lipid. Amiodarone is an extreme example of this behaviour with a Vd of approximately 5000 litres. Biologics in sharp contrast usually have a small Vd (e.g. 2–​4 litres) reflecting their con- finement to the blood space. Clearance In the same way that the volume of distribution is a proportion- ality term, so clearance is used to relate the rate of drug excretion 2.6  Principles of clinical pharmacology and drug therapy 77 to the concentration of the drug in the body. It is expressed in units of flow (volume per unit time) and is effectively the volume of the Vd cleared of drug per unit time. A drug can be elimin- ated (or cleared) from the plasma by one or more mechanisms (e.g. through the kidney by filtration and the liver by metabolism). The total clearance of the drug is simply the sum of these different organ-​based clearances. Plasma half-​life During the elimination phase, the drug concentration falls in a predictable way: in a fixed time called the half-​life it falls by 50%, and by a further 50% after another half-​life and so on. Thus, it takes between four and five half-​lives for 95% of the drug to be elimin- ated (see Fig. 2.6.2). For most drugs, the half-​life is constant (except if the kinetics becomes nonlinear—​see following paragraphs). The half-​life is not affected by the starting concentration or amount of the drug, but it is directly affected by the volume of distribution and clearance of a drug (actually proportional to their ratio). Repeated dosing and the ‘plateau principle’ A drug can be given once but it is more common for a drug to be given repeatedly (even indefinitely). After the first dose, the plasma level will rise to its maximum (Cmax) and fall as shown in Fig. 2.6.2. If the dose is not repeated until all the drug has been eliminated (which means the dose interval must be long, e.g. 10 half-​lives), the drug level will oscillate between the Cmax after a single dose and a trough value of zero. The problem with this regimen is that to give a plasma level that is therapeutic (but not toxic), a large part of the dosing interval is spent with a non​therapeutic drug level. To get around this, the drug is given at smaller doses separated by short dose intervals compared to the drug’s half-​life. Because the dose interval is similar to the drug’s half-​life, the first dose of drug is not completely eliminated before the second dose is given. The third dose is given before either the second dose, or the remaining fraction of the first dose is eliminated and so on. Hence, the drug plasma level accumulates and eventually it comes to a steady state where it oscillates between a new peak and trough (substantially greater than zero); ideally, the peak is below the toxic threshold and the trough above the therapeutic threshold throughout the dose interval. This is what is known as the plateau principle and the therapeutic and toxic threshold levels for the drug define its ‘thera- peutic window’. Loading doses The half-​life also dictates the time it takes for a drug to reach its plateau with repeated dosing. In the same way that it takes almost 5 half-​lives to eliminate a drug, it takes almost 5 half-​lives to reach plateau. The delay imposed by the drug’s half-​life in reaching steady state can be unacceptable clinically. Consider, for example, the use of digoxin to treat fast atrial fibrillation. Because of the long half-​ life of digoxin (40 h), it takes over a week to reach plateau, and over twice this if the patient has renal failure because of the increased half-​life. The solution is to give a loading dose of a drug. The loading dose is intended to take the blood concentration rapidly into the 0 6 12 18 [mg/litre] 12 11 10 9 8 7 6 5 4 3 2 1 0 24 30 36 42 48 54 60 Time after dosing (h) Drug concentration in the plasma Therapeutic threshold Toxicity threshold Drug elimination with a t½ of 12 h Cmax 66 72 78 84 90 96 102 108 114 120 Therapeutic window Fig. 2.6.2  Plateau principle. Plasma concentrations of a hypothetical drug after a single oral dose (black line) versus repeated dosing (red line). A smaller dose is used for repeated dosing, but the half-​life is the same. Both approaches give levels within the ‘therapeutic window’, but only with repeat dosing are the levels within the window throughout the dose interval. 78 section 2  Background to medicine therapeutic window. Dosing then resorts to the usual maintenance dose, which is chosen to replace the drug eliminated in each subse- quent dose interval. Absorption and systemic availability Systemic availability, introduced earlier in this chapter, is meas- ured by plotting the blood concentration at various times after dosing. Typical curves are shown in Fig. 2.6.3 for three different formulations (containing the same dose) of a hypothetical drug. The area under each of these curves (abbreviated to AUC) reflects the bioavailability of the formulation and it is actually the same in this simulation. However, it is clear that they produce very different profiles in terms of the maximum concentration (Cmax) and the time to reach Cmax after dosing (tmax). This reflects the decreasing rate of absorption from left to right for the three formulations, and hence the duration over which absorption occurs. Hence the fastest absorption gives the highest Cmax, but this would cause transient toxicity as crosses into the toxic range for this drug. The middle curve takes the drug level above the therapeutic threshold for as long as the first curve but remains below the toxicity threshold. The slowest rate of absorption here gives a Cmax that is too low because the drug concentration is subtherapeutic throughout. Choosing the optimum concentration profile, and hence the for- mulation, depends on the therapeutic effect being sought. Hence, a rapid (instant release) formulation of nifedipine would be needed to treat an episode of angina and is typified by the left-​hand curve in Fig. 2.6.3. The transient flushing and headache this would cause would be an acceptable side effect. However, when drugs are intended to be given repeatedly, and the therapeutic effect is related to a steady state concentration, the flatter profile of the slowest ab- sorption formulations are preferred (right-​hand curve in Fig. 2.6.3). So, for chronic angina prophylaxis, a slow release formulation of ni- fedipine would be preferred to achieve the smoothest profile of drug level in the blood after dosing. The rate of absorption Gastrointestinal motility Drugs are absorbed mainly in the upper small intestine, so altered gastric emptying can affect absorption. For example, in migraine, the rate of absorption of analgesics is reduced because of reduced gastric motility, delaying the response to an oral analgesic. In fact, the shift in the drug concentration–​time profile is similar to the rightward shift produced by reformulation discussed in Fig. 2.6.3. This delay can be reduced by giving metoclopramide to accel- erate gastric emptying. Erythromycin can have a similar effect on gastric emptying. The converse of delayed emptying occurs with antimuscarinics or older antihistamines (e.g. promethazine and di- phenhydramine) or tricyclic antidepressants (e.g. amitriptyline) that have substantial antimuscarinic actions. This can be particularly im- portant in drug overdoses involving these drugs. If a drug dissolves more slowly than the stomach empties, in- creased gastrointestinal motility reduces both the rate and extent of absorption. This effect is exaggerated for sustained release formula- tions, such that in severe diarrhoea enteric-​coated formulations can pass through the gut intact. Proximal ileostomies are also a problem for these formulations. 1 0 10 9 8 7 6 5 4 3 2 Drug concentration in the plasma 0 2 4 6 8 10 12 Time after dosing (h) Therapeutic threshold Toxicity threshold 14 16 18 20 22 24 [h] [mg/litre] Cmax Cmax Cmax Fig. 2.6.3  Effect of slowed absorption on drug kinetics. Plasma concentration profiles for a hypothetical drug formulated to progressively slow absorption (slows left to right). As the absorption is slowed, the peak concentration (Cmax) falls and is delayed. 2.6  Principles of clinical pharmacology and drug therapy 79 Malabsorption Drug absorption is often impaired in patients with malabsorption, but not always. For example, the absorption of propranolol and some antibiotics (co-​trimoxazole and cefalexin) is increased in pa- tients with coeliac disease. Untreated coeliac disease patients have a higher intraluminal pH that can affect the ionization of a drug, while the villous atrophy removes CYP 3A4 that can significantly reduce the first-​pass metabolism of some drugs (e.g. simvastatin). Digoxin, however, is less well absorbed from tablets in patients with coeliac disease, radiation-​induced enteritis, and other gastrointestinal dis- ease, and thyroxine absorption is impaired in coeliac disease. Food Food alters the rate and extent of absorption of many drugs. For ex- ample, eggs impair iron absorption, and milk (or any calcium, alu- minium, magnesium, or ferrous salt) impairs tetracycline absorption by the formation of an insoluble chelate. Such effects are rarely im- portant clinically, unless the drug has a very limited bioavailability, when the effect can be highly significant. Hence, bisphosphonates are only bioavailable in the complete absence of food. Grapefruit juice, and some other citrus juices such as Seville orange and pomelo, can markedly increase the bioavailability of some drugs (Fig. 2.6.4). These include antihistamines (terfenadine), simvastatin, calcium channel blockers (e.g. felodipine, nifedipine, and verapamil), im- munosuppressive drugs (ciclosporin A, sirolimus, and tacrolimus), and PDE5 inhibitors (e.g. sildenafil and vardenafil). Coadministered drugs Drugs affecting gastric emptying can affect absorption of coad­ ministered drugs as mentioned earlier. Anion exchange resins (bile acid sequestrants) used for lipid management avidly bind certain drugs. The resulting reduction in drug absorption is clinically sig- nificant for warfarin, thyroxine, thiazides, and digoxin when taken with cholestyramine. The newer resin, colesevalam, also affects ciclosporin and sulphonylurea absorption. Hence it is usually re- commended these resins are taken at least an hour after the drugs or the interacting drugs taken at least 4 hours after the resin. This strategy can of course be put to good effect in drug overdose, where absorption can often be prevented by adsorbing the drug to swal- lowed activated charcoal (see Chapter 10.4.1). First-​pass metabolism First-​pass metabolism is metabolism that occurs before the drug en- ters the systemic circulation. This may happen in the gut lumen itself (e.g. with benzylpenicillin or insulin), the gut wall (tyramine, chlor- promazine), the lungs (various amines and inhaled glucocorticoids), and the liver, which is the most important site. Many drugs undergo first-​pass metabolism in the liver. It is substantial for certain drugs, including cocaine, desipramine, lig- nocaine, pethidine, morphine, nicotine, nitroglycerin (and other organic nitrates), propranolol, and verapamil. When first-​pass metabolism results in the formation of com- pounds with less pharmacological activity than the parent com- pound, the drug’s efficacy is lower after oral than intravenous administration. In some cases, metabolism is so extensive that oral therapy is impossible. However, such a drug given sublingually, rec- tally, or transdermally, can bypass the liver (see earlier paragraphs). Distribution Protein binding Many drugs are bound to circulating proteins, usually albumin (acidic drugs), but also globulins (hormones), lipoproteins, and α1-​acid glycoprotein (basic drugs). Only free drugs (i.e. not protein bound) can bind to cell surface receptors, cross cell membranes to access intracellular drug targets, or distribute to other tissues where they may be metabolized and excreted (e.g. by the kidney). Thus, changes in protein binding can sometimes cause changes in drug distribution. However, such changes are only important if the drug is extensively bound to plasma proteins (>90%) and not widely dis- tributed to other tissues. Phenytoin and warfarin are the two drugs most frequently affected. The binding of drugs to albumin may be changed in renal im- pairment (the explanation for this is unknown), hypoalbuminaemia (drug binding is reduced when the plasma albumin concentration falls below 25 g/​litre), the last trimester of pregnancy (during which protein binding is reduced partly because of hypoalbuminaemia), and displacement by other drugs. Because α1-​acid glycoprotein is an acute-​phase protein, its levels are affected by trauma, surgery, inflammatory diseases (e.g. Crohn’s disease and rheumatoid arthritis), and infections. Hence, the binding of quinine is increased in malaria. Tissue distribution The extent of drug distribution to the tissues of the body varies widely. The drugs with the widest distributions will have the lar- gest volumes of distribution (see earlier paragraphs). Some water-​ soluble drugs are limited to one or more of the water compartments (vascular and extra-​ and intracellular), while others are bound 30 25 20 15 10 5 0 0 2 4 6 8 10 12 24 Time (h) Simvastatin (ng/ml) Fig. 2.6.4  Effect of grapefruit juice (closed symbols) or water (open symbols) taken daily for 3 days on the plasma levels after a single 40 mg dose simvastatin. The effect relies on furanocoumarin components of grapefruit juice that directly reduce P450 protein expression (CYP 3A4 principally) in the gut wall. From Lilja JL, Neuvonen M, Neuvonen PJ (2004). Effects of regular consumption of grapefruit juice on the pharmacokinetics of simvastatin. Br J Clin Pharmacol, 58, 56–​60, Copyright © 2004, John Wiley and Sons. 80 section 2  Background to medicine extensively in tissues. The distribution of drugs to different tissues is influenced by plasma-​protein binding, specific receptor sites in tissues (e.g. the binding of cardiac glycosides to the Na+,K+-​ATPase in cell membranes throughout the body), regional blood flow (well-​ perfused organs, such as the heart, kidneys, and liver accumulate drugs more than poorly perfused organs, such as fat and bone), lipid solubility (lipid-​soluble drugs enter tissues more readily than charged compounds with poor lipid solubility), and active transport across cell membranes (adrenergic neuron-​blocking drugs such as guanethidine accumulate in noradrenergic nerve terminals through the noradrenaline transporter, uptake-​1) or active transport across epithelia. The importance of epithelial transport has been appreciated with the discovery of specific transporters. These fall into two families: ei- ther solute carriers (SLC) or ATP-​binding cassette (ABC) trans- porters. P-​glycoprotein is the best understood ABC transporter and was first identified in tumour cells where it confers multidrug resist- ance (MDR) by promoting the extrusion of many structurally unre- lated anticancer drugs. But this drug efflux pump is widely expressed in normal tissues such as small intestine (brush border membrane of enterocytes), kidney (brush border membrane of proximal tubule cells), liver (canalicular membrane of the hepatocytes), and at the blood–​brain barrier (capillary endothelial cells). The latter explains, for example, why drugs such as ivermectin, digoxin, loperamide, and domperidone are rapidly pumped out of the cerebrospinal fluid, so effectively excluding them from the brain. In some diseases, drug distribution is altered for unknown reasons. In renal failure, for example, the distribution of drugs such as insulin and digoxin is decreased as well as their protein binding. In cardiac failure, the distribution of some antiarrhythmic drugs, such as disopyramide, is also reduced. Obesity and malnutrition influence the distribution of drugs that are highly fat soluble (e.g. inhalational anaesthetics). Metabolism Most drugs are metabolized in the liver. Examples of other sites are suxamethonium in the plasma; insulin and vitamin D in the kid- neys; and acetylcholine and catecholamines at their corresponding synapses and nerve endings. Drug metabolism occurs in two phases: • Phase I  metabolism involves chemical alteration of the basic structure of the drug (e.g. by oxidation, reduction, or hydrolysis). This results in free groups such as –​OH or –​NH2 that are con- jugated in phase II. Many phase I reactions are catalysed by en- zymes from the P450 family (see following paragraphs). Examples of phase I reactions include the N-​demethylation of diazepam to desmethyldiazepam, an active metabolite with a long duration of action, and the oxidation of ethanol to acetaldehyde. • Phase II metabolism involves conjugation (e.g. by sulphation, glucuronidation, methylation, or acetylation). Some drugs are conjugated without prior phase I  transformation, while others undergo phase I metabolism before conjugation can occur. The products of conjugation are more water soluble and therefore more easily cleared by the kidney or biliary system. They are usually, although not always, pharmacologically inactive (e.g. morphine-​6-​glucuronide is an active metabolite of morphine that accumulates in renal failure). Examples of phase II reactions are the glucuronidation of paracetamol and the N-​acetylation of hydralazine and procainamide. A conjugated product may some- times be further metabolized. For example, oestrogens are ex- creted via the bile, deconjugated in the gut by bacteria, and then reabsorbed (so-​called enterohepatic recycling). The end result of drug metabolism is inactivation, but during the process, compounds with pharmacological activity are often formed. Alternatively, inactive drugs (prodrugs) may be metabol- ized to active ones. The antiparkinsonian drug l-​dopa, for example, is a typical prodrug. It has no action on dopamine receptors until it enters the central nervous system and is metabolized to dopamine. Other examples include diamorphine and codeine (which are rap- idly metabolized to diacetylmorphine and morphine, respectively) and many angiotensin-​converting enzyme (ACE) inhibitors (e.g. enalapril is converted to its active form enalaprilat). Some drugs also have toxic metabolites. For example, norpethidine is a metabolite of pethidine that can cause fits, and acrolein is a bladder irritant formed from cyclophosphamide. The normally minor metabolic pathway for paracetamol forming N-​acetyl-​p-​ benzoquinone imine (NABQI) is important in overdose because of the saturation of other detoxification pathways; this metabolite causes the hepatotoxicity that follows paracetamol overdose. Most of the phase I reactions in the liver are carried out by en- zymes of the large cytochrome P450 superfamily of proteins. They are all prefixed by ‘CYP’ and tens of different isoforms are recog- nized in the human liver. The commonest is the CYP3A4 isoform through which over 50% of marketed drugs are metabolized. Others such as CYP2D6 have special importance because they are poly- morphic, that is, they are encoded by CYP genes that have common variants that affect enzyme activity. The 10 major isoforms and some of the commoner drugs that are substrates for them are shown in Table 2.6.1. Certain drugs also selectively block CYP isoforms, which re- duce the elimination of drugs through these enzymes. Examples of these are cimetidine (1A2 and 2D6), amiodarone (2C9, 2D6, and 3A4), fluoxetine (2D6), ketoconazole (3A4 and 2C19), and ritonavir (3A4). Conversely, drug elimination through some CYP pathways is enhanced by drugs that act as inducers that increase enzyme activity by directly activating transcription of the corres- ponding CYP genes. The classical inducers include phenobarbitone (2B6, 3A4), rifampicin (2B6, 2C9, and 3A4), and antiepileptics such as phenytoin and carbamazepine (both 3A4). Tobacco and ethanol are also inducers, albeit for minor CYP isoforms (1A2 and 2E1, respectively). A full list of CYP inhibitors and inducers (so-called Flockhart table) is available at this link: https://drug-interactions. medicine.iu.edu/Main-Table.aspx Other factors that affect hepatic drug metabolism are hep- atic blood flow (for drugs that are rapidly cleared), liver disease (only important in extensive liver disease or when there is ar- teriovenous shunting), and age. The metabolism of some drugs is impaired in old people and in babies younger than about 6 months, particularly premature babies. In both cases, this is due to reduced activity of the microsomal enzymes that includes CYP and non​CYP enzymes. For example, in neonates uridine diphosphoglucuronosyl transferase (EC 2.4.1.7), which conju- gates chloramphenicol, is relatively inactive; neonates eliminate chloramphenicol slowly, and may suffer peripheral circulatory 2.6  Principles of clinical pharmacology and drug therapy 81 collapse (the ‘grey syndrome’) when given the drug in weight-​ related doses that do not harm adults. Excretion The kidney is a major route of drug excretion. Other, usually minor, routes include the lungs (important for paraldehyde and gaseous anaesthetic gases), breast milk, sweat, tears, and genital secretions (alarming if the orange-​red discoloration caused by rifampicin is not mentioned), saliva, and bile. Excretion in bile can be prominent for some drugs and this can lead to the reabsorption of some of the excreted compounds from the gut—​a process called enterohepatic recycling. This recycling affects drugs such as oestradiol, rifampicin, and tetracyclines, and can substantially prolong their duration in the body. Drugs excreted in bile as water-​soluble conjugates (after phase II metabolism—​see earlier) may also be deconjugated by gut bacteria to release parent drug so facilitating enterohepatic recycling (e.g. chloramphenicol, digitoxin, indomethacin, and valproic acid). Renal excretion of drugs involves three separate processes: glomerular filtration, passive tubular reabsorption, and active tubular secretion (often through an SLC transporter). Thus: Total Table 2.6.1  The main cytochrome P450 isoforms (in bold) and their common substrate drugs 1A2 2C9 2D6 3A4,5,7 Clozapine NSAIDs β-​Blockers Macrolide antibiotics Imipramine Diclofenac Metoprolol Clarithromycin Mexiletine Ibuprofen Propafenone Erythromycin Naproxen Naproxen Timolol NOT azithromycin Theophylline Celecoxib Antidepressants Antiarrhythmics 2B6 Sulphonylureas Amitriptyline Quinidine Bupropion Glibenclamide Clomipramine Benzodiazepines Cyclophosphamide Glipizide Desipramine Diazepam Efavirenz Tolbutamide Duloxetine Midazolam Ifosfamide Angiotensin II blockers Fluoxetine Triazolam Methadone Irbesartan Imipramine Immune modulators 2E1 Losartan Paroxetine Ciclosporin Anaesthetics NOT valsartan Venlafaxine Tacrolimus Enflurane NOT candesartan Antipsychotics HIV protease inhibitors Halothane Phenytoin Haloperidol Indinavir Isoflurane Sulfamethoxazole Risperidone Ritonavir Methoxyflurane Tamoxifen Opiates Saquinavir Sevoflurane Warfarin Codeine Antihistamines 2C19 oxycodone Astemizole Paracetamol Proton pump inhibitors Tramadol Chlorpheniramine Ethanol Omeprazole Dextromethorphan Calcium channel blockers 2C8 lansoprazole Flecainide Amlodipine Paclitaxel Pantoprazole Mexiletine Diltiazem Torsemide Antiepileptics Ondansetron Felodipine Amodiaquine Diazepam Tamoxifen Nifedipine Repaglinide Phenytoin Verapamil Phenobarbitone Statins Atorvastatin Simvastatin NOT pravastatin Others Amitriptyline Citalopram Clomipramine Clopidogrel Cyclophosphamide proguanil NOT rosuvastatin Imatinib Methadone Pimozide Quinine Sildenafil Source: http://​www.medicine.iupui.edu/​CLINPHARM/​ddis/​clinical-​table 82 section 2  Background to medicine renal clearance = Clearance by filtration + Clearance by secretion − Retention by reabsorption. If a drug is mainly metabolized to inactive compounds, renal func- tion will have little impact on its elimination. However, if the drug itself or an active metabolite is excreted unchanged via the kidneys, changes in renal function will substantially affect its elimination. Glomerular filtration All drugs are filtered at the renal glomerulus, although molecules larger than 2 kDa are not freely filtered (e.g. insulin concentration in the ultrafiltrate is 0.89 × plasma) and proteins as large as albumin (69 kDa) are not filtered at all by a normal glomerulus. The extent of filtration is directly proportional to the glomerular filtration rate (GFR = 120 ml/​min) and to the fraction of unbound drug in the plasma (fu), that is, rate of clearance by filtration = fu × GFR. If the total renal clearance of a drug is equal to fu × GFR, then it is cleared principally by filtration (it could be affected by the other two mechanisms, secretion and reabsorption, but they would have to cancel each other out). Examples of drugs whose clearance is similar to the GFR (after correction for protein binding) are digoxin, gen- tamicin, vancomycin, methotrexate, and ethambutol. As creatinine is cleared mainly by filtration, the creatinine clearance is useful in estimating the clearance rates of these drugs; although there is some secretion, which explains its tendency to overestimate GFR. Passive tubular reabsorption Drugs are passively reabsorbed by the renal tubules. The elimination of drugs with very low rates of renal clearance (i.e. approaching urine flow rate, or about 1–​2 ml/​min) will be significantly affected by changes in urine flow rate (because a doubling of flow rate will in- crease their rate of clearance by 1–​2 ml/​min, i.e. twofold). However, for weak acids and weak bases that are non​polar in their union- ized form, the main factor affecting passive reabsorption is the pH of the renal tubular fluid (which can vary between 4.5 and 8), be- cause the extent of their ionization (and therefore of their passive reabsorption) depends on the urine pH in relation to the pKa of the drug. For example, in an alkaline urine, weak acids with a pKa below 7.5, such as salicylate (pKa 3), are more highly ionized, and there- fore less well reabsorbed. The reverse is true for weak bases with a pKa greater than 7.5, such as methamphetamine (pKa 10), whose re- absorption is reduced, and whose clearance is therefore enhanced, by an acid urine. These principles are put to good use in the treat- ment of overdose (see Chapter 10.4.1). Renal failure alters passive reabsorption indirectly, by changing both urine flow rate and pH. Active tubular secretion If the renal clearance of a drug exceeds GFR then there must be active secretion of the drug into the renal tubule. The active trans- port of organic anions and cations is dependent on specific trans- porter proteins (SLC and ABC transporters) in the cells lining the proximal tubule (detailed in https://www.ncbi.nlm.nih.gov/ pubmed/28210973). There are broadly two functional groups: or- ganic anion and organic cation transporters. The organic anion transporters are responsible for the secretion of many β-​lactam antibiotics, NSAIDs, antivirals, and ACE inhibitors, as well as acidic glutathione and glucuronide-​conjugated drug metabolites. Probenecid is a generic inhibitor of anion transport and anionic transport capacity can be measured by p-​aminohippuric acid (PAH) excretion. Substrates for the organic cation transporters include en- dogenous cations (e.g. guanidine, choline, N-​methylnicotinamide, and monoamine neurotransmitters), cationic toxins (e.g. 1-​methyl-​ 4-​phenylpyridinium, MPP+), and cationic drugs (e.g. cimetidine, procainamide, quinidine, vecuronium, cardiac glycosides). Both ci- metidine and trimethoprim can affect cationic transport. Blockade of kidney drug transporters explains several drug–​drug interactions (see following paragraphs). Nonlinear kinetics Most drugs show linear kinetics (i.e. an increase in dose will cause a similar-​fold increase in plasma concentration). There are excep- tions, however, and when a drug shows nonlinear behaviour (e.g. a twofold increase in dose produces a tenfold increase in plasma level) it implies that some aspect of the drug’s elimination pathway has saturated. Some tricyclic antidepressants and the selective serotonin reuptake inhibitor (SSRI) paroxetine show this behaviour. But the best-​known example is phenytoin (see Fig. 2.6.5), whose oxidation through CYP 2C19 in the liver is saturated within the therapeutic range. Inspection of the curves in Fig. 2.6.5 shows how small in- crements in the dose of phenytoin can produce large increments in its plasma level and hence neurotoxicity. In fact, many drugs taken in overdose saturate their elimination pathways, and this is clearly shown by drugs such as ethanol, salicylate, and paracetamol. The pharmacodynamic process Pharmacodynamics addresses the question ‘How does a drug pro- duce its pharmacological effects?’ Of course, this concerns both ad- verse as well as therapeutic effects. Drugs have many different mechanisms of action but the overwhelming majority are mediated through receptors, which they either block or activate. 0 0 100 200 300 Daily dose of PHENYTOIN (mg) 400 500 600 25 50 75 Serum [PHENYTOIN] (μmol/litre) 100 125 150 175 200 Fig. 2.6.5  Non​linear kinetics. The plasma level achieved after repeated phenytoin dosing is shown versus the corresponding phenytoin dose for three hypothetical patients. The plots are not straight lines but rise steeply within the therapeutic range (between the dashed lines) as the CYP2C19 metabolic pathway for phenytoin saturates. 2.6  Principles of clinical pharmacology and drug therapy 83 Actions via direct effects on receptors Receptors are proteins situated either in cell membranes or within the cellular cytoplasm. For each receptor type there is usually an en- dogenous molecule (or ligand) that binds to the receptor. There are broadly four types of receptors:  G protein-​coupled receptors, ion channels, nuclear receptors, and tyrosine kinase receptors. Drug mol- ecules target the same binding site on the receptor as the endogenous ligand and may activate the receptor, block the binding of the en- dogenous ligand, or a have mixture of both effects. These actions are referred to as agonism, antagonism, and partial agonism, respectively. This terminology is made clearer by considering these actions in terms of the µ opioid receptor. Morphine acts as an agonist at µ opioid receptors to cause analgesia. This µ receptor activation mimics the ac- tion of endogenous analgesic peptides such as enkephalin and endor- phin. Compare this with naloxone, which behaves as an antagonist at the same receptor and blocks its activation by either morphine or other opiates. Importantly, in the absence of an agonist, a pure antag- onist such as naloxone has no pharmacological action through the µ opiate receptor. This contrasts with buprenorphine that behaves as a partial agonist at the µ receptor. Hence, in the absence of an agonist such as morphine, buprenorphine is able to activate the µ receptor, but it is not able to produce an analgesic effect as large as morphine (this explains the ‘ceiling’ effect seen clinically). Yet, in the presence of a full agonist such as morphine, buprenorphine can block µ receptor activa- tion (this explains why buprenorphine can trigger opiate withdrawal). Receptor subtypes In many cases, a receptor will have subtypes, and drugs may have sub- type selectivity. For example, there at least three subtypes of opioid receptors: µ, κ, and δ. The µ and δ are both involved in analgesia, gastrointestinal motility, and respiratory depression, and κ is involved in analgesia, sedation, and miosis. These receptors are variably distrib- uted in the nervous system. Most opiates act at µ receptors, but none is completely selective and they may act at other subtypes. Long-​term effects of drugs at receptors During long-​term therapy, the effects of a drug may be altered by adaptive responses, usually accompanied by either increases (‘upregulation’) or decreases (‘downregulation’) in receptor num- bers. Such changes may explain both the therapeutic and adverse effects of drugs. Examples include: • The therapeutic response to antidepressants (e.g. an SSRI) that may involve changes in receptors within the central nervous system secondary to the increased synaptic levels of neurotransmitter caused by these drugs. This probably explains why the therapeutic response to antidepressants takes a few weeks to emerge. • The way in which the response to l-​dopa in Parkinson’s disease changes during long-​term administration (e.g. producing the ‘on–​off’ effect). • Withdrawal syndromes that may occur because long-​term changes become unopposed when the drug is withdrawn (e.g. after the long-​ term use of opiates or benzodiazepines). Actions via direct effects on second messengers When an agonist stimulates a membrane-​bound receptor, its effect is usually signalled in one of two ways:  either through a so-​called second messenger (e.g. cAMP, diacylglycerol, or inositol trisphosphate) or by changing the activity of an ion channel linked to the receptor. Some drugs may act by affecting second messen- gers directly. For example, some drugs block phosphodiesterases that normally metabolize the second messengers cAMP and cGMP. Several types of phosphodiesterases regulate cAMP levels and theo- phylline and caffeine work by inhibiting them non​specifically. In contrast, type 5 phosphodiesterase (EC 3.1.4.17) specifically de- grades cGMP in smooth muscle and is selectively inhibited by drugs such as sildenafil and vardenafil. Actions via indirect alterations of the effects of endogenous receptor ligands Drugs may oppose the physiological effects of the endogenous ligand. For example, glucagon is a physiological antagonist of the actions of insulin; hence its use to treat hypoglycaemia. Other drugs act indirectly on a receptor by altering the levels of the endogenous ligand for that receptor. They can do this in a number of ways: Increase in endogenous release Some drugs enhance release of the endogenous ligand. For example, amphetamine and tyramine increase the release of dopamine and noradrenaline, respectively, from nerve terminals. Prevention of endogenous release or synthesis The release of many neurotransmitters from nerve terminals is regulated by inhibitory receptors activated by the neurotransmitter itself. Hence, α2-​receptors on noradrenergic nerve terminals re- duce noradrenaline release into the synapse. Clonidine and α-​ methyldopa reduce the release of noradrenaline by activating these receptors (α-​methyldopa actually works through its metabolite α-​methylnoradrenaline) and is the basis of their antihypertensive action. Drugs can also reduce the production of the endogenous ligand rather than affect release of a preformed store of the ligand. For ex- ample, angiotensin II (which activates the angiotensin II receptor to cause vasoconstriction and aldosterone release) is produced enzymatically from angiotensin I. ACE inhibitors block the con- verting enzyme (cleaving angiotensin I to angiotensin II) and renin inhibitors (e.g. aliskiren) block the enzymatic cleavage of angio- tensin I from angiotensinogen. Inhibition of endogenous reuptake Many neurotransmitters are pumped back into their corresponding nerve terminals. The transporters that carry out this reuptake are targeted by certain psychoactive drugs. Many antidepressants, for example, block the reuptake of noradrenaline (such as amitrip- tyline), or 5-​hydroxytryptamine (serotonin, 5HT) (such as fluox- etine), or both (such as venlafaxine). Reuptake of γ-​aminobutyric acid (GABA) is blocked by the anticonvulsant tiagabine. Inhibition of endogenous metabolism Drugs can also increase the effect of an endogenous ligand by blocking its metabolism. Examples of these include: cholinesterase inhibitors for acetylcholine (e.g. neostigmine), vigabatrin for GABA, and monoamine oxidase inhibitors (MAO) inhibitors for catechol- amines and 5HT (e.g. tranylcypromine). 84 section 2  Background to medicine The MAO inhibitors have some selectivity for the two forms of MAO (types A and B), which explains their use both as antidepres- sants and antiparkinsonian agents. For example, selegiline select- ively inhibits monoamine oxidase type B. This results in blockade of dopamine metabolism in the central nervous system and enhances the action of l-​dopa in parkinsonism. However, because MAO in the gut and liver is predominantly type A, selegiline does not pro- duce the ‘cheese reaction’ (due to tyramine and other amines, see following paragraphs) seen with non​selective MAO inhibitors (e.g. tranylcypromine). In contrast, the antidepressant moclobemide se- lectively inhibits MAO A. Moclobemide is also unusual in being a reversible inhibitor of MAO. All other MAO inhibitors block MAO irreversibly, so that recovery requires synthesis of new enzyme. This explains the slow offset of their action (7–​10 days) and why moclobemide does not produce a ‘cheese reaction’. Actions by inhibiting the movement of ions Since cations (such as sodium, potassium, and calcium) and anions (such as chloride and iodide) have so many important roles in the maintenance of normal cellular function, inhibition of their trans- port is an important mechanism of drug action. This movement of cations and anions across membranes can be either through trans- porters or channels. Diuretics Most diuretics reduce sodium reabsorption in the renal tubules by targeting specific transporters or ion channels. Hence, loop diuretics (e.g. furosemide and bumetanide) block the Na–​K–​Cl transporter (NKCC2) in the ascending limb of Henle’s loop. Thiazide diuretics inhibit Na–​Cl cotransporter (NCC) in the distal convoluted tubule and potassium-​sparing diuretics (amiloride and triamterene) block the epithelial sodium channels in the collecting ducts (ENaC). Calcium channel antagonists The calcium antagonists, such as verapamil, diltiazem, and the dihydropyridines (e.g. nifedipine), inhibit the transport of calcium via voltage-​operated calcium channels (of the l-​type). They are able to exert different effects in different tissues (e.g. verapamil slows atrio- ventricular nodal conduction in the heart but nifedipine does not) because of separate binding sites within the l-​type calcium channel. Drugs acting on potassium channels The potassium permeability of cell membranes affects their mem- brane potential and is controlled by potassium (K) channels. They are a very large and diverse group of ion channels and drugs may either open or close (or block them). Drugs that open potassium channels include vascular smooth muscle relaxants, such as minoxidil and nicorandil (targeting KATP channels). The same KATP channels are closed by sulphonylureas such as gliclazide. Of the many potassium channels in the heart, the iKr channel plays a key role as it is involved in repolarization of the myocyte. Blockade of the iKr channels explains why many drugs cause prolongation of the QT interval and are hence arrhythmogenic (e.g. terfenadine). Drugs acting on chloride channels Intracellular chloride plays a key role in neuronal excitability. Hence, the inhibitory effect of the neurotransmitter GABA on neurons is due to chloride entry into neurons through GABA-​activated chloride channels. Progabide behaves as a GABA agonist and GABA-​activated chloride currents are increased by benzodiazepines and barbiturates. The antiseizure activity of these drugs is directly attributed the neuronal hyperpolarization that results. Some anaesthetic agents also probably rely on effects on chloride currents for their general anaesthetic effect. They form a diverse group of agents, such as the halogenated hydrocarbons (halothane, trichloroethylene), and non​halogenated agents (nitrous oxide, ether, cyclopropane), and were previously thought to influence neuronal excitability by non-​specific effects on the lipid phase of the cell membrane, changing its biophysical properties and hence the kinetics of ion channels. More recent evidence has shown that they specifically affect currents through GABA/​glycine-​coupled ion channels. Others (such as ketamine and nitrous oxide) antagonize the excitatory NMDA-​coupled ion channel. Actions via enzyme inhibition Drugs often act by directly inhibiting enzymes. Inhibitors of cholin- esterase and MAO (see earlier paragraphs), for example, have been in therapeutic use for more than half a century. The metabolism of purines involves the oxidation of xanthine and hypoxanthine to uric acid by xanthine oxidase. Hence, blockade of this enzyme with allopurinol prevents excessive uric acid produc- tion during tumour lysis and reduces the frequency of gout. The cardiac glycosides act by inhibiting the Na/​K pump. This changes the distribution of sodium across excitable membranes es- pecially in the heart: and the secondary change in intracellular cal- cium causes the cells’ contractility to increase. Other drugs that act via enzyme inhibition include warfarin (vitamin K epoxide reductase), aspirin, and other NSAIDs, targeting the cyclo-​oxygenase (COX) enzymes involved in prostaglandin synthesis, ACE inhibitors, disulfiram (alcohol dehydrogenase), some anticancer drugs such as cytarabine (DNA polymerase), and imatinib (chronic myelogenous leukaemia-​specific tyrosine kinase), and some anti-​infective drugs (bacterial or viral enzymes; e.g. tri- methoprim inhibits bacterial dihydrofolate reductase, the quin- olones inhibit bacterial DNA gyrase, and zidovudine and didanosine inhibit the reverse transcriptase of HIV). Danazol and stanozolol are examples of drugs that inhibit an en- zyme indirectly—​they stimulate the production of an inhibitor of C1 esterase and are used to treat hereditary angio-​oedema, in which there is reduced plasma activity of the inhibitor. Actions via enzyme activation or direct enzymatic activity Some drugs either activate enzymes or are enzymes themselves. The clotting and fibrinolytic factors are enzymes, and certain drugs that act on clotting and fibrinolysis do so by increasing their activity. Heparin acts by activating antithrombin III. The thrombolytic drugs streptokinase, alteplase, and tenecteplase activate plasminogen. Deficiencies of clotting factors can be treated by replacing defi- cient enzymes of the clotting pathway (e.g. factor VIII in patients with haemophilia and fresh frozen plasma in warfarin toxicity). Pancreatic enzymes are used in treating malabsorption in patients with chronic pancreatic insufficiency. Rasburicase is used to prevent the hyperuricaemia that accom- panies the tumour lysis syndrome during the treatment of acute 2.6  Principles of clinical pharmacology and drug therapy 85 leukaemias and lymphomas. It is recombinant urate acid oxidase that directly catalyses the breakdown of uric acid to the more soluble allantoin. Actions via other miscellaneous effects Chelating agents Drugs that chelate metals can be used to hasten their removal from the body (see Chapter  10.4.1). Calcium sodium edetate chelates many divalent and trivalent metals and is used to treat poisoning, particularly with lead. Dimercaprol chelates some heavy metals and is used to treat mercury and gold poisoning. Desferrioxamine chelates iron and is used in treating iron poisoning and the iron overload that occurs with repeated blood transfusion (as in thal- assaemia). Penicillamine chelates copper and is used in treating hepatolenticular degeneration (Wilson’s disease); it is also used to complex cystine and thus prevent renal damage in cystinuria. Osmotic diuretics Mannitol is freely filtered at the glomerulus but the renal tubules reabsorb relatively small amounts. It therefore increases the concen- tration of osmotically active solute in the tubular fluid, the subse- quent influx of water massively augments urine flow rates. Replacement of vitamins and minerals Some drugs are used simply to replace deficiencies (e.g. ferrous salts in iron deficiency anaemia and hydroxocobalamin (vitamin B12) in vitamin B12 deficiency). Stereoisomerism and drug action Stereoisomerism (chirality) of organic compounds is due to asym- metry in one or more of their atoms (usually carbon), resulting in two or more three-​dimensional structures (enantiomers) that cannot be superimposed on each other. Several different terminologies are used to describe the two chiral partners: R and S (from the Latin rectus = right and sinister = left), (+) -​d and (–​)-​l and d and l (from the Latin dexter = right and laevus = left). Examples of drug enantiomers are R-​warfarin and S-​warfarin, d-​glucose (dextrose) and l-​glucose (laevulose), and d-​ propranolol and l-​propranolol. Of all synthetic drugs used in clinical practice, almost 50% are chiral and about 90% of those are marketed in their racemic form (i.e. as an equal mixture of the two enantiomers). Examples include d,l-​ propranolol and R,S-​warfarin. Naproxen is one of the few examples of a synthetic compound that is marketed as a single enantiomer (hence an enantiopure drug). In contrast, naturally occurring and semisynthetic compounds are almost all chiral and almost all are marketed as a single isomer. Examples include d-​glucose (dextrose) and the naturally occurring amino acids (e.g. l-​dopa), l-​thyroxine, and l-​noradrenaline. Enantiomers often have different pharmacological actions. For example, l-​propranolol is a β-​blocker, whereas d-​propranolol has membrane-​stabilizing activity like that of local anaesthetics; l-​ sotalol is a β-​blocker, whereas d-​sotalol has antiarrhythmic effects like those of amiodarone. Sometimes these differences between the pharmacology of en- antiomers can separate the therapeutic and adverse effects of a ra- cemate. For example, R-​thalidomide is responsible for the sedative effects of R,S-​thalidomide, while the teratogenic effect resides in the S enantiomer. However, this cannot be exploited clinically to limit thalidomide toxicity, because the two enantiomers spontaneously interconvert in the body. This so-​called ‘chiral inversion’ also occurs in some NSAIDs such as ibuprofen. The interaction of a drug with its target (such as a receptor or en- zyme) is often stereo-​selective, so enantiomers may show marked differences in potency. For example, S-​warfarin is some five times more potent an anticoagulant than R-​warfarin, and S-​citalopram is 30 times more potent as an SSRI than R-​citalopram. In some cases, one enantiomer is completely inactive therapeutically (e.g. levofloxacin is the active antimicrobial enantiomer in ofloxacin with dextrofloxacin being completely inactive). The enantiomers may have different pharmacokinetics. For ex- ample, the half-​lives of S-​warfarin and R-​warfarin average around 30 h and 50 h and they are metabolized to 7-​hydroxywarfarin and warfarin alcohols, respectively. This is important in some drug interactions with warfarin, because drugs inhibiting warfarin me- tabolism (such as metronidazole) primarily affect the more potent enantiomer, S-​warfarin. Omeprazole provides a further example. The single enantiomer of omeprazole, S-​omeprazole, has almost twice the bioavailability of R-​omeprazole. Hence, esomeprazole (S-​omeprazole) gives plasma levels of omeprazole 70 to 90% higher than racemic omeprazole. However, both enantiomers are converted to the same active intermediate and since this lacks a chiral centre the enantiomers are equiactive if they achieve the same plasma level. Because of these caveats, despite the appeal of prescribing single-​ drug enantiomers, in most instances the single enantiomer has failed to show a substantial clinical advantage over the racemate. When comparing racemates to single enantiomers in clinical trials, it is critical they are compared at comparable doses of the active enantiomer. The therapeutic process The question associated with the therapeutic process is, ‘Is the pharmacological effect being translated into a therapeutic effect?’ Translation of pharmacological effect into therapeutic effect during short-​term therapy The short-​term therapeutic and toxic effects of drugs occur as a re- sult of the pharmacological actions discussed earlier. However, the translation of molecular and cellular pharmacological effects into the therapeutic or toxic effect is not a simple process, and involves several translational stages at different pharmacological and physio- logical levels. Take, for example, the action of salbutamol, a β2-​adrenoceptor agonist, in the treatment of asthma. Salbutamol stimulates bronchial β2-​adrenoceptors, and so increases the activity of adenylate cyclase; this is its pharmacological effect at the molecular level. The increase in adenylate cyclase activity raises the intracellular concentration of cAMP that leads to relaxation of the bronchial smooth muscle cells; this is the cellular effect. This leads to dilatation of the bronchioles, reduces the resistance to air flow in the bronchial tree, and improves gas exchange; this is the effect on tissue and whole organ function. 86 section 2  Background to medicine Finally, the patient feels less breathless and oxygen saturation may improve; this is the desired clinical effect. This analysis of the short-​term effects of a drug teaches us sev- eral things about drug action: how drug action may be modified; how therapeutic and adverse effects may be mediated via different pharmacological effects; the relation between the pharmacological effects of a drug and the rate of onset or duration of its therapeutic action; and drug–​disease interactions. How to modify drug action It is often possible to modify drug action positively or negatively. For example, a methylxanthine derivative, such as theophylline, which blocks cAMP breakdown by its inhibition of phosphodiesterase, should potentiate the action of salbutamol. This turns out to be both a beneficial and an adverse interaction—​beneficial because theo- phylline enhances the therapeutic action of salbutamol, adverse because it enhances the hypokalaemia (by stimulating Na+/​K+-​ ATPase) and tachycardia (by activating cardiac β-​adrenoceptors) that salbutamol causes. Different pharmacological actions may mediate therapeutic and adverse or other effects Some drugs have more than one molecular mechanism of action, and different therapeutic effects of a drug may result from different actions. For example, tetracycline acts against bacteria by interfering with their protein synthesis, but in acne, it helps by interfering with sebum production in sebaceous glands. Similarly, a therapeutic effect may be brought about by one pharmacological action and an adverse effect by another. For ex- ample, the antibacterial action of erythromycin is due to inhibition of bacterial ribosomal function, but its gastrointestinal side effects are due to activation of motilin receptors in the human gut. Other macrolides such as azithromycin are much weaker motilin mimetics and hence produce much less vomiting and diarrhoea. A drug may also produce therapeutic and adverse effects through the same molecular mechanism but in different tissues. For example, the inhibition of β2-​adrenoceptors within muscle spindles by pro- pranolol reduces benign essential tremor, but blocking the same receptors in the lung causes bronchoconstriction in susceptible in- dividuals and impairs glycogenolysis in the liver (which can delay a diabetic’s recovery from hypoglycaemia). The relation between the pharmacological actions of a drug and the rate of onset and duration of its effects The rate of onset of a drug’s effects depends not only on its pharma- cokinetics (i.e. the time it takes for a therapeutic concentration of drug to build up at its site of action), but also on how long it takes for the full pharmacodynamic sequence of events to unroll. In the case of salbutamol, the time between β2-​adrenoceptor stimulation and bronchodilatation is of the order of a few minutes. However, for other drugs, the sequence of events takes much longer. For ex- ample, corticosteroids bind to an intracellular receptor protein in the cytoplasm of target cells to form a steroid–​receptor complex. This complex translocates to the nucleus, where it binds to regu- latory sequences on target genes to cause RNA transcription. The induction of de novo protein synthesis by RNA transcription and translation takes several hours, explaining the slow onset of cortico- steroid effects. Similarly, the duration of action of a drug is related not only to the time it takes for the drug to be cleared from the body, but also to the duration of its pharmacological effects. For example, aspirin inhibits COX-​1 by acetylating a serine moiety at the active site of the enzyme. As platelets cannot synthesize new protein, the re- covery from aspirin requires the appearance of new platelets from the marrow. This process can take 7 to 10 days to restore peripheral platelet function. Drug–​disease interactions Because of the complex links between the pharmacological effects of a drug and its therapeutic or adverse effects, the pathophysiology of the disease being treated, or of other coincidental diseases, can vari- ously impact the way in which the pharmacological effect is trans- lated into a therapeutic effect. The use of digoxin in cardiac failure exemplifies this. Digoxin inhibits the activity of the membrane-​bound Na+,K+-​ATPase. Pump inhibition increases the intracellular concentration of so- dium, which secondarily raises intracellular calcium to produce a positive inotropic effect. The various steps in this process are af- fected by drug–​disease interactions. Hypokalaemia, for example, is a common side effect of diuretic use to manage fluid overload in heart failure. Low extracellular potassium increases pump block by raising the affinity of digoxin for the sodium pump. This risks cal- cium overload and hence digoxin toxicity. Coincident diseases also affect digoxin’s therapeutic effect. In hyperthyroidism, the Na+,K+-​ ATPase is upregulated, resulting in resistance to the inhibitory ef- fects of digoxin. So, increasing the dose may cause digoxin toxicity without ever producing a therapeutic effect. In patients with chronic cor pulmonale, tissue hypoxia may also lead to Na/​K pump inhib- ition and cardiac arrhythmias without increasing myocardial con- tractility. In patients with hypertrophic obstructive cardiomyopathy, although digoxin increases the rate of myocardial contractility, a rise in cardiac output is prevented because left ventricular outflow re- mains obstructed. Thus even when it can be shown that a drug is having its ex- pected action at a particular pharmacological or physiological level, it cannot automatically be assumed that it will have a thera- peutic effect. Interactions with circadian rhythms (chronopharmacology) Most physiological functions follow a circadian rhythm, so some drug effects are liable to differ at different phases of the rhythm. In some instances, the difference is dramatic. The timing of cortico- steroid dosing is a good example. Peak ACTH release from the pi- tuitary occurs at night leading to plasma cortisol peaks at around 08.00 a.m. when ACTH levels are at trough. So, exogenous cortisol in the morning will have no impact on ACTH release. But given in the evening it will completely inhibit night-​time ACTH. For this reason, cortisol given once daily in the morning causes much less pi- tuitary inhibition than the same dose given in the evening, or spread throughout the day. The cholesterol-​lowering effect of some statins is also affected by the time of administration because cholesterol synthesis has a cir- cadian rhythm with most synthesis occurring at night. Short-​acting statins are hence more effective if given at night (e.g. simvastatin, half-​life c.3 h). For longer-​acting statins there is no discernible dif- ference, as the drug level will still be high enough during the night 2.6  Principles of clinical pharmacology and drug therapy 87 even with morning dosing (e.g. atorvastatin half-​life c.20 h with even longer-​lived active metabolites). Translation of pharmacological effect into therapeutic effect: Long-​term therapy During prolonged therapy, adaptation may develop to the short-​ term pharmacological effects of the drug with several consequences. Therapeutic effects through adaptation In immunization, by adapting to an initial immunological challenge, the immune system develops the ability to respond to a subsequent similar challenge (e.g. tetanus immunization). Although tricyclic antidepressants rapidly inhibit reuptake of noradrenaline and 5-​HT in the brain, the therapeutic effect of these drugs takes 1 to 2 weeks to become evident. In certain brain regions, there is adaptation to the increased concentrations of neurotransmit- ters in the synaptic cleft with reduction in numbers of postsynaptic receptors. Part of this adaptive downregulation probably explains their therapeutic effects. Tolerance: Increasing ineffectiveness of therapy Tolerance is a state of decreased responsiveness to a drug, resulting from previous exposure, either to the same drug or to one with similar short-​term effects. For example, it can develop to the vasoconstricting effects of ephedrine nose drops, used to treat vasomotor rhinitis: as ephedrine acts by releasing noradrenaline from sympathetic nerve endings, noradrenaline depletion will reduce the effectiveness of ephedrine. Patients taking long-​term glyceryl trinitrate, particularly from transdermal patches, become tolerant to its acute effects. To avoid this, a patch should be applied for no longer than 18 h. This effect probably reflects depletion of tissue sulphydryl groups by oxida- tion to disulphide groups. Some oral preparations of isosorbide mononitrate are formulated to release their contents over 18 h for the same reason. Physiological tolerance by homoeostatic mechanisms Secondary hyperaldosteronism is a physiological response to so- dium loss produced by loop or thiazide diuretics. The enhanced potassium excretion that it causes may be reduced by using a potassium-​sparing diuretic (e.g. amiloride) or the aldosterone an- tagonist spironolactone. Another type of physiological tolerance occurs in patients given the carbonic anhydrase inhibitor acetazolamide. This causes both a diuresis and a kaluresis. However, these effects are only sustained for a matter of days, because the large amounts of bicarbonate lost from the kidney causes a metabolic acidosis. Interestingly, topical carbonic anhydrase inhibitors do not show the same tolerance when they are used in chronic glaucoma. Metabolic tolerance Metabolic tolerance results from faster metabolic clearance of the drug. The commonest cause is induction of hepatic P450 en- zymes by drugs such as barbiturates, phenytoin, carbamazepine, or rifampicin. Occasionally, drugs may induce their own metabolic clearance—​a phenomenon called ‘autoinduction’ (e.g. carbamaze- pine and artemisinin). Withdrawal syndromes A common, but not inevitable, outcome of an adaptive response to long-​term drug use is a withdrawal response either when the drug is withdrawn or when an antagonist is given. A withdrawal syndrome occurs in opiate users when the opiate is withdrawn or an antagonist, such as naloxone, is given. The symp- toms consist of yawning, rhinorrhoea, and sweating, followed by shivering and goose flesh (‘cold turkey’); later, nausea, vomiting, diarrhoea, and hypertension may occur. The acute syndrome sub- sides within a week, but the anxious and disturbed sleep patterns may last for several weeks or months. This syndrome can be avoided by introducing increasing doses of methadone as the opiate is with- drawn; methadone has a longer half-​life than opiates such as heroin and causes much less withdrawal when it is eventually discontinued. Delirium tremens is a feature of alcohol withdrawal in chronic alcohol abusers. This syndrome consists of disorientation and visual hallucinations. Withdrawal of benzodiazepines after long-​term therapy may result in a disturbance of sleep pattern (rebound in- somnia associated with abnormal sleep patterns), agitation, restless- ness, and occasionally epileptic convulsions. The risk of angina pectoris, myocardial infarction, and ar- rhythmias is increased in patients with ischaemic heart disease when β-​adrenoceptor antagonists are withdrawn after long-​term use. This may be due to upregulation in the number of cardiac β-​adrenoceptors, with increased sensitivity to the β-​adrenergic effects of sympathetic stimulation. Long-​term therapy with corticosteroids suppresses pituitary secretion of ACTH, leading to adrenal cortical atrophy. When treatment is suddenly withdrawn, ACTH secretion may take sev- eral weeks or months to recover. During this time patients risk an Addisonian crisis if stressed (e.g. if they have a myocardial infarction or are operated on). Adverse effects directly due to adaptation Patients taking a neuroleptic drug (e.g. fluphenazine or haloperidol) continuously for a period of years commonly develop abnormal movements (known collectively as tardive dyskinesia). The face, mouth, and tongue are often affected, with stereotyped sucking and smacking of the lips, lateral jaw movements, and darting move- ments of the tongue. Occasionally more widespread dyskinesia may resemble choreoathetosis. The long-​term blockade of central dopa- mine receptors is thought to lead to increased central sensitivity to the effects of dopamine; this partly reflects increases in the number of dopamine receptors. The risk of tardive dyskinesia is much lower in the newer atypical agents such as quetiapine and risperidone; the atypical clozapine has even been suggested to ameliorate established tardive dyskinesia. Adverse drug reactions An adverse drug reaction can be defined as ‘an unwanted or harmful reaction experienced following administration of a drug, or com- bination of drugs, under normal conditions of use and is suspected as being related to the drug (or combination)’. Sometimes, the term is broadened to include all adverse reactions whether the drug is dosed appropriately or not. Hence the term ‘adverse drug event’, 88 section 2  Background to medicine which includes drug prescription and dispensing errors and failures of patient compliance, is also used. Incidence The scale of the problem is probably still underestimated. Data suggests that: • 1 to 4% of acute hospital admissions are due to an adverse drug reaction • 5 to 20% of inpatients suffer an adverse drug reaction at some point in their admission • up to 3% of deaths in hospital inpatients are due to an adverse drug reaction In addition to the morbidity and mortality, adverse drug reactions are hugely expensive for healthcare systems. In the United Kingdom the cost probably exceeds several billion pounds annually, and in the United States of America some estimates have exceeded 100 billion dollars annually. Classification Dose-​related adverse reactions Dose-​related adverse reactions are usually due to an exaggeration of a known pharmacological effect of the drug. The pharmacological effect that produces the adverse reaction may be responsible for the therapeutic effect (e.g. hypoglycaemia following insulin administra- tion), or be a parallel effect (e.g. the anticholinergic action of tri- cyclic antidepressants, producing a dry mouth or urinary retention). Dose-​related adverse reactions may occur because of variations in the pharmaceutical, pharmacokinetic, or pharmacodynamic properties of a drug, often due to a drug–​disease interaction or a pharmacogenetic characteristic of the patient. These mechanisms are illustrated in the following paragraphs. Pharmaceutical problem Adverse reactions can be caused by a contaminant (e.g. pyrogens or even bacteria in intravenous formulations). This is obviously a hazard for illicit drugs that are used intravenously: not only are they dissolved under non​sterile conditions, they may also be ‘cut’ with other drugs (e.g. quinine, caffeine, and procaine). Febrile reactions can occur routinely with some manu- factured drugs given intravenously (e.g. amphotericin B and bisphosphonates), but otherwise fever should be treated very sus- piciously and the drug and giving set should be sent for microbio- logical screening. Out-​of-​date formulations may sometimes cause adverse reac- tions because of degradation products. For example, outdated tetra- cycline may cause Fanconi’s syndrome, because it is degraded to anhydrotetracycline and epiandrotetracycline. The omission of the preservative citric acid from tetracycline formulations has reduced the risk of this effect, but has not removed it completely. Very occasionally a drug has been incorrectly labelled by the manu- facturer. Of more concern is the rise of counterfeit medicines (see Chapter 2.10). They are thought to account for 15% of drug sales world- wide and in parts of Asia and Africa the figure probably exceeds 50% of sales. Counterfeit agents frequently contain none of the active drug, or subtherapeutic doses, and may also contain additional chemicals or drugs that are harmful. For example, ethylene glycol has been used in the manufacture of fake paracetamol syrups (for its sweetness and vis- cosity) and caused a number of deaths, especially in children. Pharmacokinetic variation There is often enormous variation in rate of drug elimination be- tween individuals. This variation is greatest for drugs cleared by hep- atic metabolism and is determined by several factors, which may be genetic, environmental (diet, smoking, alcohol), or hepatic (blood flow and intrinsic drug-​metabolizing capacity). On top of this vari- ability, pharmacogenetic or hepatic abnormalities may be associated with specific adverse reactions. In addition, renal and cardiac dis- ease can change drug pharmacokinetics. The impact of pharmaco- genetics is discussed in the following paragraphs. The reserve of the liver parenchyma is large, so adverse reactions due to impaired hepatic metabolism are uncommon. Nevertheless, in patients with severe liver disease caution is needed when a drug has a low therapeutic index or is subject to extensive first-​pass me- tabolism. For example, severely impaired hepatocellular function can reduce the clearance of drugs such as phenytoin, theophylline, and warfarin. The portosystemic shunting seen in advanced cirrhosis can also dramatically increase the bioavailability of drugs normally cleared rapidly by the liver (e.g. morphine and other narcotic anal- gesics, propranolol, and chlormethiazole). Drugs that the kidneys excrete unchanged, or whose active metabolites are excreted, will accumulate in renal failure. Important examples include digoxin, atenolol, lithium, aminoglycoside antibiotics, and vancomycin. Pharmacodynamic variation The variability in pharmacodynamic response to a drug may be compounded by concomitant disease. The patient with cirrhosis is a good example: impaired hepatocellular function can reduce the synthesis of clotting factors; the presence of oesophageal and gastric varices imposes a further risk of upper gastrointestinal bleeding and patients with alcoholic liver disease may have additional thrombo- cytopenia and impaired platelet function. Judging the response to an anticoagulant, antithrombotic, or antiplatelet drug in this setting is very difficult, and the risk of haemorrhage is high. A cirrhotic patient is also at risk of exaggerated sedation and en- cephalopathy from opiates or long-​acting benzodiazepines that are cleared by the liver. The hypokalaemic effects of diuretics or amphotericin carry a similar risk. Patients with cirrhosis also have inappropriate salt and water retention that can be worsened by drugs such as NSAIDs, corticosteroids, and carbamazepine. The pharmacodynamic effects of some drugs may be altered by changes in fluid and electrolyte balance. For example, both hypo- kalaemia and hypercalcaemia potentiate the toxic effects of di- goxin. Hypocalcaemia prolongs the action of muscle relaxants such as tubocurarine. Fluid depletion and hypovolaemia enhances the hypotensive effects of antihypertensive drugs. Non-​dose-​related adverse reactions Non-​dose-​related adverse drug reactions are caused by immuno- logical and pharmacogenetic mechanisms. Allergic drug reactions are unrelated to the usual pharmacological effects of the drug, and frequently show a delay between the first ex- posure to the drug and the subsequent adverse reaction. Very small doses of the drug may elicit the reaction once allergy is established. 2.6  Principles of clinical pharmacology and drug therapy 89 The reaction disappears on withdrawal; and the illness is often rec- ognizable as a form of immunological reaction (e.g. rash, serum sickness, anaphylaxis, asthma, urticaria, angio-​oedema). Factors associated with an increased risk of allergic drug reac- tions include a history of allergic disorders (patients with a history of atopic disease and those with hereditary angio-​oedema) and HLA status (e.g. the risk of severe skin reactions to carbamazepine and allopurinol is strongly associated with specific alleles of the HLA B locus). Drug allergy and its manifestations are classifiable according to the classification of hypersensitivity reactions (i.e. into four types, I–​IV). Type I reactions (anaphylaxis; immediate hypersensitivity) In type I reactions, the drug or metabolite interacts with IgE mol- ecules fixed to cells, particularly tissue mast cells and basophil leucocytes. This triggers a process that leads to the release of pharma- cological mediators (a cocktail of histamine, 5-​HT, kinins, and ara- chidonic acid derivatives including leukotrienes), which cause the allergic response. Clinically, type I reactions manifest as urticaria, rhinitis, bron- chial asthma, angio-​oedema, and anaphylactic shock. Drugs likely to cause anaphylactic shock include penicillins, local anaesthetics, and iodide-​containing radiographic contrast media. Type II reactions (cytotoxic reactions) In type II reactions, a circulating antibody of the IgG, IgM, or IgA class interacts with a hapten (drug) combined with a cell membrane constituent (protein), to form a hapten–​protein/​antigen–​antibody complex. Complement is then activated and cell lysis occurs. Most examples are haematological: thrombocytopenia from quinidine or its enantiomer quinine (‘gin and tonic purpura’), and occasionally rifampicin; ‘immune’ neutropenia, which can be difficult to distin- guish from neutropenia occurring as a direct toxic effect on the bone marrow—​phenylbutazone, carbimazole, tolbutamide, anticonvul- sants, chlorpropamide, and metronidazole have all been incrimin- ated; and the haemolytic anaemias that penicillins, cephalosporins, rifampicin, and quinidine can also produce by this mechanism. Type III reactions (immune-​complex reactions) In type III reactions, antibody (IgG) combines with antigen to form immune complexes that deposit in tissues; complement is then ac- tivated, causing capillary endothelial damage. Serum sickness, with fever, arthritis, enlarged lymph nodes, urticaria, and maculopapular rashes, is the typical drug reaction of this type. Penicillins, sulphonamides, and antithyroid drugs may cause it. Another type III reaction is the acute interstitial nephritis caused by penicillins, some NSAIDs, and some diuretics. Type IV reactions (cell-​mediated or delayed hypersensitivity reactions) In type IV reactions, T lymphocytes are sensitized by a hapten–​ protein antigenic complex. When the lymphocytes meet the antigen, an inflammatory response ensues. Examples are the contact derma- titis caused by topical local anaesthetics and antihistamines, and topical antibiotics and antifungal drugs. Rashes caused by a type IV mechanism in response to sulphonamides and thiacetazone are more common in people infected with HIV. Anaphylactoid and pseudoallergic reactions Anaphylactoid reactions resemble type I allergic reactions clinic- ally, but the mast cell and basophil activation is not IgE-​dependent. Instead, the cells are triggered directly, and drugs capable of doing this include:  succinylcholine, morphine, d-​tubocurarine, vanco- mycin (hence ‘red man’ syndrome on rapid intravenous admin- istration), and N-​acetylcysteine. They are generally less severe than allergen-​mediated anaphylaxis, and emergency treatment is the same. Aspirin can also trigger an attack of asthma by a non​immune mechanism. The inhibition of airway COX-​1 enzyme by as- pirin is thought to remove the inhibitory effect of prostaglandin E2 and divert arachidonic acid towards production of cysteinyl-​ leukotrienes (especially LTC4). This is a very powerful constrictor of airway smooth muscle (c. 1000-​fold the potency of histamine), and this mechanism explains why aspirin-​sensitive asthmatics are often sensitive to other NSAIDs (although COX-​2 selective drugs may be relatively safe). It does not appear to explain why half of aspirin-​sensitive asthmatics are also sensitive to the yellow food dye tartrazine (E102). In some patients, ampicillin, and its derivative amoxicillin, causes a maculopapular erythematous rash resembling the toxic erythema that can occur in penicillin hypersensitivity. However, this ampi- cillin rash is not immunological in origin. It can be distinguished from true penicillin hypersensitivity by its later onset after the first dose (typically 10–​14 days compared with 7–​10 days in penicillin hypersensitivity, though they overlap) and non​recurrence if re-​ exposed to ampicillin. Unlike penicillin hypersensitivity, it carries no increased risk of anaphylaxis to penicillin. An ampicillin rash oc- curs in about 1% of the normal population, but at a much higher fre- quency in some groups of patients: it occurs almost invariably (and can be a useful diagnostic pointer) in patients with some viral in- fections (e.g. infectious mononucleosis, cytomegalovirus infection, measles), lymphomas, and leukaemias. The risk is also increased in patients taking allopurinol. Other manifestations of allergic reactions Drugs may cause other adverse reactions that do not fit clearly into the earlier hypersensitivity classification, but where there is a strong suspicion of an immune basis. Drug fever as an isolated phenomenon can occur with anti- biotics (penicillins, cephalosporins, isoniazid, sulphonamides, and vancomycin), anticonvulsants (phenytoin and carbamazepine), α-​ methyldopa, hydralazine, and quinidine. The height or periodicity of the fever is not a useful clue that it is drug-​induced, but all drug-​ induced fevers defervesce rapidly on drug withdrawal (c.24–​48 h). Fever is also a manifestation of neuroleptic malignant syndrome, a rare and serious idiosyncratic adverse reaction to neuroleptic therapy (either initiation or dose-​escalation). In neuroleptic malig- nant syndrome, the fever is accompanied by rigidity, reduced con- sciousness, and autonomic disturbance. It resembles another rare syndrome, malignant hyperpyrexia, although here the fever and ri- gidity follow sensitization to volatile halogenated anaesthetic gases (e.g. halothane) or suxamethonium. The aetiology of neuroleptic malignant syndrome is still unclear, but patients with malignant hyperthermia have mutations either in the ryanodine receptor or skeletal muscle l-​type calcium channel. 90 section 2  Background to medicine A syndrome mimicking systemic lupus erythematosus, often involving joints and skin but generally sparing the kidneys, may follow long-​term treatment with hydralazine, procainamide, pheny- toin, or ethosuximide. Drug-​induced lupus is partly dose-​related and, in the case of procainamide and hydralazine, is affected by the acetylator status of the patient. Both drugs are metabolized by N-​ acetylation which is controlled by the polymorphic enzyme NAT2 (see following paragraphs). The two variants cause either slow or fast clearance of the drug and slow acetylators are at increased risk of drug-​induced lupus. Asthma occurring as a pseudoallergic reaction to NSAIDs and tartrazine is noted in the earlier paragraph. Other adverse drug reac- tions in the lung include pneumonitis associated with drug-​induced lupus (see earlier paragraph), pulmonary eosinophilia, and fibrosing alveolitis. Eosinophilic granulomatosis with polyangiitis has been associated with the use of cysteinyl-​leukotriene receptor antagonists (e.g. montelukast), but it now seems unlikely that the syndrome is actually caused by them. Rather their introduction was frequently accompanied by withdrawal of corticosteroids that probably un- covered a pre-​existing disease. Jaundice may occur as an allergic response to some drugs through either cholestasis (e.g. with phenothiazines, erythromycin, and chlorpropamide) or generalized liver damage (e.g. with halothane, isoniazid, and monoamine oxidase inhibitors). Long-​term effects causing adverse drug reactions Some adverse effects during long-​term therapy are related to both the duration of treatment and the dose. Adaptive changes These are the basis of some adverse reactions such as the develop- ment of tolerance and physical dependence to opiates, and tardive dyskinesia in patients receiving long-​term neuroleptic therapy for schizophrenia. Rebound phenomena When adaptive changes occur during long-​term therapy, sudden withdrawal of the drug may result in rebound reactions. Examples include the typical syndromes that occur after the sudden withdrawal of opiates or of alcohol (delirium tremens). Sudden withdrawal of barbiturates may result in restlessness, confusion, and convulsions, and a similar syndrome in which anxiety features predominate may occur after the sudden withdrawal of benzodiazepines. Similarly, pa- tients who abruptly stop an SSRI may complain of a constellation of symptoms 3 to 4 days later, which include headache, insomnia, diz- ziness, paraesthesia, sweating, and flu-​like symptoms. The withdrawal of some antihypertensive drugs may result in rebound hypertension, but is especially common with clonidine. Sudden withdrawal of β-​adrenoceptor antagonists may result in re- bound tachycardia and arrhythmia, sometimes precipitating myo- cardial ischaemia. Acute adrenal insufficiency can occur when corticosteroids are stopped abruptly. The risk depends on the potency and duration of corticosteroids used, but not the route of administration—​it has even been reported after stopping high-​dose topical or inhaled glucocorticoids. Reversal of the effects of unfractionated heparin with protamine sulphate may be associated with rebound hypercoagulability and an increased risk of thromboembolism. However, this risk may be justi- fied if heparin overdosage has caused life-​threatening haemorrhage. Importantly, protamine sulphate will not reverse the effect of low-​ molecular-​weight heparin and stopping or reversing oral anticoagu- lants (such as warfarin) does not lead to rebound hypercoagulability. Other long-​term effects Chloroquine may accumulate in the corneal epithelium (causing a keratopathy) and in the retina (causing a pigmentary retinopathy and blindness). The former occurs in most patients on long-​term therapy; the latter is less common but more serious. The risk in- creases with daily doses of more than 2.5 mg/​kg (as the free base) and chloroquine should only be used in inflammatory arthropathies where the safer hydroxychloroquine has failed. Amiodarone also accumulates extensively in tissues. Almost all patients on long-​term amiodarone develop photosensitization from skin deposition. They also develop microdeposits in the cornea, al- though these are rarely symptomatic. It also accumulates in other tissues but pulmonary alveolitis, neuropathy, and hepatocellular im- pairment are relatively uncommon. Delayed effects causing long-​term adverse drug reactions Carcinogenesis The long-​term effects of oestrogen therapy on cancer risk are complex and depend on whether they are administered to pre-​ or postmenopausal women. Long-​term oestrogen exposure through the oral contraceptive pill probably increases the risk of breast cancer but the effect is not consistent across all studies. The administration of hormone replacement therapy (HRT) to postmenopausal woman also increases their risk of breast cancer, although it appears that the breast cancer risk from combined oestrogen–​progesterone HRT is greater than from oestrogen-​only HRT (used in women after hyster- ectomy). The incidence of endometrial carcinoma is also increased in women taking oestrogen HRT for menopausal symptoms. In contrast, the oral contraceptive pill protects against endometrial cancer and the effect persists for many years after taking it. The risk of colon cancer may also be reduced by HRT but not the oral contraceptive pill. Anabolic steroids carry a risk of both benign and malignant hep- atic tumours on long-​term administration. The risk is greatest for the 17-​alkylated derivatives such as oxymetholone. The latter is now largely restricted for palliative use in cachectic states such as HIV wasting syndrome, but is still used illicitly by bodybuilders. Various anticancer drugs increase the risk of secondary solid tu- mours and haematological malignancy. For example, cyclophos- phamide containing chemotherapy regimens increase the risk of bladder cancer. The risk of secondary acute myeloid leukaemia and myelodysplastic syndromes is substantially increased in patients re- ceiving chemotherapy for Hodgkin’s or non-​Hodgkin’s lymphoma and testicular cancers. Alkylating agents, especially older regimens using the mustard mechlorethamine, carry the highest risk from lymphoma chemotherapy, and etoposide is the greatest risk from testicular cancer chemotherapy. Immunosuppressive drug regimens are also widely associated with an increased risk of lymphoma and solid tumours in organ trans- plant recipients. These regimens are typically based on calcineurin inhibitors, such as ciclosporin. Recent data suggests that regimens 2.6  Principles of clinical pharmacology and drug therapy 91 based on a non​calcineurin inhibitor, sirolimus, may reduce the risk of lymphoma and solid tumours in these patients. Adverse drug reactions associated with reproduction Some drugs impair fertility. For example, cytotoxic drugs can cause permanent ovarian failure with amenorrhoea. Sperm pro- duction may be reversibly impaired by sulphasalazine (espe- cially in slow metabolizers), gonadotropin hormone antagonists, methotrexate, and androgens. In fact, the reversible azoospermia achieved with depot formulations of testosterone esters and pro- gestogens is being developed for male contraception. Cytotoxic drugs (especially alkylating agents) can reversibly or irreversibly affect sperm production depending on the age of administration (the prepubertal testis is relatively insensitive), the doses used, and duration of exposure. Teratogenesis  Teratogenesis occurs when a drug taken early in pregnancy causes a developmental abnormality in a fetus. Exposure to a teratogen in the first trimester of pregnancy, and particularly the period of organogenesis (weeks 2–​8 of gestation), is most likely to cause structural abnormalities. The central nervous system is vul- nerable throughout pregnancy. For a drug to be teratogenic it must first cross the placenta. As a general rule, the drugs that do this have a low molecular weight, are poorly ionized at physiological pH, and are very lipophilic. Hence, heparin (even low-​molecular-​weight heparin) is a large, highly charged molecule and d-​tubocurarine is a small, ionized, and hydrophilic molecule; neither crosses the placenta. The placenta also expresses large numbers of transporter proteins including the ABC efflux pump P-​glycoprotein (MDR-​1). The efflux pumps are expressed predominantly on the maternal-​facing surface of the pla- cental villi and form part of the maternal–​fetal barrier. So, whether a drug crosses the placenta depends not just on physiochemical properties, but also its affinity for the efflux pump. The existence of functional polymorphisms in these pumps probably explains the wide variability observed in fetal drug concentrations, inci- dence of teratogenesis, and drug failure in pregnancies exposed to therapeutic drugs. Since most drugs that are proven teratogens in animals will not enter human development, it is unclear how well animal teratogen- icity testing predicts human teratogenicity. A drug will have been tested for teratogenicity in rodents and one other non​primate species before it can be registered. However, negative results from animal testing should not reassure anyone that the drug is safe in human pregnancy. Indeed, thalidomide itself is not teratogenic in rodents and the New Zealand rabbit is the only common labora- tory mammal that shows similar sensitivity to humans. Women of childbearing age are excluded from preregistration testing, so the teratogenic risk for humans of a drug at the time of its regis- tration is usually unknown. This explains the comments that exist in package inserts and other literature on new drugs, discouraging their use in pregnancy. It also explains why obstetricians often em- ploy drugs that seem to other clinicians to be obsolete. For example, in pregnancy-​related hypertension and pre-​eclampsia, hydralazine and α-​methyldopa are still first-​line drugs. In non​pregnant hyper- tension, they would not even be fourth-​line choices. Both drugs have, however, been in obstetric use for a half century without asso- ciated teratogenicity. Adverse drug reactions on the fetus during the later stages of preg- nancy  Some drugs that are not teratogenic may have adverse effects on the fetus if given later in pregnancy. Table 2.6.2 lists some im- portant drugs that should be avoided or used with care during later pregnancy (some throughout the whole duration of pregnancy). Given the uncertainty about teratogenic risk, what should be done if a woman of childbearing potential is given a drug, and then finds out days or weeks later that she is pregnant? First, it is important to identify the drug and the exact time of exposure to it. If it is a known or a likely teratogen, the relation between the time of ex- posure and the likely time of conception should be determined. Even if the precise date of conception is known, dating the preg- nancy by ultrasound is advisable if a suspected teratogen has been taken. If exposure to a known teratogen has occurred during the first 8 weeks of pregnancy, detailed ultrasound examination of the fetus may detect structural abnormalities, and serum and amniotic α-​fetoprotein concentrations measured to screen for neural tube de- fects. Any advice on termination of a pregnancy should be based on a consideration of the risk of fetal abnormality from both published information and investigation of the individual case. Adverse reactions to drugs in breast milk  Some drugs cause ad- verse reactions in babies after ingestion by the mother and excretion in her breast milk. Excretion of drugs into breast milk is important because 90% of women take at least one prescribed drug during the first week after delivery. This may discourage many women from breastfeeding unless they can be reassured that the benefits for the baby of being breastfed far outweigh any risks for the overwhelming majority of drugs (see Table 2.6.3). The factors affecting drug excretion into breast milk are the same as those that govern drug transfer across the placenta. The only important difference is that breast milk has a lower pH and higher lipid content than plasma. Like the placenta, this passive transfer only explains a fraction of the total transport capacity. Active drug transport also oc- curs through transporters and efflux pumps expressed in the breast epithelium, which explains why some drugs are present in breast milk at levels higher than expected (e.g. atenolol and benzylpenicillin). Table 2.6.2  Drugs with proven or very high teratogenic risk in human pregnancy Drug Associated risk ACE inhibitorsa Fetal renal toxicity/oligohydramnios Alcohol Fetal alcohol syndrome Carbamazepine Neural tube defects?/​Features of hydantoin syndrome Diethylstilboestrol Vaginal adenocarcinoma in female offspring Isotretinoin Craniofacial and cardiac anomalies Lithium Cardiac defects including Ebstein’s anomaly Methotrexate Fetal aminopterin syndrome Misoprostol Moebius syndrome Phenytoin Fetal hydantoin syndrome Tetracycline Decidual teeth staining Thalidomide Phocomelia Valproic acid Neural tube defects Warfarin Stippled epiphyses/​nasal hypoplasia a ATII receptor antagonists carry similar risk. 92 section 2  Background to medicine Neonates and young babies may be at risk of adverse drug reactions because clearance pathways for a drug are immature. The GFR is only 25% of the adult value at birth (based on body weight) and only reaches the adult range at 3–​6 months. Tubular secretion measured by PAH clearance is also impaired (10% of adult capacity until 6 months), which explains the slow elimin- ation of frusemide in neonates (it is secreted by the PAH trans- porter). Phase I metabolism through CYP enzymes are isoform dependent: 3A7 is the predominant isoform in utero; it is replaced by 3A4 postpartum, but it takes more than 1 month to reach adult levels; CYP 1A2 does not appear until 3 months; CYP2D6 and E1 appear after birth but rapidly achieve adult levels within hours of delivery. Drugs that should be given to breastfeeding mothers with caution are included among the drugs in Table 2.6.3. A more extensive list can be found in the British National Formulary and the Physician’s Desk Reference. If the safety of a drug is in serious doubt and it is not possible to identify an alternative, breastfeeding can be temporarily suspended while the drug is given. Detecting adverse drug reactions: Pharmacovigilance The importance of adverse reactions for drug regulatory bodies and the pharmaceutical industry has led to the evolution of a subspecialty of clinical pharmacology devoted to the detection and evaluation of adverse drug reactions. This specialization is called pharmacovigilance. Because drugs reach the market based on the experience of dosing just a few thousand patients, capturing as many adverse drug reac- tions as possible afterwards is of crucial importance. This activity is called postmarketing surveillance, which usually relies on self-​ reporting by health professionals and in some countries, the gen- eral public as well. In the United Kingdom, reporting by healthcare workers operates through the yellow card scheme run by the Medicines and Healthcare Products Regulatory Agency (MHRA) and the Commission on Human Medicines (CHM). It is a spon- taneous reporting scheme in the sense that it is voluntary and used to rely on the ‘yellow card’ in the back of every BNF being filled out by the healthcare worker. This can now be done online (https://​ yellowcard.mhra.gov.uk/​). The scheme aims to collect (1) serious or fatal adverse drug reactions from all drugs; (2) all adverse drug re- actions from newly licensed drugs (designated with a black triangle in the BNF); and (3) all adverse drug reactions in children under 18. Spontaneous reporting schemes rely on goodwill and are prone to considerable underreporting. Rare or unusual side effects are easily detected with them (e.g. the withdrawal of cerivastatin because of rhabdomyolysis and pergolide because of valvular heart disease). On the other hand, serious adverse reactions involving effects that occur frequently in the population receiving the drug are not so easily de- tected. The thrombotic risk of COX-​2 selective NSAIDs (coxibs), es- pecially rofecoxib (Vioxx), is a timely reminder of this serious flaw in spontaneous reporting schemes. If used over long periods, coxibs cause an excess of stroke and myocardial infarction, but both of these are common events in patients receiving NSAIDs. Most physicians, for example, would not immediately conclude that a 65-​year-​old pa- tient who died from a stroke or myocardial infarction while on a coxib was the victim of an adverse drug reaction. It is a sobering reminder that rofecoxib was only withdrawn after some 20 million people were exposed to it and many thousands had probably died from taking it. Despite the limitations of spontaneous reporting, it is a cru- cial part of the World Health Organization’s efforts to coordinate pharmacovigilance on a global scale as opposed to local efforts such as the yellow card scheme. Its database (http://​www.who-​umc.org/​) currently contains some 11 million reports (May 2015), and 122 countries had signed up to the scheme by September 2015. There is also a Europe-​wide (or rather European Union-​wide) pharmacovigi­ lance programme run by the European Medicines Agency, called EudraVigilance (https://www.ema.europa.eu/en/human-regulatory/ research-development/pharmacovigilance/eudravigilance). The dif- ference in this scheme is that adverse drug reaction notification by the pharmaceutical companies operating within the European Union is not voluntary, by a statutory requirement of their marketing authorization. It is too early to say how the performance of these two systems compares in terms of the early detection of ad- verse drug reactions. The probity of the pharmaceutical industry itself has also been seriously questioned in recent years in the process of alerting the regulatory authorities about suspected adverse drug reactions. Large Table 2.6.3  Effects of common maternal drugs on breastfed infants or milk production Acebutolol/​atenolol Cyanosis, hypotension, bradycardia Amoxicillin None Aspirin Metabolic acidosisa Bendrofluazide May reduce milk production Bromocriptine Suppresses lactation Caffeine Irritability if >2–​3 cups of coffee/​day Carbamazepine None Carbimazole Goitre Ciprofloxacin None Digoxin None Diltiazem None Fluconazole None Isoniazid None Labetalol None Levothyroxine None Lithium Plasma level up to 50% mother Nalidixic acid/​nitrofurantoin Haemolysis in G6PD deficient infanta Morphine None Oestrogen (in oral contraceptive pill) May reduce milk production Pethidine None Phenobarbitone/​primidone Sedation Phenytoin Methaemoglobinaemiaa Prednisolone None Propylthiouracil None Valproic acid None Verapamil None Warfarin None a Single case reports. From American Academy of Pediatrics (2001). Policy statement. (http://​aappolicy. aappublications.org/​cgi/​content/​full/​pediatrics;108/​3/​776). 2.6  Principles of clinical pharmacology and drug therapy 93 observational studies commissioned by the pharmaceutical com- panies to capture adverse reactions have been deliberately hidden (by Bayer in the case of aprotinin) or methodologically questioned (by Merck in the case of Vioxx) when they confirmed suspected ad- verse reactions. These and other cases have highlighted the need for regulatory agencies to be able to compel pharmaceutical companies to: (1) carry out the necessary controlled trials to investigate the safety of a new drug; and (2) make disclosure of these studies man- datory. Alternatively, healthcare systems should probably commis- sion the necessary studies. These would certainly be cost-​effective considering how much they will have been spent (e.g. managing the excess of strokes and myocardial infarction that followed the intro- duction of coxibs). Prevention of adverse drug reactions: The role of the patient Most people who take either a prescribed drug or one purchased over the counter do not usually expect unwanted effects from it, al- though they may have been alerted to potential adverse reactions by the doctor, pharmacist, or packet insert. The information from any of these sources should warn about all potential adverse drug reac- tions, so that the patient can: • assess the potential disadvantages of the drug, before deciding to take it; • connect an adverse reaction with the taking of the drug and take appropriate action. However, what is usually lacking is advice about how to minimize or avoid adverse drug reactions. This advice is as relevant for the patient as it is for the prescriber. The prescriber in particular should consider the following: 1. Are there dose-​dependent effects? If there are, they can be pre- vented or minimized by keeping the drug dosage as low as possible. It is always worth considering whether dosing below the pharmaceutical company’s recommended range may be ap- propriate for individual patients. This is especially the case in older people or for non​white ethnic groups (who may metab- olize the drug differently) as both are often underrepresented in preregistration drug trials. If adverse reactions are more likely with continued use of the drug, then the duration of use should be limited (e.g. with neuroleptic drugs). 2. Are drug interactions likely? If there are known interactions, they may be prevented by ensuring interacting drugs are avoided. This list should include agents that could be taken by the patient but are not prescribed. For example, it may be necessary to avoid con- sumption of grapefruit juice or herbal remedies such as St John’s wort if the drugs are substrates for CYP 3A4 or P-​glycoprotein (and the patient is taking simvastatin bought over the counter). 3. There may be serious adverse reactions that are unknown and/​ or undetected during preregistration trials. Drugs that have been prescribed over many years have an established margin of safety and should be used in preference to newer drugs. If there is a compelling reason to use a new drug, the patient should be aware that it is a new drug and there should be closer monitoring for adverse reactions. 4. Adverse reactions are more likely when a prescriber is using a drug they are not familiar with. The prescriber might consider asking for a second opinion about the appropriateness of the drug and problems with its use if they are not sure. 5. Some individuals are predisposed to adverse drug reactions. This is usually a genetic susceptibility. They may, for example, be poor drug metabolizers, porphyriacs, or have G6PD deficiency. If this is known, patients should be aware of drugs they should avoid, and they should tell other prescribers of the problem before they take any new drug. Drug interactions A drug interaction occurs when the effects of one drug are altered by the effects of another drug. Usually this results in an adverse reac- tion, but in a few cases, it may actually prove beneficial. Interactions form up to 10% of all adverse drug reactions, but crucially, among patients who die from an adverse drug reaction, about one-​third of deaths are due to interactions. Drugs likely to precipitate interactions often have one or more of the following properties: • They are highly protein bound (e.g. aspirin and sulphonamides). • They induce drug metabolism (e.g. phenytoin, carbamazepine, and rifampicin). • They inhibit drug metabolism (e.g. cimetidine, metronidazole, and triazole antifungals). The drugs most likely to be affected by drug–​drug interactions are also those with a steep dose–​response curve and a low therapeutic index, which includes aminoglycoside antibiotics, warfarin, and other coumarins, anticonvulsants, antihypertensive drugs, cardiac glycosides, cytotoxic and immunosuppressant drugs, oestrogen-​ containing oral contraceptives, and some centrally acting drugs. The most frequent interactions occur with warfarin and other coumarins. Drug interactions fall into three basic types (see Table 2.6.4). Pharmaceutical interactions These involve physiochemical effects between a drug and the solu- tion it is mixed with or between two drugs when they are mixed together (usually for injection). The interaction results in the drug either precipitating from solution or being inactivated in some other way. There are numerous examples, but some general principles can help to avoid many of them: give intravenous drugs by bolus injec- tion if possible or via an infusion pump; do not add drugs to infu- sion solutions other than dextrose or saline; only mix drugs in the same infusion solution if they are known to be safe (e.g. potassium chloride with insulin). Pharmacokinetic interactions Pharmacokinetic interactions occur when a drug alters the ab- sorption, distribution, or elimination (metabolism or excretion) of another drug. Absorption interactions One drug substantially altering the absorption of another is rela- tively uncommon but there are important examples. Anion ex- change resins, for example, bind warfarin, digoxin, and thyroxine very avidly (see earlier paragraphs). Other examples are given in Table 2.6.4. 94 section 2  Background to medicine This type of interaction may occasionally be beneficial. Hence, metoclopramide increases gastric emptying and speeds the absorp- tion of analgesics (such as ibuprofen and paracetamol) used to treat acute migraine. Activated charcoal also binds many drugs in the gut lumen, so preventing their absorption or enterohepatic recycling. This is widely exploited in drug overdoses (see earlier paragraphs and Chapter 10.4.1). It is also used to accelerate excretion of the antirheumatoid drug leflunomide if it causes a serious adverse re- action because of the extremely long half-​life (2 weeks) of its active metabolite. Protein-​binding displacement interactions Displacement of one drug by another from its binding sites on plasma proteins will cause an increase in the circulating concentra- tion of unbound drug. This is only important if the displaced drug is highly protein bound (>90%) and has a small volume of distribution (that will exaggerate the rise in free drug concentration). The drugs concerned are warfarin, phenytoin, and tolbutamide. The most common precipitant drugs in protein-​binding dis- placement interactions are sulphonamides, salicylates, and chloral hydrate and some of its congeners (because of their metabolite, trichloroacetic acid). In addition, valproate specifically displaces phenytoin. However, displacement interactions are generally relatively un- important. This is because the rise in free drug concentration in- creases the drug’s clearance, so that the total concentration actually falls if the displacing drug is given chronically. Displacement is only a problem if the initial rise in free drug concentration itself causes toxicity; fortunately, if this is not serious it will be transient. Interactions through induction of metabolism Induction of P450 enzymes is a major source of drug inter- actions. Drugs that induce drug metabolism include barbiturates, carbamazepine, griseofulvin, phenytoin, and rifampicin (see Table 2.6.4). The herbal preparation of St John’s wort also induces P450 isoforms. Table 2.6.4  Mechanisms of drug interactions, with important examples Mechanism Example Outcome Physiochemical Calcium gluconate plus sodium bicarbonate Precipitation of calcium carbonate in infusion solution Pharmacokinetic Altered absorption Reabsorption of oestrogens reduced by antibiotics Contraceptive failure Gastric emptying increased by metoclopramide Increased rate of absorption of simple analgesics Fluoroquinolones (e.g. ciprofloxacin) and agents containing divalent/​trivalent cations (e.g. antacids or Fe salts) or sucralfate Reduced absorption Altered protein binding Displacement of phenytoin by aspirin (not low-​dose aspirin) Phenytoin toxicity (transient) Increased metabolism Oestrogen metabolism increased by carbamazepine, phenytoin, rifampicin Contraceptive failure Reduced metabolism Warfarin metabolism inhibited by amiodarone, metronidazole, cimetidine Warfarin toxicity Theophylline metabolism inhibited by erythromycin or fluoroquinolones (e.g. ciprofloxacin) Theophylline toxicity Phenytoin/​carbamazepine metabolism inhibited by cimetidine, erythromycin, fluconazole, or isoniazid Phenytoin/​carbamazepine toxicity Amine metabolism inhibited by monoamine oxidase inhibitors (includes tyramine in foods and phenyl-​propanolamine in cold cures) Acute severe hypertension Reduced renal elimination Penicillin/​cephalosporin excretion reduced by probenecid Prolonged duration of antibiotic action Lithium excretion reduced by diuretics or NSAIDs Lithium toxicity Digoxin excretion reduced by amiodarone, quinidine, verapamil Digoxin toxicity Pharmacodynamic Shared mechanisms Vitamin K competes with warfarin for epoxide reductase Reversal of the effect of warfarin Naloxone displaces opioids from opioid receptors Reversal of opioid toxicity PDE5 inhibitors (e.g. sildenafil) and nitrates Profound hypotension Statin and niacin, fibrates (especially gemfibrozil) Risk of rhabdomyolysis/​renal failure SSRIs and tramadol, triptans (e.g. sumatriptan) or monoamine oxidase inhibitors (e.g. selegiline) Risk of serotonin syndrome Parallel mechanisms Psychoactive drugs and alcohol Increased sedation Cytotoxic drugs acting at different stages of the cell cycle Therapeutic potentiation in cancer chemotherapy Warfarin and antiplatelet drugs (aspirin and other NSAIDs or clopidogrel) Increased risk of bleeding and impaired haemostasis 2.6  Principles of clinical pharmacology and drug therapy 95 Interactions through inhibition of metabolism Most of the interactions under this heading involve drugs that in- hibit specific P450 pathways, for example, through CYP3A4 or 2D6. Some examples are shown in Table 2.6.4. Metabolism through non-​CYP metabolic pathways can also be af- fected. Hence, the interaction of allopurinol with azathioprine and 6-​mercaptopurine results from its inhibition of xanthine oxidase. Both 6-​mercaptopurine and azathioprine (which is metabolized to 6-​mercaptopurine) are metabolized by xanthine oxidase. The interaction of MAO inhibitors with dietary tyramine is an- other example of a non-​CYP pathway. Tyramine in the diet usu- ally has very low bioavailability because of degradation by MAO in the gut wall and liver. MAO inhibitors inactivate the enzyme both here and within sympathetic nerve endings, which increase their noradrenaline content. Hence, tyramine ingested after an MAO in- hibitor has substantially increased bioavailability, and the resulting plasma levels are sufficient to displace the enhanced noradrenaline stores from sympathetic nerve terminals. The hypertensive crisis this causes is referred to as the ‘cheese reaction’. These interactions may again be sometimes useful. The protease inhibitor ritonavir, for example, is a potent CYP3A4 inhibitor and is added to antiretroviral drug regimens containing other protease inhibitors to specifically inhibit their metabolism and so increase bioavailability (which may otherwise be low). This is referred to as PK-​enhanced drug formulation. Excretion interactions Most interactions involving drug excretion occur in the kidneys, although secretion into the gut through P-​glycoprotein is im- portant for some drugs (e.g. digoxin, cyclosporin). The latter ex- plains drug interactions with substances such as St John’s wort and grapefruit juice. Interactions due to the alteration of active renal drug secretion are shown in Table 2.6.4. Some drugs that are weak acids or bases are passively reabsorbed along the nephron. The extent of reabsorp- tion is pH-​dependent and can be exploited following overdose to increase drug elimination in the urine (see earlier paragraphs). Pharmacodynamic interactions Pharmacodynamic interactions are very common and occur when one drug alters the effect of another. This can arise because the two drugs act on the same drug target or pathway. Alternatively, the interacting drugs may produce the same effect but by entirely sep- arate mechanisms. Some examples are shown in Table 2.6.4. Interactions through a shared mechanism When there is a shared drug target (such as a receptor) the inter- action may arise because of antagonism or combined agonism at this site. Many antagonistic interactions are therapeutically benefi- cial. Examples are the reversal of the effects of benzodiazepines with flumazenil or warfarin with vitamin K. In contrast, synergistic inter- actions are often adverse. Hence, nitroglycerin and sildenafil are useful drugs to treatment angina and erectile failure, respectively; nitroglycerin being metabolized to nitric oxide (NO) and sildenafil augmenting the effect of endogenous NO. Combining the two, how- ever, causes devastating hypotension. Interactions through parallel mechanisms Two drugs may have similar pharmacological or toxic effects but they are achieved through different mechanisms. For example, the bleeding risk of warfarin is increased by drugs that affect platelet function (e.g. NSAIDs, clopidogrel, or glycoprotein IIb/​IIIa antag- onists), reduce platelet number (drug-​induced thrombocytopenia) or cause gastrointestinal ulceration (e.g. NSAIDs). Exploiting parallel pathways can also be beneficial. Many chemo- therapy regimens exploit pharmacodynamic interactions of their component drugs. And combinations of antibiotics are used rou- tinely even when a single organism is being targeted (e.g. penicillin plus an aminoglycoside in bacterial endocarditis and multiple-​drug regimens for tuberculosis). Pharmacogenetics and pharmacogenomics The variability between individuals in terms of both the pharmaco- kinetics and pharmacodynamics of a drug is partly determined by genetic variation. The variability can occur in a single gene (hence pharmacogenetics) or arise from the interplay of a variety of genes (pharmacogenomics). The realization that the human genome is an important player has fuelled expectation that drug prescription may in future be ‘personalized’ based on the patient’s genotype. Pharmacokinetic variability Several important pathways of drug metabolism can affect drug pharmacokinetics through polymorphic variation in the enzymes involved. These include oxidation, acetylation, S-​methylation, and suxamethonium hydrolysis (Table 2.6.5). Oxidation About 80% of oxidative drug metabolism occurs through the cyto- chrome P450 or CYP enzyme family of enzymes (see earlier para- graphs). There are some 57 genes in humans encoding these enzymes and they are named with a number and letter for the family and sub- family to which they belong (e.g. 3A4 or 2D6). Many of these genes are polymorphic, with the common wild-​type gene being mutated to produce a change in gene function; usually a reduction. The alleles for these different polymorphisms or gene variants are designated by a number after an asterisk (e.g. 2D6*4 for the commonest variant of 2D6 that reduces enzyme function). Drugs that are metabolized through polymorphic CYP enzymes show drug metabolism in the population that has a bimodal (or occasionally trimodal) distribution. An example of this is shown in Fig. 2.6.6, where the metabolic ratios for debrisoquine are shown in the urine of over 1000 patients (debrisoquine is an ob- solete antihypertensive but is still used as a ‘probe’ drug to define metabolic phenotype for CYP enzymes). Most are extensive met- abolizers with low-​to-​intermediate drug:metabolite ratios in their urine, but a significant minority are poor metabolizers with high ratios. In this case, the poor-​metabolizer status is due to 2D6 gene variants that reduce 2D6 enzyme activity. Genetically, the defect is very heterogenous with around 100-​point mutations, promoter variants, and deletions now recognized (see http://​www.imm.ki.se/​ cypalleles). The frequency of the poor-​metabolizer phenotype 96 section 2  Background to medicine varies across different ethnic groups:  it is around 5 to 10% in northern Europeans but only approximately 1% in the Chinese. Drugs that are commonly metabolized through 2D6 include co- deine, dextromethorphan, flecainide, metoprolol, nortriptyline, propafenone, timolol, and tramadol (see Table 2.6.1). The dose-​ related adverse reactions of these drugs (e.g. central nervous system toxicity with nortriptyline or bradycardia and hypo- tension with the β-​blockers) are more likely in patients with the poor-​metabolizer phenotype. Quinidine is an inhibitor of the 2D6 enzyme and is able to convert an extensive to a poor metabolizer. Some patients even have a super-​ or ultrametabolizer pheno- type. This phenotype occurs through duplication of the 2D6 gene and shortening of the half-​life of 2D6-​metabolized drugs in these patients is directly related to the degree of duplication (Fig. 2.6.7). They are relatively uncommon compared to the poor metabol- izers (see Fig. 2.6.6), but their status has important clinical conse- quences. A 2D6 drug will appear to be relatively ineffective in an ultrametabolizer unless very large doses are used, and they may even be suspected of poor compliance. In contrast, if the drug is a prodrug, ultrametabolizers are characteristically very sensitive to its effects. Hence, they may show opiate intoxication from what appear to be low therapeutic doses of codeine because of its rapid and complete metabolism to morphine. Although they are rare in Europe, 30% of East Africans may be ultrametabolizers. The other CYP enzymes are not as well studied as 2D6, but poor metabolizers have been identified for 2C9 (metabolizing losartan, phenytoin, tolbutamide, and warfarin) and 2C19 (metabolizing omeprazole). There is a commercial microarray available to geno- type individuals for the commoner variations in the 2D6 and 2C19 genes (it is called the AmpliChip P450) that has been approved by the FDA, but the technology in this area is evolving rapidly Table 2.6.5  Pharmacogenetic variation affecting drug metabolism Metabolizer phenotype Enzyme responsible Common deficiency gene variant Frequency in white people Comments Suxamethonium apnoea Butyrylcholinesterase (CHE) CHE70 D>G 1:3000 Many other variants encoding CHE with reduced or no activity Poor metabolizer CYP2D6 2D64 (premature stop codon) 5–​10% Commonest allele for PM phenotype Slow acetylator N-​acetyltransferase (NAT2) NAT2*5B c.40% carry it but compound heterozygotes as likely as *5B homozygotes 4 is the fast allele and behaves dominantly Thiopurine sensitivity Thiopurine S-​ methyltransferase (TPMT) TPMT3A (154A>T, 240T>C) 5% are carriers (homozygous rate c.1:250) 20 other variants but 95% of cases due to *2 or *3(A–​D) Irinotecan sensitivity UDP-​glucuronyl transferase 1 (UGT1A1) (TA)7–​TATA c.40% Promoter variant reduces UGT1A1 expression 0 0 0.01 0.10 1 10 100 40 Ultrarapid metabolism Poor metabolism Extensive metabolism Cutoff 80 120 Debrisoquin:4-Hydroxydebrisoquin metabolic ratio No. of subjects Fig. 2.6.6  Pharmacogenetics of CYP2D6. Urinary metabolic ratios of debrisoquin to its metabolite, 4-​hydroxydebrisoquin, for more than 1000 Swedish subjects. Poor metabolizers with no or reduced CYP2D6 activity are separated by the cut-​off box from extensive metabolizers. Adapted from Bertilsson L, et al. (1992). Pronounced differences between native Chinese and Swedish populations in the polymorphic hydroxylations of debrisoquin and S-​mephenytoin. Clin Pharmacol Ther, 51, 388–​97. Morphine Morphine-3-glucuronide 0 0 10 20 30 40 50 60 70 80 90 2 4 6 8 10 12 14 16 18 20 22 24 0 0 1 2 3 4 2 4 6 8 10 12 14 16 18 20 22 24 Concentration [μg/litre] Concentration [μg/litre] PM EM UM Fig. 2.6.7  Effect of 2D6 genotype on the plasma concentration of morphine and its active metabolite following a 30 mg dose of codeine. Ultrametabolizers with duplication of the 2D6 wild-​type gene show higher levels compared to extensive metabolizers carrying single copies. Poor metabolizers for comparison have only loss-​of-​function or non​functional copies. Reprinted by permission from Springer Nature: The Pharmacogenomics Journal. Kirchheiner J, et al. (2007). Pharmacokinetics of codeine and its metabolite morphine in ultra-​rapid metabolizers due to CYP2D6 duplication. Pharmacogenomics J, 7, 257–​65, Copyright © 2006, Springer Nature. 2.6  Principles of clinical pharmacology and drug therapy 97 Acetylation Some drugs are cleared by acetylation. These include dapsone, hydralazine, isoniazid, procainamide, and some sulphonamides. The enzyme involved is the hepatic enzyme N-​acetyltransferase (NAT2, EC 2.3.1.5) and if drug:metabolite ratios for a drug cleared by acetylation are measured, they distribute bimodally in the population. Subjects producing the highest ratios are described as being fast acetylators (cf. extensive metabolizers for the 2D6 pathway). Again, the genetic basis for the biochemical phenotype is very heterogenous, and the frequency of fast acetylators varies across different ethnic groups: 40% in northern Europeans, 85% in Japanese, and 5% among the Inuit. Slow acetylators require lower doses of drugs that are cleared by acetylation than fast acetylators. Slow acetylators are also more likely to develop the lupus erythematosus-​like syndrome caused by isoniazid, hydralazine, and procainamide, and the peripheral neuropathy caused by isoniazid (isoniazid actually causes deg- radation of pyridoxine). The interaction between isoniazid and phenytoin, in which isoniazid inhibits phenytoin metabolism causing phenytoin toxicity, is also more frequent among slow acetylators. Disease associations with polymorphic metabolism As some diseases may be related to the effects of environmental chemicals that have carcinogenic metabolites, it is of interest that polymorphic acetylation, hydroxylation, and sulphoxidation have other clinical associations. The evidence linking acetylator status with the risk of bladder cancer is probably the best-​ established association; slow acetylators having increased risk. Other weaker associations may exist between debrisoquine metabolizer status (2D6) and risk of parkinsonism in poor debrisoquine hydroxylators, of bronchogenic carcinoma in exten- sive debrisoquine hydroxylators, and of primary biliary cirrhosis in poor sulphoxidizers. Glucuronidation The antitumour agent irinotecan is a prodrug. Its active metabolite, SN-​38, has about 1000-​fold higher activity than irinotecan itself and is inactivated by glucuronidation through UGT1A1. Patients homozygous for a polymorphism in the promoter of UGT1A1 are much more likely to develop severe neutropenia after irinotecan because of defective glucuronidation of SN-​38. The same poly- morphism is also involved in Gilbert’s syndrome. Methylation The thiopurines (6-​mercaptopurine and 6-​thioguanine) are me- tabolized by S-​methylation through the enzyme thiopurine S-​methyltransferase (TPMT, EC 2.1.1.67). The metabolism of azathioprine is also affected by this enzyme as azathioprine is re- duced to 6-​mercaptopurine after dosing (i.e. it is a prodrug). High levels of TPMT are found in red cells, and the activity is trimodal in the population with high, intermediate, and low levels of TPMT detectable. Because low levels of TPMT (present in c.1:300 whites) reduces metabolic clearance of thiopurines, these subjects are ex- posed to high levels of 6-​thioguanine nucleotides. This leads to severe myelosuppression if they are given standard doses of these drugs. Hence, thiopurines can only be used safely in subjects with low TPMT if they are given very low doses. There are some 20 gene variants that affect the activity of TPMT and carriers for TPMT variants are common—​overall frequencies are typically 5 to 10%. The commonest variant (TPMT*3A) is ac- tually a double mutant that switches two amino acids and produces an unstable enzyme that explains the low level of TPMT in homo- zygotes. There is considerable ethnic variation in the frequency of variants (e.g. the *3A is not seen among the Chinese). Suxamethonium hydrolysis Suxamethonium (succinylcholine) is metabolized in the plasma by the non​specific esterase pseudocholinesterase (EC 3.1.1.8; also called butyrylcholinesterase). Normally this happens quickly, which explains why neuromuscular blockade with this drug lasts only a few minutes. Some patients, however, have very slow clear- ance of suxamethonium due to low plasma pseudocholinesterase activity. This manifests as prolonged neuromuscular blockade or apnoea after the use of suxamethonium, which can last several hours or longer. Suxamethonium apnoea is usually very rare, but is common in Inuit (up to 10%). Several different gene defects can cause pseudo- cholinesterase deficiency: the dibucaine-​resistant, fluoride-​resistant, and ‘silent’ gene types. Patients with reduced pseudocholinesterase also show increased sensitivity to the non​depolarizing blocker mivacurium. Interestingly, pseudocholinesterase deficiency does not appear to affect the conversion of the prodrug bambuterol (which is also a substrate for pseudocholinesterase) to its active me- tabolite terbutaline. Pharmacodynamic defects Some biochemical abnormalities make individuals peculiarly sensi- tive or resistant to the effects of certain drugs. All have a genetic basis. Red cell enzyme defects (see Chapter 22.6.11) Adverse drug reactions may affect people whose red cells are de- ficient in glucose-​6-​phosphate dehydrogenase (G6PD). If their red cells are exposed to an oxidizing drug, they lose their oxygen-​ carrying capacity as haemoglobin is oxidized to methaemoglobin and they eventually haemolyse. Drugs commonly implicated are doxorubicin, nalidixic acid, nitrofurantoin, primaquine, and sulphamethoxazole. There is a longer list of drugs that may cause haemolysis in some G6PD-​deficient individuals, depending on their genotype. This is because the common African variant gives higher red cell G6PD levels than the Mediterranean form, so mild oxidative stress is better tolerated. This list includes aspirin (low dose is usually safe), chloramphenicol, l-​dopa, isoniazid, quinine and related com- pounds, trimethoprim, and vitamin K. Porphyria (see Chapter 12.5) The hepatic porphyrias, acute intermittent porphyria and porphyria cutanea tarda, are characterized by abnormalities of haem biosyn- thesis. Certain drugs may precipitate an attack of porphyria espe- cially cytochrome P450 inducers (e.g. barbiturates, carbamazepine, and rifampicin). The quality of data as to the safety (or not) or other drugs is variable. There is a useful web database that rates drug safety on a five-​point scale (http://​porphyriadrugs.com/​). 98 section 2  Background to medicine Malignant hyperthermia This potentially fatal complication of general anaesthesia follows ex- posure to halogenated anaesthetic gases and suxamethonium (see earlier paragraphs). Vitamin D-​resistant rickets Three varieties of rickets are resistant to the effects of vitamin D (cholecalciferol):  familial hypophosphataemic rickets, vitamin D dependency, and Fanconi’s syndrome (see Chapter 21.16). Warfarin sensitivity Sensitivity to warfarin varies widely in the general population. This has been explained on the basis of common gene variants in its target enzyme vitamin K epoxide reductase (EC 1.1.4.2, encoded by VKORC1) and the CYP2C9 gene responsible for its metabolism. Pharmacogenomics and the prospect of ‘personal prescribing’ The realization that the pharmacokinetics and pharmacodynamics of a drug may be genetically determined has raised the prospect of tailoring the drug to the patient. Predefining the generic variants that a patient has before a drug is given could, in principle, avoid many adverse drug reactions and interactions. The sequencing technology to perform the necessary genotyping is now available, and many drug trials now incorporate this technology to identify genotypic signatures that both affect the therapeutic response to the drugs and predict adverse reactions. However, we are some way off truly ‘personalized prescribing’. Monitoring drug therapy Monitoring drug therapy usually involves trying to measure the clin- ical response directly. If this is difficult, or is not related directly in time to a dose of the drug, a surrogate measure of the response may be required. In some cases, it may be necessary to resort to measure- ment of the plasma concentration of the drug. Monitoring the therapeutic effects of drugs Some events can be directly monitored in the individual patient. Examples of therapeutic events that can be monitored in the indi- vidual include frequency of seizures during anticonvulsant drug therapy, muscle power during treatment of myasthenia gravis, the frequency of attacks of angina pectoris, and body weight during di- uretic therapy. Preventive measures in medicine often cannot be monitored in the individual patient and their impact has to be predicted from popula- tion studies (usually a clinical trial). Examples include the frequency of infections after immunization, the reduction in NSAID-​induced peptic ulceration with a proton pump inhibitor, or the prevention of the complications of myocardial infarction by the use of thromb- olysis and aspirin. Monitoring the pharmacodynamic effects of drugs In some circumstances, the pharmacological effect of a drug can be carefully measured, followed sequentially, and used as a guide to drug therapy even though it may not be correlated precisely with the therapeutic effect. Examples include the effect of insulin on the blood glucose concentration in diabetes mellitus, anticoagulants on the prothrombin time, bronchodilators on FEV1 and peak flow rate in asthma, and cancer chemotherapy on tumour markers. Monitoring drug pharmacokinetics (therapeutic drug monitoring) This is useful for drugs where: • the clinical evidence of therapeutic or toxic effects is difficult to measure or interpret • the relation between dose and plasma concentration is unpredictable • the drug has a low therapeutic index • the plasma concentration of the drug is a good predictor of response There are only a handful of drugs that meet these requirements (see Table 2.6.6). For these drugs, monitoring drug levels can be used to individualize therapy (e.g. at the start of drug dosing, when the relation between dose and plasma concentration in the individual is uncertain or rapid changes in renal function alter the relation between dose and plasma concentration), to monitor toxicity and to assess compliance (see earlier section, ‘The patient’s use of a drug: Compliance and concordance’). Phenytoin Plasma concentrations of phenytoin in the toxic range are quite well related to its acute neurotoxic effects, but not to its long-​term adverse reactions, such as gingival hyperplasia, hirsutism and acne, and folate and vitamin D deficiencies. At low dosages it takes about 2 weeks of maintenance therapy to reach steady state after a change in dose, but because of its non​linear kinetics (Fig. 2.6.5), the half-​life lengthens at higher plasma concentrations; so it can take up to 3 weeks or longer in some patients to reach steady state. For this reason the dosage should not be changed frequently. Provided the sample is not taken too soon after a dose (i.e. within 1 to 2 h), the time of sampling for phenytoin is not critical, as peak–​trough fluctuation is small between doses. Digoxin Plasma digoxin concentrations correlate well with toxic effects but not with the therapeutic effect within the therapeutic dosage range. The time of blood sampling should be at least 6 h after the last dose. During regular maintenance dosage without a loading Table 2.6.6  Drugs that commonly require therapeutic concentration monitoring and their reference ranges Drug Concentration below which a therapeutic effect is unlikely Concentration above which a toxic effect is more likely Gentamicina 5 µg/​ml (peak) 12 µg/​ml (peak), 2 µg/​ml (trough) Digoxin 1.0 nmol/​litre 3.8 nmol/​litre Ciclosporinb 80–​200 nmol/​litre 170–​300 nmol/​litre Lithium 0.4 mmol/​litre 1.0 mmol/​litre Phenytoin 40 µmol/​litre 80 µmol/​litre Theophylline 55 µmol/​litre 100 µmol/​litre a Conventional dosage regimens. b Actual range will vary between laboratories. 2.6  Principles of clinical pharmacology and drug therapy 99 dose, steady state is reached after about 7 days (normal renal func- tion) to more than 14 days (functionally anephric). The pharma- cological response to a given plasma level of digoxin is dependent on thyroid function (hyperthyroidism decreases responsiveness and hypothyroidism increases it) and the plasma potassium (hypo- kalaemia increases responsiveness and hyperkalaemia reduces it). Children younger than 6 months have lower plasma digoxin con- centrations at a given dose than older children and adults, and they are also more resistant to the pharmacodynamic actions of digitalis; so plasma digoxin concentrations cannot be clearly interpreted in this age group. Lithium The therapeutic range is 0.4 to 1 mmol/​litre. In the range of 1 to 1.5 mmol/​litre, the incidence of both acute toxicity and long-​term adverse effects is increased. Concentrations above 1.5 mmol/​litre should be avoided. Blood samples should be taken at least 12 h after the last dose. It takes about 3 days for steady state to be reached during regular maintenance therapy, but patients vary widely. It may take up to a week. Monitoring the plasma lithium concentration is necessary for several reasons. Lithium is nephrotoxic and excreted by the kid- neys, so if toxicity occurs it is self-​perpetuating. Systemic avail- ability varies from person to person and is altered by diarrhoea. It also varies widely between formulations, which cannot be used interchangeably. Sodium balance also affects renal excretion of lithium. For example, diuretic-​induced renal sodium loss reduces renal lithium excretion and can precipitate toxicity. Aminoglycoside antibiotics Gentamicin is the most widely used aminoglycoside antibiotic, but the principles hold for other aminoglycosides. The relation between the plasma concentration of gentamicin and its therapeutic efficacy is complicated by the fact that different organisms have different sensitivities to the antibiotic. Gentamicin is renally excreted, so in renal impairment it accumulates unless the dose frequency (and eventually the dose itself) is reduced. The toxic effects of gentamicin on the inner ear and kidneys are related to the ‘peak’ concentration (usually taken 1 h after an intramuscular injection or the start of an intravenous infusion) and the ‘trough’ concentration (taken just before the next dose). These should be measured after three or four doses, or sooner if there is renal impairment. A peak plasma concentration of 5 to 9 mg/​litre is generally considered necessary, although when gentamicin is used together with benzylpenicillin to treat bac- terial endocarditis, lower plasma gentamicin concentrations may be effective. Bacteriological measurement of in vitro inhibitory concentrations will help to guide therapy. If there is uncertainty, expert advice on dosing and target plasma concentrations should be sought. Theophylline Plasma theophylline concentrations correlate well with therapeutic and toxic effects. Measurement is essential in any patient who has been taking oral theophylline before it is given intravenously. It is also important in smokers who usually have increased theophylline clearance and hence require higher maintenance doses. Ciclosporin Ciclosporin is generally measured in whole blood, and the result of the assay may depend on whether the measurement technique is by immunoassay or high-​performance liquid chromatography. The time to steady state is about 2 days and samples should be taken at trough (just before the next dose). The whole-​blood concentra- tion of ciclosporin can be affected by reduced absorption (due to diarrhoea or reduced bile-​salt production) or altered bioavailability (due to liver disease, coadministered drugs such as ketoconazole and rifampicin, grapefruit juice, or St John’s wort). FURTHER READING Clinical pharmacology Brown MJ, Sharma P, Mir FA, Bennett PN (2019). Clinical pharma- cology, 12th Edition. Elsevier, North Holland. Ritter JM, et al. (2008). A textbook of clinical pharmacology, 5th edition. Arnold, London. Pharmacological effects of drugs Brunton LL, et al. (2012). Goodman & Gilman’s the pharmacological basis of therapeutics, 12th edition. Mc-​Graw Hill, London. Rang HP, et  al. (2019). Rang & Dale’s pharmacology, 9th edition. Elsevier, Amsterdam. Pharmacokinetics Gibaldi M, Perrier D (1982). Pharmacokinetics, 2nd edition. Marcel Dekker, New York. Rowland M, Tozer TN (1995). Clinical pharmacokinetics. Concepts and applications, 3rd edition. Lea & Febiger, Philadelphia, PA. Adverse effects of drugs Aronson JK (ed) (2015). Meyler’s side effects of drugs, 16th edition. Elsevier, Amsterdam. Websites Medicines and Healthcare Products Regulatory Agency (MHRA). https://​www.gov.uk/​government/​organisations/​medicines-​and- ​healthcare-​products-​regulatory-​agency/​ UL Cochrane Centre. http://​www.cochrane.co.uk/​ World Health Organization (1994). Guide to Good Prescribing. http://​ whqlibdoc.who.int/​hq/​1994/​WHO_​DAP_​94.11.pdf 2.7 Biological therapies for immune, inflammatory, 2.7 Biological therapies for immune, inflammatory, and allergic diseases 100 ESSENTIALS Therapeutic monoclonal antibodies and related molecules are increasingly used to treat immune-​mediated and inflamma- tory diseases. They interact very precisely with a soluble or cell-​ bound ligand to have three predominant effects:  neutralization of proinflammatory cytokines or growth factors; modulation of intercellular interactions; or depletion of pathogenic cells. They deliver significantly enhanced specificity in comparison to trad- itional synthetic drugs and have delivered substantial improve- ments in clinical outcomes in many immune, inflammatory, and allergic diseases. However, there are no biomarkers to help de- cide which biological class to try first in a particular patient, hence one of the main challenges for the future is the identification of ‘stratifiers’ to guide therapy. Introduction A biological therapy is a medicinal product that is produced from a biological source, usually a living cell, in contrast to a synthetic, small molecule drug. Biological therapies include growth factors, hor- mones, and other recombinant therapeutic proteins, but this chapter will focus on therapeutic monoclonal antibodies (mAbs) and related molecules in the context of immune-​mediated and inflammatory diseases (Table 2.7.1). The overriding concept underpinning these molecules is that they are ‘targeted therapies’, developed to interact very precisely with a soluble or cell-​bound ligand. In this way they have significantly enhanced specificity in comparison to traditional synthetic drugs. In the 30 years since the first mAb was licensed for acute trans- plant rejection the field has expanded rapidly (see Chapter 3.8). These complex molecules are now a multibillion-​dollar industry and occupy an enlarging space in the global pharmaceutical market. Growth has been catalysed by increased knowledge of dis- ease pathologies but also by advances in protein engineering and bioprocessing, which have improved the efficiency of therapeutic protein production as well as the quality of the products. The development of therapeutic antibodies and derivatives In the 1970s Kohler and Milstein developed technology that enabled the isolation and immortalization of B-​cell clones (hybridomas) producing a particular type of antibody. The product of each clone was a mAb of unique and defined specificity, which raised hopes of modern-​day ‘magic bullets’ to target cancers and other diseases. The first licensed mAb was OKT3, a murine mAb against the antigen CD3 on T cells. This reversed steroid-​refractory transplant rejection with significant success, despite a cytokine release syn- drome precipitated by T-​cell activation. A further limitation was immunogenicity, a consequence of the recipient’s immune system viewing the murine mAb as a foreign protein. The resulting immune (antiglobulin) response neutralized the mAb and accelerated its clearance, frustrating attempts to retreat patients and occasionally causing hypersensitivity reactions. Concurrent advances in recombinant DNA technology, how- ever, enabled the cloning, manipulation, and recombination of antibody genes from hybridomas (Fig. 2.7.1). The next generation of therapeutic mAbs were chimeric, whereby the murine variable (V) region, responsible for target-​binding, remained intact but was linked to a human constant (C) region, with an incremental reduc- tion in ‘foreignness’. In a further iteration, the complementarity-​ determining regions (CDRs) of the murine mAb (the V-​region elements that determined specificity) were left intact but embedded in a human framework, providing ‘humanized’ mAbs. Ultimately, ‘fully human’ mAbs were produced, either from transgenic animals whose genomes had been manipulated to produce human anti- bodies upon immunization; or by in vitro techniques such as phage display technology. As well as reducing potential immunogenicity these techniques enabled the engineering of antibody C-​regions, to enhance or reduce effector function, and thereby the fate of the mAb target. Antibody fragments were also generated, both intact Fab fragments and smaller elements that retained target-​binding capacity; other technologies enabled the generation of antibody-​drug conjugates and bispecific antibodies of dual specificity. Receptor-​fusion proteins resemble 2.7 Biological therapies for immune, inflammatory, and allergic diseases John D. Isaacs and Nishanthi Thalayasingam 2.7  Biological therapies for immune, inflammatory, and allergic diseases 101 Table 2.7.1  Currently licensed biological therapies for immune, inflammatory, and allergic diseases Drug name Target Structure Licensed indication(s) Route of administration Notes Abatacept CD80/CD86 Fusion protein of CTLA4 EC domain and human IgG1 Fc RA, JIA SC (RA); IV (RA, JIA) Fc is modified to prevent cytotoxicity Adalimumab TNF Fully human mAb RA; JIA; CD; UC; AS; PsA; Ps; Hidradenitis suppurativa, uveitis SC Biosimilar available Alemtuzumab CD52 Humanized mAb MS IV Also used extensively for lymphoma/ leukaemia and conditioning of solid organ and bone marrow transplant recipients Anakinra IL-1 Recombinant human IL-1 receptor antagonist RA; CAPS; Still’s disease SC Recombinant form of naturally occurring IL-1 receptor antagonist Atacicept BLyS, APRIL Fusion protein of TACI EC domain and human IgG1 Fc SLE SC In development Belimumab Blys Fully human mAb SLE IV Benralizumab IL-5 receptor Humanized mAb Eosinophilic asthma SC Brodalumab IL-17RA Fully human mAb Ps SC Canakinumab IL-1β Fully human mAb Gout; JIA (systemic); CAPS; Still’s disease SC Certolizumab Pegol TNF Humanized mAb Fab' fragment linked to PEG RA; AS; PsA; Ps SC Denosumab RANKL Fully human mAb Osteoporosis SC Cases of osteonecrosis of the jaw and atypical femoral fractures have been reported Etanercept TNF Lymphotoxin Fusion protein of human p75 TNF receptor EC domain and human IgG1 Fc RA; AS; JIA; PsA; Ps SC Ineffective in CD Biosimilar available Etrolizumab β7 integrin Humanized mAb IBD SC In development Golimumab TNF Fully human mAb RA; AS; PsA; UC SC Guselkumab IL-23 (p19 subunit) Fully human mAb Ps SC Infliximab TNF Chimeric mAb RA; CD; UC; PsA; AS, Ps IV Biosimilar available Ixekizumab IL-17A (both IL-17A and IL-17A/F) Humanized mAb PsA; Ps SC Can exacerbate IBD Mepolizumab IL-5 Humanized mAb Eosinophilic asthma SC Natalizumab α4 integrin Humanized mAb MS IV Increased susceptibility to PML Ocrelizumab CD20 Humanized mAb MS IV Omalizumab IgE Humanized Mab Allergic asthma; chronic spontaneous urticaria SC Dose determined by serum IgE and body weight in allergic asthma Reslizumab IL-5 Humanized mAb Eosinophilic asthma IV Rituximab CD20 Chimeric mAb RA; GPA; MPA; NHL; CLL IV Dosing regimen differs between rheumatological and haematological indications Biosimilar available Sarilumab IL-6 receptor Fully human mAb RA SC Secukinumab IL-17A Fully human mAb Ps, PsA, AS SC Can exacerbate IBD Tildrakizumab IL-23 (p19 subunit) Humanized mAb Ps SC Tocilizumab IL-6 receptor Humanized mAb RA; JIA; GCA IV (RA, JIA); SC (RA) Ustekinumab IL-12 & IL-23 (p40 subunit) Fully human mAb Ps, PsA; CD SC Inhibits Th1 and Th17 pathways Vedolizumab α4β7 integrin Humanized mAb CD, UC IV Abbreviations: CD, cluster of differentiation; CTLA4, cytotoxic T-lymphocyte associated protein 4; EC, extracellular; Ig, immunoglobulin; Fc, fragment crystallisable; RA, rheumatoid arthritis; JIA juvenile idiopathic arthritis; SC, sub-cutaneous; IV, intravenous; TNF, tumour necrosis factor; mAb, monoclonal antibody; CD, Crohn’s Disease; UC, ulcerative colitis; AS, ankylosing spondylitis; PsA, psoriatic arthritis; Ps, psoriasis; MS, multiple sclerosis; IL, interleukin; CAPS, cryopyrin-associated periodic syndrome; BLyS, B-lymphocyte stimulator; APRIL, a proliferation-inducing ligand; TACI, transmembrane activator and CAML interactor; SLE, systemic lupus erythematosus; RA, receptor A; Fab, fragment antigen binding; PEG, polyethylene glycol; RANKL, receptor activator of nuclear factor kappa-B ligand; IBD, inflammatory bowel disease; PML, progressive multifocal leukoencephalopathy; GPA, granulomatosis with polyangiitis; MPA, microscopic polyangiitis; NHL, non-Hodgkin lymphoma; CLL, chronic lymphocytic leukaemia; GCA, giant cell arteritis; Th, T helper cell. 102 section 2  Background to medicine mAbs but comprise the extracellular domain of a cell surface receptor fused to an immunoglobulin (Ig) Fc to form a soluble molecule that is specific for the receptor’s ligand (Fig. 2.7.2). For further discussion see account by H Waldmann and G Winter in Chapter 3.8. Classification The structure of a therapeutic mAb can be discerned by its no- menclature and is comprised of: prefix, substem A, substem B, and suffix. The prefix is unique to the drug and the suffix, -​mab, indicates that it is a mAb. Substem B provides the species upon which the Ig sequence is based: u for human, o for mouse, xi for chimeric, and zu for humanized. Substem A indicates the class of drug (e.g. -​l(i)-​ for immunomodulatory, k(i) for interleukin and t(u) for tumour). Some examples are provided in Table 2.7.2. Targets for biological therapies in immune-​mediated inflammatory diseases In immune-​mediated inflammatory diseases, biological therapies act in three predominant ways: to neutralize proinflammatory cyto- kines or growth factors; to modulate intercellular interactions; or to deplete pathogenic cells (see Fig. 2.7.3 and Table 2.7.1). Cytokine and growth factor targets Tumour necrosis factor (TNF) TNF is a pivotal cytokine mediator in a variety of immune-​mediated inflammatory diseases. It exists in soluble and transmembrane forms, and binds to type 1 and type 2 receptors to mediate a var- iety of proinflammatory and cell survival effects. TNF blockade was the first of the modern-​day biological therapy paradigms and has been licensed for adults and children with a variety of rheum- atological conditions (rheumatoid arthritis, ankylosing spondyl- itis, psoriatic arthritis, juvenile idiopathic arthritis), as well as for psoriasis, Crohn’s disease, ulcerative colitis, uveitis and hidradenitis suppurativa (Table 2.7.1). There are five anti-​TNF drugs currently licensed. Two are fully human mAbs, one is a chimeric mAb, one (certolizumab) is a PEGylated Fab’ fragment and the last is a sol- uble receptor. The latter (etanercept) also binds lymphotoxin α3 but there is nothing to suggest this is relevant to its efficacy or toxicity. Certolizumab is the only anti-​TNF manufactured in E. coli, which should increase yields over mammalian cell culture. The PEG moiety increases the half-​life by reducing renal clearance and proteolysis. TNF blockade was licensed for rheumatoid arthritis at the start of the 21st century and was revolutionary. Patients who had been re- fractory to all treatment modalities suddenly experienced previously unimagined symptomatic improvements in joint pain and stiffness, and fatigue. Furthermore, TNF blockade prevented the relentless joint damage seen with more traditional therapies. Consequently, Heavy chain Light chain CDRs VARIABLE REGION CONSTANT REGION (a) (b) MURINE CHIMERIC -XIMAB HUMANISED -ZUMAB ‘FULLY HUMAN’ -MUMAB Fig. 2.7.1  The different types of therapeutic antibody. (a) Basic antibody structure. (b) Basic structure of a murine, chimeric, humanized, and human monoclonal antibody. Red indicates murine sequence and black indicates human sequence. CDR, complementarity-​determining region. Reproduced from: Isaacs JD. Antibody engineering to develop new antirheumatic therapies. Arthritis Res Ther 2009, 11, 225. disulphide bonds VH VL CH1 CL (b) VH VL CH1 CL VH VL CH1 CL CH2 CH2 CH3 CH3 Hinge (a) Fc (c) CH2 CH2 CH3 CH3 receptor fragment receptor fragment Fig. 2.7.2  The domain structures of an antibody molecule and derivatives. (a) An antibody molecule. (b) A fragment antigen-​binding (Fab) fragment. (c) A receptor-​immunoglobulin fusion protein. CH, heavy chain constant domain; CL, light chain constant domain; Fc, fragment crystallizable; VH, heavy chain variable domain; VL, light chain variable domain. NB: A Fab’ fragment is a Fab fragment plus the hinge region. Reproduced from: Isaacs JD. Antibody engineering to develop new antirheumatic therapies. Arthritis Res Ther 2009, 11, 225. 2.7  Biological therapies for immune, inflammatory, and allergic diseases 103 patients experienced less disability than previous generations with retained employment prospects and reduced need for joint surgery. Registry data demonstrated a slightly enhanced risk of serious in- fection (about 20% higher than with synthetic antirheumatic drugs such as methotrexate) but no enhanced risk of malignancy. An un- expected complication was the emergence of tuberculosis (TB), often disseminated and with unusual histology. Because of their targeted nature biological therapies have provided unexpected in- sights into pathophysiology. In the case of TB, we learnt that TNF was essential for the maintenance of granulomas, which in turn con- trolled latency in individuals with prior exposure. The dissolution of granulomas with TNF blockade thereby released TB bacilli into the bloodstream with extrapulmonary reactivation. Patients prescribed TNF blockade are now routinely screened for prior TB exposure and treated with appropriate prophylaxis where indicated. The soluble receptor etanercept is less likely to reactivate TB. The precise reason is unclear but may reflect a reduced capacity to disrupt granulomata due to an incapacity to perform so-​called ‘inside-​out’ signalling upon binding cell surface TNF. TNF blockade is also associated with an enhanced susceptibility to opportunistic intracellular pathogens such as listeria, salmonella, and some fungi. The effects of TNF blockade were similarly impressive in patients with severe Crohn’s disease, particularly those with fistulae and peri- anal disease in which healing was rapid and complete. When used in the first two years of disease, around 50% of patients achieve remis- sion and even mucosal healing. Perhaps with parallels to TB reactiva- tion, etanercept is not effective in this granulomatous disease of the gastrointestinal (GI) tract. TNF blockade is also effective in ulcera- tive colitis. Psoriasis provided a further successful disease target, at a time when systemic treatments were relatively limited. Infliximab provided 75% improvement in the Psoriasis Area and Severity Index (PASI) in more than 80% of patients, and 90% improvements in 50–​ 60%. These drugs were also effective in psoriatic arthritis and an- kylosing spondylitis, for which therapeutic options were even more limited than for rheumatoid arthritis. In both diseases they provide significant improvement in disease activity, including spinal inflam- mation and function. In contrast, TNF blockade was not successful in other indica- tions characterized by TNF excess, perhaps illustrating homeostatic functions of TNF. These included multiple sclerosis and congestive heart failure, in which trials were stopped prematurely for disease worsening. Uncommon and unexpected adverse effects of TNF blockade include neutropenia and vasculitis and, paradoxically, psoriasis can be triggered in some patients. Drug-​induced lupus can also occur, including renal and cerebral manifestations. IL-​6 IL-6 is a key local and systemic inflammatory mediator. Its receptor exists in membrane-​bound and soluble forms. After binding IL-​6, the soluble receptor pairs with its signalling component, gp130, on any cells that express it. Tocilizumab binds both forms of the IL-​6 receptor, effectively preventing IL-​6 signalling. The protean effects of IL-​6 include: activation of B and T cells, and macrophages; the differentiation of macrophages into osteoclasts; synovial fibroblast proliferation and synovial neovascularization; production of plate- lets from megakaryocytes; production of acute phase reactants and hepcidin from the liver; and systemic effects such as fever and Table 2.7.2  Mab nomenclature Prefix Substem A Substem B Suffix Infliximab Inf li Xi Mab Tocilizumab Toci li Zu Mab Canakinumab Cana ki nu Mab Rituximab Ri tu xi Mab (a) (b) (c) Fig. 2.7.3  Potential mechanisms of action of immunomodulatory mAbs and receptor-​immunoglobulin fusion proteins. (a) Neutralization of soluble mediators. (b) Cell lysis. (c) Inhibition of intercellular or cell: endothelial interactions. 104 section 2  Background to medicine fatigue. Il-​6 levels are elevated in rheumatoid arthritis, particularly within the joint, and administration of tocilizumab provides re- ductions in disease activity similar to TNF blockade. In line with hepatic effects of IL-​6 there is a robust normalization of the acute phase response and, in patients with anaemia of chronic disease, a rise in haemoglobin as hepcidin levels fall. Elevated platelet counts also fall. Dyslipidaemia may occur in patients receiving tocilizumab with elevations in HDL, LDL, and triglycerides, usually without a rise in atherogenic index. Nonetheless lipid rises should be treated according to relevant guidelines. Diverticular perforations have oc- curred in patients receiving tocilizumab, which should therefore be used with caution where there is a history of intestinal ulceration or diverticulitis. The aetiology is unclear but could reflect an important homeostatic role of IL-6. Reversible neutropenia and transaminase elevation are also common. In other respects, the efficacy and ad- verse event profile of IL-6 blockade resemble those of TNF blockade. In particular there is a slightly enhanced risk of serious infections. IL-​1 IL-​1α and IL-​1β are proinflammatory and bind the IL-​1 receptor (IL-​1R). The naturally occurring IL-​1 receptor antagonist (IL-1Ra) competes with IL-​1α and β for binding to IL-​1R. IL-​1 inhibition can be achieved with anakinra (recombinant IL-1ra), or with canakinumab (anti-​IL-​1 mAb). Anakinra effectively blocks IL-​1α and IL-​1β but its short half-​life requires daily dosing. It is licensed for rheuma- toid arthritis and for the autoinflammatory cryopyrin-​associated periodic syndromes (CAPS), in which the activated inflammasome results in increased secretion of IL-​1β. These include neonatal-​ onset multisystem inflammatory disease (NOMID), chronic in- fantile neurological, cutaneous, articular syndrome (CINCA), Muckle–​Wells syndrome, and familial cold autoinflammatory syndrome (FCAS). In rheumatoid arthritis, despite elevated IL-​1 levels in plasma and synovial fluid, the efficacy of anakinra is less than that of the other biological drugs. Canakinumab is an anti-​IL-​ 1β human mAb, licensed for the treatment of CAPS, systemic ju- venile idiopathic arthritis (JIA), and gouty arthritis. In gout, urate crystals are ingested by neutrophils and monocytes, activating the inflammasome with the release of IL-​1β. IL-​17 IL-​17A is the signature cytokine of Th17 T cells which, in health, protect against bacterial and fungal infections, particularly candid- iasis. Other members of this cytokine family include IL-17F, IL-17C, and IL-17A/​F. The various family members bind receptors that share the subunit IL-​17RA. Th17 T cells are also associated with a number of autoimmune diseases including rheumatoid arthritis, psoriasis, inflammatory bowel disease, and ankylosing spondylitis. The func- tions of IL-​17 overlap with those of TNF and IL-​6, while also stimu- lating the production of additional proinflammatory cytokines and chemokines. Akin to IL-​6, IL-​17 promotes osteoclastogenesis and angiogenesis. Secukinumab is a fully human anti-​IL-​17A antibody. The drug penetrates the skin and reduces local inflammation asso- ciated with psoriasis. In a head-​to-​head study its efficacy exceeded that of etanercept, with 54% of patients achieving PASI 90 and 24% PASI 100 after 12 weeks. It is also an effective treatment for psoriatic arthritis and ankylosing spondylitis. While IL-17A is abundantly present in the gut mucosa in inflammatory bowel disease (IBD), secukinumab was ineffective in Crohn’s disease and associated with infectious complications. Furthermore, IBD exacerbations were re- ported in trials of secukinumab for other indications. Blockade of IL-​17A is associated with infections, including mucosal and, occa- sionally, systemic candidiasis. Brodalumab targets IL-17-​RA, thereby antagonizing the different IL-​17 family members; it is licensed for use in moderate to severe plaque psoriasis. B-​lymphocyte stimulator B-​lymphocyte stimulator (BLyS), also known as B-​cell activating factor, is a B-​cell survival factor and a member of the TNF super- family. Levels are elevated in certain autoimmune diseases such as systemic lupus erythematosus (SLE), where there is an association with disease activity. Inhibition of BLyS leads to impaired B-​cell survival and reduced differentiation of B cells into plasma cells. Belimumab is a fully human anti-​BLys mAb. It is the only biological drug to be approved in the treatment of SLE, with a licence for pa- tients with active, autoantibody positive disease. Administration of belimumab is associated with reductions in IgG levels, including autoantibodies; longer-​term administration is associated with a fall in circulating B cells. Bacterial infections are more common in pa- tients receiving belimumab. Atacicept (TACI-​Ig) is a fusion protein comprising the ligand binding portion of the BLyS receptor TACI (transmembrane activator and CAML interactor) and a human IgG1 Fc. As well as binding BLyS, TACI also binds a proliferation-​inducing ligand (APRIL), which is an additional B-​cell growth factor. Hence atacicept has a broader anti-​B-​cell profile than belimumab, and is currently under investigation in SLE. IL-23 Th17 differentiation is partly controlled by IL-​23, produced by den- dritic cells and by Th17 cells themselves. IL-​23 comprises two sub- units, p40 shared with IL-​12 (important in the differentiation of Th1 cells), and p19. Ustekinumab is a fully human anti-​p40 mAb that blocks the actions of both IL-​23 and IL-​12. It is effective for treating severe plaque psoriasis and psoriatic arthritis although showed inferiority to secukinumab in a head-​to-​head trial in psor- iasis. It is also licensed for use in Crohn’s disease, in contrast to IL-17 blockade. Guselkumab, which blocks IL-23 p19 (and therefore spe- cific for IL-23 vs. IL-12) is in development for similar indications, and is licensed for use in moderate to severe plaque psoriasis. Modulation of intercellular interactions A second strategy for biological therapies involves the inhibition of cell interactions, which can prevent activation or migration. Costimulation blockade T cells require two critical signals for their activation. The first is the recognition, by the T-​cell receptor, of their cognate antigen (a complex of major histocompatibility complex (MHC) molecule and peptide) on an antigen presenting cell (APC). The second is the rec- ognition of CD80 and CD86 on the APC by CD28, which provides an essential costimulatory signal. Following activation the T-​cell upregulates CTLA4, which displaces CD28 from CD80 and CD86 due to its higher affinity. CTLA4 imparts a negative signal to the T-​cell, thereby acting as a ‘brake’ to activation, returning the T-​cell to its resting state. Abatacept, or CTLA4-​Ig, is a fusion protein of the extracellular domain of CTLA4 and a modified Fc region of human IgG1 (Fig. 2.7.4). This molecule competes with CD28 for binding to 2.7  Biological therapies for immune, inflammatory, and allergic diseases 105 CD80 and CD86, thereby blocking costimulation and inhibiting the activation of naïve T cells. The Fc modification is designed to pre- vent abatacept from killing APCs to which it binds. It is an effective agent in patients with rheumatoid arthritis, polyarticular juvenile idiopathic arthritis, and psoriatic arthritis. It can be administered by monthly infusion or weekly subcutaneous injection. Adhesion molecule blockade Natalizumab is a humanized anti-​α4-​integrin mAb which blocks the interaction of α4β1 integrin with its receptor vascular cell ad- hesion molecule-​1 (VCAM-​1) and α4β7 integrin with mucosal addressin cell adhesion molecule-​1 (MadCAM-​1). This prevents the transendothelial migration of integrin-​expressing leucocytes into inflamed tissues. In multiple sclerosis VCAM-​1 is upregulated on CNS endothelium and possibly glial cells and natalizumab is an effective therapy, administered by four-​weekly infusion. However, its use is associated with an increased risk of progressive multifocal leukoencephalopathy (PML), the opportunistic infection associated with John Cunningham (JC) virus. Patients receiving natalizumab for multiple sclerosis must be fully informed of this risk and carry a patient alert card. While also effective in Crohn’s disease, by inhibiting leucocyte adhesion to MadCAM-​1 on gut endothelial cells, natalizumab is rarely used for this indication as a result of the PML risk. In contrast, vedolizumab is a humanized anti-​α4β7 mAb. α4β7 integrin is expressed preferentially on a specific subset of gut-​ homing memory T cells that cause inflammation in inflammatory bowel disease. Vedolizumab therefore blocks leucocyte trafficking to the gut, with a licence for both Crohn’s disease and ulcera- tive colitis. Etrolizumab is an anti-​β7 humanized mAb that binds both α4β7 and αEβ7 integrins. The latter is selectively expressed on gut mucosal intraepithelial T cells, where it binds E-​cadherin. Hence etrolizumab can prevent T cells trafficking to the gut but may also block their retention. It is currently under investigation in inflammatory bowel disease, having demonstrated efficacy in a phase 2 trial in ulcerative colitis. Depletion of pathogenic cells Alemtuzumab, which targets CD52 on lymphocytes, monocytes, macrophages, and NK cells, was the first humanized therapeutic mAb. It is unusual in having its origins in a rat rather than a mouse mAb. It is potently cytotoxic and, while first developed to treat leu- kaemia and lymphoma, it was subsequently studied in a range of autoimmune diseases. It is licensed for relapsing remitting mul- tiple sclerosis, administered as two short, annual treatment cycles. Its use is paradoxically associated with the development of sec- ondary autoimmunity. The most common manifestation is thy- roid autoimmunity, but more serious complications have included thrombocytopenic purpura, antiglomerular basement membrane disease, and other haematological cytopenias. Administration of alemtuzumab is also associated with significant infusion reactions, which can be reduced with appropriate premedication that includes a glucocorticoid. Oral prophylaxis for herpes virus infections is also recommended for at least one month after each treatment cycle. Rituximab is a chimeric human IgG1 anti-​CD20 mAb, originally developed for the treatment of B-​cell malignancies but subsequently established as an effective treatment for rheumatoid arthritis and certain types of vasculitis. It is potently cytotoxic, probably a conse- quence of efficient antibody-​dependent cellular cytotoxicity, comple- ment-​mediated cytotoxicity, and apoptosis induction. In rheumatoid arthritis it is administered as two intravenous infusions spaced two weeks apart, again with prophylaxis against infusion reactions. CD20 is expressed on all B cells but not on haematopoietic stem cells, pro-​B cells, or plasma cells. Consequently, after initial depletion B cells reconstitute and there is minimal loss of humoral immunity due to survival of plasma cells. Reconstitution occurs typically after 6 months, but with differential rates in distinct B-​cell subsets. Retreatment for rheumatoid arthritis is usually administered when MHC TCR CD28 CD80/86 Antigen presenting cell T cell Signal 1 Signal 2 (a) CD80/86 Antigen presenting cell T cell Signal 1 (b) CTLA4 MHC TCR CD28 MHC CD80/86 Antigen presenting cell TCR T cell Signal 1 (c) CD28 CTLA4-Ig Fig. 2.7.4  The structure and mode of action of abatacept. (a) T cells require two signals for full activation. Signal 2 (coactivation) derives from the interaction of CD28 with B7.1 and B7.2. (b) CTLA4 is upregulated following T-​cell activation and competes with CD28 due to its higher affinity, de-​activating the T-​cell. (c) Abatacept (CTLA4-​Ig) is a fusion protein of the extracellular domain of CTLA4 and an IgG Fc. 106 section 2  Background to medicine symptoms start to recur but at least 6 months after the initial course. There is no recommended upper limit to the number of rituximab courses that can be administered but repeated courses are associ- ated with secondary hypogammaglobulinaemia, most commonly IgM but occasionally IgG. The latter may be associated with an en- hanced infection risk and the risk:benefit ratio of continued treat- ment should be considered before each course of therapy. PML cases have also been reported in patients who have received rituximab but most commonly in patients with leukaemia or lymphoma who have received additional immune suppression. Nonetheless all patients should be counselled about this risk and receive a patient alert card. Rituximab is also licensed for use in granulomatosis with polyangiitis, microscopic polyangiitis, and pemphigus vulgaris. Immunogenicity The development of chimeric, humanized, and fully human mAbs was expected to reduce immunogenicity associated with murine components of antibody molecules. While this has largely been the case, immunogenicity cannot be eliminated altogether. MAb CDRs are intrinsically immunogenic, being created by gene recombination events and somatic mutations. There are also Ig allotypes which vary between individuals. Antidrug antibodies (ADAs) can be difficult to measure, hence their importance and extent is yet to be fully es- tablished. Nonetheless they can undoubtedly interfere with efficacy by neutralizing an mAb’s binding site or accelerating its clearance. Clinically this is most likely to manifest as secondary non​response, where an individual loses the benefit of a previously effective therapy. Hypersensitivity reactions also occur, such as infusion reactions as- sociated with anti-​infliximab ADAs, but anaphylactic reactions are fortunately rare. Soluble receptors are less immunogenic as they contain minimal foreign sequence. Immunogenicity is influenced by many factors such as the primary mAb sequence, its formulation (particularly the presence of aggre- gates), the route of administration and precise treatment regime, disease under treatment, and coadministered therapy. For example, anti-​TNF mAbs are more immunogenic in rheumatoid arthritis than in ankylosing spondylitis or psoriatic arthritis; and the frequency of ADAs in rheumatoid arthritis patients is reduced by coadministration of methotrexate. Even with such cotherapy, however, almost a third of patients receiving the fully human mAb adalimumab generated ADAs, most commonly within the first 6 months of therapy, which significantly reduced efficacy. Measurement of circulating drug is often easier than measuring ADAs, and a number of algorithms have been proposed whereby measurement of drug levels with or without ADAs should render mAb therapy more personalized and cost-​ effective. Nonetheless, controlled trials with economic analyses are ideally needed before so-called ‘therapeutic drug monitoring’ can be recommended in routine practice. Biosimilars The availability of biological therapies has been restricted or ra- tionalized in most parts of the world, due in large part to their high cost. However, as the patents protecting the original biological drugs expire, there is the potential for biosimilar molecules to be developed at lower cost. Unlike synthetic small molecule drugs, it is not possible to create an exact replica of a therapeutic mAb. This is because many features, particularly post-​translational modifica- tions, depend less on the primary amino acid sequence and more on cell culture conditions and downstream manufacturing, pro- cessing, and production. Furthermore, some of these features are difficult to characterize precisely, and a given batch of therapeutic mAb is also characterized by ‘microheterogeneity’—​a mixture of closely related but distinct molecules. Consequently, both the Food and Drug Administration (FDA) and the European Medicines Agency (EMA) have developed regulations for the development of biosimilar biological products. These rely heavily on in vitro com- parisons of the physicochemical characteristics and biological ac- tivities of the originator and biosimilar biologicals, and less on data from clinical trials in humans. While reducing the cost of biosimilar development these guidelines mean that biosimilars can be licensed having been administered to relatively few patients. Because it is im- possible for a biosimilar to be identical to the originator biological, this limited clinical experience has led to concerns over safety and efficacy. There are additional questions around: indication extrapo- lation (biosimilars can be licensed for the same indications as the originator biological without requiring clinical trial data in each indication); differences in immunogenicity compared to the ori- ginator biological; substitution (the potential for a pharmacist to substitute a biosimilar when an originator biological has been pre- scribed); nomenclature and identification of the biosimilar product, particularly when adverse effects occur. While smaller and less complex than mAbs, biosimilar growth factors have been marketed for several years without significant problems. Furthermore, there is batch-​to-​batch variability of originator biologicals, particularly when manufacturing processes change. Nonetheless the launch of biosimilar mAbs has been seen as a major milestone and a number of anti-TNF and rituximab biosimilars are now licensed and in use, so far without unexpected complications. Future developments Stratified medicine There are currently four classes of biological therapy for rheumatoid arthritis: anti-​TNF, anti-​IL-​6R, B-​cell depletion, and costimulation blockade. Surprisingly, the efficacy profile of each class is very similar, with only 10–​20% of established rheumatoid arthritis patients achieving optimal responses. Adverse event profiles are also largely similar. Currently, however, there are no biomarkers available that help the rheumatologist decide which biological class to try first in a particular patient. A similar situation pertains to TNF blockade, anti-p40 blockade, and integrin blockade in IBD. Because biological therapies are costly this has negative health economic implications as well as delaying the achievement of disease control, which may require cycling through more than one agent. Consequently, there has been a major investment in the identification of ‘stratifiers’ to guide therapy examining, for example, genetics, transcriptional and proteomic profiles in blood and, particularly in IBD, tissue charac- teristics. In cancer the tissue often provides a clue, such as the use of trastuzumab in HER2 positive breast cancer. In the respiratory field omalizumab (humanized anti-​IgE) is indicated in allergic asthma pa- tients who have a positive skin test or in vitro reactivity to a perennial 2.7  Biological therapies for immune, inflammatory, and allergic diseases 107 aeroallergen, and dosed according to serum IgE levels. Mepolizumab (humanized anti-​IL-​5) is indicated only for patients with refractory eosinophilic asthma. Once stratifiers are identified, new tests (com- panion diagnostics) will be developed to assist in the most appro- priate allocation of therapy. Antibody fragments and novel species Novel species include bispecific antibodies, which can now be pro- duced using various technologies, as well as smaller antibody frag- ments. Bispecific antibodies permit the simultaneous targeting of two specificities by a single molecule. In inflammatory diseases, for example, there is significant interest in the dual targeting of TNF and IL-​17. Smaller antibody fragments, while suffering from a shorter half-​life, should achieve better tissue penetration. This will be par- ticularly important not only in oncology, but potentially also in in- flammatory diseases. FURTHER READING Baker KF, Isaacs JD (2018). Novel therapies for immune-mediated in- flammatory diseases: What can we learn from their use in rheumatoid arthritis, spondyloarthritis, systemic lupus erythematosus, psoriasis, Crohn’s disease and ulcerative colitis? Ann Rheum Dis, 77, 175–87. Beniwal-​Patel A, Saha S (2014). The role of integrin antagonists in the treatment of inflammatory bowel disease. Expert Opin Biol Ther, 14, 1815–​23. Billiet T, et al. (2014). Targeting TNF-​α for the treatment of inflamma- tory bowel disease. Expert Opin Biol Ther, 14, 75–​101. Dörner T, Kay J (2015). Biosimilars in rheumatology: current perspec- tives and lessons learnt. Nat Rev Rheumatol, 11, 713–​24. Elvin JG, Couston RG, van der Walleb CF (2013). Therapeutic anti- bodies: Market considerations, disease targets and bioprocessing. Int J Pharmaceutics, 440, 83–​98. Garbers C, et al. (2018). Interleukin-6: designing specific thera- peutics for a complex cytokine. Nat Rev Drug Discov, 17(6), 395–412. Hawkes JE, et al. (2018). Discovery of the IL-23/IL-17 signaling pathway and the treatment of psoriasis. J Immunol, 201(6), 1605–13. Isaacs JD (2009). Antibody engineering to develop new antirheumatic therapies. Arthritis Res Ther, 11, 225. Keating GM (2013). Abatacept: a review of its use in the management of rheumatoid arthritis. Drugs, 73, 1095–​19. Kohler G, Milstein C (1975). Continuous cultures of fused cells se- creting antibody of predefined specificity. Nature, 256, 495–​7. Krieckaert C, Rispens T, Wolbink G (2012). Immunogenicity of bio- logical therapeutics: from assay to patient. Curr Opin Rheumatol, 24, 306–​11. Maxwell LJ, et al. (2015). TNF-​alpha inhibitors for ankylosing spon- dylitis. Cochrane Database Syst Rev, 4, CD005468. Reich K, et al. (2008). Comparative effects of biological therapies on the severity of skin symptoms and health-​related quality of life in patients with plaque-​type psoriasis: a meta-​analysis. Curr Med Res Opin, 24, 1237–​54. Schmid AS and Neri D (2019). Advances in antibody engineering for rheumatic diseases. Nat Rev Rheumatol, 15(4), 197–207. Thalayasingam N, Isaacs JD (2011). Anti-​TNF therapy. Best Pract Res Clin Rheumatol, 25, 549–​67. Tiller KE, Tessier PM (2015). Advances in antibody design. Annu Rev Biomed Eng, 17, 191–​216. Winter G, Milstein C (1991). Man-made antibodies. Nature, 349(6307), 293–9. 2.8 Traditional medicine exemplified by traditiona 2.8 Traditional medicine exemplified by traditional Chinese medicine 108 ESSENTIALS Traditional (often synonymous with complementary) medicine can be defined as the knowledge, skill, and practices, based on the theories, beliefs, and experiences indigenous to different cul- tures, which are used in the maintenance of health and the pre- vention, diagnosis, improvement, or treatment of illness. These practices are found in almost every country, and demand for them is increasing. For individual patients, when of proven quality, safety, and ef- ficacy, traditional medicine contributes to the goal of ensuring that all people have access to care that they feel they need. For medical science, investigators in traditional fields have discovered new medicines with which to combat important diseases, includ­ ing the sympathomimetic ephedrine, the antimalarial artemisinin, and arsenic trioxide used in the treatment of promyelocytic leukaemia. The World Health Organization Traditional Medicine Strategy 2014–​2023 has two key goals: to support Member States in harness­ ing the potential contribution to health, wellness, and people-​centred healthcare, and to promote the safe and effective use of traditional medicine through the proper and organized regulation of products, practices, and practitioners. Much benefit will emerge if the science of the West and the rich experience and traditions of medicine in China can work together in an integrated way. Introduction Traditional and complementary medicine are important and often underestimated aspects of healthcare. Not only are the practices found in almost every country; demand is increasing. The im- portance of the approach resides in two qualities: first, when it is of proven quality, safety, and efficacy, traditional and complemen- tary medicine contributes to the goal of ensuring that all people have access to care that they feel they need. Secondly, several in- vestigators in these traditional fields have over the years discovered new medicines with which to combat important diseases. Here we define traditional medicine and discuss several therapeutic contri- butions exemplified by traditional Chinese medicine—​including the strategic issues of disease prevention, health promotion, and affordability. What is traditional medicine? The World Health Organization (WHO) defines traditional medi- cine as follows: ‘Traditional medicine has a long history. It is the sum total of the knowledge, skill, and practices based on the theories, beliefs, and experiences indigenous to different cultures, whether explicable or not, used in the maintenance of health as well as in the prevention, diagnosis, improvement, or treatment of physical and mental illness.’ The terms ‘complementary medicine’ or ‘alternative medicine’ are used interchangeably with traditional medicine in some countries. They refer to a broad set of healthcare practices that are not part of that country’s own tradition or conventional medicine and are not fully integrated into the dominant healthcare system. Traditional Chinese medicine, Ayurveda, Unani, naturopathy, osteopathy, homeopathy, and chiropractice are frequently included in traditional medicine. In China, the earliest initiatives originated from Shennong, thousands of years ago: he examined the proper- ties and tasted many hundreds of herbs. Ayurveda, or the ‘science of life’, was developed by the seers of ancient India. Practitioners and dispensers now use Ayurveda for treatment. Unani medicine is the traditional practice of South Asian countries and its use is growing in other parts of Asia, such as Bangladesh and India. Naturopathy emphasizes prevention and the promotion of optimal health through the process of self-​healing. Homeopathy originated from the belief that a substance which causes the symptoms of a disease in healthy people would cure similar symptoms in those who are unwell. Commonly used treatments and therapeutic techniques for each of these traditional approaches are listed in Box 2.8.1. Certain popu- lations in Africa, Asia, and Latin America continue to use traditional medicine to meet their primary healthcare needs. Meanwhile, in Australia, Europe, and North America, ‘complementary and 2.8 Traditional medicine exemplified by traditional Chinese medicine Fulong Liao, Tingliang Jiang, and Youyou Tu 2.8  Traditional medicine exemplified by traditional Chinese medicine 109 alternative medicine’ is increasingly used in parallel to dominant practices of modern medicine, particularly for managing chronic diseases. In these countries, a desire for more personalized healthcare is likely to explain the increasing take up of these approaches. Since traditional Chinese medicine is long established and pro- vides some notably successful therapies, we provide examples that illustrate how traditional approaches remain so much a feature of medicine in China; in some instances, the treatment has been adopted in the parallel world of modern medicine that is dominant elsewhere. Products in the global pharmacopoeia but which originated from traditional Chinese medicine Ephedrine—​the first drug transferred to the Western pharmacopoeia from Chinese medicine Since the Han dynasty (206 BCE-200 CE), the herb Mahuang (Ephedra sinica) in traditional Chinese medicine has been known to have antiasthmatic properties. A decoction of ephedra is recorded in the classical ‘Treatise on Febrile and Miscellaneous Diseases’ by Zhang Zhongjing (150–​215 CE). Ephedrine was the first drug that origin- ated from traditional Chinese medicine to appear in the Western pharmacopoeia. The first isolation of ephedrine from Ephedra The renowned Japanese organic chemist and pharmacologist Nagai Nagayoshi who had studied in Europe, first succeeded in extracting the active principle, ephedrine, from Ephedra in 1885. He isolated the alkaloid (ephedrine hydrochloride) from the plant and deter- mined its formula and chemical structure. At that time, the pharma- cological effect of ephedrine was best known in the eye in which it causes pupillary dilatation. The agent was largely forgotten until the early 1920s when it was rediscovered by K.K. Chen and C.F. Schmidt. The discovery of ephedrine as a sympathomimetic Dr Chen’s research at Peking Union Medical College sought to understand the pharmacology of chemical components obtained Box 2.8.1  Commonly used therapies and therapeutic techniques in branches of traditional medicine globally Traditional medicine includes diverse health practices, approaches, knowledge and beliefs incorporating plant, animal and/or mineral based medicines, spiritual therapies, manual techniques and exercises, applied singularly or in combination to maintain well-being, as well as to treat, diagnose or prevent illness. Commonly used therapies and therapeutic techniques Chinese Medicine Ayurveda Unani Naturopathy Osteopathy Homeopathy Chiropractic Herbal medicines Acupuncture/acupressure Manual therapies Spiritual therapies Exercises = commonly incorporates this therapy/therapeutic technique = sometimes incorporates this therapy/therapeutic technique = incorporates therapeutic touch Reprinted from Traditional Medicine Growing Needs and Potential—​WHO Policy Perspectives on Medicines, No. 002, May 2002. Copyright © World Health Organization 2002. 110 section 2  Background to medicine from Chinese herbs and was supported by the Rockefeller founda- tion; his work on ephedrine became very well known. Since the iso- lation of ephedrine by Nagai was unknown to him, Chen set out to develop a method of extraction to obtain the active compound from Ephedra, which he isolated in only a few weeks. Later, he col- laborated with Schmidt to determine the pharmacological actions of ephedrine. The agent was found to increase myocardial contractility, induce sustained elevation of carotid arterial blood pressure, con- tract the blood vessels in skin, mucosa, and internal organs, as well as stimulate central awareness. These actions resemble those of adren- aline (epinephrine), but with long-​lasting effects and relatively low toxicity. Chen and Schmidt published their findings in the Journal of Pharmacology and Experimental Therapeutics in 1924. Soon after- wards, the Eli Lilly Company marketed ephedrine (in 1926) for the indications of nasal congestion and bronchial spasm. With the discovery of ephedrine, traditional Chinese medicine had thus borne its first commercial fruit in Western pharmacology. As a sympathomimetic agent, ephedrine is able to cause release of adrenaline and noradrenaline (epinephrine and norepinephrine) and induce relaxation of bronchial smooth muscle. Among other ef- fects, stimulation of ß-​adrenergic receptors by ephedrine appears to be responsible for inducing relaxation of bronchial smooth muscle. Ephedrine became a highly popular and effective treatment for asthma, particularly because, unlike the standard therapy of adren- aline at that time, conveniently it could be administered orally. As a treatment for asthma, ephedrine reached its zenith of popularity in the late 1950s. Artemisinin—​a gift from traditional Chinese medicine Compared with the time taken to introduce many of contemporary discoveries through drug development into widespread clinical use, the discovery of artemisinin largely followed a relatively simple and direct logical route. This route was fortuitously directed by findings that had been long-​reported in the ancient Chinese literature and in this sense, artemisinin, with its unique sesquiterpene lactone gener- ated by phytochemical evolution, is truly a gift from the old practi- tioners of Chinese medicine. In the event, the modern achievement was accomplished by the nationwide collaboration of ‘project 523’ (a national antimalarial project initiated by Chairman Mao Zedong during the war with the United States in Vietnam). This national cooperative project included institutions in Beijing, Shanghai, as well as Shandong, Yunnan, and Guangdong provinces. Each institution contributed to the successful progress of the project in different ways: these in- cluded the identification of the agricultural area where local strains of Artemisia which are rich sources of artemisinin, grew most pro- lifically; the simplified procedure for extraction of artemisinin suitable for large-​scale production; determination of the unique stereo-​structure of artemisinin; development of active artemisinin derivatives; and, not least, the facility for experimental and clinical investigation of the actions of artemisinin and its derivatives. Breakthrough in antimalarial research in the 1970s Malaria caused by Plasmodium falciparum has been a life-​threatening human disease for millennia. After a failed international attempt to eradicate malaria in the 1950s, the disease rebounded, largely due to the emergence of malarial parasites which were resistant to the existing antimalarial drugs, such as chloroquine, which had been introduced in 1947. Resistance to quinine, often the drug of last resort, had been noted in Brazil in 1907 and had been reported from the Thai–​Cambodian border in the mid-​1960s. In 1969, the Institute of Chinese Materia Medica, Academy of Traditional Chinese Medicine was engaged in the national ‘523’ antimalarial project, and the senior author of this chapter (YY Tu) was appointed head of an antimalarial research group. The group collected and selected more than 2000 Chinese herb preparations and identi- fied 640 recipes from ancient literatures that might have some antimalarial activities. More than 380 extracts were evaluated in a living model of malaria in mice infected with Plasmodium berghei, but contradictory to what was recorded in the ancient literature, no promising or reproducible result was obtained. In an effort to explain this, Tu intensively re-​examined the literature:  in A Handbook of Prescriptions for Emergencies authored by Ge Hong (Jin dynasty, 284–​346 CE), the use of Qinghao (the Chinese name of sweet wormwood, Artemisia annua) for alleviating malaria symptoms is described: ‘A handful of Qinghao is immersed in 2 litres of water; wring out the juice and drink it all’ (Fig. 2.8.1). Tu was struck by the possibility that the heating involved in a con- ventional extraction step might have destroyed the active compo- nents and that extraction at a low temperature might be necessary. First extracting by ether reflux, YY Tu and colleagues subse- quently resolved the extracts into the acidic and neutral fractions and finally obtained a non​toxic neutral extract (No. 191). On 4 October 1971, extract 191 was found to be completely effective in suppressing parasitaemia in the P. berghei-​infected mice and later, in monkeys with malaria due to Plasmodium cynomolgi. To accelerate the clinical evaluation of extract 191, Tu and colleagues voluntarily tested the safety of the extract in themselves. Shortly after this, in October 1972, a trial with 30 patients mainly from southern China infected with P. vivax and P. falciparum showed a rapid disappearance of fever and intraerythrocytic parasites. One month later, these results were formally notified to the national ‘523 project’ office, after which further collaborative research on Artemisia annua was promoted. The clear demonstration of anti- malarial efficacy in extract 191, stimulated by the ancient and ac- curate account many centuries earlier by Ge Hong, symbolizes the breakthrough which led to the later discovery and pharmaceutical development of artemisinin. The discovery of artemisinin YY Tu’s research group set out to purify the components with anti- malarial activity that were present in extract 191 of Artemisia annua. In 1972, a colourless crystalline substance was identified with a mo- lecular weight of 282 Da, a molecular formula of C15H22O5 absent ni- trogen, and a melting point of 156–​157°C. The name of ‘Qinghaosu’ (artemisinin, ‘su’ means basic element in Chinese) was given to this newly identified compound. Afterwards, the team found out that only the species of A. annua in the Artemisia genus and its leaves in the alabastrum stage contain abundant artemisinin. The chem- ical properties and behaviour of the compound were identified as a sesquiterpene lactone; finally in 1975, the stereo-​structure of arte- misinin was determined unambiguously in collaboration with col- leagues at the Institute of Biophysics, Chinese Academy of Sciences. The structure was first published in 1977 and the new molecule and the paper were immediately cited by the Chemical Abstracts (C.A. 1977, 87, 98788g). 2.8  Traditional medicine exemplified by traditional Chinese medicine 111 Worldwide attention to artemisinin The successful treatment of several thousand patients with artemi- sinin and its derivatives for malaria in China attracted worldwide attention in the 1980s. In 2002, the WHO announced a switch in the recommendations for malaria treatment to artemisinin combin- ation therapy. In 2006, to prevent malaria parasites from developing resistance to the drug, WHO requested pharmaceutical com- panies to end the marketing and sale of ‘single-​drug’ artemisinin, Artemisinin combination treatment is currently widely used, and has saved many lives globally, and especially in Africa. The remedy impressively reduces the intensity of malaria in Africa due to its activity against the gametocyte stages of P. falciparum in the blood which allow transmission from host to host via the mosquito vector. Murray et al. have examined the global mortality due to malaria over the period 1980–​2010 (see Fig. 2.8.2): since the global peak in 2004, there has been a substantial decrease in malaria deaths that is H H H D C B A 6 7 8 9 10 3 4 2 1 1 5 3 2 0 0 5 0 0 0 4 12 13 11 H CH3 H3C 15 CH3 14 Fig. 2.8.1  From ancient literature to artemisinin molecule. Life: A Handbook of Prescriptions for Emergencies by Ge Hong (Ming dynasty version, 1574 ce); Middle: ‘A handful of Qinghao immersed with 2 litres of water, wring out the juice, and drink it all’ is printed in volume 3 in the fifth column from the right. Right: the molecular structure of artemisinin. Image courtesy of Prof Liao FL. 2000000 1800000 1600000 1400000 1200000 1000000 800000 600000 400000 200000 0 1980 1985 Deaths (n) 1990 1995 Year 2000 2005 2010 ≥70 years 50–69 years 15–49 years 5–14 years <5 years Fig. 2.8.2  Malaria deaths by age from 1980 to 2010. Reprinted from The Lancet, 379(9814), Murray CJ, et al., Global malaria mortality between 1980 and 2010: a systematic analysis, 413–​31. Copyright © 2012, with permission from Elsevier. 112 section 2  Background to medicine attributable to the rapid scale-​up of control activities, including the intensified use of artemisinin combination therapy. Arsenic trioxide and the treatment of acute promyelocytic leukaemia Brief medicinal history of arsenic Arsenic is one of the oldest drugs in Western medicine and trad- itional Chinese medicine, since it was cited by Hippocrates (460–​ 370 BCE) as a treatment of skin ulcer and by the Yellow Emperor’s Internal Classic (263 BCE) for treatment of periodic fever (likely to be attributable to malaria). Arsenic exists in different forms: As4S4, known as realgar; and As2S3, known as orpiment or yellow arsenic; the common term for As2O3 (arsenic trioxide) is white arsenic. The highly poisonous nature of arsenic has long been known and ac- cording to the traditional principle of ‘combating an evil with a toxic agent’ limited amounts of arsenic have been included in traditional medical recipes; it was also recorded by the 16th century Li Shi-​Zhen in the Compendium of Meteria Medica. In the late 18th and early 19th centuries, Fowler’s solution, 1% potassium arsenite (KAsO2) solution, was introduced in clinics in Europe to treat periodic fever, chronic myelogenous leukaemia, Hodgkin’s disease, and many other conditions. Some practitioners in Europe used arsenicals as a ‘tonic’ to improve energy in those who complained of fatigue or reduced libido. However, due to its toxicity, including the development of squamous cancers of the skin, and the advent of modern chemo- therapy, the use of Fowler’s solution and other arsenicals was largely discarded in the early decades of the 20th century. The application of arsenic to acute promyelocytic leukaemia Acute promyelocytic leukaemia is one of the most lethal forms of leukaemia. In the 1970s, the combination of anthracyclines and cytosine arabinoside was adopted as the mainstream treatment for acute promyelocytic leukaemia, but the effects were often not cura- tive and adverse events with bleeding related to disseminated intra- vascular coagulation—​often initiated by chemotherapy. Enlightened by the ancient eastern wisdom, Chinese doctors never stopped their attempts to blaze a new trail. In the 1980s, inspired by the Chinese philosophy that ‘the evil can be educated to be the good’ and the concept of cancer cell differentiation from the Western literature, Dr Zhenyi Wang’s team of Shanghai Institute of Haematology first ap- plied all-​trans retinoic acid to the treatment of acute promyelocytic leukaemia and achieved a complete remission rate of 90%. More im- portantly, this provides a first example that a human cancer can be treated effectively using inducers of cell differentiation rather than cytotoxic agents. The application of arsenic to the treatment of leu- kaemia was another breakthrough initiated by Chinese clinical in- vestigators and physicians. In 1971, Dr Tingdong Zhang and colleagues affiliated to the First Hospital at Harbin Medical University, chanced upon the potential anticancer effect of arsenic trioxide from a countryside herbalist. Accordingly, Zhang and colleagues prepared a solution for par- enteral administration termed ‘Ailin’ (broadly translated from the Chinese as a cancer-​curing mixture): this contained arsenic trioxide and a trace amount of mercury chloride and was administered to patients with chronic myeloid leukaemia. The first report on the ef- fects of Ailin solution was published in 1973 in a local journal en- titled Medicine and Pharmacy of Heilongjiang. In 1976, the authors reported five patients with acute leukaemia who had had a complete remission after administration of Ailin. Later, in 1979, Zhang pub- lished a summary of 55 cases of acute leukaemia treated by Ailin solution or related remedies in which arsenic trioxide had been combined with other agents. All 55 cases were improved to some ex- tent, with a remission rate of 70%; this included 12 patients in whom complete remission occurred. Of note, few or only slight toxic side effects occurred with the small doses of arsenic used. Zhang and col- leagues clearly pointed out that the key component in the Ailin so- lution was arsenic. From 1984 to 1992, they further observed that acute promyelocytic leukaemia was particularly susceptible to ar- senic treatment. In the mid of 1990s, a research group at the Shanghai Institute of Haematology, led by Dr Zhu Chen, established a cooperation with Dr Tingdong Zhang. They then launched the first systematic study of the cellular and molecular mechanisms of arsenic trioxide and confirmed its antileukaemia effects. Grounded in their laboratory findings, Dr Chen’s team conducted a controlled clinical trial of pure arsenic trioxide in relapsed and newly diagnosed patients with acute promyelocytic leukaemia in which high (80%) rates of complete re- mission were observed in both settings. Based on the convincing re- sults of clinical and basic studies by Dr Chen’s group, in August 1999, arsenic trioxide was approved as a new drug by the China Food and Drug Administration. Mechanism of action of arsenic in acute promyelocytic leukaemia Dr Zhu Chen, Dr Saijuan Chen, and colleagues of Shanghai Institute of Haematology investigated the mode of action of arsenic trioxide. A  combination of apoptosis induction and partial differentiation appears to be the cellular mechanisms associated with induction of remission. In molecular terms, the presence of the transloca- tion, t(15;17), causing fusion of the retinoic acid receptor α gene on chromosome 17 with the promyelocytic leukaemia (PML) gene coding for a nuclear protein on chromosome 15, leads to formation of the PML-​RARα fusion protein containing sequences from the PML moiety and retinoic acid receptor α. It is notable most patients with acute promyelocytic leukaemia who harbour PML-​RARα re- spond well to the therapeutic effect of arsenic trioxide, while there is evidence that those very rare patients without the t(15;17) trans- location and therefore without expression of the PML-​RARα fusion protein, such as those with the t(11;17) translocation and resultant PLZF-​RARα, are resistant or relatively resistant to arsenic. It has been shown that arsenic trioxide binds directly to cysteine residues located within the specific RING-​B box-​coiled coil domain of PML-​RARα. The identification of the PML-​RARα fusion protein as a direct target of arsenic trioxide and at in situ concentrations that are considerably lower than those required to induce apoptosis in other cancer cells, indicates a remarkable selectivity of the molecular target. It is worth noting that the wild-​type PML protein is a tumour suppressor in- volved in growth arrest and regulation of apoptosis and senescence. In healthy cells, PML is located in a specific nuclear subdomain (the PML nuclear body). However, in promyelocytic leukaemia, complex formation of PML-​RARα with wild-​type PML breaks up the nuclear microspeckle pattern and the functions of the protein. Arsenic tri- oxide at therapeutic dosage corresponding to in situ concentrations as low as 10-​7 M induces degradation of both the wild-​type PML 2.8  Traditional medicine exemplified by traditional Chinese medicine 113 and PML-​RARα fusion proteins with posttranslational modifica- tion by sumoylation, which appears to restore accumulation of the wild-​type protein in the formerly leukaemic cells. The sumoylation of the fusion protein is then followed by its degradation through the proteasome-​dependent pathway (Fig. 2.8.3). At the level of cells and the whole organism, arsenic trioxide causes partial differentiation of acute promyelocytic leukaemia cells as well as their apoptosis; not- ably it has little effect on other cancer cells under similar conditions. A  striking additional finding is that only the disease-​associated PML-​RARα, but not the wild-​type RARα receptor is degraded upon exposure to the drug. Given the striking effects of arsenic trioxide in a leukaemia that is otherwise very challenging to cure, clinicians worldwide have rapidly adopted the original findings. In the first instance, the agent found favour with clinicians who explored use of parenteral ar- senic in patients who had proved resistant to or had relapsed after receiving recommended first-​line drugs including the targeted mo- lecular therapy of all-​trans retinoic acid which, by also targeting the PML-​RARα oncoprotein, can lead to terminal differentiation of malignant promyelocytes to mature neutrophils. However, it is apparent that this treatment alone does not eradicate the ma- lignant clone. Cytotoxic chemotherapy may lead to remission but through the mechanistic studies of the group of Shanghai Institute of Haematology it is now recognized that achievement of a cure in the great majority of acute promyelocytic leukaemia patients requires the addition of arsenic trioxide. The treatment of acute promyelocytic leukaemia is complex, and involves management and prevention of several potentially life-​ threatening complications such as the promyelocytic leukaemia cell differentiation syndrome, occurring in a small subset of patients within the first three weeks of treatment with all-​trans retinoic acid or arsenic trioxide. However, the past three decades have witnessed a great progress in the treatment of the disease largely as a result of original and modern studies related to the traditional use of medi- cinal arsenic in China. Based on the observations that both arsenic trioxide and all-​trans retinoic acid bind and trigger proteasomal degradation of PML/​ RARα fusion protein but through distinct mechanisms, a syner- gistic targeting strategy has been developed by Dr Zhu Chen and Dr Saijuan Chen’s group for newly diagnosed patients with acute promyelocytic leukaemia. This involves a combination of all-​trans retinoic acid and arsenic trioxide, and incorporates chemotherapy in the postremission treatment. This regimen not only reduces the early deaths caused by bleeding or differentiation syndrome, but more importantly yields up to 90% five-​year disease-​free survival in acute promyelocytic leukaemia. Recent reports from the same group have confirmed the safety of the all-​trans retinoic acid/​arsenic tri- oxide combination therapy, since no obvious long-​term toxicities NB4 cells 0 hr 0.5 hr 6 hr 24 hr PML (a) (b) (d) (c) PML-RARα RARα 0 0 10 20 30 60 0 10 20 30 60 0 10 20 30 60 As2 O3(hr) As2 O3(min) 121 77 181 121 77 181 121 77 181 121 77 kDa kDa kDa 48 77 48 77 Anti-FLAG Anti-FLAG Anti-FLAG Mock Arsenic (ng/mg) 0 5 10 15 20 25 PML P S PML-RARα > RARα > 1 6 12 24 < < < < < Fig. 2.8.3  The effect of arsenic trioxide (ATO) on the posttranslational modification and degradation of PML-​RARα and PML is rapid and specific. (a and b) ATO-​induced degradation of PML-​RARα in the APL cell line NB4 cells, as assessed by Western blotting with RARα antibody and immunofluorescence staining with PML antibody (green). The cells were treated with 1 mM ATO. Scale bar, 10 mm. (c) A time-​course study showing the effects of ATO on PML, PML-​RARα, or RARα. (<) points to parental proteins, (▼) to modified proteins, and (▽​) to degraded fragments. Transfected HEK 293T cells were treated with 2 mM ATO and then lysed in radioimmunoprecipitation assay (RIPA) buffer and fractionated into supernatants (S) and pellets (P) by centrifugation. (d) The arsenic content of pellets from HEK 293T cells that had been mock-​transfected or transfected with a vector encoding PML. Reproduced from Chen Z, Chen SJ, et al. Arsenic trioxide controls the fate of the PML-​RARalpha oncoprotein by directly binding PML. Science, 2010, 328:240–​3. 114 section 2  Background to medicine were observed among patients in whom treatment is stopped. Based on the risk factors for prognosis in acute promyelocytic leukaemia, optimization of the treatment emphasizes the role of arsenic in in- duction, consolidation and maintenance therapy as a substitute to chemotherapy in low-​ and intermediate-​risk patients, and in potential reduction of chemotherapy in the patients with a high-​ risk of complications without adverse impact on the outcome. See Chapter 22.3.3 for further discussion. Evolution of pharmaceutical use and commercial development of medicinal arsenic The all-​trans retinoic acid/​arsenic trioxide/​chemotherapy triad therapy achieves a cure rate of more than 90% and costs less than CNY 100 000 in Mainland, China. Since the new round of health reforms launched in 2009, both all-​trans retinoic acid and ar- senic trioxide have been listed in the China’s National Essential Drug List; moreover, the costs of treatment are covered univer- sally by the health insurance. An oral mixture called Realgar-​ Natural Indigo Formula containing realgar, natural indigo, Salvia miltiorrhiza, and Radix pseudostellariae, which was developed by Dr Shilin Huang of the 210 Military Hospital in the 1980s, can achieve a 5-​year survival rate of 85% by targeting PML-​RARα and orchestrating related signal networks. This formula is also effective and affordable. In the recent years, researchers of Hong Kong University, led by Dr Wing-​Yan Au and Dr Yok-​Lam Kwong, have promoted use of a convenient oral formulation to replace intravenous arsenic in Hong Kong and to avoid exorbitant charges: 150 patients with acute promyelocytic leukaemia have been treated with this alternative, convenient, and affordable first-​line therapy. Introduction to compound formulae and disease prevention in traditional Chinese medicine The role of compound formulae The finding of ephedrine as a sympathomimetic, the discovery of the antimalarial, artemisinin and the exploration of arsenic trioxide for treating acute promyelocytic leukaemia have strictly followed the modern pharmaceutical approach. All the three discoveries were based on a single herbal formula and the application of con- temporary molecular science in translational research to develop modern and effective medicines for important conditions. However, the investigation and use of a single herbal principle for the treatment of a specific disease is the rare case in the use of traditional Chinese medicine today. The reality is that popular treat- ments are characterized by the holistic syndrome differentiation for a patient; this leads to the prescription of a herbal formula composed of a group of well-​tailored herbs that are related by the practitioner to the syndrome for which treatment is requested. The relationship between individual syndromes and the herbal formulae used, while largely symptom-​based and antithetic to the teaching of Western medicine, are established as part of the rich experience in the ad- vancement of traditional Chinese medicine that has evolved over thousands of years. Currently, marketed products of traditional Chinese medicines are mostly patented compound formulae; these account for more than 20% of the whole pharmaceutical market in China. In the 2015 edition of the Chinese pharmacopoeia, 1493 for- mulae and single herb preparations are recorded (of which formulae constitute the majority); 618 medicinal materials and processed herbal slices are the exception. Herbal formulae for treatment of cardiovascular diseases Cardiovascular diseases are common life-​threatening conditions and naturally come to the frequent attention of practitioners and doctors. The term of ‘blood’ in Chinese medicine has a meaning that corresponds closely with that in Western medicine (i.e. the flowing or circulating blood). The syndromes associated with stasis of blood also occur frequently as part of syndrome differentiation, and are linked closely with vascular diseases. The principle of activating or improving the circulation of blood is a therapeutic concern in many aspects of medicine and is an important aspect of traditional Chinese medicine. Since the 1970s, Keji Chen and colleagues have conducted nu- merous clinical trials by applying herbal medicine in efforts to stimu- late the circulation of blood as part of the treatment of coronary heart disease, including anginal attacks and restenosis after percutaneous coronary intervention. In collaboration with physicians in working in 16 hospitals in Beijing, Chen and this clinical research team in- vestigated the use of a herbal compound formula simply termed ‘coronary heart II’ to treat chronic stable anginal pectoris. The for- mula consists of five herbs with properties considered to stimulate or improve the circulation of blood, including Salvia miltiorrhiza and Ligusticum wallichii. The randomized double–​blind controlled clinical trial showed that the compound formula could reduce the frequency of anginal attacks; at the same time laboratory studies indicated that the effect might in part be attributed to antiplatelet activity with increased fibrinolysis. The trial supported clinical use of the formula preparation in the treatment of chronic stable an- gina pectoris compared with the placebo. Tetramethylpyrazine, an antiplatelet drug used in China, was detected in extracts from Ligusticum wallichii contained in the aforementioned formula preparation. In the past 30 years, active treatment for acute myocardial in- farction and coronary syndromes has saved many lives. However, coronary artery restenosis after percutaneous coronary interven- tion continues to present a dilemma. To investigate frequently encountered complications such as plaque rupture and the con- tribution of inflammatory factors to obstructive complications of atheromatous coronary heart disease, Chen and colleagues took a holistic view of the ischaemic complications of blood stasis in the coronary circulation and selected XueFu ZhuYu Tang, a well-​ known Chinese compound formula developed by Wang Qingren (1768–​1831 CE) for various syndromes attributed to blood stasis. Pursuing evidence from pharmacological studies, they used Ligusticum phenols and Paeonia glycosides extracted from the for- mula in a preparation for a multicentre clinical trial carried out in 335 patients with potential coronary arterial restenosis after per- cutaneous coronary intervention. Based on a routine treatment of aspirin and ticlopidine, XS0601 capsule (extracted Ligusticum phenols and Paeonia glycosides) showed significant benefit for preventing restenosis compared with placebo. The restenosis rate 2.8  Traditional medicine exemplified by traditional Chinese medicine 115 was significantly reduced in the XS0601 group as compared with that of the placebo (26.0% compared with 47.2%, p <0.05; see Fig. 2.8.4). At up to 1 year of follow-​up, the Kaplan–​Meier survival curve for freedom from clinical end-​point events showed a sig- nificant difference between XS0601 and the placebo; in addition, follow-​up angiographic findings support the salutary findings in relation to survival outcome. To request approval for compound formulae from American Food and Drug Administration (FDA) is a new effort of narrowing the gap between traditional Chinese medicine and mainstream Western medical practice. The patent medicine, Compound Danshen Dripping Pill in the Chinese pharmacopoeia, is a typ- ical example. The pill, containing extracts from Salvia miltiorrhiza and Panax notoginseng, is mainly used to treat angina and cor- onary heart diseases. So far it is the first Chinese patent traditional medicine to have completed the FDA’s clinical phase II trials and it is encouraging that this medication remains under evaluation in rigorous clinical phase III trials. Also, the drug in tablet form has been approved by drug watchdogs in Canada, Russia, and some Asian/​African countries. The manufacturer’s data shows that more than millions of people worldwide take this medication annually. Disease prevention, health promotion, and traditional Chinese medicine Disease prevention and health promotion are strategic issues in medicine. ‘Preventing disease is far superior to treating a dis- order’ is also one of the sacred principles underlying traditional Chinese medicine. A drawing of physical and breathing exer- cises on silk was unearthed from a Han Tomb at Mawangdui, central China in the 1970s. The finding provided early archaeo- logical evidence of exercise employed in healthcare in China some 2000 years ago (Fig. 2.8.5). For disease intervention and healthcare, biomechano-​pharmacology, a new borderline dis- cipline, pursues the combinatorial effects of traditional medicine with biomechanical factors that are designed to improve blood flow. Many studies have revealed that endothelial release of ni- tric oxide, endothelin-​1, prostacyclin (PGI2), von Willebrand factor, tissue plasminogen activator, intercellular cell adhesion molecule-​1, vascular cell adhesion molecule-​1, endothelial cell growth factor, and inflammatory factors are all regulated by blood shear stress. Therefore, vessel tension, thrombosis, thrombolysis, cell adhesion, angiogenesis, atherosclerosis can be argued on first principles to be influenced by blood shear stress and hence circulatory flow. The argument in Chinese trad- itional medicine is made that cardiovascular and cerebrovas- cular diseases may benefit from interventions that influence the biological responses of endothelium regulated by biomechanical factors. In most cases, sedentary life style reduces blood flow and cardiac output and with this the level of blood shear stress. As a modern interpretation of blood stasis syndrome, decreased blood shear stress is a common pattern of the syndrome. The an- cestors of modern Chinese utilized exercise to prevent this syn- drome and promote health. The great surgeon, Hua Tuo, before 205 CE, not only created five animal-​mimic boxing as a phys- ical fitness exercise, but also realized ‘Diseases are prevented as blood flow is promoted’. This appears today to be a really wise and prescient foresight of the biological effects of blood flow. Pharmacological experiments have recently shown that the joint application of exercise combined with administration of ex- tracts from Chinese medicine may prevent atherosclerosis. For example, the experimental combination of a Yindan Xinnaotong capsule and swimming in experimental rats may prevent athero- sclerosis through a synergistic effect between the capsule and 50 40 26.1 Per patient restenosis Per lesion restenosis New lesion Post-stenting Pre-stenting Follow-up Stent-dilating Restenosis rate (%) 47.2 23.6 39.2 12.3 13.9 Control XS0601 30 20 10 0 Fig. 2.8.4  Restenosis comparison between XS0601 and the controls. Left: Restenosis rates per patient and per lesion and percent of new lesion. XS0601 significantly lowered both per patient and per lesion restenosis as compared with the placebo group (p <0.05). As for new lesions, there was no significant difference between the two groups. Right: Angiographic evidences for a patient in the XS0601 group in the stages of prestenting, stent-​dilating, poststenting, and follow-​up. Image courtesy of Prof Chen KJ. 116 section 2  Background to medicine swimming in improving blood circulation, rheological param- eters in the blood, concentration of proatherogenic lipopro- teins, and the vascular endothelium. Vascular remodelling may contribute to the prevention of atherosclerosis by upregulating smooth muscle protein responses—​certainly, this whole field continues to be an active field of contemporary research in Chinese medicine. Making traditional medicine available and affordable Large swathes of the populations in the world’s poorest coun- tries are those most in need of inexpensive, effective treatments. The WHO estimates that one-​third of the global population still lacks regular access to essential drugs, and that in the poorest parts of Africa and Asia, this figure rises. In these regions, some form of traditional medicine is often a more widely available and more affordable source of healthcare. Clearly to provide a sound basis for efforts to promote traditional medicine and to prevent illness and wasted resources, safe and effective therapies must be identified, The focus should be on safe and effective treatments for diseases which represent the greatest burden for poor popu- lations, in this context for malaria and HIV/​AIDS are of great importance but the value of a safe supply of potable water and simple hygiene also must go hand in hand with treatment for all infectious diseases. Many Member States of WHO have made great efforts to advance traditional medicine, some of which can be attributed to the imple- mentation of the WHO Traditional Medicine Strategy 2002–​2005. In recent years, WHO has collaborated with many countries and areas to develop these programmes and to bring traditional medi- cine into the mainstream healthcare system. WHO has developed technical guidelines and standards and organized trainings/​work- shops in support of Member States. Fig. 2.8.6 shows the progress of Member States regarding established national policies on traditional medicine and national regulations on herbal medicines. The newly established WHO Traditional Medicine Strategy 2014–​2023 has two key goals:  to support Member States in harnessing the potential contribution to health, wellness, and people-​centred healthcare and to promote the safe and effective use of traditional medicine through the proper and organized regulation of products, practices, and practitioners. We very much hope that the implementation of this strategy will accelerate the availability and affordability of traditional medicine in the near future. As stated by Mao Zedong, Chinese medicine and pharma- cology are a great treasure-​house and we should explore them and raise them to a higher level. Fig. 2.8.5  Drawing of physical and breathing exercise on silk unearthed from a Han Tomb at Mawangdui, central China. It is an early archaeological evidence of exercise employed in healthcare in China some two thousand years ago. 2.8  Traditional medicine exemplified by traditional Chinese medicine 117 So much of benefit for humankind will emerge if the science of the West and the rich experience and traditions of medicine in China can work together in an integrated way. Acknowledgements The authors wish to record their gratitude to Professors Chen (Saijuan) and Chen (Zhu) for kindly reviewing this chapter. FURTHER READING Au WY, et al. (2011). Oral arsenic trioxide-​based maintenance regi- mens for first complete remission of acute promyelocytic leukemia: a 10-​year follow-​up study. Blood, 118(25), 6535–​43. Chen GQ, et al. (1997). Use of arsenic trioxide (As2O3) in the treat- ment of acute promyelocytic leukemia (APL): I. As2O3 exerts dose-​ dependent dual effects on APL cells. Blood, 89(9), 3345–​53. Chen KJ, et al. (2006). XS0601 reduces the incidence of restenosis: a prospective study of 335 patients undergoing percutaneous cor- onary intervention in China. Chin Med J, 119(1), 6–​13. Chen KK, Schmidt CF (1924). The action of ephedrine, the active principle of the Chinese drug Ma Huang. J Pharmacol Exp Ther, 24, 339–​57. Hu J, et al. (2009). Long-​term efficacy and safety of all-​trans ret- inoic acid/​arsenic trioxide-​based therapy in newly diagnosed acute promyelocytic leukemia. Proc Natl Acad Sci U S A, 106(9), 3342–​7. Huang ME, et al. (1988). Use of all-​trans retinoic acid in the treatment of acute promyelocytic leukemia. Blood, 72(2), 567–​72. Klayman DL (1985). Qinghaosu (artemisinin): an antimalarial drug from China. Science, 228, 1049–​55. Li GQ, et al. (1984). Randomised comparative study of mefloquine, qinghaosu, and pyrimethamine-​sulfadoxine in patients with falcip- arum malaria. Lancet, 2(8416), 1360–​1. Liao FL, et al. (2006). Biomechanopharmacology: a new borderline discipline. Trends Pharmacol Sci, 27, 287–​9. Lo-​Coco F, et al. (2013). Retinoic acid and arsenic trioxide for acute promyelocytic leukemia. N Engl J Med, 369(2), 111–​21. Luo J, et al. (2015). Compound danshen (salvia miltiorrhiza) dripping pill for coronary heart disease: an overview of systematic reviews. Am J Chin Med, 43(1), 25–​43. Murray CJ, et al. (2012). Global malaria mortality between 1980 and 2010: a systematic analysis, Lancet, 379(9814), 413–​31. Shen ZX, et al. (1997). Use of arsenic trioxide (As2O3) in the treat- ment of acute promyelocytic leukemia (APL): II. Clinical efficacy and pharmacokinetics in relapsed patients. Blood, 89(9), 3354–​60. Sun HD, et al. (1992). Ai-​Lin I treated 32 cases of acute promyelocytic leu- kemia. Chin J Integrat of Chinese and Western Medicine, 12, 170–​1. Tu YY (2011). The discovery of artemisinin (qinghaosu) and gifts from Chinese medicine. Nat Med, 17(10), 1217–​20. Tu YY (2015). Discovery of Artemisinin—​a gift from traditional Chinese medicine to the world. Karolinska Institutet, Nobel Lecture, The Nobel Foundation, Stockholm. Wang JG, et al. (2018). What has traditional Chinese medicine de- livered for modern medicine?, Expert Rev Mol Med, 20, e4. WHO (2013). WHO Traditional Medicine Strategy 2014–​23. https://​ www.who.int/​medicines/​publications/​traditional/​trm_​strategy14_​ 23/​en/​ Zhang TD, Rong FX (1979). Treatment of acute granulocytic leu- kemia by Ailin No.1 and dialectic theory. Medicine and Pharmacy of Heilongjiang, (4), 7–​11. Zhang XW, et  al. (2010). Arsenic trioxide controls the fate of the PML-​RARalpha oncoprotein by directly binding PML. Science, 328(5975), 240–​3. 120 100 80 60 40 20 0 1999a 25 65 39 82 45 92 48 69 110 119 2003b 2005c 2007d 2012e,f Number of MS with TM policy Number of MS regulating herbal medicines Fig. 2.8.6  The national policies on traditional medicine and national regulations on herbal medicines in WHO member states. Reprinted from WHO Traditional Medicine Strategy 2014–​2023, page 21. Copyright © World Health Organization 2013. 2.9 Engaging patients in therapeutic development 1 2.9 Engaging patients in therapeutic development 118 ESSENTIALS Engagement of patients during the development of new treatments has evolved rapidly in the last decade. No longer should patients be viewed as passive subjects of research: they should be active partici- pants in a more open and ambitious ecosystem of academic investi- gators, companies, and government entities. Enhanced engagement of patients throughout the development process provides important benefits for the introduction of useful and effective drugs by chan- ging the understanding of the disease, the measurement of the dis- ease during treatment, and the interpretation of benefit-​risk. This system is critical for the discovery, study, development, regulation, delivery, and use of innovative therapies. Introduction Increasing involvement of patients and their advocacy organizations in therapeutic development provides an opportunity to innovate experimental medicine and render challenging or invasive clinical trials possible. The recent excitement in the field of ‘rare diseases’ has a long history—​but since the accelerated introduction of novel treatments as a result of the introduction of the Orphan Drug Act (first in the United States, in 1983, later followed in Europe in 2001), participation of patients in the whole process of therapeutic devel- opment has burgeoned at many levels. There can be little doubt that this trend of practice substantially improves the development of drugs and promotes a healthy social contract between all parties that can accelerate achievement of one common goal: introduction of ef- fective and safe treatments. For academic investigators, clinicians, companies, and government entities, the goal is common; but the processes undertaken by, and investment of each party is strikingly different. As to the involvement of patient organizations, this may include funding creative breakthrough research at a university that might otherwise not be able to provide adequate support for the neces- sary preclinical or clinical studies. Patient groups have contributed materially to the development of new ways to measure or score the manifestations of a given disease, as well as its response to potential treatments—​not least by helping to recruit patients for exploratory clinical research. More recently, patient groups have also partici- pated in the collection of data on the benefits and risks for treatments relative to their disease, as perceived by the individual experien- cing that condition (the ‘patient perspective’). Finally, many patient groups assist regulatory authorities, such as the European Medicinal Agency (EMA), directly and extensively. This involvement relates to policy changes that are needed to allow early access to therapy or other policies that have significance for patients. Input from patient organizations has broadened the perspective of the regulatory agen- cies as they evaluate new treatments for approval: this involvement is relatively new but provides insights beyond those offered by pro- fessionals in academia and industry—​moreover it helps to maintain the focus of the approval process on those most critically affected by a given condition. Enhanced engagement of patients throughout the development process provides important benefits for the development of useful and effective drugs by changing the understanding of the disease; the measurement of the disease during treatment, and the interpret- ation of benefit-​risk. The understanding of disease is deepened as a result of detailed disease surveys or studies into the natural course of the condition in the absence of specific treatment, the course of the condition treated or in the context of standard care; here again such work is often supported or organized by patient groups. By these means, those who develop drugs, as well as investigative phys- icians gain a sharper and more quantitative understanding of the true burden of illness in a given disease. It is surprising how often these more intensive and systematic studies cast aside historical in- accuracies and mistaken assumptions created decades before as a result of received clinical wisdom. In the era before treatments were introduced, much knowledge was based on available authorities who had, in reality either seen only one subset of a disease and were unaware of its true diversity: lacking the ability to advance clinical care, little experience of the condition gained through long-​term clinical monitoring and follow-​up. To study how a treatment affects a disease, studying the disease at a deep and quantitative level is critical for determining how ex- perimental therapies can have an impact on the clinical course. Such detailed knowledge allows investigators to devise more ef- fective instruments of measurement, and at times radically different 2.9 Engaging patients in therapeutic development Emil Kakkis and Max Bronstein 2.9  Engaging patients in therapeutic development 119 evaluations and determinants of the progression of a disease, by which treatments can be assessed authentically. Many patients now participate in ‘no-​drug’ studies:  here, testing methodologies are studied carefully to develop the best measures by which cap- ture the particularities of a given disease. Lately, one of the more compelling effects of patient engagement has been revealed in the assessment of benefit and risk of treatments. For some devastating disorders, the patient and their family’s view of relative risk and benefit can be starkly different from that of physicians or regulators. The quantitative collective of the risks for a disease relative to the additional effects of treatment, allows investigators and regulators to scrutinize the merits of a treatment while accounting for the risks in a devastating disease at the same time. Orphan medicinal products In recent years, the development of therapies for rare or ultra-​rare diseases, often referred to as ‘orphan drugs’ or medicines is receiving increasing attention. This is a welcome sea change for patients and their respective advocacy organizations, although it is estimated of the 7000 rare diseases, only 5% have marketed therapies available to patients. There are a wide variety of challenges and opportunities that are unique to rare disease therapy development, but economics may be the most pronounced stumbling block. That is, financing the testing and successful development for a small patient population may be difficult to statistically demonstrate (as a consequence of small sample size) and cost prohibitive for investors and others who seek to earn a return on investment. Nonetheless, breakthroughs in genetic testing, gene therapy, enzyme replacement therapy have al- ready demonstrated substantial clinical benefit and potential for pa- tients who previously had little hope of access to effective medicines. While the science around rare diseases has progressed rapidly, in some cases, it has outpaced the ability of regulatory agencies to effectively monitor these developments and account for them in ap- proval decisions. In general, both the Food and Drug Administra­ tion (FDA) and EMA are largely on par for approval of orphan therapies (see Fig. 2.9.1), but the EMA is seen as often providing additional flexibility on the types of data packages submitted to sup- port approval and marketing of an innovative therapy. For example, several orphan therapies have been approved earlier for marketing in Europe, but have been declined or delayed in the United States usually because of issues surrounding the phase 3 studies. In com- plex settings, the EMA has relied on their scientific and medical in- sights into the total package of data to allow a patient population with high unmet medical need access to the drug whereas in the United States, a missed primary endpoint in a phase 3 study has gen- erally not been approved. The contemporary clinical development process While in the minds of many scientists and most employees in com- panies, the clinical trial is the principal focus of their endeavour, the truth remains that clinical trials are indeed another form of experi- ment, with effectiveness and safety still incompletely understood in broader human populations. All the clinical phases of trial pro- grammes are part of a sequence that is in effect an uncertain journey to the ultimate set of goals involving regulatory approval of a safe and effective therapy; and one that is both useful and reimbursed. Before this stage is even contemplated, even the road to initiating an early clinical trial can be long, arduous, and very costly. Identifying a suitable drug target can take years of research and development, with a notably high failure rate. Most early research ideas never make it to clinical trials, and those that do, still have a small possi- bility of reaching an approval. The regulatory process is designed to test safety and efficacy into various stages, and over time these tests do help provide some basic level confidence that a drug is safe and 50 45 40 35 30 25 20 15 10 5 0 2010 2011 2012 2013 Year Approval of new molecular entities (NMEs) 2010–2015 Number of NME approvals 2014 2015 Orphan EMA approvals Orphan FDA approvals All EMA approvals All FDA approvals Fig. 2.9.1  Approval of new molecular entities. 120 section 2  Background to medicine works. No doubt also, however rigorous the regulatory system, it can also fail by approving a drug for a population for which the risks were not well defined, or by failing to approve a drug that might ac- tually benefit some patients, but not those originally studied. Nonetheless, once a suitable small molecule or biological drug candidate has been identified, typically employing in vitro and/​or in vivo models of disease, then extensive pharmacological studies, including toxicology are undertaken. This is to verify the reasonable likelihood of safety of the compound before the clinical trial pro- gramme can be planned in any detail. Once completed, and submitted for scrutiny to regulatory au- thorities, then the clinical trial process can move forward in that country for which regulatory approval for clinical trials has oc- curred. In recent years, patient groups have become well-​organized and highly engaged with the early research process: some, directly fund preclinical research projects through grant programmes or other charitable mechanisms. In the intense world of rare diseases, families of patients have been known to go so far as to form their own biotechnology company in order to recruit the funding re- quired to treat their family member (see, for example, The Cure by Geeta Anand). Clinical trials are typically divided into several phases—​each encompassing a growing cohort of patients. At each stage, there are rich opportunities for patients and their organizations to engage. Phase 1 of clinical development This phase marks the first opportunity for a novel therapy to be administered to patients. The primary objective at this stage is to determine the safety profile of the compound. During phase 1 clin- ical research, to determine how dosing impacts the safety profile, varying doses of the compound or biological agent are administered to small cohorts of patients. Throughout this process, patients are closely monitored for adverse events and researchers will observe how the compound is metabolized in the body. In addition, some clinical trials may also experiment with different routes of admin- istration for the compound (i.e. oral, infusion, sublingual, and so on). It is important to note that if the phase 1 study reveals high or unacceptable toxicity in patients, the trial may be halted and future development may be completely scrapped. Phase 2 of clinical development Once a safety profile has been established using clinical data, the trial can proceed to the next stage in order to explore the clinical effectiveness of the therapy —​‘efficacy’. During this period, clinical researchers are obliged to select various endpoints or health metrics that will unambiguously determine the potential benefit of the pu- tative treatment on patients for whom the licensed product is envis- aged. Clearly the selection of what regulatory authorities will agree to as informative clinical trial endpoints vary widely and depend entirely on the disease in question. Some more traditional measures are: overall survival time, severity of symptoms, and quality of life. Alternatively, if known, clinical researchers may attempt to use a biomarker or other alternative measure, which is often referred to as a surrogate endpoint. In some quarters, surrogate endpoints are becoming increasingly favoured by those who develop innovative medicines; this is because they can provide more precise measures of disease progression or remission and often can be rapidly de- termined in accessible biological fluids using facile modern assays. Sequential measurement of biomarkers, in turn, can expedite and facilitate the clinical trial process, thereby reducing the burden of assessment of traditional endpoints. However, the predictive value of biomarkers for clinical outcomes is frequently questioned, and a high level of proof is often demanded for their acceptance in clinical trial endpoints as opposed to time-​honoured monitoring in clin- ical practice. Improved scientific methods are thus needed to select biomarkers for disease assessment from an often-​overwhelming array of potential biomolecules whose abundance is known to be altered as a subsequence or consequence of the pathogenesis of a given condition. Phase 3 of clinical development The final and most extensive stage of the evaluation of putative new therapies in clinical trials seeks to elucidate whether the experi- mental therapy is safe and effective as intended for use. Phase 3 trials are typically completed in a randomized controlled design in which patients are randomly assigned to receive the experimental therapy or a placebo or in some situations, an established, active therapy. These studies are typically double-​blind, that is both patients and doctors or sponsor are unaware as to which treatment is being ad- ministered. Once the trial is completed, the sponsors will assemble a large package of data to be submitted to the regulatory authority for review. Regulatory review in Europe can take usually about one year: in this time, the authorities are charged to make a determin- ation about whether the benefits of a new therapy outweigh the risks, for if so, the novel therapy can be approved for marketing. Phase 4/​Postmarket surveillance Once a new therapy has been approved for marketing and distribu- tion, sponsors and public health authorities will continue to monitor patients for adverse events. Postmarketing studies may continue for years and new data may emerge that requires regulators to recon- sider the benefit-​risk determination (Fig. 2.9.2). Expanded access/​named patient treatment/​Compassionate use Some patients facing acute, life-​threatening diseases, may seek out experimental treatment outside of the normal clinical trial process for unapproved therapies. This increasingly common scenario is referred to as ‘compassionate use’ or ‘expanded access’ or named patient treatment. In many cases, the impetus to obtain access to experimental treatments is patient-​driven and typically deployed as a final or ‘last-​ditch’ therapeutic measure. The increasing complexity of the regulatory process and with it, the prolonged development period for many drugs, has had consequential effects on patients with deteriorating health. Delays in the development process can cause an emerging a crisis in patients suffering from the effects of severe and progressive diseases, and has led to calls for earlier access. This state of affairs applies particularly to drugs for patients with life-​threatening conditions where the risk of adverse and unwanted effects becomes less of an ethical concern. Advocacy groups sup- porting patients in such a position are exploring various options to enable patients to obtain access to an experimental drug (ostensibly with appropriate clinical surveillance and a practical framework for its administration). The difficulty with this worthy principle is that 2.9  Engaging patients in therapeutic development 121 the clinical development process alongside regulatory approval is in- tended to protect patients from unnecessary risks. In the United States, the number of requests for expanded ac- cess to experimental therapies, a process regulated by the Food and Drug Administration, increased from 1014 in 2010 to 1873 in 2014. This burgeoning, and international trend poses several chal- lenges for patients, the biopharmaceutical industry, and regulatory bodies. The complexities of these interdependent relationships are particularly exposed in instances when patients, their families and advocacy groups deploy large-​scale social media campaigns to pressurize companies into providing early access to experimental therapies that are still in development and under regulatory scru- tiny (see Box 2.9.1). Patient groups partnering with industry to fund drug development Cystic fibrosis is an inherited genetic disorder caused by a muta- tion in the cystic fibrosis transmembrane regulator gene, CFTR; the disease causes progressive damage to the respiratory and digestive systems (see Chapter 18.10). One of the leading patient groups is the Cystic Fibrosis Foundation (CFF), which began funding spe- cific drug development efforts in the late 1990s with investments in a biotechnology company called Aurora Biosciences. In 2001, Aurora was acquired by Vertex Pharmaceuticals and received continued in- vestments from CFF that would prove vital in sustaining the risky cystic fibrosis research programme. This programme would go on to develop a novel pharmacological chaperone therapy, Kalydeco, which was approved for marketing in the United States in 2012, and was the first approved drug designed to treat the cause of cystic fi- brosis as opposed to symptoms. The Foundation will receive a one-​ time pay-​out of $3.3 billion (US) in exchange for the rights to the royalties from the drug. This scenario represents a novel paradigm for funding drug development for a rare disease. CFF is utilizing those revenues to continue investing in further cystic fibrosis re- search and treatment. Patient engagement in drug development: Disease burden, disease measurement, and benefit-​risk Unravelling the true burden of disease requires close input and collaboration with patients. This is especially true for diseases PHASE I TENS HUNDREDS NUMBER OF VOLUNTEERS THOUSANDS NDA/BLA SUBMITTED APPROVAL PHASE II PHASE III PHASE IV Regulatory approval POST-APPROVAL RESEARCH & MONITORING CLINICAL TRIALS REVIEW BASIC RESEARCH DRUG DISCOVERY PRE- CLINICAL POTENTIAL NEW MEDICINES IND SUBMITTED Fig. 2.9.2  The contemporary clinical development process. Box 2.9.1  Case study: Josh Hardy One recent and widely publicized case involved Josh Hardy, who had experienced severe health complications stemming from a rare kidney cancer. After a bone marrow transplant, Josh was infected by an adenovirus that, if untreated, would probably result in death. However, the patient’s responsible physician at St. Jude Children’s Research Hospital recognized that an experimental therapy, brincindofovir had the potential to treat the infection. Chimerix Inc., the company that was developing the treatment, refused to provide access, citing concerns about cost, workload, and sustain- ability of such requests: ‘The company says helping him will slow efforts to get it on the market ...’, as reported by CNN. Chimerix argued that as a small 54-​person company in Durham, North Carolina, diverting their resources to this one case could inhibit their broader effort to get the drug approved as quickly as possible. What followed was a social media firestorm, with thousands of emails, Facebook, and Twitter messages being directed to the com- pany Chief Executive Officer, Dr Ken Moch, and other employees imploring them to ‘Save Josh.’ The company also reported receiving more extreme messages, including death threats. Ultimately, the company relented and with the engagement of the Food and Drug Administration created a new clinical trial to enrol Josh as the first patient rather than provide the treatment through the expanded access pathway. Josh responded well to the treatment and did well for 2 years but later declined and died due to other causes in 2016. 122 section 2  Background to medicine that may not be well-​established in the medical literature, as many assumptions or clinical dogma may turn out to be unfounded or tenuous after further investigation. Disease burden is particularly poorly understood in rare diseases in part due to the limited op- portunity for surveys conducted by investigators, and the failure to invest in tools for measuring disease. Patient engagement in rare diseases is now almost the rule as patient groups fund and develop surveys of disease or natural history studies. The Cystic Fibrosis (CFF) Foundation is one example of an organization that established a network of centres of excellence, developed common methods for measuring and tracking cystic fibrosis (CF), and pub- lished information on the disease that helped all investigators or sponsors considering development treatments. This work has suc- cessfully supported efforts to develop new treatments for CF. Many other rare disease groups are replicating this effort to some degree, and developing data on their disease in the precompetitive space so that diverse therapeutic avenues, predicated on deep phenotyping and extensive clinical knowledge, can be explored. The increasing use of computerized international disease registries has had a useful effect in this field. Quantifying disease severity Disease measurement is one of the more complex areas of de- velopment; and patient groups alone are not usually able to pro- vide all the information concerning clinical measurement or patient-​reported outcomes. Here an important synergy can de- velop, based on consultation and collaboration with experts who are able to develop quantitative instruments. These activities are expensive and often there is a need for industrial sponsor- ship to facilitate completion of such enabling work. Challenges in obtaining access to these data as public resources has led to more efforts to assure that all such natural history data sets do not end up as proprietary information of corporate sponsors, but are available for anyone working on a new treatment or develop- ment of tools. Determination of the true benefit-​risk assessment for any treat- ment requires comprehensive understanding of the risks of un- mitigated disease and a deep appreciation of the trade-​off between one disease risk and another risk—​that which emanates or might emanate from the treatment. In the past, industrial sponsors and regulatory authorities spoke for the patients: it is now clear that there this tradition is an unwarranted presumption that lacks any depth of understanding about the experience of a chronic severe disease or indeed what patients, often supported by their families and the larger disease-​family are willing to do to contribute to the management of their disease and future therapies. A recent example of patient engagement has been the work of a Duchenne muscular dystrophy patients’ group in the United States, called Parent Project Muscular Dystrophy (PPMD). Duchenne muscular dystrophy (DMD) is a genetic disease that leads to severe loss of muscle bulk and function. Males with the disease typically live into their late twenties. In 2014, PPMD, convened a variety of stakeholders from throughout the Duchenne community and de- veloped a comprehensive survey tool to rigorously capture the key aspects of benefit:risk. One critical finding was that most parents would support a treatment that stabilizes their child’s strength for one more year at the cost of one less year of life. That is a trade-​off that would not normally be expected and, in most instances, runs not only counter to the usual traditions of professional ethics and decision-​making, but also provides an unexpected foil to the con- ventional view of what patients really want. After the outcomes of this survey were disseminated, a broad co- alition was assembled, including parents of DMD patients, DMD patients themselves, academic researchers, and physicians, as well as representatives from the biopharmaceutical industry to discuss the implications for the development of treatments. The outcome of this collaboration was a guidance document that explores key areas for drug development including:  benefit-​risk assessments, diagnostic tools, natural history studies, clinical trial designs, and outcome measures including the use of biomarkers. Ultimately, this document was submitted to the Food and Drug Administration for consideration and was incorporated in part as an official guidance; it provides a practical roadmap for the biopharmaceutical industry as it seeks to pioneer useful new treatments. Conclusions Successful treatment of patients is at the heart and goal of medi- cine but the increasing complexity of treatments and under- standing of diseases has led to a beneficial societal metamorphosis which involves patients at all levels of therapeutic development. Including patients throughout the process of drug development enhances understanding of pathogenesis, measurement, and the personal impact of disease; genuine participation focuses attention on the authentic therapeutic targets of the disease. In summary, increasing involvement of patients and cognate organizations and advocacy groups broadly in drug development, has driven important changes to the regulatory process and almost always expedites achievement of the common goal to deliver safe and salutary new therapies. FURTHER READING Clinical trials U.S. Food and Drug Administration. The FDA’s Drug Review Process: Ensuring Drugs Are Safe and Effective. http://​www.fda.gov/​drugs/​ resourcesforyou/​consumers/​ucm143534.htm Patient engagement FasterCures (2011). Back to basics: HIV/​AIDS advocacy as a model for catalyzing change. FasterCures and HCM Strategists (14 June 2011). Kim E, Lo AW (2019). Venture Philanthropy: A Case Study of the Cystic Fibrosis Foundation (April 23, 2019). https://ssrn.com/ abstract=3376673 National Health Council (NHC) and Genetic Alliance (2015). Advancing Meaningful Patient Engagement in Research, Development and Drug Review. (22 September 2015). https://​www.nationalhealthcouncil.org/​ sites/​default/​files/​PatientEngagement-​WhitePaper.pdf Parent Project Muscular Dystrophy, Franson TR, Peay H (eds). Benefit-​Risk Assessments in Rare Disorders, the Case for Therapeutic Development in Duchenne Muscular Dystrophy as the Prototype for New Approaches. http://​www.parentprojectmd.org/​site/​DocServer/​ 2.9  Engaging patients in therapeutic development 123 br_​paper_​v11_​_​2_​.pdf;jsessionid=2C381495CB3753608053FD8D D624B686.app247d?docID=14503 Biomarkers as primary endpoints Institute of Medicine (2010). Evaluation of Biomarkers and Surrogate Endpoints in Chronic Disease. Micheel CM, Ball JR (eds) http://​www. nap.edu/​catalog/​12869/​evaluation-​of-​biomarkers-​and-​surrogate-​ endpoints-​in-​chronic-​disease Kakkis E, et  al. (2015). Recommendations for the development of rare disease drugs using the accelerated approval pathway and for qualifying biomarkers as primary endpoints. Orphanet J Rare Dis, 10, 16. Compassionate use/​expanded access European Medicines Agency. Compassionate Use. http://​www.ema. europa.eu/​ema/​index.jsp?curl=pages/​regulation/​general/​general_​ content_​000293.jsp&mid=WC0b01ac058007e691 U.S. Food and Drug Administration. Expanded Access:  Information for Patients. https://​www.fda.gov/​NewsEvents/​PublicHealthFocus/​ ExpandedAccessCompassionateUse/​default.htm