How do scientists talk about race? For quite some time a small group of geneticists have been engaged in deep conversations about how best to convey the complexities of, and the relationship between race, DNA, and human variation to the general public. We come from different backgrounds—Ewan is the director of the European Bioinformatics Institute, Adam is a geneticist and science writer, Aylwyn is a human evolutionary geneticist, and I am an anthropological geneticist—and nationalities, but are united in our agreement that patterns of human genetic variation do not support the biological division of people into races.
Over the course of a year, we worked together on a statement that best reflects our consensus view of human genetic variation, race, and even the fraught topic of race and IQ. We wanted to correct the misconceptions that many people have about these topics, and directly confront a number of untrue ideas promoted by a small group of pseudo-scientists who refer to themselves as “race realists” or proponents of “human biological diversity” (HBD).
The result is a (rather lengthy) statement which Ewan has posted in its entirety here. I want to summarize its main points here with excerpts, but I encourage everyone to go read the whole thing. We intend for this statement to contribute to the ongoing conversation between scientists, social scientists, scholars in the humanities, the media, and the public.
(Also please note that I kept the original British spellings in excerpts that I quoted from the statement).
The biological race concept emerges from a particular history
Research in the 20th century found that the crude categorisations used colloquially (black, white, East Asian etc.) were not reflected in actual patterns of genetic variation, meaning that differences and similarities in DNA between people did not perfectly match the traditional racial terms. The conclusion drawn from this observation is that race is therefore a socially constructed system, where we effectively agree on these terms, rather than their existing as essential or objective biological categories.
Describing race as a social construct does not undermine its existence, nor its importance; it merely points out that there is no fundamental biological basis for race.
Human population structure is not race
Some people claim that the exquisitely detailed picture of human variation that we can now obtain by sequencing whole genomes contradicts this. Recent studies, they argue, actually show that the older notions of races as biological categories (some dating back to the 18th century) were basically correct in the first place. As evidence for this they often point to the images produced by analyses in studies that seem to show natural clustering of humans into broadly continental groups based on their DNA. But these claims misinterpret and misrepresent the methods and results of this type of research. Populations do show both genetic and physical differences, but the analyses that are cited as evidence for the concept of race as a biological category actually undermine it.
Geneticists use a variety of tools to visualise the subtle and complex patterns of genetic variation between people, and to mathematically cluster them together based on relatedness. Such methods are helpful for exploring data, but have also been the source of wider confusion. For example, Principal Component Analysis (PCA) plots often show distinct, colourful clusters of dots that appear to separate groups of people from different parts of the world. In some cases, these clusters even seem to correspond to traditional racial groupings (e.g. ‘Africans’, ‘Europeans’ and ‘Asians’). It is images such as these which are often deployed as genetic evidence for the existence of separate races. But these methods can be misleading in ways which non-experts – and even some specialists – are unaware of. For example, some of the observed genetic clustering is a reflection of the samples that were included in the study and how they were collected, rather than any inherent genetic structure. DNA sample collection typically follows existing cultural, anthropological or political groupings. If samples are collected based on pre-defined groupings, it’s entirely unsurprising that the analyses of these samples will return results that identify such groupings. This does not tell us that such taxonomies are inherent in human biology.
Traits, IQ, and genetic diversity
‘Human biodiversity’ proponents sometimes assert that alleged differences in the mean value of IQ when measured in different populations – such as the claim that IQ in some sub-Saharan African countries is measurably lower than in European countries – are caused by genetic variation, and thus are inherent. The purported genetic differences involved are usually attributed to recent natural selection and adaptation to different environments or conditions. Often there are associated stories about the causes of this selection, for example that early humans outside Africa faced a more challenging struggle for survival, or that via historical persecution and restriction of professional endeavours, Ashkenazi Jews harbour genes selected for intellectual and financial success.
Such tales, and the claims about the genetic basis for population differences, are not scientifically supported. In reality for most traits, including IQ, it is not only unclear that genetic variation explains differences between populations, it is also unlikely. To understand why requires a bit of background.
(Most genome-wide association studies for detecting variants associated with complex traits such as IQ, known as GWAS) have been carried out in populations sampled from across Europe, and have ancestries consistent with this sampling. In many cases though, only certain subsets of people are included in these analyses – for good scientific reasons. For example, samples of “European” populations used in genetic studies often have excluded up to as many as 30% of self-identified Europeans. This is because some individuals introduce hard-to-model complications into the data, forming distinct sub-clusters or complicating the genetic model. For example, Finns and Sardinians are often excluded as they have quite distinct genetic ancestries compared to many other Europeans, as are some people in India, north Africa, Latino/Hispanics, and many individuals with complex ancestries, despite confident self-identification within their ethnic group. Researchers therefore often exclude them from the set of people used in a particular GWAS analyses, on the basis that their unique population histories can invalidate the statistical models used in these techniques.
This, in turn, can confuse people who read the studies and observe distinct and seemingly ‘natural’ population clusters emerge. If they aren’t familiar with the practice of removing these individuals with more complex ancestries (or don’t read the detailed methods, which are often tucked away in elusive supplementary sections of a published paper), they could easily be misled into thinking that the populations in these analyses are much more distinct than they are in reality. The resulting biases are poorly understood, and the terminology involved can be confusing to non-specialists. Furthermore, while it is clear to GWAS researchers that the results of their analyses tend to be specific to the population studied and their predictions cannot be reliably extended to other populations with very different ancestry, this is not widely recognised or understood by non-specialists.
IQ scores are heritable: that is, within populations, genetic variation is related to variation in the trait. But a fundamental truism about heritability is that it tells us nothing about differences between groups. Even analyses that have tried to calculate the proportion of the difference between people in different countries for a much more straightforward trait (height) have faced scientific criticisms. Simply put, nobody has yet developed techniques that can bypass the genetic clustering and removal of people that do not fit the statistical model mentioned above, while simultaneously taking into account all the differences in language, income, nutrition, education, environment, and culture that may themselves be the cause of differences in any trait observed between different groups. This applies to any trait you could care to look at – height, specific behaviours, disease susceptibility, intelligence.
Not only that, the genetic knowledge we gain from studying our mainly-European pools of participants becomes highly unreliable when it is applied to those with different ancestries. Although it is a common trope to argue that we will have the answer to the question of the genetic basis of group differences in traits “in the next five years”, or “in the next decade”, the advances in genomics reveal that the question is far more complex than we could have imagined, even just a few years ago. Consequently, anyone who tells you that there’s good evidence on how much genetics explain group differences (rather than individual differences) is fooling you – or fooling themselves.
However, there are some strong hints towards the answer. The genetic variants that are most strongly associated with IQ in Europeans are no more population-specific than any other trait. To put it bluntly, the same genetic variants associated with purportedly higher IQ in Europeans are also present in Africans, and have not emerged, or been obviously selected for, in recent evolutionary history outside Africa. Moreover, since it is a complex trait, the genetic variation related to IQ is broadly distributed across the genome, rather than being clustered around a few spots, as is the nature of the variation responsible for skin pigmentation. These very different patterns for these two traits mean that the genes responsible for determining skin pigmentation cannot be meaningfully associated with the genes currently known to be linked to IQ. These observations alone rule out some of the cruder racial narratives about the genetics of intelligence: it is virtually inconceivable that the primary determinant of racial categories – that is skin colour – is strongly associated with the genetic architecture that relates to intelligence.
Finally, multiple lines of evidence indicate that there are complex environmental effects (as might reasonably be expected) on measures of IQ and educational attainment. Many socioeconomic and cultural factors are entangled with ancestry in the countries where these studies are often performed – particularly in the USA, where structural racism has historically and continues to hugely contribute to economic and social disparities. We cannot use populations in these countries to help answer the question of why IQ scores are claimed to be lower in other countries with entirely different social, economic, and cultural histories, nor to answer the role of genetics for alleged differences in IQ measures between groups inside a country with strong societal differences linked to ancestry (for example, the USA). Thus, confident assertions that current GWAS show us that ‘race’ is associated with cognitive function are simply wrong. It is our contention that any apparent population differences in IQ scores are more easily explained by cultural and environmental factors than they are by genetics.
The history of our species is complex and convoluted, and our genomes reflect that. As we delve deeper into the DNA of the people of the world, the science of genetics becomes even more complex too. But we see no scientifically sound evidence that contemporary genetics can be used to recapitulate biological or historical concepts for race. It is our duty and wish that this understanding is spread far and wide.
European Molecular Biology Laboratory, European Bioinformatics Institute
Department of Anthropology, University of Kansas.
Genetics, Evolution & Environment, University College London
Department of Genetics, University of Cambridge
The authors wish to thank Stuart Ritchie for his valuable contributions to our discussion.
**Note: comments for this post are not enabled**