GenomicsSNP Deep Dives

Lactose Intolerance Genetics: The rs4988235 SNP Explained

Learn how rs4988235 in the MCM6 gene controls lactase persistence, why 65% of humans are lactose intolerant, and the remarkable convergent evolution of dairy tolerance.

Ryan Bethencourt
April 9, 2026
9 min read

The Majority of Humans Are Lactose Intolerant

Roughly 65–70% of the global adult population cannot efficiently digest lactose, the sugar in milk. This is not a disease or a deficiency — it is the biological default. All mammals naturally downregulate the enzyme lactase after weaning. What requires explanation is not lactose intolerance, but lactose tolerance: why some populations evolved to keep drinking milk into adulthood.

Note
Evolutionary Perspective: Lactase persistence is one of the strongest signals of recent positive natural selection ever detected in the human genome. The T allele at rs4988235 rose from near-zero to over 80% frequency in Northern Europeans in roughly 7,500 years.

The Genetics: MCM6 and LCT

The LCT gene on chromosome 2 encodes the lactase enzyme. But the key regulatory variant is not in LCT itself — it sits 13,910 base pairs upstream, in intron 13 of the neighboring MCM6 gene. This SNP, rs4988235 (also written as LCT-13910 C>T), falls within an enhancer element that controls LCT transcription.

Genotype and Phenotype

  • TT (homozygous derived): Full lactase persistence. Can digest milk comfortably throughout life. Common in Northern Europeans (up to 90% in Scandinavia).
  • CT (heterozygous): Partial to full lactase persistence. Most CT individuals can tolerate moderate amounts of dairy without symptoms.
  • CC (homozygous ancestral): Lactase non-persistence. Lactase production declines after childhood, typically by age 5–12. Predominant genotype in East Asian, West African, Native American, and most Southern European populations.

Convergent Evolution: The Same Problem, Solved Twice

One of the most fascinating aspects of lactase persistence is that it evolved independently in multiple populations. European lactase persistence traces to the C>T change at rs4988235. But in East African pastoralist groups — the Tutsi, Maasai, and other cattle-herding peoples — completely different SNPs in the same MCM6 enhancer region confer lactase persistence: rs41380347 (G>C) and rs41525747 (C>G).

This is textbook convergent evolution. Two geographically separated populations, both with strong dairying traditions, independently evolved the same phenotype through different genetic mechanisms. The selective pressure — nutritional advantage from milk in pastoral societies — was so strong that evolution found a way twice.

Global Distribution

Lactase persistence frequencies vary dramatically by geography and ancestry:

  • Northern Europe: 80–95% lactase persistent (highest in Scandinavia, Ireland, Britain)
  • Southern Europe: 40–60% (Mediterranean populations have intermediate frequencies)
  • Middle East: 20–50% (higher in pastoralist groups, lower in agricultural societies)
  • East Africa: 30–80% (high in pastoralist Tutsi and Maasai, low in agricultural Bantu groups)
  • West Africa: 5–20%
  • East Asia: 1–5% (extremely low, consistent with historically low dairy consumption)
  • Native American: 0–5%
Tip
Check rs4988235 with the free SNP Lookup tool to see your predicted lactase persistence status and population frequency data.

The Dairy Farming Timeline

Archaeological evidence places the domestication of cattle in the Near East around 10,000 years ago, with dairying practices emerging by roughly 7,500 years ago in Central Europe. Ancient DNA studies show that early European farmers did not carry the rs4988235 T allele — lactase persistence only became common after centuries of selection. The Linearbandkeramik farmers of 5500 BCE were drinking milk but likely processing it into cheese and yogurt (which contain less lactose) before the persistence allele became widespread.

Why Symptoms Vary

Even among CC homozygotes, the severity of lactose intolerance varies substantially. Several factors beyond genotype influence symptoms:

  • Gut microbiome: Colonic bacteria that ferment lactose produce varying amounts of gas depending on microbial composition. Regular dairy exposure can shift the microbiome toward more efficient lactose fermentation with fewer symptoms.
  • Dose: Most lactose-intolerant individuals can tolerate 12–15 grams of lactose (one cup of milk) without significant symptoms, especially when consumed with other foods.
  • Dairy type: Aged cheeses and yogurt contain much less lactose than milk due to bacterial fermentation during production.
  • Transit time: Faster intestinal transit gives less time for lactase to work, worsening symptoms regardless of genotype.

Clinical and Ancestral Context

Understanding the genetics of lactase persistence matters for personalized nutrition and avoids the common misconception that lactose intolerance is abnormal. For individuals of East Asian, West African, or Native American descent, lactose intolerance is the expected phenotype, not a disorder. Dietary guidelines that emphasize dairy as a primary calcium source have been criticized for not accounting for this genetic variation.

Frequently Asked Questions

What causes lactose intolerance genetically?

Lactose intolerance is the ancestral default in humans. After weaning, most people gradually stop producing lactase, the enzyme that digests milk sugar. Lactase persistence (the ability to digest milk as an adult) is caused by regulatory variants upstream of the LCT gene, most notably rs4988235 in the MCM6 gene. The T allele at this position keeps the LCT gene active into adulthood.

What does rs4988235 C/T mean?

At rs4988235, the C allele is ancestral and associated with lactase non-persistence (lactose intolerance in adulthood). The T allele is a derived mutation that maintains lactase production. TT and CT genotypes typically retain the ability to digest lactose, while CC individuals usually become lactose intolerant after childhood.

Why is lactose tolerance more common in Europeans?

Lactase persistence became advantageous when Northern European populations domesticated cattle roughly 7,500 years ago. Individuals who could digest milk year-round had a significant nutritional and survival advantage, driving strong positive selection for the T allele at rs4988235. This is one of the strongest known examples of recent natural selection in humans.

Do all lactose-tolerant populations carry the same variant?

No. This is a remarkable example of convergent evolution. European lactase persistence is driven by rs4988235 (C>T), but East African pastoralist populations independently evolved different lactase persistence variants: rs41380347 (G>C) and rs41525747 (C>G) in the same MCM6 enhancer region. Different mutations, same outcome, same selective pressure from dairying.

Can I check my lactose tolerance genotype?

Yes. If you have raw genotyping data from 23andMe or AncestryDNA, you can look up rs4988235 using SciRouter's free SNP Lookup tool. A TT or CT result suggests you likely produce lactase as an adult, while CC suggests you may be lactose intolerant. Keep in mind that the actual degree of symptoms also depends on gut microbiome composition and dietary habits.

Try this yourself

500 free credits. No credit card required.