INTRODUCTION TO POPULATION GENETICS (2024)

INTRODUCTION TO POPULATION GENETICS

In this and the next few lectures we will be dealingwith population genetics which generally views evolution as changesin the genetic makeup of populations. This is a somewhat reductionist approach:if we could understand the combined action of the forces that change genefrequencies in populations, and then let this run over many generationswe might understand long term trends in evolution. Continuing debate: canthe processes of microevolution account for the patterns of macroevolution?Population genetics is an elegant set of mathematical models developedby largely by R. A. Fisher and J. B. S. Haldane in England and Sewall Wrightin the US. Continues to be developed by many mathematical, theoreticaland experimental biologists today (see J. Crow and M. Kimura Introductionto Population Genetics Theory).

In very simple terms, population genetics involvesanalyses of the interactions between predictable, "deterministic"evolutionary forces and unpredictable, random, "stochastic" forces.The deterministic forces are often referred to as "linear pressures"because they tend to push allele frequencies in one direction (up, downor towards the middle). Important forces of this nature are selection,mutation, gene flow, meiotic drive (unequal transmission of certainalleles [a form of selection]), nonrandom mating (also a form ofselection). The primary stochastic evolutionary force is genetic driftwhich is due to the random sampling of individuals (and genes) in smallpopulations. It is important to realize that the deterministic forces mayact together or against one another (e.g., selection may "try"to eliminate an allele that is pushed into the population by recurrentmutation). Moreover, deterministic forces may act with or against geneticdrift, to determine the frequencies of alleles and genotypes in populations(e.g., gene flow tends to hom*ogenize different populations while drifttends to make them different). Hence, the interaction of these forces iswhat we are really interested in (a later lecture), but since this canget very complex mathematically, we will start by analyzing one force ata time.

To begin we need to understand some simple populationgenetic "bookkeeping." Consider a locus with two alleles(alternative forms of the DNA sequence that "reside" at thatlocus, e.g., one from mother other from father). Now consider a populationof N individuals (N=population size); this means that there are2N alleles in the population. We can thus talk about genotypefrequencies and allele frequencies. In a population of N = 100individuals, if there are 25 AA, 50 Aa and 25 aa, then the genotype frequenciesare f(AA) = 0.25, f(Aa) = 0.50 and f(aa) = 0.25. If we count up the individualalleles there are 200 of them (because there are 100 diploid individuals).Hence to determine the frequency of the "A" allele we have tocount each individual "A" allele that is specified in each diploidgenotype. We get f(A) = (25+25+50) / 200 = 0.5. We generally refer to thefrequency of the "A" allele as f(A) = p; the frequency of the"a" allele is f(a) = q. Note that p = (1-q) because the sum ofthe allele frequencies must be 1.0. Common "language errors"in learning population genetics are to refer to the "p" allelewhen you really mean the "A" allele, or to say "the frequencyof the p allele" when you really mean: "...p, the frequency ofthe "A" allele..." Got it?? Good.

Since evolution is change in the genetic makeup ofa population over time, a general approach to modeling this is to determinethe allele and genotype frequencies in the next generation (p_t+1)that result from the action of a force on those frequencies in the currentgeneration (p_t). Thus :

p_t -> evolution happens-> p_t+1

Consider a simplistic life cycle where thegenotypes (a single locus way of referring to adults) produce gametes.These gametes mate to form new genotypes (=adults). See 5.1, pg.93 and 5.3, pg. 99. The relationship between allele frequencies (sometimescalled "gene" frequencies) and genotype frequencies is determinedby the Hardy Weinberg Theorem which defines the probabilities bywhich gametes will join to produce genotypes. Consider a coin toss: probabilityof a head = 0.5; of a tail = 0.5; prob. of two heads = 0.5x0.5 = 0.25;prob. of one head and one tail = 0.5x0.5 = 0.25, etc. Each coin is analogousto the type of allele you can get from one of your diploid parents; thetossing of two coins is analogous to the mating of two individuals to producefour possible genotypes (but heads,tails is the same as tails,heads). Nowconsider a roll of the dice. The probability of each face is 1/6, and isactually analogous to cases where more than two different alleles existin the population at a given locus. The probability of any combinationis 1/6 x 1/6 = 1/36. But recall that there can be more than one way toget many of the combinations (2,3 is the same as 3,2). The general expressionfor the number of genotypes that can be assembled from n different allelesis: [n(n+1)/2].

Assumptions of Hardy Weinberg: 1) diploidsexual population 2) infinite size, 3) random mating, 4)no selection, migration or mutation. This is a Null Model; obviouslysome of these assumptions will not hold in real biological situations.The theorem is useful for comparison to real-world situations where deviationsfrom expectation may point to the action of certain evolutionary forces(e.g., mutation selection, genetic drift, nonrandom mating, etc.). Usea Punnet square to determine genotype frequencies: f(AA) = p²,f(Aa) = 2pq, f(aa) = q² and p²+ 2pq + q² = 1 Learn this: One generation of randommating restores Hardy Weinberg equilibrium. H-W equilibriumis when the genotype frequencies are in the proportions expected basedon the allele frequencies as determined by the relation p² +2pq + q². This is derived more thoroughly in table 5.1, andaccompanying text, pg. 94.

Example: consider a sample of 100 individuals withthe following genotype frequencies:

	Observed Genotype Frequencies	Allele count	Allele frequency	Expected genotytpe frequencies under H-W
BB	0.71	142 B	p = 156/200 = 0.78	p² = (.78)² = 0.61
Bb	0.14	14 B, 14 b		2pq = 2(.78)(.22) = 0.34
bb	0.15	30 b	q = 44/200 = 0.22	q² = (.22)² = 0.05

Observed are different from expected, thus some forcemust be at work to change frequencies.

Substitute the fitnesses (w_ii) in conditionI above into the expression Dp= p_t+1 - p_tand prove for yourself that the equationson page 101 (eqn. 5.5) is related to the expression for p_t+1shown above. First three are directional in that selection stops only whenallele is eliminated. In I the elimination process slows down because asq becomes small the a alleles are usually in heterozygote state and thereis no phenotypic variance. In II selection is slow at first because withq small most genotypes are AA so there is low phenotypic variance; as selectioneliminates A alleles q increases and the frequency of the favored genotype(aa) increases so selection accelerates. III is like the worked examplerun to fixation/loss. IV is known as balancing selection due tooverdominance (heterozygotes are "more" than either hom*ozygote).Both alleles maintained in population by selection. This is an exampleof a polymorphic equilibrium (fixation/loss is also an equilibriumcondition but it is not polymorphic). The frequencies of the alleles atequilibrium will be:

p_equil = t/(s + t); q_equil= s/(s+t).

Classic example = sickle cell anemia. A=normalallele; S=sickle allele. S should be eliminated because sickle cell anemialowers fitness. S is maintained where malarial agent (Plasmodium falciparum)exists because AS heterozygotes are resistant to malaria. Note that S alleleis very low frequency where there is no malaria (the selective coefficientof S is different because the environment is different). See figure5.8, pg. 120; table 5.9, pg. 119.

Another way that genetic variation can be maintainedis through multiple niche polymorphism (polymorphism maintainedby environmental heterogeneity in selection coefficients). If differentgenotypes are favored in different niches, patches or habitats, both allelescan be maintained.

	AA	Aa	aa
habitat 1	1.0	0.8	0.5
habitat 2	0.5	0.8	1.0

Heterozygotes will have the highest averagefitness although they are not the most fit in either habitat (see figure5.12, pg. 124). The same dynamics would apply to temporal heterogeneity(spring and fall; winter and summer) assuming that selection did not eliminateone allele during the first period of selection. Classic example of temporalheterogeneity: third chromosome inversions of Drosophila pseudoobscurastudied by T. Dobzhansky. Different chromosomal arrangements ("Standard"and "Chiricahua") show reciprocal frequency changes during theyear.

Yet another way to maintain variation by selectionis through frequency dependent selection.

If an allele's fitness is not constant but increasesas it gets rare this will drive the allele back to higher frequency. Seefigure 5.9, pg. 121. Example: allele may give a new or distinct phenotypethat predators ignore because they search for food using a "searchimage" (e.g., I like the green ones).

Most (by no means all) evolutionary biologists believethat selection plays a major role in shaping organic diversity, but itis often difficult to "see" selection. One reason is that selectioncoefficients can be quite small (1-s ~1) so the response to selection issmall. When selection coefficients are large Dpcan be large, but the problem here is that with directional selection fixationis reached in a few generations and we still can't "see" selectionunless we are lucky enough to catch a population in the middle of the periodof rapid change.

What affects the rate of change under selection?Recall that Dp = p_t+1- p_t

Dp = [(w_AAp2 + w_Aa pq)/(w_AA p2 + w_Aa 2pq + w_aaq2)] - p . With some simple algebra we can rearrange this

equation to: Dp= (pq[p(w_AA - w_Aa) + q(w_Aa - w_aa)])/(w_AAp2 + w_Aa 2pq + w_aa q2)

Note that Dpwill be proportional to the value of pq. This value (pq) will be largestwhen p=q=0.5 or, in English, when the variance in allele frequencyis greatest. This is a simplified version of the main point of the fundamentaltheorem of natural selection modestly presented by R. A. Fisher.

It states that the rate of evolution is proportionalto the genetic variance of the population. In the above example wehave not explicitly defined the fitnesses wiis or the dominance relationshipsand these can have a major effect on Dpas written above.

Another important observation for looking at thisDp equation and pluggingin some values is that selection always increases the mean fitness ofthe population. For example with p=0.4, q=0.6 and w_AA=1,w_Aa=0.8 and w_aa=0.6, the mean fitness (w'bar') =0.76. After one generation of selection p' = 0.463 and q' = 0.537. Recalculatingw'bar' we get wbar_t+1 = 0.78, which is greater than 0.76. Whenwill this process stop? At fixation (or equilibrium with overdominance).

This treatment of the algebra of natural selectionillustrates what selection alone can do to allele and genotype frequencies.In the next lectures we will consider other evolutionary forces (mutationgene flow, genetic drift), how they act alone, and eventually, how theyinteract with each of the other evolutionary forces.

FAQs

INTRODUCTION TO POPULATION GENETICS? ›

Population genetics is concerned with genetic differences within and across populations, and the dynamics of how populations evolve as a result of the propagation of genetic mutations occurring within the germlines of individuals.

What is the introduction to the concepts of population genetics? ›

Introduction

Population genetics is defined as the sub-area of biology that studies the distribution and change in frequency of alleles. The population genetics is also the basis of evolution, and it has been established as a science; its main founders were JBS Haldane, Sir Ronald Fisher, and Sewall Wright.

Read On ›

What is population genetics in genetics? ›

Population genetics is the study of genetic variation within and among populations and the evolutionary factors that explain this variation. Its foundation is the Hardy - Weinberg law, which is maintained as long as population size is large, mating is at random, and mutation, selection and migration are negligible.

View Details ›

What is the genetic definition of a population? ›

In population genetic theory, the concept of a population is often defined as a group with a common gene pool from which individuals choose mates with whom they reproduce (King et al., 2014), often assumed not to experience immigration or emigration (e.g., in some models, a population is characterized by a set of ...

What is the main cause of genetic variation in a population? ›

Sources of Genetic Variation

Gene duplication, mutation, or other processes can produce new genes and alleles and increase genetic variation. New genetic variation can be created within generations in a population, so a population with rapid reproduction rates will probably have high genetic variation.

See Details ›

What are mutations in population genetics? ›

Mutations are changes to an organism's DNA that create diversity within a population by introducing new alleles. Some mutations are harmful and are quickly eliminated from the population by natural selection; harmful mutations prevent organisms from reaching sexual maturity and reproducing.

Genotypes	Zygote	-----> ----->	Adult	-----> ----->	Gametes
AA	p²		l_AA p²		m_AA l_AA p²
Aa	2pq		l_Aa 2pq		m_Aa l_Aa 2pq
aa	q²		l_aa q²		m_aa l_aa q²

	AA	Aa	aa
I	1	1	1 - s	selection against recessive
II	1 - s	1 - s	1	selection against dominant
III	1	1 - hs	1 - s	incomplete dominance (0<h<1)
IV	1 - s	1	1 - t	selection for heterozygotes

INTRODUCTION TO POPULATION GENETICS (2024)

FAQs

INTRODUCTION TO POPULATION GENETICS? ›

What is the main cause of genetic variation in a population? ›

References