INTRODUCTION TO POPULATION GENETICS (2024)

INTRODUCTION TO POPULATION GENETICS

In this and the next few lectures we will be dealingwith population genetics which generally views evolution as changesin the genetic makeup of populations. This is a somewhat reductionist approach:if we could understand the combined action of the forces that change genefrequencies in populations, and then let this run over many generationswe might understand long term trends in evolution. Continuing debate: canthe processes of microevolution account for the patterns of macroevolution?Population genetics is an elegant set of mathematical models developedby largely by R. A. Fisher and J. B. S. Haldane in England and Sewall Wrightin the US. Continues to be developed by many mathematical, theoreticaland experimental biologists today (see J. Crow and M. Kimura Introductionto Population Genetics Theory).

In very simple terms, population genetics involvesanalyses of the interactions between predictable, "deterministic"evolutionary forces and unpredictable, random, "stochastic" forces.The deterministic forces are often referred to as "linear pressures"because they tend to push allele frequencies in one direction (up, downor towards the middle). Important forces of this nature are selection,mutation, gene flow, meiotic drive (unequal transmission of certainalleles [a form of selection]), nonrandom mating (also a form ofselection). The primary stochastic evolutionary force is genetic driftwhich is due to the random sampling of individuals (and genes) in smallpopulations. It is important to realize that the deterministic forces mayact together or against one another (e.g., selection may "try"to eliminate an allele that is pushed into the population by recurrentmutation). Moreover, deterministic forces may act with or against geneticdrift, to determine the frequencies of alleles and genotypes in populations(e.g., gene flow tends to hom*ogenize different populations while drifttends to make them different). Hence, the interaction of these forces iswhat we are really interested in (a later lecture), but since this canget very complex mathematically, we will start by analyzing one force ata time.

To begin we need to understand some simple populationgenetic "bookkeeping." Consider a locus with two alleles(alternative forms of the DNA sequence that "reside" at thatlocus, e.g., one from mother other from father). Now consider a populationof N individuals (N=population size); this means that there are2N alleles in the population. We can thus talk about genotypefrequencies and allele frequencies. In a population of N = 100individuals, if there are 25 AA, 50 Aa and 25 aa, then the genotype frequenciesare f(AA) = 0.25, f(Aa) = 0.50 and f(aa) = 0.25. If we count up the individualalleles there are 200 of them (because there are 100 diploid individuals).Hence to determine the frequency of the "A" allele we have tocount each individual "A" allele that is specified in each diploidgenotype. We get f(A) = (25+25+50) / 200 = 0.5. We generally refer to thefrequency of the "A" allele as f(A) = p; the frequency of the"a" allele is f(a) = q. Note that p = (1-q) because the sum ofthe allele frequencies must be 1.0. Common "language errors"in learning population genetics are to refer to the "p" allelewhen you really mean the "A" allele, or to say "the frequencyof the p allele" when you really mean: "...p, the frequency ofthe "A" allele..." Got it?? Good.

Since evolution is change in the genetic makeup ofa population over time, a general approach to modeling this is to determinethe allele and genotype frequencies in the next generation (pt+1)that result from the action of a force on those frequencies in the currentgeneration (pt). Thus :

pt -> evolution happens-> pt+1

Consider a simplistic life cycle where thegenotypes (a single locus way of referring to adults) produce gametes.These gametes mate to form new genotypes (=adults). See 5.1, pg.93 and 5.3, pg. 99. The relationship between allele frequencies (sometimescalled "gene" frequencies) and genotype frequencies is determinedby the Hardy Weinberg Theorem which defines the probabilities bywhich gametes will join to produce genotypes. Consider a coin toss: probabilityof a head = 0.5; of a tail = 0.5; prob. of two heads = 0.5x0.5 = 0.25;prob. of one head and one tail = 0.5x0.5 = 0.25, etc. Each coin is analogousto the type of allele you can get from one of your diploid parents; thetossing of two coins is analogous to the mating of two individuals to producefour possible genotypes (but heads,tails is the same as tails,heads). Nowconsider a roll of the dice. The probability of each face is 1/6, and isactually analogous to cases where more than two different alleles existin the population at a given locus. The probability of any combinationis 1/6 x 1/6 = 1/36. But recall that there can be more than one way toget many of the combinations (2,3 is the same as 3,2). The general expressionfor the number of genotypes that can be assembled from n different allelesis: [n(n+1)/2].

Assumptions of Hardy Weinberg: 1) diploidsexual population 2) infinite size, 3) random mating, 4)no selection, migration or mutation. This is a Null Model; obviouslysome of these assumptions will not hold in real biological situations.The theorem is useful for comparison to real-world situations where deviationsfrom expectation may point to the action of certain evolutionary forces(e.g., mutation selection, genetic drift, nonrandom mating, etc.). Usea Punnet square to determine genotype frequencies: f(AA) = p2,f(Aa) = 2pq, f(aa) = q2 and p2+ 2pq + q2 = 1 Learn this: One generation of randommating restores Hardy Weinberg equilibrium. H-W equilibriumis when the genotype frequencies are in the proportions expected basedon the allele frequencies as determined by the relation p2 +2pq + q2. This is derived more thoroughly in table 5.1, andaccompanying text, pg. 94.

Example: consider a sample of 100 individuals withthe following genotype frequencies:

Observed Genotype

Frequencies

Allele count

Allele frequency

Expected genotytpe frequencies

under H-W

BB0.71142 Bp = 156/200 = 0.78p2 = (.78)2 = 0.61
Bb0.1414 B, 14 b2pq = 2(.78)(.22) = 0.34
bb0.1530 bq = 44/200 = 0.22q2 = (.22)2 = 0.05

Observed are different from expected, thus some forcemust be at work to change frequencies.

NATURAL SELECTION

Selection occurs because different genotypesexhibit differential survivorship and/or reproduction. If we consider acontinuously distributed trait (e.g., wing length, weight) with a stronggenetic basis, the response to selection can be characterized by wherein the distribution the "most fit" (greatest survivorship&reproduction)individuals lie. If after selection one extreme is most fit thisis directional selection; if the intermediate phenotypesare the most fit this is stabilizing selection; if both extremesare the most fit this is disruptive selection.

R. A. Fisher proposed a simple bookkeeping, or populationgenetics, approach for one locus with two alleles: we have AA, Aa and aain frequencies p2, 2pq, q2 . Define liias the genotype-specific probability of survivorship, mii as the genotype-specificfecundity. We build a model that will predict the frequencies of allelesthat will be put into the gamete pool given some starting frequenciesat the preceding zygote stage;

GenotypesZygote-----> ----->Adult-----> ----->Gametes
AAp2lAA p2mAA lAA p2
Aa2pqlAa 2pqmAa lAa 2pq
aaq2 laa q2maa laa q2

The gamete column is what determines the frequenciesof A and a that will be put into the gamete pool for mating to build thenext generation's genotypes. We can simplify by referring to the fitnessof a genotype as wii = mii lii . Thesefitness values will determine the contribution of that genotype to thenext generation. Thus the frequency of A allele in the next generationpt+1 (sometimes referred to as p') would be the contributionsfrom those genotypes carrying the A allele divided by all alleles contributedby all genotypes:

pt+1 = (wAA p2+ wAa pq)/(wAA p2 + wAa 2pq+ waa q2). Or for thea allele,

qt+1 = (waa q2 +wAa pq)/(wAA p2 + wAa 2pq +waa q2). Note thatthe heterozygotes are not 2pq but pq because in each case they are onlybeing considered for the one allele in question. If we scale all wii'ssuch that the largest = 1.0 we refer to these as the relative fitnessesof the genotypes. A worked example where p = .4, q = .6 and wAA= 1.0 wAa = 0.8 waa = 0.6:

Genotype frequencies are p2 = 0.16, 2pq= 0.48, q2 =0.36, thus:

pt+1 = ((.16 x 1.0) + (.24 x .8))/((.16x 1.0) + (.48 x .8) + (.36 x .6)) = .463; so q = .537 and thus f(AA)t+1= .215, f(Aa)t+1 = .497 and f(aa)t+1 = .288. Noteboth allele frequencies and genotype frequencies have changed (compareto what we saw with inbreeding). This can be continued with the new allelefrequencies and so on. When will the selection process stop? when Dp= 0, i.e., when pt+1 = pt . In some situations thiswill stop only when one allele is selected out of the population (p = 1.0).

Now we can consider various regimes of selection(s = selection coefficient, (1-s) is fitness):

AAAaaa
I111 - sselection against recessive
II1 - s1 - s1selection against dominant
III11 - hs1 - sincomplete dominance (0<h<1)
IV1 - s11 - tselection for heterozygotes

INTRODUCTION TO POPULATION GENETICS (1)

Substitute the fitnesses (wii) in conditionI above into the expression Dp= pt+1 - pt and prove for yourself that the equationson page 101 (eqn. 5.5) is related to the expression for pt+1shown above. First three are directional in that selection stops only whenallele is eliminated. In I the elimination process slows down because asq becomes small the a alleles are usually in heterozygote state and thereis no phenotypic variance. In II selection is slow at first because withq small most genotypes are AA so there is low phenotypic variance; as selectioneliminates A alleles q increases and the frequency of the favored genotype(aa) increases so selection accelerates. III is like the worked examplerun to fixation/loss. IV is known as balancing selection due tooverdominance (heterozygotes are "more" than either hom*ozygote).Both alleles maintained in population by selection. This is an exampleof a polymorphic equilibrium (fixation/loss is also an equilibriumcondition but it is not polymorphic). The frequencies of the alleles atequilibrium will be:

pequil = t/(s + t); qequil= s/(s+t).

Classic example = sickle cell anemia. A=normalallele; S=sickle allele. S should be eliminated because sickle cell anemialowers fitness. S is maintained where malarial agent (Plasmodium falciparum)exists because AS heterozygotes are resistant to malaria. Note that S alleleis very low frequency where there is no malaria (the selective coefficientof S is different because the environment is different). See figure5.8, pg. 120; table 5.9, pg. 119.

Another way that genetic variation can be maintainedis through multiple niche polymorphism (polymorphism maintainedby environmental heterogeneity in selection coefficients). If differentgenotypes are favored in different niches, patches or habitats, both allelescan be maintained.

AAAaaa
habitat 11.00.80.5
habitat 20.50.81.0

Heterozygotes will have the highest averagefitness although they are not the most fit in either habitat (see figure5.12, pg. 124). The same dynamics would apply to temporal heterogeneity(spring and fall; winter and summer) assuming that selection did not eliminateone allele during the first period of selection. Classic example of temporalheterogeneity: third chromosome inversions of Drosophila pseudoobscurastudied by T. Dobzhansky. Different chromosomal arrangements ("Standard"and "Chiricahua") show reciprocal frequency changes during theyear.

Yet another way to maintain variation by selectionis through frequency dependent selection.

If an allele's fitness is not constant but increasesas it gets rare this will drive the allele back to higher frequency. Seefigure 5.9, pg. 121. Example: allele may give a new or distinct phenotypethat predators ignore because they search for food using a "searchimage" (e.g., I like the green ones).

Most (by no means all) evolutionary biologists believethat selection plays a major role in shaping organic diversity, but itis often difficult to "see" selection. One reason is that selectioncoefficients can be quite small (1-s ~1) so the response to selection issmall. When selection coefficients are large Dpcan be large, but the problem here is that with directional selection fixationis reached in a few generations and we still can't "see" selectionunless we are lucky enough to catch a population in the middle of the periodof rapid change.

What affects the rate of change under selection?Recall that Dp = pt+1- pt

Dp = [(wAAp2 + wAa pq)/(wAA p2 + wAa 2pq + waaq2)] - p . With some simple algebra we can rearrange this

equation to: Dp= (pq[p(wAA - wAa) + q(wAa - waa)])/(wAAp2 + wAa 2pq + waa q2)

Note that Dpwill be proportional to the value of pq. This value (pq) will be largestwhen p=q=0.5 or, in English, when the variance in allele frequencyis greatest. This is a simplified version of the main point of the fundamentaltheorem of natural selection modestly presented by R. A. Fisher.

INTRODUCTION TO POPULATION GENETICS (2)

It states that the rate of evolution is proportionalto the genetic variance of the population. In the above example wehave not explicitly defined the fitnesses wiis or the dominance relationshipsand these can have a major effect on Dpas written above.

Another important observation for looking at thisDp equation and pluggingin some values is that selection always increases the mean fitness ofthe population. For example with p=0.4, q=0.6 and wAA=1,wAa=0.8 and waa=0.6, the mean fitness (w'bar') =0.76. After one generation of selection p' = 0.463 and q' = 0.537. Recalculatingw'bar' we get wbart+1 = 0.78, which is greater than 0.76. Whenwill this process stop? At fixation (or equilibrium with overdominance).

This treatment of the algebra of natural selectionillustrates what selection alone can do to allele and genotype frequencies.In the next lectures we will consider other evolutionary forces (mutationgene flow, genetic drift), how they act alone, and eventually, how theyinteract with each of the other evolutionary forces.

INTRODUCTION TO POPULATION GENETICS (2024)

FAQs

INTRODUCTION TO POPULATION GENETICS? ›

Population genetics is concerned with genetic differences within and across populations, and the dynamics of how populations evolve as a result of the propagation of genetic mutations occurring within the germlines of individuals.

What is the introduction to the concepts of population genetics? ›

Introduction

Population genetics is defined as the sub-area of biology that studies the distribution and change in frequency of alleles. The population genetics is also the basis of evolution, and it has been established as a science; its main founders were JBS Haldane, Sir Ronald Fisher, and Sewall Wright.

What is population genetics in genetics? ›

Population genetics is the study of genetic variation within and among populations and the evolutionary factors that explain this variation. Its foundation is the Hardy - Weinberg law, which is maintained as long as population size is large, mating is at random, and mutation, selection and migration are negligible.

What is the genetic definition of a population? ›

In population genetic theory, the concept of a population is often defined as a group with a common gene pool from which individuals choose mates with whom they reproduce (King et al., 2014), often assumed not to experience immigration or emigration (e.g., in some models, a population is characterized by a set of ...

What best describes population genetics? ›

Population genetics is a subfield of genetics that deals with genetic differences within and among populations, and is a part of evolutionary biology. Studies in this branch of biology examine such phenomena as adaptation, speciation, and population structure.

Why study population genetics? ›

Population genetics gives us an opportunity to step back and observe patterns of genetic change over time. By comparing populations to each other (and to themselves), we can begin to see how outside factors might spark the evolution of a trait.

What is the simple introduction to genetics? ›

Genetics is the study of genes and tries to explain what they are and how they work. Genes are how living organisms inherit features or traits from their ancestors; for example, children usually look like their parents because they have inherited their parents' genes.

What is the formula for population genetics? ›

p2 + 2pq + q2 = 1

Derivation: Take a gene with two alleles; call them A1 and A2. (Dominance doesn't matter for our purposes; this works equally well with codominance or incomplete dominance.) In a population, some members will have the A1 A1 genotype, some will have the A1 A2 genotype, and some will have A2 A2.

What is the population genetic structure? ›

At a broad level, population structure is the existence of differing levels of genetic relatedness among some subgroups within a sample. This may arise for a variety of reasons, but a common cause is that samples have been drawn from geographically isolated groups or different locales across a geographic continuum.

How does population genetics define evolution? ›

In population genetics, scientists define the term evolution as a change in the allele's frequency in a population. Using the ABO blood type system as an example, the frequency of one of the alleles, IA, is the number of copies of that allele divided by all the copies of the ABO gene in the population.

What is the main cause of genetic variation in a population? ›

Sources of Genetic Variation

Gene duplication, mutation, or other processes can produce new genes and alleles and increase genetic variation. New genetic variation can be created within generations in a population, so a population with rapid reproduction rates will probably have high genetic variation.

What are mutations in population genetics? ›

Mutations are changes to an organism's DNA that create diversity within a population by introducing new alleles. Some mutations are harmful and are quickly eliminated from the population by natural selection; harmful mutations prevent organisms from reaching sexual maturity and reproducing.

What are all the genes in a population called? ›

The collection of all the genes and the various alternate or allelic forms of those genes within a population is called its gene pool.

What is population genetics with an example? ›

For example, in a population of fifty people where all the blood types are represented, there may be more IA alleles than i alleles. Population genetics is the study of how selective forces change a population through changes in allele and genotypic frequencies.

What is the aim of population genetics? ›

Population genetics seeks to understand how and why the frequencies of alleles and genotypes change over time within and between populations. It is the branch of biology that provides the deepest and clearest understanding of how evolutionary change occurs.

What is the population genetic method? ›

The prevailing approach involves generating a plethora of single nucleotide polymorphism (SNP) markers through whole genome sequencing, followed by comprehensive studies on population genetic evolution, phylogeny, germplasm resource identification, molecular marker development, DNA fingerprinting, and other genomic ...

What is the introduction and concept of population? ›

A population is defined as a group of individuals of the same species living and interbreeding within a given area. Members of a population often rely on the same resources, are subject to similar environmental constraints, and depend on the availability of other members to persist over time.

What is the introduction of population genomics? ›

Population genomics utilizes comprehensive DNA sequence analysis on a wide scale to reveal patterns in genetic variation. It provides valuable information that can be used to better understand ancestry, health, evolution, and more.

What are the main concepts of genetics? ›

A child inherits two sets of genes—one from each parent. A trait may not be observable, but its gene can be passed to the next generation. Each person has 2 copies of every gene—one copy from mom and a second copy from dad. These copies may come in different variations, known as alleles, that express different traits.

What is the introduction of population study? ›

Population study is an interdisciplinary field of scientific study that uses various statistical methods and models to analyse, determine, address, and predict population challenges and trends from data collected through various data collection methods such as population census, registration method, sampling, and some ...

References

Top Articles
Latest Posts
Article information

Author: Nathanael Baumbach

Last Updated:

Views: 6226

Rating: 4.4 / 5 (75 voted)

Reviews: 90% of readers found this page helpful

Author information

Name: Nathanael Baumbach

Birthday: 1998-12-02

Address: Apt. 829 751 Glover View, West Orlando, IN 22436

Phone: +901025288581

Job: Internal IT Coordinator

Hobby: Gunsmithing, Motor sports, Flying, Skiing, Hooping, Lego building, Ice skating

Introduction: My name is Nathanael Baumbach, I am a fantastic, nice, victorious, brave, healthy, cute, glorious person who loves writing and wants to share my knowledge and understanding with you.