Skip to content

Classification And Evolution

Part of Module 4: Biodiversity, evolution and disease.

This topic explains how biologists organise living things and how evolutionary theory accounts for both the similarities and differences between them. Classification is best treated as evidence-based and revisable, not as a fixed naming exercise. Evolution is the unifying mechanism that explains the pattern of biodiversity — and it continues to operate in ways that matter directly to human medicine and agriculture.

Learning Objectives

ID Specification-aligned objective Main teaching sections
4.2.2-lo-1 Explain how organisms are classified and compare major classification systems. Classification And Taxonomy, The Five-Kingdom System And The Three-Domain Model
4.2.2-lo-2 Explain phylogeny and evaluate the evidence used to support evolutionary change. Phylogeny, Evidence For Evolution
4.2.2-lo-3 Explain how variation and adaptation arise within populations. Variation, Adaptations
4.2.2-lo-4 Explain natural selection as a mechanism for evolution. Natural Selection

Classification And Taxonomy

A species is a group of organisms that can interbreed to produce fertile, living offspring. Classification organises species into groups based on similarities, and its purposes extend well beyond naming. It allows scientists to identify new species, study evolutionary relationships, understand shared traits and adaptations, and communicate across national and language boundaries about the same organism.

The taxonomic hierarchy runs from the broadest to the most specific rank: domain, kingdom, phylum, class, order, family, genus, species. The broadest groups contain the most species, while the most specific groups contain fewer, more closely related organisms.

Binomial Nomenclature

The binomial naming system gives each species a standardised two-part Latin name. The name is italicised in print and underlined when written by hand. The first part is the genus, written with an upper-case initial letter; the second part is the specific epithet (species name), written entirely in lower case. For example, the binomial name for humans is Homo sapiens and for domestic dogs is Canis familiaris. This system avoids the confusion that arises from regional common names, allowing scientists globally to refer unambiguously to the same organism.

The Five-Kingdom System And The Three-Domain Model

The five-kingdom system classifies all life into Prokaryotae, Plantae, Animalia, Protoctista, and Fungi. The key distinguishing features are:

  • Prokaryotae: prokaryotic and unicellular; cell walls often made of peptidoglycan; no membrane-bound organelles; reproduction by binary fission; nutrition autotrophic, heterotrophic, or parasitic; glycogen as storage polysaccharide.
  • Plantae: eukaryotic and multicellular; cellulose cell walls; chloroplasts present; autotrophic (photosynthetic); starch as storage polysaccharide; reproduction using seeds or spores.
  • Animalia: eukaryotic and multicellular; no cell walls; heterotrophic; glycogen as storage; variable reproduction.
  • Protoctista: eukaryotic; unicellular or simple multicellular; variable nutrition and reproduction; sometimes with chloroplasts.
  • Fungi: eukaryotic; unicellular or multicellular bodies composed of hyphae; cell walls of chitin; saprophytic nutrition, absorbing dissolved organic nutrients from dead matter; reproduction by spores; glycogen as storage.

The three-domain model supersedes the five-kingdom system by splitting the former kingdom Prokaryotae into two separate domains based on molecular and genetic evidence. The three domains are:

  • Bacteria: contain only the kingdom Eubacteria; found in all environments; prokaryotic and unicellular; distinct membrane lipids; peptidoglycan (murein) cell walls; unique RNA polymerase.
  • Archaea: contain only the kingdom Archaebacteria; typically found in extreme environments such as hot springs and salt lakes; prokaryotic and unicellular; have histone proteins so gene and protein synthesis resembles Eukarya more than Bacteria; no peptidoglycan in cell walls; cell membranes contain fatty acids bound to glycerol by ether linkages; more complex RNA polymerase than Bacteria.
  • Eukarya: contain the four eukaryotic kingdoms — Animalia, Plantae, Fungi, and Protoctista; all have nuclei and membrane-bound organelles.

The three-domain model better reflects true evolutionary relationships because molecular evidence reveals that Archaea and Bacteria are as different from each other at the molecular level as either is from eukaryotes. Classification systems change because evidence improves, not because biology itself changes arbitrarily — this is a general principle of how taxonomy evolves.

Phylogeny

Phylogeny is the study of evolutionary relationships between organisms and their shared ancestors. Phylogenetic classification groups organisms according to evolutionary lineage rather than by convenient observable features, and it reveals how closely related different species are.

Phylogenetic trees map these relationships. Branching points (nodes) represent common ancestors; the relative position of branching points indicates how distantly related species are from their shared ancestors; the proximity of species at the branch tips indicates closeness of evolutionary relationship. The oldest species appear at the base of the tree and the most recently evolved at the tips.

Phylogenetic classification has two advantages over older artificial classification. It produces a continuous tree that does not force organisms into groups where they do not fit, and there is no overlap between the groups it produces.

Artificial Versus Phylogenetic Classification

Artificial (traditional) classification grouped organisms by differences useful at the time — colour, size, number of limbs, leaf shape — relying only on visible morphological features. This approach can misrepresent true evolutionary relationships. The classic example is sharks and dolphins: both have streamlined bodies and dorsal fins, superficially similar external features. However, sharks are fish and dolphins are mammals that belong to entirely different evolutionary lineages. Their physical similarities reflect independent adaptation to the same aquatic environment through convergent evolution, not shared ancestry.

Modern taxonomy uses multiple lines of evidence: molecular comparisons of DNA and amino acid sequences (for example, cytochrome c); embryonic development comparisons; anatomical examination including fossil material; and behavioural analyses. New technologies continue to refine these investigations, and reclassifications occur as the evidence base improves.

Evidence For Evolution

Both Darwin and Wallace independently proposed the theory of evolution by natural selection, suggesting that organisms best suited to their environment are more likely to survive, reproduce, and pass on advantageous characteristics to their offspring.

The Fossil Record

Palaeontology studies life's history as preserved in rock layers. The fossil record shows that simple bacteria and algae appear in the oldest rocks, with progressively more complex organisms in more recent layers. Plant fossils appear before the fossils of animals that feed on them. Anatomical similarities between fossil species and their modern relatives demonstrate shared evolutionary ancestry.

The fossil record is incomplete for several reasons: many organisms decompose before they can fossilise; fossilisation requires specific conditions and is inherently uncommon; erosion and geological processes destroy formed fossils over time; many fossils have not yet been discovered; and soft-bodied organisms are unlikely to fossilise, leaving systematic gaps in the record. Despite these limitations, the record provides compelling directional evidence for evolutionary change over geological time.

Comparative Anatomy

Comparative anatomy examines anatomical structures across species to find evidence for shared ancestry. Homologous structures are physical features that have a similar underlying bone or tissue structure but may serve different functions in different species. Organisms sharing homologous structures likely evolved from a common ancestor that possessed the ancestral form of that structure, which has since been modified by natural selection for different functions. This is called divergent evolution.

The pentadactyl limb is the canonical example. The limbs of mammals, birds, reptiles, and amphibians all share the same underlying bone arrangement — one proximal bone, two bones in the next segment, then several smaller bones — despite being used for running, flying, and swimming in very different ways.

Analogous structures, by contrast, serve similar functions but arise from entirely different evolutionary origins. Dolphins and sharks both have dorsal fins that aid swimming, but those fins evolved independently from different ancestral structures. This is convergent evolution, and it illustrates why morphological similarity alone is insufficient evidence for close evolutionary relationship.

Comparative Biochemistry

Comparative biochemistry uses molecular sequences to infer evolutionary relationships. Several molecules are particularly informative. Cytochrome c is a highly conserved protein involved in cellular respiration; because its sequence changes slowly, slight differences between species can indicate the time since their evolutionary divergence. Ribosomal RNA is integral to protein synthesis and changes very slowly, making it useful for revealing connections between species that diverged long ago. DNA sequences — nuclear, mitochondrial, or chloroplast — are more similar between closely related species. Messenger RNA base sequences reflect DNA composition and can be used to assess DNA diversity. Amino acid sequences in proteins are determined by DNA, so closely related species have more similar amino acid sequences.

The hypothesis of neutral evolution states that most variability in molecular structure does not affect function. These neutral changes accumulate at a fairly regular rate because they are not screened by natural selection. Comparing rates of neutral substitution between species allows scientists to estimate when two lineages diverged — forming a molecular clock. Generally, a greater number of molecular differences indicates a more ancient divergence.

Variation

Variation refers to differences among individuals. It is the raw material upon which natural selection acts, so understanding its causes and types is fundamental to understanding evolution.

Causes Of Variation

Genetic variation arises from mutations (changes to DNA that may be heritable), meiosis (independent assortment of chromosomes and crossing over between chromatids produce gametes with new combinations of alleles), random fertilisation (which combines alleles from two parents), and random mating. Environmental variation is caused by conditions such as light, nutrient and food availability, temperature, rainfall, soil conditions, and pH. Many characteristics are influenced by both genetics and environment simultaneously; the combined effect can be difficult to disentangle.

Polygenes are different genes at different loci that each contribute a small effect to a single aspect of phenotype. Their individual contributions are too small to observe separately, but their combined action produces continuous variation in characteristics such as height and skin colour.

Continuous And Discontinuous Variation

Continuous variation shows a range of values between two extremes with no distinct categories, producing a spectrum of phenotypes. It is typically controlled by multiple genes and influenced by environmental factors; when plotted, it gives a normal distribution (bell-shaped) curve. Human height and milk yield in cows are examples.

Discontinuous variation features clear, distinct categories with no intermediates. It is typically controlled by one or a few genes with little environmental influence. Human ABO blood groups (A, B, AB, or O) are a classic example; each individual belongs unambiguously to one category. When plotted, discontinuous variation produces a bar chart of separate categories rather than a continuous curve.

Intraspecific And Interspecific Variation

Intraspecific variation is variation within a single species. Robins, for example, vary in body mass from approximately 16 g to 22 g. Interspecific variation is variation between different species. The bee hummingbird weighs roughly 1.6 g, while an ostrich can weigh up to 160 kg — a difference that illustrates the enormous magnitude of interspecific variation. Intraspecific variation is the raw material for evolution within a lineage; interspecific variation reveals the outcome of divergence over evolutionary time.

Quantifying Variation

The mean provides an average value for a sample; comparing means between samples identifies variation between populations. Data that show continuous variation typically follow a normal distribution, with most values clustered symmetrically around the mean and fewer values at the extremes.

Standard deviation measures how spread out values are around the mean. A small standard deviation indicates that values are clustered closely around the mean; a large standard deviation indicates that values are widely spread. The formula is:

SD = √(Σ(x − x̄)² / (n − 1))

where x is each measured value, x̄ is the mean, and n is the number of values. This formula will be provided in examinations.

Student's T-Test

The Student's t-test is used to determine whether there is a statistically significant difference between the means of two independent, normally distributed datasets with approximately equal variance. The steps are:

  1. State the null hypothesis: there is no significant difference between the two means.
  2. Calculate the t statistic using the provided formula: t = (x̄₁ − x̄₂) / √(σ₁²/n₁ + σ₂²/n₂).
  3. Calculate degrees of freedom: df = n₁ + n₂ − 2.
  4. Compare the calculated t value to a critical value at p = 0.05 for the appropriate degrees of freedom.
  5. If t exceeds the critical value, reject the null hypothesis and conclude that the difference is statistically significant; if not, accept the null hypothesis.

A t value above the critical value indicates that the observed difference between means is unlikely to be due to chance.

Spearman's Rank Correlation Coefficient

Spearman's rank correlation coefficient (ρ, rho) measures the strength and direction of association between two continuous variables that are not normally distributed. The steps are:

  1. Rank both variables from smallest to largest; assign average ranks to ties.
  2. Calculate the difference in ranks (d) for each pair of values.
  3. Square each rank difference (d²) and sum them (Σd²).
  4. Apply the formula: ρ = 1 − (6Σd² / n(n² − 1)), where n is the number of data pairs.
  5. Compare ρ to a critical value at the 5% significance level using the provided table.

A ρ value near +1 indicates a strong positive correlation; near −1 indicates a strong negative correlation; near 0 indicates no correlation. If ρ exceeds the critical value, the correlation is statistically significant and unlikely to be due to chance.

Adaptations

Adaptations are inherited characteristics that enhance an organism's ability to survive and reproduce in its specific environment. They are the direct product of natural selection acting on variation.

Anatomical adaptations are physical structural features, both external and internal. Examples include body coverings such as fur, feathers, and scales that protect organisms and regulate temperature; camouflage colouration that reduces detection by predators; mimicry of dangerous or unpalatable species; and tooth shapes adapted to specific diets.

Behavioural adaptations are actions that increase chances of survival and reproduction. Innate (instinctive) behaviours include defensive responses such as playing dead (opossums) or freezing (rabbits) when predators are nearby; courtship displays such as the male scorpion's mating dance; and seasonal behaviours such as migration (to access resources year-round) and hibernation (to conserve energy during resource-scarce periods). Learned behaviours, acquired through experience, include tool use — sea otters using rocks to open shellfish.

Physiological adaptations are internal biochemical processes. Examples include venom production (used by snakes to immobilise prey and by some plants to deter herbivores), antibiotic production by bacteria to compete with other microorganisms, and water storage (desert frogs can survive over a year without surface water by storing water in their bodies).

Convergent Evolution

Analogous structures are features that serve similar functions but have evolved independently in different lineages. Dolphins and sharks both evolved dorsal fins suited to swimming, but from entirely different ancestral structures. Convergent evolution occurs when unrelated species develop similar traits in response to similar environmental selection pressures. It is one reason why morphological similarity alone cannot be trusted as evidence of shared ancestry — modern molecular evidence is needed to resolve true relationships.

Natural Selection

All organisms face selection pressures — environmental factors that affect survival and reproduction. These include predation, competition for resources, disease, and climate change. Fitness describes an organism's ability to survive and reproduce; organisms with higher fitness are more likely to pass their alleles to the next generation.

The process of natural selection can be summarised as follows:

  1. Variation exists within the population due to different alleles and genetic combinations.
  2. Further genetic variation arises from random mutations.
  3. A selection pressure operates in the environment.
  4. Individuals whose alleles confer traits suited to surviving the selection pressure are more likely to reproduce.
  5. These advantageous alleles are inherited by offspring.
  6. Over generations, advantageous allele frequencies increase in the population, and the population becomes better adapted to its environment.

This mechanism results in populations becoming more closely matched to their environment over time. Genetic diversity — the range of alleles in the population — is the essential prerequisite; without heritable variation, selection cannot produce change.

Modern Examples Of Natural Selection

Natural selection is ongoing and produces results in human time frames.

Antibiotic-resistant bacteria demonstrate natural selection clearly. Some bacteria acquire random mutations that confer resistance to a specific antibiotic. When antibiotics are administered, susceptible bacteria die, but resistant individuals survive and reproduce, passing resistance alleles to offspring. Over generations, the proportion of resistant bacteria increases until the antibiotic is no longer effective. Resistance genes on plasmids can also be transferred directly between bacterial cells by conjugation, accelerating the spread. MRSA (methicillin-resistant Staphylococcus aureus) illustrates this at a clinical scale.

Pesticide resistance in insects follows the same logic. Random mutations produce some individuals resistant to a pesticide. When pesticide is applied, susceptible insects die but resistant individuals reproduce, passing resistance alleles on. Over successive generations the pest population evolves toward resistance, eventually making the pesticide ineffective. Understanding this process is critical for sustainable agriculture, because unchecked resistance development leads to increased pesticide use, environmental damage, and crop losses.

Reproductive isolation is the mechanism by which natural selection in separated populations can eventually produce new species. When populations cannot exchange genes, each accumulates different allele frequencies under different selection pressures. Given sufficient time and divergence, they may become so different that interbreeding between them is no longer possible — they have speciated.

Common Confusions

"Survival of the fittest" in biology means greatest reproductive success, not simply being the strongest or largest individual. The "fittest" organism is the one best suited to its particular environment at that time, which may favour camouflage, resistance to disease, or reproductive rate rather than physical size.

Similar features in different organisms do not automatically indicate close evolutionary relationship. Sharks and dolphins share external body form through convergent evolution driven by similar aquatic environments, not because they share a recent common ancestor. Classification decisions must be grounded in multiple lines of evidence, especially molecular data.

Classification systems are periodically revised as molecular and genetic evidence accumulates. This reflects improving scientific understanding, not biological change or scientific inconsistency.

Key Terms

  • Species: a group of organisms that can interbreed to produce fertile, living offspring.
  • Taxonomy: the science of classifying organisms into groups based on shared characteristics.
  • Binomial nomenclature: the two-part scientific naming system; genus (upper case) followed by species (lower case), italicised in print.
  • Phylogeny: the study of evolutionary relationships and shared ancestry between organisms.
  • Phylogenetic tree: a branching diagram showing evolutionary relationships derived from shared ancestry.
  • Homologous structure: a feature with a similar underlying structure in different species, indicating common ancestry but possibly serving different functions.
  • Analogous structure: a feature that serves a similar function in different species but evolved independently from different ancestral structures.
  • Convergent evolution: the independent evolution of similar features in unrelated species in response to similar selection pressures.
  • Divergent evolution: the accumulation of differences between related populations or species as they adapt to different environments.
  • Cytochrome c: a highly conserved respiratory protein used in comparative biochemistry to infer evolutionary relationships.
  • Molecular clock: the use of neutral mutation rates to estimate the time of divergence between species.
  • Intraspecific variation: variation occurring within a single species.
  • Interspecific variation: variation occurring between different species.
  • Continuous variation: variation showing a spectrum of values; typically polygenic and environmentally influenced; gives a normal distribution.
  • Discontinuous variation: variation falling into distinct categories; typically controlled by one or few genes; little environmental influence.
  • Polygene: one of several genes at different loci that each contribute a small effect to a single characteristic, producing continuous variation.
  • Standard deviation: a measure of the spread of values around a mean; calculated as √(Σ(x − x̄)² / (n − 1)).
  • Student's t-test: a statistical test for significant difference between the means of two independent normally distributed datasets.
  • Spearman's rank correlation coefficient (ρ): a statistical measure of the strength and direction of association between two variables that are not normally distributed.
  • Adaptation: an inherited characteristic that increases an organism's fitness in its environment.
  • Anatomical adaptation: a physical structural feature that aids survival or reproduction.
  • Behavioural adaptation: an action or behavioural pattern that aids survival or reproduction.
  • Physiological adaptation: an internal biochemical or metabolic process that aids survival or reproduction.
  • Natural selection: the process by which individuals with advantageous heritable traits leave more offspring, increasing the frequency of those traits in the population over generations.
  • Selection pressure: an environmental factor that influences survival and reproduction, driving natural selection.
  • Fitness: an organism's ability to survive and reproduce in its environment.
  • Reproductive isolation: the separation of populations so they no longer exchange genes, enabling independent divergence and eventually speciation.
  • Speciation: the formation of new species through the accumulation of genetic differences in reproductively isolated populations.
  • Allele frequency: the proportion of a particular allele in a population's gene pool.

Connected Pages