Interface Focus
You have accessResearch article

What limits the morphological disparity of clades?

Published:https://doi.org/10.1098/rsfs.2015.0042

    Abstract

    The morphological disparity of species within major clades shows a variety of trajectory patterns through evolutionary time. However, there is a significant tendency for groups to reach their maximum disparity relatively early in their histories, even while their species richness or diversity is comparatively low. This pattern of early high-disparity suggests that there are internal constraints (e.g. developmental pleiotropy) or external restrictions (e.g. ecological competition) upon the variety of morphologies that can subsequently evolve. It has also been demonstrated that the rate of evolution of new character states decreases in most clades through time (character saturation), as does the rate of origination of novel bodyplans and higher taxa. Here, we tested whether there was a simple relationship between the level or rate of character state exhaustion and the shape of a clade's disparity profile: specifically, its centre of gravity (CG). In a sample of 93 extinct major clades, most showed some degree of exhaustion, but all continued to evolve new states up until their extinction. Projection of states/steps curves suggested that clades realized an average of 60% of their inferred maximum numbers of states. Despite a weak but significant correlation between overall levels of homoplasy and the CG of clade disparity profiles, there were no significant relationships between any of our indices of exhaustion curve shape and the clade disparity CG. Clades showing early high-disparity were no more likely to have early character saturation than those with maximum disparity late in their evolution.

    1. Introduction

    Much like the species and individuals that constitute them, all clades have an origin and all must ultimately suffer extinction. Their intervening histories, however, can follow a variety of complex trajectories. The study of these patterns is central to the study of macroevolution, with questions centring on whether there is a typical pattern [17], whether the fortunes of clades are positively or negatively correlated [813] and whether there are particular responses to marked environmental changes [14]. Clade evolution is commonly studied by plotting diversity (numbers of constituent species, genera or higher taxa) through time, which can highlight periods of elevated diversification, extinction and turnover, as well as potential interactions between groups [1520]. All Phanerozoic diversity curves affirm some form of increasing trajectory, variously modified by physical and biological factors [21]. Diversity change within individual clades can be modelled using relatively simple birth/death processes with constant parameters [22], which predict symmetrical clade shapes—waxing and waning diversity through time—as a null. More complex and asymmetrical patterns result from models in which parameters are varied through time [6,23,24]. Gould et al. [25] summarized the evolutionary trajectories for extinct clades using a simple measure of their centre of gravity (CG), with a symmetrical clade trajectory having a CG of 0.5. Empirical studies revealed a tendency towards bottom-heaviness (CG < 0.5), with clades typically reaching their highest diversity relatively early in their evolution.

    1.1. What is disparity?

    In addition to assessments of diversity, it is increasingly common to investigate the morphological variety or disparity of clades through time (see summaries in Erwin [26] and Wagner [6]). All indices of disparity are relative and depend upon the nature of variables used to quantify form and the manner in which these variables are summarized [27]. Most use some form of morphospace; a multidimensional space-filling plot in which the distances between taxa are proportional to the measured morphological differences between them [28]. These may themselves be visualized using data reduction and ordination techniques (e.g. principal components or coordinates) to summarize variation in the original set of morphological variables within a smaller number of abstracted axes. Several indices of disparity assess the distribution of taxa in such morphospaces: for example, by adding the ranges or variances on successive axes (a boxing approach), using convex hulls or determining the mean distance between all pairs of taxa. Indices can then be used to compare the disparity of constituent subclades, or to track the morphospace occupation of one or more groups through time, thereby building up a disparity profile [6,2832]. When multiple disparity indices are used to capture variation in the same aspects of form, these often show a high degree of congruence. However, conflicting patterns are possible, depending upon the variance structure of the data [2832].

    Counterintuitively, diversity and disparity appear to be fundamentally decoupled [6]. Some periods or clades contain modest numbers of species that are nevertheless highly distinct morphologically, whereas others contain much greater numbers of morphologically very conservative species. More broadly, some of the most speciose groups (e.g. beetles and insects more generally) have some of the most constrained bodyplans; indeed, there are suggestions that a constrained and entrenched bodyplan might even be conducive to higher diversity [33]. Because clades evolve by lineage branching, we would expect a progressive exploration of morphospace even via a random walk. Once occupied, however, random extinction processes will tend to winnow out the space, but are less likely to leave large regions entirely vacant. All other things being equal, therefore, clades might be expected to have top-heavy disparity profiles through time, although driven evolutionary trends and selective extinction patterns may easily combine to yield a diversity of profile shapes. Empirical investigations for major clades over the past 25 years also show many different patterns, but the most common counterintuitively entails comparatively high-disparity relatively early in the clade's history (see also simulations by Foote [24,34,35]). Many groups therefore appear to explore the range of available ‘design’ options quite quickly, with subsequent evolution principally serving to increase diversity, possibly by the progressively fine subdivision of niche space [7,26,36].

    1.2. Why might clades show early high-disparity?

    One possible explanation for early high-disparity is that there are constraints and restrictions upon the available morphospace, thereby limiting the potential for expansion [3740]. Once filled, the space can only be subdivided or vacated unless the constraints are removed or a clade evolves so as to circumvent them [41]. Such limits can be broadly classified in four categories: geometric, ontogenetic, physical and environmental.

    Geometric constraints are those that can be predicted for any form in any context (many shapes are geometrically impossible), and are not limited to biological structures. Additional limits are imposed by particular generative processes [42] such that ontogeny can sometimes also be modelled geometrically. In such cases, it may be possible to delimit a morphospace theoretically [43], subsequently plotting real specimens within this. The best-known example is the shell of molluscs [44]. All forms—from simple cones (e.g. belemnites, patelloid limpets and hyolithids) to planispiral coils (e.g. many ammonites and bivalves) and translated coils (e.g. most gastropods)—can be modelled with reference to three or fewer variables that describe growth patterns, defining the theoretical morphospace. Forms outside of this are geometrically and ontogenetically impossible; typically, because the shell cannot grow through itself. However, many regions of the theoretical space are never occupied [42]. Actualized morphologies are limited to a relatively small fraction of the space, despite the half-billion-year history of molluscs, during which time groups have repeatedly re-radiated in the wake of mass extinctions, and within which there is rampant convergence in gross form [4547]. Additional limits must apply, therefore. There are more ontogenetic constraints upon form than those predictable geometrically. Organisms develop by the complex interplay of mutually inductive systems and feedback loops, themselves underpinned by cascades of genetic control: not all developmental trajectories and morphologies are possible [48]. Further limits to the evolution of disparity are physical, but understanding these requires additional knowledge of biological context. Form is limited by the properties of biological materials, but the performance of such materials depends upon the function of the structures that they compose, and the context and environment in which they are deployed. For example, the physical constraints upon walking [4952] and swimming [53,54] vertebrates differ from an engineering standpoint. Environmental restrictions [6] can therefore be both physical and biological, and might be broadly defined as all those factors that determine the availability of ecospace or niche space. A lineage can only evolve to realize a particular morphology if there are selective advantages; not only to the endpoint, but also to all intermediate forms along that evolutionary trajectory. The physical and biological environments are also dynamic and coupled systems.

    1.3. Can we detect the operation of limits on disparity from levels and patterns of homoplasy?

    If a clade has evolved to explore the limits of its morphospace, then its constituent lineages (variously prevented from exploring novel morphologies) might be more constrained or restricted to revisit previously occupied regions. This could be realized as increased levels of character state reversal and convergence. Overall levels of homoplasy might therefore be expected to be higher in constrained clades than in those free to colonize new regions of their morphospace. Most indices of homoplasy are influenced by dataset dimensions [55], but the homoplasy excess ratio (HER) of Archie [56] is a relatively unbiased ensemble metric that can be compared across clades [57]. The commonly reported ensemble consistency index is simply the ratio of the minimum possible number of steps (i.e. the number of derived states; MINL) to the number of steps observed on the optimal tree(s) (L). By comparison, the HER includes a term for the mean number of steps in optimal trees inferred from randomized data (MEANNS); specifically, a large number of matrices in which the assignment of states has been randomized within characters but across taxa. The HER is then given by

    Display Formula

    Nonetheless, overall levels of homoplasy may be less informative than the trajectory with which homoplastic changes are accrued in transitioning from the root to the terminals of a phylogeny. Wagner [58] noted that the rate of novel character state evolution usually decreased over the lifetime of a clade [59,60], with some groups approaching an asymptote and therefore character state exhaustion. If the disparity profile of clades were shaped by such exhaustion patterns, then we might expect clades reaching the bounds of their morphospaces early in their evolution (early high-disparity and low CG) to approach an earlier asymptote in numbers of realized states (character state saturation). We therefore test for such relationships here.

    2. Methods

    2.1. Indices of disparity

    A comprehensive discussion of methods, in addition to a presentation of discrete character morphospaces, disparity profiles and summary statistics is given in Hughes et al. [7]. A brief summary is provided here. Our 93 discrete character matrices for metazoan clades were all sampled uniformly with respect to higher taxonomy, or were edited (by generating composite taxa) in order to standardize coverage [7]. For each clade, intertaxon distance matrices were calculated using the generalized Euclidean distance metric of Wills [32]. These matrices were ordinated in R (v. 3.0.1) [61] using principal coordinates analysis [36] and implementing Caillez's [62] correction for negative eigenvectors. Stratigraphic ranges for each constituent taxon were assigned to stages as defined in the International Stratigraphic Chart 2009 [63,64], whereas stratigraphic range data were sourced from Sepkoski Online [65], the Fossil Record 2 [66] and the Paleobiology Database [67]. Our index of disparity was calculated as the sum of variances on all principal coordinate axes (a maximum of n – 1 coordinates, where n is the number of sampled taxa), with the trajectory of disparity through subsequent time bins yielding a profile through time. A centre of gravity metric [25,68] in absolute time (CGm) was calculated for each profile as

    Display Formula
    where di is the disparity at the ith stratigraphic interval and ti the temporal midpoint in absolute time (Myr) of the ith stratigraphic interval. This was subsequently scaled between the ages of the oldest (toldest) and youngest (tyoungest) intervals to yield an index of observed CG (CGscaled) between 0 and 1.
    Display Formula

    A clade with uniform disparity through time is highly unlikely to have a CGscaled of 0.50. Rather, the null CGscaled is determined by the durations of the time bins constituting the profile. The observed CGscaled was therefore scaled relative to the inherent CG (CGi) for a clade of constant disparity spanning the same intervals. A bootstrap resampling procedure was used to generate a distribution of 1000 differences between CGscaled and CGi. Clades for which more than 97.5% of these replicates lay either above or below the CG inherent in the timescale (p < 0.05) were deemed to be significantly top or bottom heavy, respectively [23]. Observed CGscaled was ultimately expressed relative to CGi as a baseline, hereafter, simply CG.

    We also implemented tests for early high and late high-disparity; specifically using a bootstrapping approach to determine if the disparity observed in the first or last two stages could be distinguished from the maximum level attained by the group. The 93 study clades were thereby classified as showing early or late high-disparity, and we tested for differences in our indices of homoplasy and character state saturation in these categories using Mann–Whitney U-tests.

    A simple index of the extent to which a clade was constrained within its morphospace was derived by expressing the maximum intertaxon Euclidean distance within any time bin as a fraction of the maximum distance across all time bins. Clades closest to this maximum might also be expected to show higher levels of overall homoplasy and state saturation [7].

    Finally, we also derived an index of the degree to which a clade migrated throughout its morphospace during the course of its evolution, because constant levels of disparity through time need not necessarily imply the static occupation of morphospace. We modified the Dmorpho index of Huang et al. [69] to provide an index (Dcentroid) of the degree to which the centroid of sampled taxa migrates throughout the morphospace throughout the history of the clade. For each time bin, the mean scores of the sampled taxa on each principal coordinate axis were calculated, defining the group centroid. The difference between the positions of the centroid in successive time bins on a given axis was standardized by dividing it by the full range of values realized by all taxa in all time bins. These standardized univariate distances were then used to calculate Euclidean distances between the centroid positions in successive time bins (univariate standardized distances were squared, summed and the total square rooted). Finally, the distances between all successive time bins were summed (n – 1 comparisons for n time bins; the first time bin being a reference). Where time-series data were missing, the centroid was assumed to have remained stationary from the preceding bin.

    2.2. Phylogenies and indices of homoplasy

    A single outgroup taxon was used to infer ancestral character states at the base of each focal ingroup clade. Phylogenies were inferred in the program 'Tree analysis using New Technology' (TNT) [70] using Xmult level = 10, which performs a random sectorial search combined with ratchet, drift and tree fusing, followed by TBR branch swapping. The resulting most parsimonious trees (MPTs) sometimes differed from those in the source publications, especially where the taxon sample had been reduced. In cases where multiple MPTs were obtained, we selected the tree most congruent with that presented in the original publication. The character exhaustion analysis required fully resolved (dichotomously branching) trees, so polytomies were resolved stratigraphically (branching in a pectinate sequence matching their order of first fossil occurrences). It has been demonstrated that the precise trees used in character exhaustion analyses have relatively minor effects upon the results [58]. Moreover, using incorrect MPTs introduces a conservative bias because they minimize the number of steps required to achieve the observed number of character states; longer trees (even if more accurate) necessarily imply greater exhaustion by implying that greater ‘sampling’ of character space fails to yield additional novel states. Overall homoplasy levels were assessed using the HER of Archie [56], an index that is relatively insensitive to differences in data matrix dimensions. Five hundred randomly permuted matrices were used in each case, each subjected to TNT searches as above.

    Character exhaustion analyses were performed using the method and scripts of Wagner [58] (figure 1). Character states for ancestral nodes were reconstructed using Fitch parsimony [71], and all nodes were numbered. A traversal of the tree from the root to the terminal branches was used to tally a cumulative total of character change steps and novel states. Working from the basal node, branches were added in order of their stratigraphic age (as given by the age of the oldest fossil representative of the clade that the branch leads to), then by their nodal proximity to the root, and finally according to the smallest numbers of novel states evolving along them. As fossil data are unavailable for unsampled internal nodes, many of the internal branches could not be ordered by stratigraphic age and so were ranked according to the last two criteria. This does leave ties. For example, consider six taxa that appear in the same stratigraphic interval with hypothesized relationships ((A,(B,C)),(D,(E,F)). The basal node necessarily precedes the (A,(B,C)) and (D,(E,F)) nodes, and those two nodes necessarily precede the (B,C) and (E,F) nodes, respectively. However, neither the (A,(B,C)) nor (D,(E,F)) sister nodes necessarily precede each other [72], and the ‘cousin’ nodes (B,C) and (E,F) cannot be ordered relative to each other either. Therefore, such sister-taxon and ‘Xth cousin’ ties were resolved randomly, but with second cousin nodes preceding third cousin nodes. This ordering strategy is the most exact possible without recourse to stratigraphic data of higher resolution to subdivide branches. Such data are unavailable for the vast majority of our sampled clades. In addition, it is not uncommon for multiple fossil taxa to have their first occurrences at the same locality, resulting in ties, regardless of the temporal resolution available. Another approach would be to use arbitrary evolutionary models to calibrate branch lengths [7375], and to assign character changes between known occurrences [76]. However, such models will bias results towards favouring character exhaustion. Longer branches with more novel character states will be pulled closer to the root, causing novel states to appear earlier in evolutionary time. This will be more pronounced if rate-variation among characters is permitted, because characters with a greater number of novel states will evolve at a faster rate, thereby concentrating the novel state changes on branches with deeper divergence times. In addition, it has been shown that different branch scaling methods can markedly influence the evolutionary inferences derived from trees [77]. Our approach is therefore a conservative one, insofar as it is more likely to defer the appearance of novel character states until later in our character exhaustion curves (inferred exhaustion will be less marked) and is not contingent upon arbitrary models of character evolution.

    Figure 1.

    Figure 1. Generating character saturation curves. (a) Step 1: ancestral states are reconstructed on a phylogeny in order to determine character transitions along each branch. Horizontal bars on branches indicate character state changes. Asterisks denote homoplastic changes (steps) that are not also new states. Branches are numbered within squares, and pairs of numbers above these indicate number of steps and number of novel states respectively. (b) Step 2: branches are ordered by stratigraphic occurrence, proximity to the root and number of new states. (c) Step 3: the number of steps and new states along each branch in the resulting sequence (denoted by the values in boxes) are calculated, along with running totals. (d) Step 4: the cumulative total of new states is plotted against the cumulative total of steps to generate a saturation curve. The dotted line indicates the trajectory (gradient of 1.0) for the hypothetical situation where there is no homoplasy, and all steps are novel states. (Online version in colour.)

    For each branch in the ranked sequence, the total number of character state changes (steps) and the total number of novel character states (states) was calculated and added to the cumulative total. Plotting the cumulative number of states against the cumulative number of steps yielded a states/steps curve for each of the 93 clades.

    All subsequent analyses were implemented in R (v. 3.0.1) [61]. The shape of each states/steps curve was quantified in two ways (figure 2). First, we recorded the fraction of total observed steps at which an arbitrary threshold (50%) of the maximum number of observed states was reached. Second, we calculated the CG for each states/steps curve (in an analogous manner to the CG for disparity profiles) and scaled this relative to the number of steps in the clade. The most convex curves with the highest initial gradients (i.e. those more quickly approaching an asymptote) yielded the lowest values for both indices. We also estimated the overall degree of saturation at clade extinction by fitting Michaelis–Menten nonlinear regression curves [7880] to the data based on the assumption that the number of character states would eventually reach an asymptote (i.e. that the character space was finite). We then expressed the maximum number of observed states as a fraction of the inferred maximum. Low values in this context indicated clades that were further from saturation at their extinction.

    Figure 2.

    Figure 2. Quantifying character saturation from state/steps curves. (a) The 50% threshold is defined as the number of steps taken to reach 50% of the total number of character states (a) divided by the total number of observed steps (a + b). (b) The fraction of maximum states is defined as the total number of observed character states (c) divided by the estimated maximum number of possible character states (d) from the asymptote of the Michaelis–Menten curve. (Online version in colour.)

    Finally, each dataset was fitted to Wagner's idealized models of character evolution [58]. Log-likelihood values were used to assess whether a null model of a step-independent (linear) model of character evolution could be rejected in favour of step-dependent models indicating exhaustion.

    3. Results

    All summary statistics are given in table 1. Of our 93 sampled clades, only two realized the maximum intertaxon Euclidean distance for the entire morphospace within a single time bin. Most appeared relatively free to evolve within the morphospatial bounds, with a mean maximum observed distance (as a fraction of the maximum possible) of 0.712. HERs had a mean of 0.470 with a fairly typical distribution [55,56]. States/steps curves exhibited a range of shapes although most were asymptotic and reached a slope less than 1 (figure 3), indicating that some degree of character state saturation occurred in most groups. Of the 68 clades tested for fit with Wagner's models, the null model of a linear increase in new character states was rejected in 60 cases. Although nearly all clades showed a decrease in the rate at which new states appeared after about 30% of the maximum number of steps, 12 maintained a much reduced but constant rate of addition of states over the remainder of their evolutionary history (e.g. cinctans; figure 3c). Some groups, such as Aplodontoidea (mammalia; figure 3f), had stepped patterns, indicating that the origin of novel states was concentrated in a relatively small number of branches equidistant from the root. This is similar to the pattern recently documented within post-Palaeozoic echinoids [177]. The mean fraction of steps at which 50% of states were realized was 0.307, with values ranging between 0.103 (the most convex curve with fastest saturation) and 0.625 (the most nearly linear curve with the least saturation). Michaelis–Menten curve fits all inferred asymptotes in excess of the realized maximum at extinction; observed maxima varied between 0.067 and 0.896 of the inferred, with a mean of 0.583. These two indices of state saturation were strongly negatively and highly significantly correlated (rs = −0.873, p < 0.001; those clades taking longest to reach 50% of the realized maximum tended to be those in which the realized maximum was the smallest fraction of the inferred, because the empirical curves were truncated by extinction at the steepest gradients). CG indices for the empirical curves showed a narrow range of values as expected (0.571–0.704), but correlated highly significantly with both the empirical 50% thresholds (rs = 0.631, p < 0.001) and the realized fraction of inferred states (rs = −0.578, p < 0.001).

    Figure 3.

    Figure 3. Example Michaelis–Menten functions fitted to state/steps data for different extinct animal clades. See text for explanation of how the fraction of estimated maximum number of states was calculated. Points indicate cumulative totals as each branch is added. (a) Orthograptidae [171]. (b) Asaphina [172]. (c) Cinctans [173]. (d) Bothremydidae [174]. (e) Plesiosauria [175]. (f) Aplodontoidea [176]. (Online version in colour.)

    Table 1.Summary metrics for the 93 clades in the dataset. Ext: N = does not terminate coincident with a mass extinction boundary; Y = does terminate coincident with a mass extinction boundary. HER: homoplasy excess ratio. T50%: 50% threshold for character states. SCG: Saturation CG. Fchar: fraction of total character states relative to the estimated maximum from Michaelis–Menten asymptotes. CG: Disparity profile centre of gravity. CDev: Summed centroid deviance. Euc: Maximum Euclidean distance between taxa in any given time bin as a fraction of the maximum across all time bins. W: Top, significantly top heavy; Bot, significantly bottom heavy; N, CG neither top nor bottom heavy. ESat: Y, disparity in the first two stages not significantly different from maximum; N, disparity in the first two stages significantly different from maximum. LSat: Y, disparity in the last two stages not significantly different from maximum; N, disparity in the last two stages significantly different from maximum. Clades that realize the maximum inter-taxon Euclidean distance are highlighted in italic.

    authorcladeextinctHERT50%SCGFcharCGCdevEucWESatLSat
    Anderson et al. [81]AcanthodiiN0.7960.2920.6350.5090.4461.4920.837TopNY
    Sigurdsen & Bolt [82]AmphibamidaeN0.2400.6070.6580.2740.3161.7550.601NYY
    Hill et al. [83]AnkylosauriaY0.3770.2090.5850.7120.6876.0100.750NYY
    Fröbisch [84]AnomodontiaN0.6020.2540.6020.6720.5112.6180.765TopNY
    Hopkins [85]AplodontoideaN0.6260.1360.5930.8200.1853.3070.764NNY
    Dupret et al. [86]ArthrodiraN0.5650.1470.5710.7740.5663.4250.715BotNN
    Fortey & Chatteron [87]AsaphinaY0.5020.4380.6930.2550.6111.7530.803NNY
    Lieberman & Kloc [88]AsteropyginaeY0.1940.2330.6050.7180.6412.4340.713TopNY
    Alvarez et al. [89]AthyrididaN0.2360.3020.6270.7230.5262.9100.690TopNY
    Milner et al. [90]BaphetoideaN0.0950.2660.5990.6390.4510.6850.909NNY
    Benedetto [91]BillingsellidinaN0.3930.3080.6200.5750.7342.9680.700NYN
    Bodenbender & Fisher [92]BlastoideaN0.3110.1530.5880.8340.6094.0580.662TopNY
    Foote [93]BlastozoansY0.6230.1150.5850.8960.4773.1490.685TopNY
    Wang et al. [94]BorophaginaeN0.7100.4470.6450.3440.5021.6900.698BotYN
    Gaffney et al. [95]BothremydidaeN0.5870.3350.6330.5440.4712.0270.512NNN
    Schoch & Milner [96]BranchiosauridaeN0.5320.3010.6010.5930.2691.7520.792NYY
    Mihlbachler & Deméré [97]BrontotheriidaeN0.5490.1030.6140.8600.4152.6580.516NYN
    Jimenez-Sanchez et al. [98]Bryozoa (unnamed clade)N0.2340.2860.5950.6450.4273.2470.669NYN
    Sampson et al. [99]ChasmosaurinaeY0.6650.2830.6120.6040.4580.6440.533BotNY
    Smith & Zamora [100]CinctansN0.4700.3530.6070.4400.4771.2010.637NYY
    Carlson & Fitzgerald [101]CryptonelloideaN0.2770.3310.6200.5680.3993.1540.587BotNN
    Novas et al. [102]DeinonychosauriaY0.6070.3340.6030.5410.6355.5660.720TopNN
    Wenwei et al. [103]DimeropygidaeY0.5390.3710.6100.5510.5280.7220.697NYY
    Clement & Long [104]DipterimorphaN0.4170.2680.5840.6810.2253.8810.753BotYN
    Foote [105]DisparidaN0.2760.1460.5750.8030.4902.8200.506BotYN
    Cotton & Fortey [106]EodiscinaN0.2910.1290.5760.8650.3752.1180.594BotYN
    Maletz et al. [107]EugraptoloidaN0.8610.2600.6310.5670.5632.0930.646NNY
    Bloch et al. [108]EuprimateformsN0.4160.3580.5940.6130.3482.8970.623BotYN
    Tetlie & Cuggy [109]EurypterinaN0.5530.3230.6390.5250.5093.6770.787NYY
    Foote [105]FlexibiliaY0.3390.2130.6120.7550.5062.8550.493NNY
    Zhu & Gai [110]GaleaspidaN0.6590.2790.5820.6010.5494.3570.729NYY
    Korn [111]GoniatitaceaeN0.7370.5000.6280.2700.5693.5190.866NNN
    Gebauer [112]GorgonopsiaY0.6970.4220.6350.4240.5140.6940.991NYY
    Prieto-Marquez [113]HadrosauroideaY0.6390.2680.6040.6560.7000.4070.708TopNN
    Wang [114]HesperocyoninaeN0.6360.4810.7040.4040.5203.4670.668NNY
    Polly [115]HyaenodontidaeN0.4540.3700.6220.5010.5401.6270.797NNY
    Motani [116]IchthyopterygiaN0.6990.3720.6140.4150.3595.6250.770BotYN
    Trinajstic & Dennis-Bryan [117]IncisoscutoideaN0.4480.2610.6020.7030.4981.4380.689NYY
    Sundberg [118]Kochaspid TrilobitesN0.1220.3130.6500.5610.4851.2250.660NYN
    Adrain et al. [119]KoneprusiinaeN0.3990.2400.6340.6870.7554.5191.000NYY
    Klembara et al. [120]LabyrinthodontiaN0.3010.2140.5760.7060.3682.4920.658NNN
    Anderson et al. [121]LepospondyliN0.3410.1960.6110.8160.48410.2450.620NNN
    Pollitt et al. [122]LichoideaY0.3340.1950.5920.7490.5556.2890.752BotYN
    Yates & Warren [123]LimnarchiaN0.3440.2240.5990.7410.4374.9570.640NYN
    Hoffmann [124]LytoceratoideaY0.8350.5000.6550.4280.4682.2871.000BotYN
    Damiani [125]MastodonsauroideaN0.3420.3020.5750.5330.3942.5920.726BotYN
    Young & De Andrade [126]MetriorhynchoideaN0.8690.3320.6390.4890.4515.4180.780NNN
    Polly et al. [127]MiacoideaN0.3520.3970.6480.3980.6382.4520.606NYY
    Ruta & Coates [128]MicrosauriaN0.6080.3040.6090.5800.5716.8510.482BotYY
    Lee et al. [129]MissisquoiidaeN0.0960.2140.5980.7420.3792.8280.761NNY
    Bell Jr. & Polcyn [130]MosasauridaeY0.4800.2520.6050.7140.5091.3690.712NNN
    Kielan-Jaworowska & Hurum [131]MultituberculataN0.4650.3100.6280.5400.5203.5350.774BotYN
    Pol et al. [132]NotosuchiaN0.4350.3530.6210.4820.3553.9680.500NNY
    Lieberman [133]OlenellinaN0.1470.2760.6250.6660.5070.6980.732NYN
    Lieberman [134]OlenelloideaN0.0830.2650.6040.7280.4811.4110.658NNN
    Bajpai et al. [135]OmomyoideaN0.2220.1900.5920.7900.4978.3930.446TopNN
    McDonald et al. [136]OrnithopodaY0.6910.1760.5980.7640.6201.0760.992TopNY
    Mitchell [137]OrthograptidaeY1.0000.5290.6450.0670.6281.3380.626NYY
    Sansom [138]OsteostraciN0.5520.2660.5990.6820.4992.8940.520TopNN
    Longrich et al. [139]PachycephalosauriaY0.4800.4420.6250.4690.6311.7910.655BotNN
    Prokop & Ren [140]PalaeodictyopteraN0.0770.3140.6250.5210.3512.3160.861BotYN
    Jin & Popov [141]ParastrophinidaeN0.4140.3210.6120.6520.1184.5770.565NYY
    Stocker [142]ParasuchiaY0.3740.3290.6180.4890.4960.3690.761NYN
    Lopez-Arbarello & Zavattieri [143]PerleidiformesY0.3460.2130.6040.7300.4711.4220.791TopYN
    Smith & Pol [144]PlateosauriaN0.4260.4500.6280.7610.6722.3130.771NNY
    Anderson et al. [81]PlacodermiY0.6380.2710.5840.5280.5602.5150.650NNN
    Ketchum & Benson [145]PlesiosauriaY0.3750.2640.6060.7090.5093.6840.762NYY
    Smith & Dyke [146]PliosauroideaY0.6160.2740.5960.6930.5063.0010.743NYY
    Cisneros & Ruta [147]ProcolophonidaeY0.5810.3550.6480.5170.4862.3870.650NYY
    Huguet et al. [148]ProtomyrmeleontidaeY0.0460.3050.6290.5850.4352.0510.851BotYN
    Nel et al. [149]ProtanisopteraN0.4780.3130.5760.5630.4511.7320.949NYN
    Egi et al. [150]ProviverrinaeN0.2140.2970.6340.5570.1844.3030.773TopYY
    Parker & Irmis [151]PseudopalatinaeY0.5720.3330.6190.4710.6411.1040.811NYY
    Pernègre & Elliott [152]PteraspidiformesN0.3360.2480.5860.6440.4711.4920.709NNY
    et al. [153]PterosauriaY0.5450.3150.6160.5150.5292.6260.679NYN
    Poyato-Ariza & Wenz [154]PycnodontiformesN0.2790.3080.6270.5680.4871.8780.590NYY
    Brusatte et al. [155]RauisuchiaN0.5170.3880.6190.5010.5083.3100.753NNN
    Bates et al. [156]RetiolitidaeN0.5780.3330.6230.5170.5572.7650.704NNN
    Cerdeno [157]RhinocerotidaeN0.3270.1310.5860.8310.5172.3790.772NYN
    Hone & Benton [158]RhyncosauriaY0.7640.4110.5870.3650.5682.8770.880NYY
    Allain & Aquesbi [159]SauropodaY0.5380.3800.6060.4780.5393.8500.774BotYY
    Maidment [160]StegosauriaN0.6520.6250.6730.1020.3381.4220.602BotNN
    Carlson & Fitzgerald [101]StringocephaloideaN0.3520.2790.6170.4360.4732.5620.739BotNN
    Schoch [161]StereospondyliN0.4740.4090.6410.6590.3544.7550.650BotNN
    Lamsdell et al. [162]StylonurinaN0.5410.2590.6120.6730.4705.9660.629NNY
    Klug [163]SynechodontiformesN0.6410.2880.6270.6450.6171.5610.200TopNN
    Gaudin [164]TardigradaN0.4660.2010.5890.6910.6416.9170.669NNY
    Wu et al. [165]ThalattosauriaY0.5600.4140.6220.3760.5330.9670.951TopNN
    Wilson & Märss [166]ThelodontiN0.3870.2490.6110.6070.5873.8980.729TopYN
    Hu et al. [167]TheropodaY0.4220.3000.6110.5670.5792.1250.971NNN
    Chatterton et al. [168]ToernquistiidaeY0.1410.3080.5930.5930.4620.7930.653TopNY
    Brusatte et al. [169]TyrannosauroideaY0.7780.3720.6450.4050.6333.2670.950BotNN
    Anderson & Seldon [170]XiphosuraN0.8960.5750.5900.0760.2664.9340.803BotNN

    There was a weak but significant negative correlation between overall homoplasy levels and the disparity CG (rs = 0.227, p < 0.029): clades with a lower CG (earlier higher disparity) had greater homoplasy (lower HER) on average (figure 4). However, we found no significant relationships between disparity CG and any of our indices of saturation curve shape (rs = 0.008, p = 0.941 for the 50% threshold; rs = 0.091, p = 0.388 for the saturation curve CG; rs = −0.039, p = 0.708 for the Michaelis–Menten estimate of the realized fraction of inferred states). Limiting the analysis to wholly extinct clades that did not terminate coincident with a mass extinction boundary resulted in weaker correlations for all indices of character saturation except the 50% threshold point (table 2). Finally, analysis of the CG of clades terminating at mass extinction boundaries yielded similar results for indices of character exhaustion but showed no correlation with HER values. Maximum Euclidean distance within a time bin correlated negatively with the Michaelis–Menten estimates of the realized fraction of inferred states (rs = −0.229; p = 0.027), indicating that character saturation may be greater in clades that reach their morphospatial bounds. However, no correlation was found between character saturation metrics and the amount of centroid deviation (50% threshold: rs = −0.173, p = 0.097; saturation CG: rs = −0.183, p = 0.079 fraction of inferred states rs = 0.198, p = 0.057) implying that clades that migrate through the morphospace are as likely to show saturation as those that statically occupy a defined region. The morphospace of clades that show early disparity and similar saturation values (figure 5) reveals that some clades continue to evolve new character states as they migrate through the morphospace (e.g. disparid crinoids), whereas others remain fixed and occupied space within existing bounds (e.g. lichoid trilobites). Whether a clade showed early or late high-disparity also had no effect on the degree of character exhaustion within that clade (table 3).

    Figure 4.

    Figure 4. Disparity profile centre of gravity (CG) plotted against homoplasy excess ratio (HER) and estimates of character saturation. (a) Disparity CG versus HER. (b) Disparity CG versus 50% threshold. (c) Disparity CG versus saturation curve CG. (d) Disparity CG versus fraction of the Michaelis-Menten estimate of the maximum number of character states realized at extinction. rs and p-values are from Spearman's rank correlation coefficient tests.

    Figure 5.

    Figure 5. Differing patterns of morphospace occupation along the first two principal coordinate axes in clades showing early high-disparity. CG: disparity profile centre of gravity. Fchar: fraction of total realized character states relative to the maximum estimated from Michaelis–Menten asymptotes. Euc: maximum Euclidean distance between taxa in any given time bin as a fraction of the maximum across all time bins. (a) Disparid crinoids from Foote [178] (CG = 0.490, FChar = 0.803, Euc = 0.506) showing migration through the morphospace. PCo 1 = 23.6% total variance, PCo 2 = 12.0% total variance. (b) Lichoid trilobites from Pollitt et al. [179] (CG = 0.555, FChar = 0.749, Euc = 0.752) showing more static occupation of the morphospace. PCo 1 = 26.2% total variance, PCo 2 = 13.4% total variance. (Online version in colour.)

    Table 2.p-values from Spearman rank tests for homoplasy excess ratio (HER) and three proxies of character exhaustion (fraction of the total number of steps at which 50% of states are realized, CG of the saturation curve, and the fraction of the estimated number of states (inferred from Michaelis–Menten curve) that are observed) correlated with disparity profile CG. Values are calculated for the entire dataset of 93 clades, the subset of 55 clades not becoming extinct coincident with a mass extinction boundary and that have no extant survivors, and the subset of 31 clades that terminate at a mass extinction boundary.

    HER50% thresholdsaturation curve CGobserved maximum states/estimated maximum states
    disparity CG (entire dataset, n = 93)rs = 0.227p = 0.029rs = 0.008p = 0.941rs = 0.091p = 0.388rs = 0.039p = 0.708
    disparity CG (clade extinction not coincident with mass extinction, n = 55)rs = 0.285p = 0.035rs = −0.107p = 0.436rs = 0.010p = 0.940rs = 0.037p = 0.786
    disparity CG (clade extinction coincident with mass extinction n = 31)rs = 0.085p = 0.649rs = 0.145p = 0.438rs = 0.099p = 0.597rs = 0.094p = 0.614

    Table 3.Summary statistics from Mann–Whitney U tests of differences between median homoplasy excess ratio (HER) and three character saturation metrics (fraction of the total number of steps at which 50% of states are realized, CG of the saturation curve, and the fraction of the estimated number of states (inferred from Michaelis–Menten curve) that are observed) when bi-partitioned by disparity profile shape. Bottom heavy versus top heavy: clades grouped based on a CG value significantly higher or lower than mean randomized values (with other clades omitted). Early maximum disparity: clades partitioned according to whether or not they show disparity in the first two stages that is significantly different from the maximum. Late maximum disparity: clades partitioned according to whether or not they show disparity in the last two stages that is significantly different from the maximum.

    HER50% thresholdsaturation curve CGobserved maximum states/estimated maximum states
    significantly bottom heavy versus significantly top heavyW= 216p= 0.588W= 288p= 0.012W= 219.5p= 0.520W= 110p= 0.019
    early maximum disparityW= 1277.5p= 0.131W= 989p= 0.482W= 1077p= 0.979W= 1189.5p= 0.407
    late maximum disparityW= 997.5p= 0.535W= 1095p= 0.899W= 1005.5p= 0.580W= 1051p= 0.838

    4. Discussion

    The significant but weak correlation between disparity CG and overall levels of homoplasy demonstrates that clades with higher disparity earlier in their histories are more likely to show higher levels of character state reversal and convergence. While this implies the operation of some constraint or restriction (sensu Wagner [6]), the small size of the effect (R2 = 0.030 if modelled linearly) suggests that some other factor or factors are much more important. The absence of significant correlation between disparity CG and any of our proxies for states/steps curve shape indicates that disparity is not shaped in any straightforward way by progressive exhaustion of the character space. Patterns of disparity through time cannot therefore be deduced straightforwardly from patterns of homoplasy increase throughout the lifetime of clades, and are only weakly influenced by overall homoplasy levels. Many clades continue to evolve new character states with no associated increase in their disparity, whereas others achieve their highest levels of disparity through homoplastic character change. Several clades (including crustaceans and priapulid worms [180,181]) occupy a similarly sized morphospace envelope throughout much of their evolution (similar disparity), but nevertheless migrate through the overall morphospace. Other clades (e.g. angiosperms, Jurassic ammonoids [182]) quickly colonize many of the morphospatial extremes (reaching maximum disparity) but subdivide the envelope progressively through time and continue to evolve new states. The major axes of our empirical morphospaces are likely to be defined by the principal patterns of covariation between character states, and it is these patterns that largely determine Euclidean eccentricity from the global centroids. Similarly, the most eccentric (extreme) morphologies may embody sets of character states that have individually evolved earlier in the history of the clade, but never before in combination. Upon its first evolution, a new state need not necessarily move a lineage to a particularly eccentric position in the morphospace, neither will it necessarily result in the expansion of the morphospace occupied by contemporaneous taxa, particularly where the space is contracting on other fronts.

    In most of our sampled clades, new character states continued to evolve long after maximum disparity had been reached. Major groups often share a conserved morphological template or bodyplan (Bauplan), usually defined by character changes at the clade's base. This implies that some characters are relatively invariant or become ‘fixed’, whereas other characters continue to evolve new states. Neither conventional morphospace analyses nor our states/steps curves distinguished between characters on the basis of their evolutionary or developmental depth. State changes might therefore range from fundamental shifts in body symmetry and organization (more typical of those delimiting phyla), down to subtle changes in bristle patterns at the other (perhaps more typical of species), yet all contribute equally. Furthermore, as cladistic matrices are created with the express purpose of inferring relationships, the characters within such matrices will primarily be synapomorphies of subclades. Deep synapomorphies (which may be shared by all members of the focal clade) will usually be absent, because they contain no useful phylogenetic information. For example, characters such as the presence and absence of limbs or a notochord will not be included in a dataset of vertebrates that all share these derived features. As a result, cladistic datasets are likely to under-represent characters subjected to strong intrinsic constraints, which have already become ‘fixed’ within the clade of interest. For this and other reasons, conventional discrete character morphospaces—and the estimates of disparity derived from them—may not be best suited for recognizing the changes of deepest developmental and evolutionary significance. Morphospaces that take account of the developmental depth of characters have long been called for [3], and some moves have been made towards realizing these for particular clades [48,183186].

    Several authors have distinguished between intrinsic and extrinsic limits to disparity [7], with intrinsic factors being those that operate within the individuals and lineages that constitute a clade (broadly equating to geometric and developmental constraints) and extrinsic or ecological factors being those imposed from the outside (biological and physical restrictions) [6,26]. The precise limits on the evolution of disparity are probably unique to each clade, and comprise some combination of factors. Determining the relative importance of these is not straightforward, and direct tests are impossible with the present data. There are some strongly suggestive patterns, however.

    4.1. Intrinsic developmental constraints

    As ontogeny becomes more complex, and genetic and other mechanisms become progressively more interdependent, increasing pleiotropy and functional linkage may result in developmental programmes that are more difficult to modify and subsequently evolve [187,188]. While some aspects of bodyplan organization may be strongly adaptive and maintained by stabilizing selection, other aspects may be largely contingent but locked down by the difficulty of effecting change in developmental programmes. The seven cervical vertebrae of mammals furnish the best-known example. Nearly all mammals—including the long-necked giraffes, gerenuks and alpacas—have just seven neck vertebrae. Other vertebrate groups retain the ability to modify this number, and invariably evolve longer necks with greater numbers of vertebrae; up to 25 in birds, 19 in sauropods [189] and 75 in the extinct plesiosaurs [190]. Two extant groups of mammals depart from the mammalian ground plan of seven; sloths have six (Choloepus) or eight or nine (Bradypus), whereas manatees (Trichechus) have six. All achieve this by homeotic frameshifts of the thoracic expression pattern (the development of ribs, etc.) relative to the underlying somites [191]. Such shifts in other mammals may be linked to highly deleterious, pleiotropic side effects, not least problems with the innervation, musculature and blood supply of the forelimbs and elevated rates of juvenile cancer [192]. Sloths and manatees appear to avoid these effects by low rates of metabolism and overall activity [178,192,193]. The pentadactyl limb of tetrapods is another example of a design that was apparently much more labile early in its evolution. Early labrynthodont tetrapods had higher numbers of digits: eight in the forelimbs of Acanthostega, seven in the hindlimbs of Ichthyostega, six in Tulerpeton. Modern lissamphibians—despite their ground plan of five digits—often develop greater numbers with no ill effects: ostensibly because limb patterning in aquatic larvae occurs prior to the phylotypic stage of development, during which time inductive interactions and interdependencies are concentrated. Many amniote groups, by contrast, have reduced digit numbers as adults (e.g. horses, non-avian dinosaurs, birds [179]), but few lineages have attained higher numbers, often evolving a variety of digit-like structures rather than extra digits per se [194,195]. Ichthyosaurs furnish the best-known exception: opthalmosaurians added digits anterior to digit one and posterior to digit five [196], whereas non-opthalmosaurians may have achieved polydactyly by interdigital or postaxial phalangeal bifurcation [197]. In most amniote groups, however, polydactyly is associated with a range of deleterious pleiotroipic effects [198200], because limb development coincides with the phylotypic stage. Variation in this particular set of characters appears to be effectively locked down, therefore.

    4.2. Extrinsic physical and biological (ecological) restrictions

    In general, levels of clade disparity are often much less depleted by mass extinction events than levels of diversity. This is because numerous lineages can be lost from a morphospace while still maintaining a broad distribution of survivors [201]. Indeed, even where extinction selectively removes large subclades, disparity levels may remain high provided that the surviving clades occupy peripheral regions of the morphospace [202]. Where increases in levels of disparity coincide with marked and episodic changes in the physical or biological environment, it may be reasonable to infer that extrinsic, ecological constraints have been removed. Such shifts may occur in the immediate wake of mass extinctions, although in such cases, it may be difficult to distinguish the removal of biological constraints—for example, the extinction of competing or incumbent clades—from the physical environmental shifts that precipitate these biological changes. However, several of the largest and most conspicuous adaptive radiations have classically been understood in ecological terms. Crown group mammals evolved numerous new body forms (broadly equating to modern orders, and with many striking parallels between Eutheria and Metatheria in different settings) after the K/Pg mass extinction. This occurred not only in the aftermath of the extinction of the non-avian dinosaurs, but also coincident with the final demise of eutriconodont, spalacotheroid and multituberculate mammals [203]. Similarly, articulate brachiopods rapidly increased their disparity in the wake of the end Permian mass extinction; a pattern consistent with rebound after the removal of highly structured guilds and the freeing up of ecospace [204]. Comparable post-extinction rebounds have been observed for crinoid and blastozoan echinoderms [27], as well as ammonoids [205] through multiple events. Similarly, rapid increases in disparity may occur when a clade is first able to colonize a fundamentally new region of ecospace. Even bodyplans that are assembled piecemeal over many tens of millions of years may reach a critical threshold, thereby suddenly circumventing previous restrictions [41].

    5. Conclusion

    In addition to studying the phylogeny and diversity of clades throughout their evolution [21,25,30], it is increasingly common to examine the manner in which groups explore theoretical or empirical morphospaces through time [43,184], as well as their resulting temporal patterns of morphological disparity change. Disparity and diversity are fundamentally decoupled [29], and a variety of trajectories have been observed empirically. The most common pattern, however, is for disparity to peak relatively early in the history of a clade, and certainly before its peak in diversity [7]. Putative limits on disparity may either be intrinsic (e.g. developmental [48,193]) or extrinsic (e.g. ecological [27,204,205]), but both imply constraints on available morphospace that might be reflected in the rate of evolution of novel morphology throughout the lifetime of a clade. The majority of clades studied do indeed show a significant decrease in the rate of appearance of novel character states over time. However, despite a weak correlation between overall levels of homoplasy (as measured by the HER) and the CG of clade disparity profiles (greater homoplasy implies earlier high-disparity) we found no more detailed relationships between the shapes of character saturation curves and disparity profiles. Many clades continue to evolve new character states while disparity levels remain constant, which can variously be achieved by wholesale migration through the morphospace or by subdividing it. Similarly, disparity may be increased or maximized by predominantly homoplastic state changes. The anecdotally large number of clades showing the expansion of hitherto restricted morphospaces in the aftermath of mass extinctions (or upon transitioning into fundamentally new habitats) suggests that many of the limitations may be ecological. However, given the variation shown in both character saturation and morphospace occupation, limits on disparity almost certainly result from a complex interplay of clade-specific intrinsic and extrinsic factors, suggesting that a simple universal explanation for early high-disparity is unlikely.

    Authors' contributions

    J.W.O. co-wrote the paper, collected data, implemented analyses of homoplasy and character exhaustion and drafted figures. M.H. analysed disparity trajectories, collected data, wrote scripts and drafted figures. P.J.W. scripted the states/steps analyses and contributed to writing. S.G. analysed disparity trajectories. M.A.W. conceived the study, co-wrote the paper and drafted figures.

    Competing interests

    We have no competing interests.

    Funding

    This research was supported by grants from the John Templeton Foundation (43915), and from The Leverhulme Trust (F/00351/Z) awarded to M.A.W.

    Acknowledgements

    The authors thank Melanie Hopkins and an anonymous reviewer for constructive comments on an earlier draft of this paper. The authors were are also grateful to Alex Jeffries for instructive discussion and help in implementing Michaelis–Menten curves.

    Footnotes

    One contribution of 12 to a theme issue ‘Are there limits to evolution?

    Published by the Royal Society. All rights reserved.

    References