Introduction

The presence of heritable variation is a basic prerequisite for evolutionary response to natural selection on a trait. Understanding how this variation changes when populations are brought from nature to captivity is important for two primary reasons. First, it is difficult, except in rare circumstances (eg, Milner et al, 2000), to obtain the information needed for quantitative genetic studies on free-living populations. Thus, studies are commonly made on populations in captivity and then extrapolated to make inferences about evolutionary forces in nature; the circumstances under which this extrapolation may be valid have been examined extensively (see Weigensberg and Roff, 1996 for review).

A second reason for examining changes across the nature-captivity transition, which has not been considered previously, is that threatened and endangered species are increasingly recommended for captive breeding (Balmford et al, 1996). Because such populations may remain in captivity for extended periods, and may become the only source for population recovery in the future, it is of inherent interest to understand how genetic variation might change upon captivity, since such variation, among other things, determines a population's ability to persist upon reintroduction (Bürger and Lynch, 1995).

Genetic variation available for response to selection has been measured traditionally as narrow-sense heritability, h2, the ratio of additive genetic variance σA2 to total phenotypic variance (σP2), itself composed of additive genetic, nonadditive genetic (σI2), and environmental (σE2) components: σA2/[ σA2 + σI2 + σE2] (Falconer and MacKay, 1996). Decomposed in this way, it is perhaps unsurprising that extensive study has failed to find consistent patterns of change from nature to captivity in this complex measure. Heritability may increase upon captivity simply because environmental components are reduced in more controlled conditions (eg, Simons and Roff, 1994) and may decrease in nature due to seasonal or annual environmental variation (eg, Sgró and Hoffmann, 1998). Similarly, it may remain stable if both environmental and genetic variance change proportionally (eg, Blanckenhorn, 2002).

A less traditional but perhaps better measure for assessing changing capacity to respond to selection across a variety of traits may instead be evolvability, the coefficient of additive genetic variation, , which depends only on additive genetic variance, standardized by the trait mean, X̄ (Houle, 1992). At least three nonexclusive processes may influence changes in evolvability from nature to captivity: drift, selection and gene-by-environment interaction. Classic infinitesimal theory holds that σA2 should change in proportion to gene diversity, GD (equivalent to expected heterozygosity; Falconer and MacKay, 1996), such that no change in σA2 due to drift is expected unless the number of individuals used to found the captive population is very small, and/or the population remains small in subsequent generations. It is on the basis of this theory that it is often assumed that no more than 20 founders (N) are needed for a captive-bred population, since they are expected to sample on average 1-1/(2N), or 97.5% of wild GD, and thus also nearly all wild additive genetic variance (Lacy, 1994). If additive assumptions do not hold, however, and σI2 is large, sampling-induced changes in allele frequencies may inflate σA2, causing increases in evolvability (Willis and Orr, 1993 and references therein).

A second process affecting changes in evolvability in nature vs captivity may be selection. Hedrick (1986) has argued that environmental heterogeneity is responsible for maintaining most polymorphic loci in natural populations, such that higher σA2 is expected in the wild, where it is maintained by heterogeneous selection. On the other hand, if populations are under mostly balancing selection in nature, σA2 may be expected to increase in the permissive conditions of captivity, as selection-induced linkage disequilibrium dissipates and/or mutations accumulate which would have otherwise been selectively removed (Prout and Barker, 1989). However, both balancing and directional selection in captivity (‘domestication’) may also cause rapid and large losses of σA2 (Rodríguez-Clark, 2002).

Finally, gene-by-environment interactions have been hypothesized to play an important role in determining immediate changes in σA2 from nature to captivity. Specifically, it has been suggested that greater σA2 may be expressed in more ‘stressful’ conditions, as would be characteristic of nature, allowing for rapid response to selection challenges there. Although the mechanisms underlying this stress-dependence hypothesis remain obscure, the findings of numerous studies appear consistent with this view. Nonadditive genetic variation has similarly been hypothesized to be at higher levels in the stressful conditions of nature (reviewed in Hoffmann and Parsons, 1991; Bijlsma and Loeschcke, 1997).

General conclusions about changes in evolvability from nature to captivity have been hampered in past studies by a focus on a few species, on only one trait in any given study, or on heritability exclusively as the parameter of interest. In the present paper, I address these issues by combining minimum estimates of heritability in nature and in captivity with estimates of phenotypic variability in both environments, to estimate changes in additive genetic variation and evolvability in five traits, in a well-characterized but underutilized experimental insect species.

Methods

The study organism

The basic data collected consist of three categories of observation – wing length, fecundity and colour – which are of particular interest in the context of the natural history of the large milkweed bug. Oncopeltus fasciatus experiences great environmental variation across its natural range, which covers much of North America and the Caribbean. A seed predator specializing on Asclepias (milkweed) species, O. fasciatus is highly dependent on this patchy and ephemeral resource for reproduction. It is not thought to overwinter in the northern parts of its range, with these populations instead re-establishing every year with migrants from continuously reproducing southern ones, leading to likely panmixia within large geographic regions. This conclusion is supported by studies that have found quantitative and molecular genetic differentiation within the species only at the scale of thousands of kilometres (Leslie and Dingle, 1983; Leslie, 1990). Low temperatures, food limitation and short day lengths all induce flight restlessness and reproductive diapause in northern populations, suggesting that some late-autumn northern residents may escape frosts by flying south (Rodríguez-Clark, 2002 and references therein). Wing length, the first trait examined in this study, appears to be an important character in this migratory syndrome, as it is genetically correlated with flight behaviour in northern populations (Dingle and Evans, 1987).

Another category of trait examined, fecundity, is also a part of this migratory syndrome. Unlike many insect species, female O. fasciatus will produce large numbers of (infertile) eggs even if virgin, which is thought to allow females to take advantage of patchily distributed mates and food as quickly as possible. Once females arrive at a suitable host, they deposit clutches of 30 eggs per day (up to 2000 or more total) in protected crevices on the plant (Rodríguez-Clark, 2002 and references therein).

Colour comprises the final category of trait examined. As with many other milkweed specialists, O. fasciatus sequesters the plant's poisonous cardiac glycosides and has a contrasting black and coloured aposematic pattern effective against both vertebrate and invertebrate predators (Evans, 1988). The gregariousness of this species through the adult stage may be a further adaptation enhancing the aposematic effect. Colouring of hard body parts and the hemelytra (forewing) may vary considerably, from nearly white through yellow and orange to deep red. This study represents the first to explore the genetic basis of this variation.

Field collections

This experiment spanned three generations: Generation 0 (wild caught), 1 (lab-reared) and 2 (lab-reared), each of which contained 50 families. Individuals were collected between 18 and 27 September 1998, from 29 locations along a 350 km transect within the north-eastern United States (Figure 1).

Figure 1
figure 1

Locations across the north-eastern USA from which 50 founding individuals were collected. Points indicate sampling locations with sex ratio of founders used from that location (males.females). Asterisks indicate number of possible sibling pairs captured.

The transect was constructed in order to sample within-subpopulation variation widely while ensuring unrelatedness among founders, as is recommended for captive populations of endangered species (Lacy, 1994), taking into account the species' long-distance dispersal and likely panmixia within broad geographic regions. Distance between sampling locations was 15 km, as permitted by the patchy population distribution. Within each location, individuals or clusters of bugs were collected from 2 to 10 common milkweed plants (Asclepias syriaca), between 100 m–1 km apart. Migrating adults tend to stay in a suitable patch once they find one, and hatched broods usually stay together on a single plant at least until the third instar, with subsequent moves of usually less than a meter (Dingle, 1981). Thus, bugs found together on a plant or in a patch of plants were conservatively considered to be a ‘sibship’.

Very recently eclosed adults or 5th instars (just prior to sexual maturity) were preferentially collected, so that most development occurred under natural conditions, but females were still virgin. Of the hundred wild individuals used to form Generation 0, at collection, 25 were recently eclosed adults, 60 were 5th instar, 11 were 4th instar, three were 3rd instar and two were 2nd instar.

Presumed sibships were maintained as groups upon capture, with males and females separate. Individuals are easily sexed at both the 5th instar (when females have two dark spots on their posterior segments, while males have one) and as adults, when females have a caudal median point on the posterior margin of the 4th abdominal sternite, as well as a triangular cleft pygidium, unlike the more rounded male structure (Rodríguez-Clark, 2002).

Culture techniques

In general, bugs were maintained in captive conditions as close as possible to those typical in the wild during the season of maximal reproduction: northern latitude midsummer. Groups were maintained in 9.5 × 9.5 × 9.5 cm re-usable polystyrene boxes (Pioneer Packaging, model 29-C), each with a sheer fabric aeration window in the removable lid, which was secured with a rubber band. In Generation 0, groups larger than 10 were split so that density was never higher than 10 individuals/box, giving each bug at least 82 cm3, well in excess of the 35 cm3 minimum necessary to avoid density-dependent growth effects (Koerper and Jorgensen, 1984).

In Generations 1 and 2, once eggs were laid and weighed (see below), sibships were divided among four boxes, with 10 eggs placed in each box using a sable paintbrush, to provide an excess of individuals for the next generation. Labelled boxes were assigned random locations within the rearing chamber. Upon reaching the 5th instar, males and females in each family were sorted and redistributed among their four boxes, maintaining sexes separate and densities at no more than 10 bugs/box. A band of liquid Teflon (Northern Products Inc., Woonsocket, RI, USA) was painted on the inside rim of each box as an additional safeguard against escapes by smaller instars. Each box was provided with ad libitum deionized water in a disposable 37 ml vial with a cotton wick, secured to the base of the box with nontoxic poster gum.

Individuals were reared on ripe milkweed seeds from a variety of wild sources. Ripened pods were collected from the vicinity of Princeton, New Jersey, in the autumns of 1997 and 1998, and seeds were removed either by hand or according to methods described by Feir et al (1982). To randomize effects of seed age on nutrition, these fresh seeds were mixed periodically with >1-year-old seed purchased commercially (Valley Seed Service, Fresno, CA, USA), and older seeds obtained from Missouri and Minnesota. Seeds were stored in airtight containers with indicating silica gel to prevent mould growth.

O. fasciatus from north-eastern populations fed ad libitum are indistinguishable in size and survivorship from those given 12 seeds per bug from egg to eclosion (Dingle, 1992), equivalent to 0.7 g for 10 bugs. To provide well in excess of this amount, 1.5 g seed was scattered on the bottom of each box. Rearing boxes were kept at 28±1°C with humidity maintained at 50–70%, on a schedule of 16 h light 8 h dark. These conditions maintain optimal egg production and minimal probability of diapause, with 27 days between hatch and adult metamorphosis, and 10 additional days to first reproduction (Groeters and Dingle, 1988). Boxes were kept on clear Plexiglas trays on wire shelving, to maximize light penetration to all groups.

Husbandry

The 50 mating pairs in Generation 0 were created by selecting one or two each of mature males and females captured at each sampling location, all from different presumed sibships where possible (never using individuals of the same sex from the same sibship). These 100 individuals were then paired randomly, avoiding possible sib matings. Two adult offspring (one of each sex) from each pair were again chosen arbitrarily to form Generation 1, and were paired randomly, avoiding sib matings, to give rise to Generation 2. If offspring reached maturity prior to separation, females found in a box with males were discarded. All pairs were made at least in quadruplicate to guard against family loss, with one replicate selected randomly to contribute to the next generation.

Pairs were placed in plastic Petri dishes, 9 cm diameter × 2.5 cm height, with a 14.8 ml disposable water vial with a cotton wick and 20 fresh seeds (0.12 g). The mouth of each dish was covered with 40x32-denier wrinkled cotton gauze secured with a rubber band and the dish lid. Dishes were kept entirely dark, inside cardboard trays with cardboard coverings. Under these circumstances, O. fasciatus from northern populations deposit eggs almost exclusively through wrinkled areas in the gauze onto the dish lid, allowing eggs to be easily removed (H Dingle, D Feir, personal communication). This set-up provided a total laying area of 64 cm2 per female (more than the 40 cm2 minimum required for optimal egg production; Koerper and Jorgensen, 1984).

Measurement of traits

Just prior to creating mating pairs, wing length and colour traits were assessed on the right-hand side of all individuals, at the same age (30–35 days posthatch) and at least 2 days posteclosion, when the cuticle hardens and final adult colour and dimensions have stabilized. Individuals were anaesthetized with carbon dioxide on a fritted glass microscope stage, and placed on a custom-designed brass cradle, using dental wax to hold wings flat (Figure 2a). Bugs were photographed using a Polaroid Digital Microscope Camera with a glass-etched 2 mm scale bar adjusted to be exactly at the focal plane. For each picture, X-Y coordinates were recorded for both ends of the scale bar as well as for the wing tip and the proximal posterior wing process, using Scion Image software (2000) on a PC (Figure 2b). Wing length (WL) was taken as the distance between these two points in mm. Repeatability for this trait was 99% (40 individuals measured 10 times each; F=638, P<0.0001).

Figure 2
figure 2

(a) Cradle used to immobilize live bugs for microscope photography. (b) Morphological traits measured (WL=wing length; PC=pronotum colour; WC=wing colour).

Colours on parts of the pronotum (PC) and forewing (WC) (Figure 2b) were assessed by comparing individuals to a colour bar created freshly from standard colour chips each generation. Although colour in this species varies nearly continuously, the range was divided into a 5-unit scale in order to have enough classes to treat the data as continuous, but few enough to minimize classification error (1=light yellow #1215; 2=yellow-orange #1225; 3=orange #116, 2X; 4=red-orange #130, 2X; 5=red # 165, 2X; Letraset Ltd., 1998). Repeatability was 94% for wing colour (50 individuals measured 10 times each, F=142, P<0.0001), and 66% for pronotum colour (F=20, P<0.0001). A Meiji TechnoRZ zoom stereo microscope (10 × magnification) and V-Lux 1000 light source set to maximum were used throughout. Values for colours and wing length were recorded for both parents and eight randomly selected offspring per family (four each males and females).

For individual females, the rate of egg production following mating has been found to be highly correlated with the lifetime quantity of eggs produced, as well as average clutch size and total fecundity by weight (Leslie, 1990). Total mass of eggs laid during two time spans was thus recorded for each wild female, using a modification of Palmer and Dingle (1986) and Groeters and Dingle (1996): (1) from pair creation to 5–6 days afterward, and (2) from that point to 9–12 days afterward. All eggs were weighed on glassine paper using a Sartorious R200D electronic semi-microbalance. Masses were converted into mg/day production rates (‘early fecundity’, EF and ‘later fecundity’, LF) by dividing by the number of full days elapsed between mating and first weighing or between the first and second weighing dates, respectively.

Egg mass was considered to be a function of the maternal generation, following Palmer and Dingle (1986), in recognition of the overwhelming likelihood of maternal effects in this and other egg characters (Lynch and Walsh, 1998). Sperm storage in females prevents the use of more than one male per female, so male influence on the trait could not be separately assessed. Egg mass was recorded for each dam and for four (Generation 1) or eight (Generation 2) of her female offspring at the same age.

Statistical analyses

Heritability in captivity, hC2 = σAC2PC2, is estimated by b ( M O c · M P c ) , the regression of mid-offspring on mid-parent values, or 2 * b ( M D c · D c ) , twice the regression of mid-daughter on dam values, with both generations reared in captivity (Generations 1 and 2) (Falconer and MacKay, 1996). VPC, an estimate of the total phenotypic variance (σPC2), was obtained from the total variance within dams or sires for single-sex estimates, or pooled across sexes for two-sex estimates. Additive genetic variance in captivity (σAC2) was estimated by VAC = hC2 * VPC. Residual variation in captivity (σRC2, nonadditive genetic and environmental variance combined) was estimated as VRC = VPCVAC, and the coefficient of additive genetic variation (CVA) was estimated as defined in the Introduction.

Two consecutive generations were not available in nature, so estimation in this environment is less straightforward. Coyne and Beecham (1987), Prout and Barker (1989), and Riska et al (1989) together develop the relationships among three different estimates of heritability in nature, hN2. The first of these, hNV2, assumes constancy of σA2 across environments, and is given by hNV2 = VAC/VPN, in which VPN estimates σPN2, the total phenotypic variance in parents from nature (Generation 0). When σA2 is lower in captivity than in nature, hNV2 will be an underestimate of the true heritability in nature.

Using the fact that the regression of captive offspring on parents from nature β ( M O c · M P N ) is defined as γσACσANPN2 (Lande in Coyne and Beecham, 1987), with some rearrangement, the second estimate of heritability in nature, γ2hN2, is given by

in which b ( M O c · M P N ) 2 is the squared estimate of β ( M O c · M P N ) , and γ is the additive genetic correlation between the trait in nature and in captivity. γ is high when the rank ordering of genotype performances in both environments is the same, and low when it is not, when norms of reaction cross (Coyne and Beecham, 1987), so that this estimate is biased when γ < 1.

The final estimate of heritability in nature, h2 is defined as β ( M O c · M P N ) and is estimated by b ( M O c · M P N ) . It contains both sources of bias: it assumes both that γ = 1 and that σA2 is constant across environments.

Based on the relationships among these three estimates of heritability in nature, Riska et al (1989) develop a parameter κ, which is defined as

and is estimated in this study by k = hNV2/h2. Because of the conditions in which these estimates are biased, when k<1, it follows that hNV2<h22hN2 and that additive genetic variance is greater in nature than in captivity – and furthermore, that γ2hN2 is the best approximation of the lower bound of heritability in nature. Conversely, when k>1, hNV2>h22hN2, and k provides no information about the relationship between additive variance in the two environments, nor which of the estimates of heritability in nature is the best (though ;γ2hN2, as the smallest, is most conservative).

Quantitative genetic parameters based on family relationships may be unreliable if there are significant gender differences in the means or variances of traits under consideration, or if data do not fit the assumptions of tests performed (Falconer and MacKay, 1996). Thus, prior to conducting the above analyses, all traits were examined for normality of residuals, with sexes both apart and combined, as well as for nonrandom associations between means and variances, and were transformed as needed (see Results). Differences in mean across sexes and generations in colours and length were examined using a 2-way ANOVA, while differences across generations in egg mass were examined with a Kruskal–Wallis test. Differences in variance were tested using Levene's test (SAS Institute Inc., 2001). Where the sexes differed systematically in mean, this was corrected by adding to each member of the smaller sex the average difference between female and male means (Norry et al, 1997). Following transformation, all analyses were conducted on both sexes separately to look for sex-specific effects, before combining sexes for final analyses if no differences were found.

Because of the complex, composite, nature of many of the statistics estimated in this study, parametric standard errors may be either unreliable, cumbersome to calculate, and/or simply unavailable (Lynch and Walsh, 1998, p. 560). The distributions of all statistics were thus estimated empirically using the bootstrap, following Riska et al (1989) and Manly (1997). In all, 50 family units (keeping parents with their associated offspring) were sampled independently with replacement from Generations 0–1 and 1–2 for 2000 replicates. All statistics were calculated for each replicate; bias-corrected parameter estimates were produced for the original data set following Efron and Tibshirani (1986), while the standard deviations of the empirical distributions were used to estimate standard errors of all statistics (Manly, 1997).

Hypothesis testing is similarly complicated when composite statistics are the subject of comparison. Although individual regression coefficients could be tested for difference from 0 and from each other using standard F and homogeneity-of-slopes methods, direct comparison of estimates from nature and captivity (when the two are estimated using different methods or are themselves combinations of other estimates) is cumbersome or not possible with traditional large-sample approximations (Lynch and Walsh, 1998, pp. 561, 807). Hence, cross-environment differences were tested using randomization, using the ratio of the larger to smaller estimate as the test statistic for both variance components and the additive coefficient of variation (Manly, 1997). For each comparison, the test statistic was calculated for the original data, then parents were randomized with respect to offspring, leaving each in their respective generations, and the test statistic was recalculated for each of 5000 replicates. This ‘restricted randomization’ tests for changes in variance across environments while holding mean effects constant (that is, correcting for any overall mean difference in offspring across generations; Petraitis et al, 2001). The probability of the observed test statistic was then taken as its location to the nearest percentile in the empirical distribution of 5000 test statistics. All analyses were conducted on a PC using Statistical Analysis System procedures (SAS Institute Inc., 2001).

Results

Trait distributions

The distributions of wing colour (WC) and pronotum colour (PC) were remarkably constant across different rearing environments, although the pronotum was on average 1 unit darker than the wing, and less variable (Table 1). There was a slight and significant trend toward females being darker than males in pronotum colour (F=10.8, P<0.001), and a similar nonsignificant trend in wing colour, so values for males were corrected by adding the average difference between males and females to each male value. The phenotypic correlation between WC and PC was moderate (Spearman's rank coefficient=0.56, P<0.0001), such that individuals with darker wings tended to have darker pronota; however, neither was correlated with other traits considered.

Table 1 Summary statistics for five traits in wild-caught and laboratory reared O. fasciatus (original measurement scales)

Average wing lengths (WL) in both environments (Table 1) were comparable to those recorded in previous studies (Groeters and Dingle, 1996). As with colour traits, WL differed consistently between sexes, with females larger (F=1326, P<0.0001) and more variable (F=8.5, P=0.004). This correlation of mean and variance held across generations as well, so wing lengths were ln-transformed, and the mean difference between female and male lengths was then added to each male length. WL was not phenotypically correlated with any other trait in the study.

In contrast to morphological traits, distributions of egg production rates (early fecundity, EF and later fecundity, LF) changed greatly across rearing environments (Table 1). Mean production rates increased significantly each generation (H = 203, P<0.0001 for EF, 362.5 P<0.0001 for LF), and variability increased with mean production (F = 35.2 for EF, 59.8 for LF, P<0.0001). Simple ln-transformation did not remove the dependence of variance on mean in these traits, so EF and LF were both transformed using Kleczkowski's method (Lynch and Walsh, 1998, p. 300). The phenotypic correlation between the two measures was moderate (Spearman's rank coefficient=0.54, P<0.0001), and neither was phenotypically correlated with any other trait.

Heritabilities, variance components and errors

Heritabilities were significantly different from zero in captivity for all three morphological traits, and in nature for the two colour traits and for early fecundity (Table 2). Although differences in single-sex estimates indicate the possibility of slight maternal effects for wing colour in captivity, differences were not significant, so midparent–midoffspring estimates were used in all subsequent analyses. Similarly, in estimates from nature, there was a trend toward a paternal effect on pronotum colour (PC) and a maternal effect on wing length (WL), but, in the absence of significant differences, sexes were combined in further analyses.

Table 2 Heritability analysis for five traits in O. fasciatus in nature and captivity

Analysis of changes in heritability in wing colour (WC) across environments is the most straightforward of all traits; all three estimates of hN2 were high (0.90–0.92) and virtually identical to each other, as well as to the estimate of hC2 (0.92) (Table 2). kWC thus was 1, indicating that γ ≈1, and that VA for WC remained constant between nature and captivity. Pronotum colour (PC), despite its phenotypic correlation with WC, displayed a slightly different pattern. All three estimates of hN2 (0.76–1.00) exceeded hC2 (0.59), indicating a decline in heritability upon captivity. k was just slightly larger than 1, indicating that despite the decline in heritability, additive variance remained essentially unchanged. Although in these circumstances, the data do not indicate which of the three estimates in nature is most accurate, the estimate of γ2hN2 was used in later analyses, as it is the most conservative.

In the case of wing length (WL), the three estimates for heritability in nature range from considerably more to considerably less (0.55–0.02) than that in captivity (0.34), and k is 6.05 (much larger than 1), indicating that no information is available from this level of analysis regarding changes in heritability or additive genetic variance from nature to captivity in this trait.

However, both fecundity traits (EF and LF) were associated with a k below 1 (0.29 and 0.21, respectively), indicating a decline in additive genetic variance in captivity in both cases (Table 2). Although Riska et al (1989) assert that in cases where k< 1, γ2hN2 is the best estimator of hN2, the low estimate of VA in captivity for both of these traits in this case makes the estimator unstable, and h2 was used as a (more conservative) lower bound of heritability instead. There was a trend toward decline in heritability in EF, as it changed from significantly above zero to no different from zero.

A more detailed picture of changes in variance components can be gleaned by combining the above heritability estimates with estimates of total phenotypic variance (VP) for the traits in question (Table 3). Patterns in VP were clearer than those in heritability, due to relatively smaller standard errors: VP remained constant for WC, increased significantly for PC and WL, and decreased significantly for EF and LF (Table 4). No significant changes were found in heritability for any trait using randomization tests, in the case of PC because the estimate used for lower bound of heritability in nature, γ2hN2, is associated with wide standard errors. Despite their disparate changes across traits when considered singly, estimates of VP and h2 combine to reveal a striking pattern: CVA was relatively stable in WC, PC and WL (with very small but sometimes significant increases or decreases, around 1–2%), but declined significantly in EF and LF, between 5 and 7%. These changes appear to be independent of initial evolvabilities, which though highest in wing colour, are comparable (0–12%) in all other traits, regardless of trait type.

Table 3 Information about additive genetic variance (VA) and residual variance (VR) that may be derived from known changes in heritability (h2) and phenotypic variance (VP) from nature to captivity
Table 4 Heritabilities, variance components, and evolvabilities for five traits in O. fasciatus using bootstrap bias-corrected estimates (standard errors in parentheseis)

Hypothesis testing on variance components in this study presents something of a challenge in terms of multiple comparisons. If all traits and statistics considered were independent, Table 4 would contain a total of 5 traits *5 statistics = 25 independent comparisons. Manly (1997) suggests that in randomization tests, one method of guarding against Type 1 errors is to divide the desired significance level (typically 0.05) by the number of independent comparisons made, which would leave P0.002 as ‘significant’ in this case. However, he further points out that any correlations in the data will render this overly conservative. Indeed, the statistics in these 25 comparisons are not independent; within a trait, VA is positively correlated with CVA and h2, and VA and VR are constrained to sum to VP. Furthermore, some traits in this study are correlated phenotypically, though presence or absence of phenotypic correlations may not provide any information about genetic correlations (Roff, 1995). For example, Palmer and Dingle (1986) found a strong and significant negative genetic correlation between fecundity and wing length in O. fasciatus, despite finding a weaker and positive phenotypic correlation. Unfortunately, the number of families available in this study do not provide sufficient power for examining genetic correlations. Other authors have suggested that heterogeneity in phenotypic variability among traits within populations indicates that they may be treated statistically as independent estimates of shifts across environments (Bryant and Meffert, 1998). Combining these considerations, I take probabilities of 0.01 or less to be ‘significant’ for randomization

Discussion

Previous studies have compared heritabilities in nature and in captivity in order to examine the question of the validity of extrapolating more easily obtained laboratory results to natural populations of interest. In the present study, I have used techniques developed in this context to examine an assumption often made in forming captive populations for conservation purposes, that a small number of founders can capture a large proportion of the genetic variation of wild populations. Although my estimates in nature are less than ideal, since they rely on data derived in part from captive individuals, if anything they will be conservative, as they are minima which may be biased downward if additive genetic correlations across environments are not perfect.

In the context of the first question, no significant changes in heritability were observed in any trait in the transition from nature to captivity. However, this cannot be taken as straightforward support for previous claims of a predictable relationship between heritability in nature and captivity (eg, Bitner-Mathé and Klaczko, 1998): the lack of significant differences appears to be due not to small changes in the location of estimates, but to the large errors around those estimates. This is particularly so in the fecundity traits examined, and underscores the shortcomings of studies on single traits in isolation. Had only wing colour been considered, the conclusion that estimates in captivity can readily be extrapolated to nature might seem reasonable, as estimates in both environments are very similar, with small errors. However, when the additional traits are considered, this pattern is much less clear, in keeping with findings summarized across a large number of single-trait studies (Weigensberg and Roff, 1996).

Indeed, large errors in heritability estimates suggest that the other measure of heritable variation used in this study, evolvability, may be a valuable additional metric in comparisons of genetic variance. Although significant heritable variation was detected in a life history and two morphological traits in nature and in three morphological traits in captivity, comparison of these values with other studies emphasizes the variability in heritability in species even within the same environment, across different studies. While in agreement with a previous study finding moderate heritability for early fecundity in captivity (0.44 in the present study vs 0.25–0.31 in Groeters and Dingle, 1988) and a low minimum heritability for wing length in nature (0.06 compared with 0.10–0.20 in Groeters and Dingle, 1996), the present estimate of heritability of wing length in captivity (0.34) is considerably lower than the range of values uncovered in previous work (0.6–0.87: Palmer and Dingle, 1986; 0.45–0.53: Dingle et al, 1988; 0.80: Groeters and Dingle, 1996).

The comparisons of evolvability across environments in this study provide additional evidence that the lack of significant change observed in heritability from nature to captivity is likely to stem from large errors, rather than from true stability in variance components. Although evolvability remained unchanged in the three morphological traits examined, shifting 1–2% or less, it declined 5–8%, and significantly, in the fecundity traits. This large and significant decline is striking for a number of reasons. First, changes are inherently more difficult to detect in statistics with the decreased precision imposed by single-sex estimation. Second, estimates of VAN and CVAN for these traits are lower bounds of these parameters, such that if anything, true declines in captivity were greater than observed. Finally, the ratio test used to detect declines in evolvability loses power when means between groups differ (Manly, 1997, p. 103), as they did in the case of both EF and LF (Table 1). These same factors also make it less surprising that a change of less than 1% in evolvability in wing length, which has fewer of these limitations, was significant.

The significant decline in evolvability for EF and LF may be the result of several factors, as they differed from the other traits examined in several respects. Although generation 0 completed its development in nature, these two traits, unlike others, were not measured in fully natural conditions: pairs were mated and eggs were collected in a captive context. However, this would tend to make the estimates from different generations more similar rather than more different. Second, selection may have been another factor depleting additive genetic variation in these traits. Every effort was made to minimize selection for improved captive performance by ensuring that every family was represented in every generation and also randomly selecting individuals within families to be used as parents (Falconer and MacKay, 1996). However, unlike other traits examined, individuals on the low end (or at 0) on the fecundity scale could not, by definition, be represented in the next generation, so any genetic factors associated with poor reproductive performance in captivity were necessarily lost. Finally, although care was taken to sample wild individuals widely and yet within a single subpopulation, it is conceivable that founders came from more than one subpopulation. However, the inflation of genetic variance possible from such mixing would be expected to influence natural and captive estimates equally, and thus is not likely to be a factor contributing to the declines observed in evolvability for fecundity.

Instead, other factors seem more likely to be the cause of the significant declines in evolvability in EF and LF. ‘Domesticating’ selection reducing additive genetic variance for fecundity could have been generated in the first generation of captivity by gene-by-environment interactions, such that variants were lost who produce well in nature but poorly in captivity. Such interactions can cause changes in additive genetic variance even in the absence of selection, when genetic variants themselves are not lost, but their effects change. Extensive interactions of fecundity with photoperiod (Groeters and Dingle, 1987), temperature (Groeters and Dingle, 1988), and flight activity (Slansky, 1980) have been observed in this species and in closely related ones (Kasule, 1991), consistent with the sharp change in fecundity measures across the two environments seen here. However, whether observed declines may also be due to lower levels of ‘stress’ in captivity remains unclear (Bijlsma and Loeschcke, 1997).

It is tempting to conclude that the stability of evolvability in WC, PC and WL across the nature–captivity transition vs its decline in EF and LF represents a fundamental difference between morphological and life-history traits in general. Such a difference could result from the consistent directional selection traits closely correlated with fitness are assumed to experience in nature (Mousseau and Roff, 1987) or from enhanced developmental noise in life-history traits, which may be under the influence of a greater number of loci (Houle, 1992 and references therein), both of which might increase the opportunity for gene-by-environment interactions. However, with so small a sample size (just two ‘life-history’ traits and three ‘morphological’ traits, themselves phenotypically correlated), such conclusions would be premature.

Regardless of the reasons behind the difference, an important result is that, though selection or gene-by-environment interactions may generate significant changes in evolvability in some circumstances, this does not always have to be the case. The conservation implications of stability in evolvability for some traits in the first generation following capture are not trivial: in line with classic theory, it appears possible, at least for some traits, to capture the full compliment of potentially adaptive genetic variation from nature, with even a limited number of founders. If gene-by-environment interactions or selective losses were the rule, fundamental genetic constraints on all traits at the moment of sampling would have to be added to the already considerable list of limitations on the usefulness of captive breeding as a conservation tool (Kleiman et al, 1996).

However, it is sobering to note that the 50 pairs of founders in this study is still greater than the 10 pairs currently considered adequate in conservation contexts (eg, Clark et al, 1997). Given recent advances in applying techniques used in agricultural production to estimate variance components in nondomestic species with complex pedigrees (eg, Milner et al, 2000) it would be of great interest to apply the above methods to endangered species currently in captivity in which wild founders are still alive, to determine whether evolvability can remain stable in nonmodel organisms, even when founder numbers are smaller and drift might be expected to have a more pronounced effect. Such methods also might help identify which among many traits would be useful to monitor in such populations over successive generations. It seems likely that such efforts would be of greater value than simply monitoring heritability, which has been advocated in the past (Cheverud et al, 1994; Lynch, 1996).