An Overview of
PESTICIDE IMPACT ASSESSMENT SYSTEMS
(
a.k.a. "Pesticide Risk Indicators")
based on
Indexing or Ranking Pesticides by Environmental Impact

 

 

 

 

Background Paper Prepared for the
Organisation of Economic Cooperation and Development (OECD)
Workshop on Pesticide Risk Indicators
21-23 April, 1997
Copenhagen, Denmark
(revised May 19, 1997)

 

Lois Levitan

Department of Fruit and Vegetable Science
Cornell University
Ithaca, New York, USA

E-mail: LCL3@cornell.edu

 

 

Support for this research has been provided by NRICGP/USDA Grant No. 95-37313-1940 and by the Department of Fruit and Vegetable Science, Cornell University, Ithaca, NY USA.

OUTLINE

Section

Page

  1. Introduction

2

  • Objectives and Limitations of Ranking Pesticides by Environmental Impacts
  • 3

  • Typology of Pesticide Ranking Systems for Different Purposes
  • 17

  • Generic Methods for Ranking Pesticides
  • 34

  • Pesticide Impact /Risk Ranking Systems: How They Work; What They Show
  • Decision Aids for Farmers and Other Land Managers

      • The Environmental Impact Quotient 61
      • Stemilt Growers Responsible Choice Point Summary 68
      • PestDecide 72
      • Ipest 79
      • CLM Environmental Yardstick for Pesticides 85

    Policy Tools

    Indexing National Trends in Agricultural Pesticide Risk:

      • Consumers Union 90
      • USDA ERS Chronic and Acute Risk Indicators 93
      • SYNOPS 95

    Focusing Attention on Most Hazardous Pesticides:

      • UC Berkeley Environmental Health Policy Program 98

    Applying Human Health Risk Assessments to Pesticide Labels:

      • World Health Organization Classification 103
      • US EPA Classification by Acute Human Health Hazard 107

    Screening for Hazardous Chemicals:

      • Toxic Substances Chemical Scoring System 109
      • CHEMS-1: Chemical Hazard Evaluation for Management Strategies 113

    Measuring Adoption of Integrated Pest Management:

      • USDA ERS Appraisal of IPM Practice on US Cropland 118

    • WWF/Consumers Union BioIntensive IPM Continuum 121

    59

    1. Future Directions in Pesticide Impact Assessment

    125

  • References
  • 127

     

    INTRODUCTION

    Many groups of individuals and types of institutions--including farmers and other land managers, consumers and consumer groups, food retailers and agribusinesses, regulatory agencies and regulatory "watchdogs"--have a stake in better understanding the non-target impacts of pesticides used in agriculture, landscaping, materials preservation, and elsewhere in modern society. In the past, much of the attention on pesticides focused narrowly on monitoring costs to producers and efficacy in controlling target pests. When non-target impacts were considered, the quantity of pesticides applied was generally used as the only indicator of risk. However, especially as new classes of more potent chemicals have been developed which require far lower dosages than older types of pesticides, it has become increasingly apparent that pesticide weight is not a sufficient proxy for risk. Thus a diverse research community is working to develop methods for more accurately estimating the impacts of pest control products and methods on one or more environmental indicators.

    This paper is an overview of pesticide impact assessment systems which compare the characteristics and effects of different pest controls and generate an index or ranking of pest control options, or which compare pesticide risks over time or in different places. These type of assessment tools are sometimes called "pesticide risk indicators." The paper begins by defining and critically examining this EIA methodology. A typology is suggested for distinguishing among three categories of assessment systems: (1) decision aids for farmers/growers and other land managers, (2) research and policy tools for use by governments, industry or academia, and (3)"ecolabeling" systems designed to influence consumer opinion and market behavior. Systems are differentiated by objective, arena of activity, target audience, and by how an economic component figures into the assessment. Methods for constructing and calculating the rankings are described in the fourth section. The fifth section illustrates the procedures and specific uses for a number of farmer decision aids and policy tools. The paper concludes by looking to the future, at trends in the development and demand for pesticide ranking and indexing systems.

     

    OBJECTIVES AND LIMITATIONS OF RANKING PESTICIDES BY ENVIRONMENTAL IMPACTS

     

    What Is Environmental Impact Assessment (EIA) of Pest Controls?

    Environmental impact assessments (EIA) are measures or estimates of the consequences of an action--in this case the application of pest control products or practices--on one or more environmental parameters. EIAs may simply be methods for identifying changes in the environment, or they may also evaluate the magnitude and significance of these changes. EIA methodologies include:

    The last of these approaches--indexing systems--draw on, synthesize and compare information collected by other EIA methodologies and thus become tools for decision-making and policy formation. Many types of environmental variables can be evaluated with the indexing methodology, not only those which can be sampled, monitored or mathematically modeled. Indexing systems can thus make the leap from assessing test endpoints to also assessing decision endpoints. To illustrate the difference using an example from pesticide impacts on honey bees: Measures of pesticide toxicity to bees can provide information about pesticide lethality (LDx HB), or the effective dose at which certain behaviors (such as nectar-collecting activity) change (EDNECTAR-COLLECTING). However, a beekeeper is less interested in these test endpoints than in knowing how a pesticide application will affect hive survival or, perhaps, crop pollination. Therefore, the decision points of interest to beekeepers are how the impact on honey bee colonies might be reduced by using a different pest control, a lower dosage or a different time of application.

    Pesticide indexing and ranking systems can help answer such questions by comparing alternatives. These methods can also contribute to policy analysis by monitoring trends in pesticide risk and evaluating the success of risk reduction initiatives. Especially when there are significant economic, safety and environmental ramifications from an assessment, it is vitally important that it be based on meaningful environmental criteria and indicators. Natural questions to then ask are: Which environmental parameters should be included in an assessment as descriptors of the environment? What indicators should be used to register effects on the selected environmental variables? and How should these impacts be measured, scored and interpreted?

    The images which come to mind upon hearing the term "environmental impact" are different for each of us, depending upon our vantage and how we value the components of the environment. In some contexts the environment is considered distinct from public health and other social impacts, but in this paper all non-target effects are included under the rubric of environmental impacts. A complete set of enviro-social parameters for assessing consequences of pest control activities includes:

    The choice of assessment parameters from this vast array of potential indicators is not trivial, as illustrated by the radically different results from several pesticide ranking systems, shown in Figure 1. The rank order of pesticides depends in part upon the components of the analysis--the pesticides considered, the variables assessed, the choice of specific measurable endpoints as the indicators of impacts on these variables; the mathematical structure of the model, including relative weighting of variables and scoring of results; the method for filling data gaps; and whether usage data are factored into the equation (i.e., a ranking by hazard or by hazard potential/risk). A similar comparison by Pease et al. (1996) found only one insecticide--permethrin--among the ten most hazardous pesticides, as rated by both the Environmental Impact Quotient (EIQ) and by a US National Oceanic and Atmospheric Administration model (Kovach et al. 1992). Results differed because of the factors just listed. More specifically, the NOAA focus is on estuarine species while the EIQ has a terrestrial emphasis. Thus dimethoate and parathion get high hazard ratings from the EIQ because both are acutely toxic to birds, but a relatively low NOAA ranking because that model does not consider impacts to avian species. Also, a number of the fruit and vegetable pesticides surveyed by the EIQ were not ranked by NOAA because they are little used in California, where that model was applied. Conversely, the wood preservative pentachlorophenol and persistent insecticide toxaphene are not included in the EIQ dataset, but receive high hazard ratings from the NOAA model because of pentachlorophenol's high bioconcentration factor (an environmental parameter not included in the EIQ) and toxaphene's toxicity to fish and invertebrates.

     

    Figure 1 Comparison of"most hazardous" pesticides as ranked by three assessment systems. Only 2,4-D, trifluralin and dimethoate are on more than one list

    Note: These rankings are not weighted by pesticide release, usage, environmental concentration, exposure or typical dosage and thus should not be interpreted as presenting the greatest danger or risk.

    Pesticides from List of Top 30 Chemicals, ranked by Shannon et al. Screening System, Not Weighted by Usage (1997)

     

    Highest-Hazard Pesticides, ranked by the UC Environmental Health Policy Program (Pease et al. 1996)

     

    Highest Environmental Impact Quotient (EIQ) values for pesticides (Kovach et al. 1992)

             

    Rank

    Pesticide

     

    Rank

    Pesticide

     

    Rank

    Pesticide

    1

    terbufos

     

    1

    methomyl

     

    1

    disulfoton

    2

    trifluralin

     

    2

    aldicarb

     

    2

    parathion

    3

    hexachlorobenzene

     

    3

    carbofuran

     

    3

    propoxur

    4

    anthracene

     

    4

    2, 4-D (+ salts)

     

    4

    oxydemeton-methyl

    5

    chlorothalonil

     

    5

    mevinphos

     

    5

    fenamiphos

    6

    2, 4-D

     

    6

    dimethoate

     

    6

    dimethoate

    7

    1,3-dichloropropene

     

    7

    trifluralin

     

    7

    paraquat

     

    The question of whether and how to weight and integrate impacts on different organisms and environmental components is also not trivial. The answer can determine how policy-makers and the public come to view different products. The lists of "most dangerous" compounds--which are the succinct results of pesticide impact assessments--are remembered long after the basis for drawing the conclusions is forgotten. Despite these issues regarding the application and interpretation of ranking systems, many constituencies are demanding some type of handle for comparing the environmental impacts of different pest control products and methods. Thus researchers from a range of disciplines are attempting to construct meaningful and reliable evaluative tools.

    Box 1 What is the Relation Between "Pesticide Impact Assessment Systems, Models or Tools" and "Pesticide Risk Indicators"?

    The decision aids created by ranking pesticides by environmental impact are herein generally referred to as "impact assessment systems, models or tools." However, others often refer to them as "pesticide risk indicators."

    In part the difference in terminology simply depends on what one is accustomed to using, but it may also reflect a difference in disciplinary orientation and methodological approach. The products generated may also be somewhat different, and put to different uses. Systems ecologists and other systems analysts think in terms of describing and quantifying energy and resource flows in complex systems, with a primary objective of identifying causes and effects of change in systems, and a penchant for understanding interactive effects. Pesticide impact assessments drawing from this disciplinary heritage--particularly those used as farmer decision tools--often retain a systems perspective and resultant complexity, as well as the related terminology. On the other hand, policy makers looking for a more terse analysis may have a predilection for "indicators" of trend highlights, which do not pretend or attempt to holistically model systems.

    In this paper the term "indicator" generally refers to the component parts of assessment systems rather than to the totality of the assessment. These indicators are the measurable endpoints that provide information about an effect or impact on the environment. For example, the half life of a pesticide in the soil is an indicator of how long the pesticide persists in the environment. An indicator of pesticide risk to birds may be a function of the pesticide’s acute toxicity to birds, its foliar persistence and its impact on food sources for birds (e.g.: on insects, worms, seeds). I.e.: Risk IndicatorBIRDS = f(Acute Toxicity, Foliar Persistence, Food Source). Risk to birds may be calculated from the LD50 BIRDS, Half Life LEAVES and LD50 WORMS which are, respectively, the measureable endpoints that indicate the impact on these toxicity and exposure variables. Yet another example of an indicator is the Groundwater Ubiquity Score, calculated as GUS = log 10(t1/2) x [4-log10 (KOC)] (Gustafson 1989). This algebraic model is used by itself as an indicator of pesticide leaching potential and is also used in combination with other indicators of different environmental impacts to create multiattribute impact assessments. Composite or integrated assessment systems are generally composed of a set of such risk or impact indicators.

    Another semantic distinction is that in this paper the term impacts is generally used, rather than either hazards or risk, to describe consequences of pest control activities. The rationale is that the word hazard is supposed to convey a type of harm, e.g.: "lethal hazard to amphibians," while risk is supposed to convey a probability of harm. By incorporating pesticide usage and exposure indicators, indexing systems typically convey more information than merely a listing of hazards, but they are generally constrained by their structure and available input data to be less than a probablistic assessment of risk. The use of the term impacts is intended to capture this approximation of "hazard potential" and risk characterization.

     

    Why Assess Environmental Impacts of Pesticides?

    I am a grape grower in Napa Valley. A group of us are trying to promote sustainable agriculture in our area. What indicators should we use to assess sustainability?

    One of the primary objectives for developing methods for assessing environmental impacts of agriculture is to be able to respond to questions such as this from farmers and other land managers. Decision makers in the field need information and methods for choosing pest control practices which have the least negative impacts on the environment, and on human health and safety. Policy makers then need to make broad brush appraisals of the impacts of such choices.

    More broadly, the impetus to develop and use pesticide impact ranking systems is to facilitate the shift to more benign pest control products and practices, and thus reduce pesticide load on the environment. It is an uphill battle: Total worldwide pesticide use is at a record high--4.7 billion pounds in 1995--and continues to grow. In the US, pesticide use reached an all time high of more than 1.2 billion pounds in 1995. Despite the fact that many newer pesticides are used at very low volume per unit area, this is more than double the amount applied 35 years ago when Rachel Carson wrote Silent Spring (1962). The biggest users remain in North America (29% of sales in 1996) and Western Europe (26% of sales), followed by all of Asia with 25% (PANUPS 1997). However, the largest increase in sales is in Latin America, accounting for one third of global sales growth and more than 10% of use. International cooperation in pesticide impacts assessment is therefore essential, because some of the most hazardous chemicals that have been banned from some countries continue to be used elsewhere, often in tropical climates where risks may be greater due to social as well as biophysical factors such as higher temperatures and greater skin hydration (PANUPS 1995).

    Simply projecting risk from pesticide usage data is not sufficient, and is even less meaningful now that highly potent chemicals are used at very low volumes. Risk-weighted indexes of usage are a more powerful measure and avoid the shell game of shifting risk from less potent chemicals used at higher volumes to more potent chemicals used at lower volumes. However, the analytic task of pesticide impacts assessment has become more complex as the number of chemical classes of pesticides has quintupled from approximately 25 in the 1970s to about 130 today, and the modes of pesticide activity affecting the environment have also diversified (Wauchope et al. 1994).

     

    Complexities and Challenges in Developing Pesticide Environmental Impact Ranking Systems

    About 1% of the 8 million chemicals listed in the Chemical Abstract Services Registry are commercially produced. Although fewer than 1% of these are used for pest control (Davis 1994), the compilation and interpretation of hazard data for 500-1000 pesticidal chemicals (the higher number reflects newly identified microbial and biochemical pesticides that have been added to the synthetic chemical arsenal of pesticides) is a daunting task. It is, however, a necessary prerequisite for development of reliable and meaningful comparative and summary assessments of pesticide impacts on the environment. Data limitations and complexities are just one of the challenges in developing pesticide environmental impact ranking systems. Other issues are the identification and integration of suitable environmental indicators; estimation of situation-specific variabilities; the lack of a common currency for comparing disparate types of impacts; and the bias against considering future impacts.

     

    Choosing Environmental Indicators and Deciding How to Integrate Them

    No one species or group of biota reacts most sensitively to all pesticides, and thus is useful as a surrogate for all others in toxicity testing. With other environmental perturbations as well, we cannot rely on a single indicator species or abiotic effect to tell all we need to know about impacts of management decisions. Thus there is a tension in the development of assessment systems between the advantages of ranking pesticides based upon impacts in one environmental dimension versus advantages of composite assessments of impacts in multiple environmental dimensions. The drawback to the former is that no single parameter can fully describe an environmental impact, and thus conclusions can be misleading unless system objectives and limitations are made explicit. On the other hand, the challenge of multi-parameter optimizations are in specifying test endpoints of significance and then integrating results--either into a composite picture of environmental impacts or by prioritizing the most critical impacts in a given situation. To grasp the conceptual challenge this poses, think about how you would weigh impacts on human beings in relation to impacts on other biota, especially if they were dissimilar in magnitude and type.

    There is also no one set of social or environmental indicators that is most appropriate to use in assessing impacts under all sets of circumstances. Even if a system developer were to decide upon a protocol for weighting, valuing and integrating impacts in the context of one assessment scenario, these issues would re-emerge in a different configuration when the question again arises on a different scale, for a different target audience, or with different situation-specific conditions. To illustrate: the types of data required for a farmer to choose a least impact but efficacious pest control may not be the same as the data required for a national assessment of agriculture practices. IPM farmers want to avoid pesticides that harm parasites and predators of the pests in their fields, but these producers might be misled by a decision model based on national data for pesticide impacts on beneficial organisms. The only such data included in the US EPA Ecological Effects dataset, for example, are acute toxicity of pesticides to worker honey bees (Atkins et al.; US EPA 1996). Even were toxicity dose responses comparable for honey bees and other beneficials, the significance of effects on these groups of organisms is likely to be quite different. When honey bees are repelled from a field by pyrethroid pesticides, for example, they survive and move on to another nectar source. However, if beneficial parasites and predators are repelled from a field, they are then not available to work in that field as biological control agents. As shown by this example, the honey bee acute toxicity data may not be sufficent or appropriate to use in a micro scale assessment for advising farmers about pesticide effects in their agroecosystems, but may be valuable as a proxy for other ecological effects in a macro scale analysis of pesticide risk trends. The design of an assessment system must, therefore, be appropriate to the objectives of the audience served.

    Some assessment systems include variables that do not really assess environmental impact. These include the variable "availability of alternatives" which could bias comparative assessments of pesticides toward dangerous pesticides with no conventionally-known alternatives. Production cost factors are also sometimes included in assessment equations alongside environmental factors.

     

    Dealing with Situation-Specific Variability

    Environmental impacts result from interactions among inherent pesticide properties and situation-specific factors, including particulars of the site, weather conditions, the diurnal and seasonal timing of application, and the application method (packaging, equipment, location, and degree of care). Interactive effects are highly variable, so that it is difficult to make reliable, application-specific predictions of ecological effects. The challenges of incorporating this variability have been addressed in various ways, depending in part on the decision-making context and designated user group. The complexity of indexing systems is generally a function of methods used to estimate exposure: the simplest systems generally consider only toxicity (and to one group of organisms) or use toxic release or production figures as a surrogate for exposure. More complex systems also evaluate one or more exposure pathways (Davis 1994).

     

    Finding a Common Currency

    Assessment indexes are well-suited for comparing relative impacts of similar management options, such as toxic impacts of different pesticides. They may be less successful comparing impacts that do not share a "common currency" of accounting units. Two such examples are (1) comparing toxic impacts of herbicides with soil compaction resulting from tilling to control weeds, or (2) comparing pesticide impacts from regional food production with impacts of resource consumption and pollution resulting from the transport of organically-produced food from a distant agricultural region. We are accustomed to using money as a common currency for trade (both for trade in ideas--as we are talking about here--as well as trading of goods), but money is inadequate for describing non-market costs such as the loss of an individual life, loss of biodiversity, disruption of an ecosystem, future costs of current soil erosion, or loss of non-replaceable resources. Although methods (such as contingent valuation, apportioning remediation costs, using travel or avoidance costs, etc.) have been developed for assigning a monetary value to non-market goods----they generally fail to capture the full dimensions of the richness and value of non-market attributes.

     

    Data Limitations

    Data are required at all stages of environmental assessment of agriculture. Because many environmental impacts are generated on different temporal and spatial scales than they are experienced, the complete set of data for assessing these impacts cannot be collected on-farm. This important factor distinguishes environmental assessments from farm production cost assessments.

    However, toxicological and ecological effects datasets of pesticides are incomplete. Of the 150 commonly used pesticides evaluated by Pease et al. 1996, data for invertebrates were available for 44 pesticides and persistence (field half-life), for 106 pesticides. In addition, some of the existing ecological effects data are inappropriate to use for assessing relative impacts because standardized testing protocols were not used and so the data are not comparable (Levitan et al. 1995). Moreover there are very limited data and no standardized datasets on impacts of new biopesticides, such as microbial and fungal pesticides. These gaps and inconsistencies in data have been less of an impediment for pesticide screening systems and case-by-case evaluations (as is done for pesticide registration), but complete and reliable datasets are critical for comparing relative impacts or creating a rank order of pesticides.

    Most available data on pesticide environmental impacts result from single-species toxicity tests mandated for pesticide registration. Ecotoxicologists have questioned the predictive value of such tests, noting that interactive effects of pesticide inputs at the community and ecosystem levels can be different than inferred from single-species effects, and that the higher level impacts can be of greater long-term environmental significance (Cairns 1986, 1991; Karr 1992). Unfortunately, no standardized datasets exist for pesticide impacts at these higher levels of ecological organization---and data availability strongly influences which parameters and types of impacts can be included in a comparative assessment system.

    In addition to limitations associated with testing single species of organisms, most ecotoxicological data are also limited because the test pesticides are generally applied in single doses of individual active ingredients, whereas biota in the environment are constantly exposed to chemical mixtures that change spatially and over time (Yang 1994). Acute impacts may result from exposure to pesticide mixtures, but long-term impacts may also result from mixtures of pesticides with other chemicals in the environment. Also datasets for impacts at standard pesticide dosages may not suffice for assessing risk from aberrantly high doses, such as those following a spill or toxic release, or at the low dosages that are only recently becoming sufficiently suspect to cause concern. Existing datasets also may not be sensitive to impacts on particularly susceptible populations--the unborn, the very young and rapidly growing, pregnant females, people on medications, and biota (human and other) whose immune systems are compromised. Evidence is accumulating which show that mixes of certain chemicals at individually-low doses can have impacts that may be magnitudes of order greater than simple additive individual low-dosage effects (McLachlan and Arnold 1996). Cumulative impacts from repeated or extended exposures can also be different than impacts from single, larger exposures. Bioconcentration of fat soluble compounds has long been known to be a cause of cumulative impacts. However, cumulative effects also result from weakened immune systems and from allergic and other auto-immune responses triggered by chemical sensitivity. These cumulative effects involve short-lived and water soluble compounds as well as persistent culprits. Little is known about such cumulative and interactive effects, especially at low or fluctuating levels and in mixtures, and particularly in terrestrial systems. Yang (1994) concludes that:

    While these comments are intended to apply to human subjects, these principles and concerns can probably be extrapolated to non-human biota, some populations of which may be more vulnerable to such risks because of limited mobility and physiological factors. We should have to assume (and indeed anecdotal evidence is being compiled which supports this point) that not only do we learn about possible impacts on human beings from mammalian studies, but also that the sublethal disorders noted in human populations may well have parallel effects on other non-target organisms.

    Many ecologists have suggested that environmental assessments consider indicators of impacts on organisms in all trophic positions and for all key roles in basic ecological processes. This would include the role of invertebrates and microorganisms in decomposition and humus formation, and the role of herbivores in interactions with vegetation (De Snoo et al. 1994). The problem with this idealized vision is of course that there is no dataset covering this array of ecological indicators. Moreover, for many of these indicators there are no protocols for collecting data and/or no accepted correlations between the direction and magnitude of presumed cause and effect. This results in an interesting tension between the knowledge among biophysical scientists of what would ideally constitute an array of test endpoints for a holistic assessment of environmental impacts, and the paucity of data to make such evaluations possible.

     

    Bias Against Future as Compared to Present Impacts

    Long-term and cumulative impacts are more difficult to comprehend and quantify than short-term impacts. As a result, less data are available and less weight tends to be given to these impacts in environmental assessments. We also tend not to consider impacts associated with future events, such as the future leaking of improperly stored pesticides, changes in soil pH leading to release of previously bound chemicals, narrowing of the gene pool, or increased pest resistance to pesticides (Riha et al. 1997). In general, there is a bias against variables for which data are difficult to collect or for which results are uncertain.

    Adverse ecological effects encompass a wide range of disturbances ranging from mortality in an individual organism to a loss in ecosystem function. Thus the ecological risk assessment process must be flexible while providing a logical and scientific structure to accommodate a broad array of stressors and ecological components (Norton et al 1992).

    The remainder of this paper takes a pragmatic look at how assessments of pesticide impact can be structured to meet specific objectives, within the limits of current knowledge and technical information.

    TYPOLOGY OF PESTICIDE RANKING SYSTEMS for different purposes: How Indicators Designed for Different Purposes Work and What They Tell Us

     

    Assessment Tools for Different Purposes: Two Proposed Typologies

    Pesticide impact and risk measurement systems are being designed to meet at least three types of objectives, to be:

    Not only are there multiple societal values involved in estimating potential hazards of pesticide use, but also methods, scale of analysis, and results differ depending upon target audiences and intended uses of the indicator. Some assessment tools wear more than one hat, but most are not equally suited for multiple roles. This section and the one which follows lay out some of the principal and idealized characteristics and distinctions among the types of assessments on the basis of the following factors:

    Another handle or typology for discriminating among assessment tools is by whether systems are based on impact or behavioral criteria and indicators. Indicators of impacts include many pesticide test endpoints, such as single species toxicity test results (e.g.: the LD50), estimates of exposure, measures of residues on food and in the environment, sublethal effects (e.g.: impacts on behavior, reproduction, genetic diversity), secondary impacts on habitat and food sources, and impacts at higher levels of ecological organization (e.g.: impacts on species richness, biomass, ecosystem productivity). As were detailed earlier, assessments based upon impact criteria have been constrained by data limitations and inconsistencies.

    In contrast to indicators of impacts, examples of behavioral indicators include: Was X crop produced by organic methods? Does this farm practice Integrated Pest Management (IPM)? Which/ How many/ What types of IPM techniques? Were biochemical or synthetic chemical pesticides used to control target pests? Assessments based on this type of indicator are not similarly constrained by gaps in the data because data points are observational. There are really two types of indicators of behavior: those based on current behaviors and those based on future or promised behaviors. Examples of the latter include the promise that fewer pesticides will be used by producers in coming years, or promises of better health safeguards for farmworkers. Assessments based on promised behaviors perform some very useful functions: They are used as ecolabeling criteria where the objective of the system is to use incentives to develop compacts between producers and accreditors in order to shift behaviors. They may also be useful in developing policy tools which project future risk trends.

    Systems based on behavioral indicators are a creative means for circumventing some of the difficulties of multiattribute optimization, and the problem of gaps in impact data. They do so by assuming a positive relationship between certain sets of behaviors and impacts. More specifically, behavioral assessments are generally predicated on the assumption that IPM and organic methods are more benign in their environmental impacts than post World War II "conventional" chemical pest control strategies. This is why people "buy organic" and why the IPM research community is interested in developing IPM accreditation standards and IPM ecolabels. However, assessments based on impacts data are essential for undergirding, supporting and challenging these implicit assumptions about the impacts of certain behaviors.

     

    Decision Aids for Farmers/Growers and Other Land Managers

    In the same way that the term "environmental impacts" is used in this paper as a shorthand for the broad array of impacts that are sometimes separately listed as public health, ecological, economic or social, the phrase "farmer decision tools" is used here as a shorthand to refer to all site-based, field-scale decision aids intended to be used by property managers. This group does include farmers, but also includes homeowners and others (e.g.: lawn maintenance companies, golf course managers, foresters, etc.) who must deal with pests of buildings and grounds. In other words this typology and these assessment tools, are not limited to agricultural applications.

    The objective of this type of assessment tool is to inform people who make pest management choices about potential environmental consequences of their decisions. Farmer decision models are generally structured as a comparison of pest control options, based upon some subset of key enviro-social indicators. As decision aids, the models are most reliable when tailored to meet situation-specific conditions for a given field or farm. However, the manipulation of situation-specific data (e.g.: soil type, local climate, application timing and equipment) can be cumbersome and overwhelming, so that succinct and user-friendly measurement tools may find more widespread use. Some of the seemingly more simple models make use of variables which impute situation-specific characteristics, rather than actually measuring and utilizing situation-specific data.

    Because of the complexity of determining which pest control management practice is "best" under a particular set of situation-specific conditions, the development of reliable decision aids for farmers presents a formidable challenge. The results (recommendations) of most such assessments should be linked with the decision making process because the ranking of pest control options may differ depending upon the situation-specific conditions. Thus farm-scale decision tools may best be presented in a workbook format or as a computerized expert system. These formats, unlike standalone rankings of pesticides by risk, permit "if-then-else" routines that can be responsive to situation-specific variability.

    The typical unit of analysis in producer decision tools is either a discrete pest control application or practice, or else the compendium of decisions made during one or more production seasons. Together these decisions constitute a pest control strategy. Results are often expressed as a listing of unitless impact points for an array of pest control products. Results may also be expressed as a similar impact index of pest controls, but specifically as these are used to combat a given target pest (see figures accompanying the PestDecide© farmer decision tool for an example). The reason for this level of specificity is that pesticides are applied at different rates and times, with different expected efficacy, to combat different target pests. Non-target impacts may differ accordingly.

    Farmer decision tools typically identify pest controls by trade product names--rather than only by the common name of the pesticide active ingredient--because efficacy and non-target impacts also differ with formulation. Unfortunately, input data are rarely specific to formulated products and, therefore, the impacts of formulations are at best adjusted by the concentration of active ingredient and the type of product (e.g.: granular, liquid, vapor). However, impact data do not generally reflect the actual characteristics of the trade product, as affected by the mixture of adjuvants and other inert ingredients in the formulation. The full season time horizon that is considered by a number of farm-scale assessments provides a broader and more realistic picture of environmental impacts than a snapshot evaluation of a single pest control application. Some examples: The application of a lower impact (and thus seemingly preferable) pest control at one point in the season may necessitate additional subsequent applications of various pest controls later in the season, causing a larger cumulative impact. Repeated applications of certain lower impact pesticides may lead to more rapid build-up of pest resistance, and thus ultimately to a renewed reliance on more hazardous pesticides. Or the buildup of weed seeds following a period without herbicide use could necessitate more herbicide use later in the season or in subsequent seasons. These interactive and cumulative factors would be missed by an assessment limited to the impacts from a single pest control application. Some of the full season assessments are crafted to do double duty as ecolabels, with accreditation offered to the yields of production scenarios that meet threshold criteria (or which receive a certain number of impact points).

    The value of assessment tools for farm managers depends on their ability to provide valid information pertinent to specific pest management decisions. The problematic of these decision aids is to integrate the many factors contributing to each management decision--a farmer’s personal concerns for applicator safety and the agroenvironment, altruistic or regulated interests in protecting human health and the broader environment, business interests in production costs and in securing a market niche. An assessment model would be hard pressed to deal with all of these factors, but it is not unusual for farmer decision tools to include production cost estimates for each pest management choice or scenario. In contrast, assessments designed as policy tools may instead weigh economic costs to society at large, whereas ecolabels tend to leave the economic assessment to the consumer (or intermediaries) in the marketplace.

     

    Policy and Analytic Tools for Use by Governments, Industry, Academia

    In this typology a wide range of assessment systems used for many distinct purposes fall under the aegis of policy tools. The types and objectives of policy tools include:

    The types of decision-makers targeted by policy tools are as different as this list of objectives. They include IPM research and extension teams and their funders working in academia, extension, and budget offices of government and foundations, as well as policy-makers at all levels of government and in the environmental research and activist communities. The arena of activity for policy tools could be said to be "at the roundtable," in contrast to the farmer’s field or the marketplace. The unit of analysis for assessing the success of IPM programs may be a single production unit (i.e.: a farm) or the set of farms in a region which produce similar commodities. For other policy tools, the unit of analysis may be the total quantities of a pest control product used in a state, a nation, or internationally. Thus the situation-specific variability and details of importance in making pest management decisions at the field scale often wash out at this larger scale. Because results of the analysis will not differ, for the most part, on the basis of situation-specific responses from producers, risk indexes used for policy purposes may be published separately from the decision-making process, as tables in a scientific paper or as a pesticide label.

    The development of policy tools to meet many of the objectives listed is perhaps an easier row to hoe than developing holistic pest control decision aids for farmers, because policy objectives are often more narrowly defined. For example, there is worldwide concern about decline in amphibian populations. If herpetologists were to synthesize existing data and create a pesticide classification scheme based upon the hazard to frogs, this assessment would be an important pesticide policy tool with regard to amphibians. It would, however, be but a small addition to the array of information needed for a holistic farm-scale decision tool. Similarly, farmers require significantly more detail about impacts of pest control products than are generally considered in policy assessments. Impact data developed for a broader scale of policy analysis--for nations, industrialized countries, etc.--are frequently too generic to be suitable for farmers to use in making site-specific decisions about controlling a particular target pest.

    Another distinction is that most national datasets lump all forms of a pesticide active ingredient and all the formulated products in which it is found, whereas the ideal farm-scale decision tool will be sensitive to the different toxicities of formulated products (due to adjuvants and different formulation types) and to factors which mitigate exposure to non-target organisms (including packaging and application equipment). A major difference between the objectives of farm-scale decision tools and most assessments for policy purposes is that the former are generally constructed as comparisons among available options for control of specific pests--since a choice must be made from among the finite group of alternatives. Risk assessments for policy purposes are rarely constrained in the same way. Thus in a ranked list of hazardous pesticides all of the options for a specific task may be among the most or least hazardous.

    Although a number of assessment tools have been created for the policy and research arenas, systems which holistically measure and express global (or other large scale) pesticide risk and trends in risk reduction are still at basic stages of development. Procedurally, most such ranking systems weight toxicity factors by pesticide usage figures to create a set of most hazardous, highly used pesticides. For the most part, risk to just one or a very few enviro-social indicators is assessed (most usually these are acute and/or chronic lethality to human beings, and water pollution). Data which would enable a more comprehensive assessment of non-target impacts are uneven and unavailable for many other important test endpoints, and for many pest control chemicals and alternative products. Were data available, methods for integrating and weighting multiple criteria are still imperfectly developed for these purposes, as they are also for farm-scale decision aids. Beyond these challenges, additional questions remain regarding development of pesticide risk reduction yardsticks at a national or international scale:

    To address this last point, there is an opportunity--if not an obligation--to consider economic cost in a broader social context in policy tools than they are typically considered in farm-scale decision tools. For the latter, system developers have an obligation of sorts to consider production costs borne by the farmer, as well as the environmental and public health effects on society. However, economic costs also manifest as medical expenses, diminution of the quality of life, remediation and restoration costs, etc. Thus even when these costs are not explicitly considered by an assessment model, analysts can tacitly recognize these multiple dimensions of economic cost by labeling "production costs" as such, rather than giving them the title "economic costs" which allows the narrower meaning to "own" the more encompassing term.

    It is also important to consider how economic costs are incorporated into environmental impact assessments. Several of the most widely used farm decision tools include production costs among the variables determining the pesticide ranking. If the ranking is interpreted as an environmental assessment, however, then it incorrectly implies that the magnitude and severity of environmental impacts varies with the producer’s cost for pesticide use. Since this is not the case, it is important to assess such costs separately from environmental impacts. One such method is shown in Figure 2. When the economic costs of environmental protection are high (i.e., if farm costs for producing without pesticides are burdensome), there should be a policy debate about whether and how to shift that economic burden from the farmer (or the consumer) to the broader society. However, these costs should not be allowed to unduly influence the assessment of environmental impact and potential risk.

    Figure 2 Integrating an economic dimension into an environmental impact assessment

     

     

    Ecolabeling Systems to Influence Consumer Opinions and Purchases

    "Ecolabels" (also called "green labels") bring environmental impact assessment to the marketplace by encouraging the production and purchase of goods that meet a set of environmentally-sensitive criteria. This idea has mushroomed in recent years. There has been a surge of interest in IPM accreditation for labeling purposes, but ecolabels of many varieties are also affixed to manufactured goods as well as to agricultural and forest products. The term "ecolabel" is generic for this market mechanism; it is not specific to any one set of environmental standards or any one certification program. Ecolabels are the "front end" for many different assessment criteria and assessment systems.

     

    Ecolabel Criteria

    Reduced pesticide risk is only one of a number of environmentally-sensitive criteria reflected by ecolabels. The "organic" or "bio" ecolabel is probably the most common label based upon pest control criteria, but many other labels which reflect environmentally-sensitive criteria--such as recycling, local production, no rainforest destruction or cruelty to animals--have become familiar in the marketplace. Many of the emerging ecolabeling programs are built on incentives based on promises of behavioral change in the future: For some specified duration of time, products or companies may be granted a third party ecolabel on the basis of having agreed to change and improve on past environmental performance. Promises may include some percentage reduction in pesticide use, an effort to use less of potentially dangerous pesticides, or a willingness to better protect worker safety.

    An idealized objective of a number of programs is that their ecolabels reflect a product "life cycle assessment" of all enviro-social impacts from "cradle to grave"--or from the extraction of basic inputs to the eventual disposal of waste goods. More realistically, however, most ecolabels reflect a subset of enviro-social criteria of importance to those affixing the label. For example, some of the criteria for ecolabels on manufactured goods are that recycled materials or energy-conserving processes are used in production; that the product is less caustic than alternatives; that inputs derived from animals are not used in production; that products are not tested on animals; or that old-growth forest and rainforest resources are not destroyed in the production process. Criteria underlying ecolabels on agricultural products may include:

    Some products carry the stamp or label of more than one certification program, each with criteria in a different enviro-social domain. Some ecolabels incorporate social criteria and human rights standards, such as fair trade practices and adequate recompense for labor. The emergence of ecolabeling as a factor in the market is causing concern in some circles about its potential as a barrier to free trade (Dawkins).

    Common characteristics of most ecolabeling programs are that:

    Although the consumer is the decision maker who decides whether to buy the labeled product, the defining characteristic of ecolabeling is that label standards are set before the product reaches the marketplace by an accrediting body that decides whether to confer the "green" designation. Consumers are able to weigh the information conveyed by the ecolabel logo as they also balance their need for the product against its cost. However, the consumer has neither the onus nor the opportunity to independently assess the factors going into the designation. They cannot modify the decision about whether or not a product is labeled. Consumer generally remain unaware of the decision criteria behind the visible logo. The results of the assessment are thus conveyed via a product logo or label that is always viewed separately from the assessment process.

     

    Who Confers the Ecolabel? Who Participates in the Program? And Why?

    The accrediting body may be a government or international agency; a third party non-governmental organization (NGO) or private firm; or a self-regulating group of producers and/or product distributors. Increasingly, coalitions of different interest groups are developing labeling criteria cooperatively. This broader base of support imbues projects with internal "checks and balances" and thus increases their exposure and legitimacy in the public eye. Some labels rest on the transformation of government regulations (or international treaty regulations) into a marketing tool. The US "organic" designation will eventually be backed by USDA National Organic Standards, for example; and the "dolphin-safe" label on tuna fish reflects the standards of the Inter-American Tropical Tuna Commission (Lefferts and Heinicke 1996). Government agencies and NGOs support ecolabeling because it is seen as voluntary, rather than regulatory, and thus a less heavy-handed mechanism for inducing a shift in behavior.

    A mix of personal benefits and altruism motivate consumers to buy labeled products. While some ecolabels are similar to health and safety labels in flagging products that may cause personal harm or directly improve consumer well-being, most ecolabels only promise benefits for the environment and the greater good. For example, consumers do not benefit directly by using recycled paper products in the same way that they believe they derive health benefits by heeding warnings from the Surgeon General or by eating food without pesticide residues. In fact, retail- and industry-sponsored food-labeling programs are often adamant in claiming that advantages of the ecolabel accrue only to the environment, and that ecolabeled food is not safer for consumers or superior to the other food products they sell.

    Producers and those in the marketing chain participate in ecolabeling programs to fulfill some combination of altruistic and economic objectives. In situations where consumers or distributors are demanding eco-sensitive products, producers may be motivated to join the program to secure a market niche. In addition, some ecolabels confer a price advantage to producers as a financial incentive for following relatively environmentally-benign production practices, or as recompense for greater producer risk and/or higher costs of production. There is a creative tension between programs set up to capture the higher price niche market, and those which are attempting to set an industry standard, without price differentials. The Center for Agriculture and the Environment (CLM) in the Netherlands supports an ecolabel based upon their Pesticide Yardstick. Label standards aim at a level of environmentally-sensitive practices currently in use by 20% of regional farmers. The intent is to set standards at a level attainable by a small enough percentage of farmers so that the market incentive of higher prices is retained, but with a sufficiently small price differential not to deter consumers. Participating farmers are informed that label criteria may change and become more stringent as mainstream practices become more environmentally-sensitive (Joost Reus 1996). Organic labels have often commanded a price premium. In the US, the premium generally ranges from 25 to 100 percent of the conventional market price. The Midwest Organic Alliance found that producers growing organic soybeans received three times the price paid to growers of non-organic beans (Farm Aid News 12/21/95). Several clothing manufacturers and distributors (e.g.: Patagonia, the Gap Inc., Seventh Generation, the Espirit Ecollection) are featuring lines of organic and/or pesticide-reduced cotton clothing. The strong name recognition and consumer loyalty to these companies (gained by some companies by virtue of their stated commitment to environmental protection) gives them some leverage with consumers as they parlay ecolabeled products at premium prices. In effect, consumers of certain ecolabels are asked to be "eco-pioneers" by paying the higher costs of more labor intensive and smaller batch processing of these products until they become mainstream fare and can be sold at standard prices.

    Ancillary to the issue of price premiums is the question of who should pay for them. While it may be fair for consumers to pay a premium if their product choice is motivated by the chance of gaining an extra margin of personal safety, is it also fair for environmentally-conscious and conscientious consumers (and risk-taking altruistic producers) to shoulder the additional cost when their buying habits are motivated by broader enviro-social concerns? Conversely, is it equitable for the global society to be burdened by the costs of environmental remediation necessitated by more hazardous production practices? Ecolabeling focuses attention on the potential of market mechanisms to shift these costs to appropriately targeted producers and consumers. It also calls attention to the interface of market mechanisms with policy initiatives--such as the application of surcharges for purchasing environmentally dangerous or degrading products--and raises questions about which approach is more efficient, effective and tenable. Also at issue with ecolabel programs (albeit perhaps more of a concern with manufactured goods than most farm products) is that there may be significant costs of contracting with the accrediting group. If these costs are borne by producers, then the benefits of the program may be biased towards larger firms which can afford the accreditation procedure. Government and not-for-profit labeling schemes can perhaps circumvent this bias by waiving costs for small-scale producers.

     

    How Do Ecolabeling Systems Work?

    Agricultural ecolabels are a "yes or no" composite assessment of farm produce and/or farm practices. The bag of carrots in the supermarket either displays the IPM logo or it does not. The summary judgment about whether or not to confer a label is generally arrived at by one of two methods (which are further elaborated in the section "Generic Methods for Ranking."):

    With the checklist, the product must meet a set (or some predetermined percentage) of independent criteria: L and R and T and P... The checklist allows labeling criteria to include any number of disparate factors that would be difficult to tally in a common currency of either point values or money. For example, the criteria for the "checklist" label could be: "This product was grown locally, within 100 miles of point of sale (L), using recommended crop rotations (R), minimum tillage (T), and without use of pesticides that have been found as residues in county water supplies (P)." As these labeling criteria indicate, this approach allows the label to flexibly adapt to situation-specific conditions and local area standards. Some checklist systems consider certain criteria more important than others, and thus weight criteria with a point value. The product is labeled if it meets a threshold percentage of weighted criteria points: Index value > bLL and bRR and bTT and bPP. Note that only the weighting factors (bi) are assigned point values. In calculating the weighted checklist, criteria variables have values of either 0 or 1, depending on whether or not the criteria are met. This is unlike the algebraic equation method described next in which both weights (bi) and criteria variables (xi) are assigned point values.

    A number of ecolabeling systems which serve double duty as farmer decision aids confer the label when the Index Value < x1 + x2 + x3 + x4. For these algebraic systems, a common currency must be used to evaluate each xi. Systems which use this approach include the Stemilt Responsible Choice for fruit from Washington State in the US, PestDecide©, and the CLM Dutch Yardstick, all of which are described in detail in the later section "Pesticide Impact /Risk Ranking Systems." The structure of an ecolabel accreditation system, however, may not be completely compatible with the objectives of a farmer decision system. To draw an example from PestDecide©, a very creative decision tool for tree fruit farmers in NSW Australia: Of 200 possible index points in the system, 30% or 60 points can be assigned for production cost and pesticide efficacy factors. While these may be of interest to growers, they do not affect environmental or public health risk from pesticide use. Should they, therefore, affect the ecolabel accreditation? Conversely, 20% of the 200 PestDecide© index points are based on the time and location of a pest control application (e.g. whether the pesticide is applied to soil or to fruit). While these may indeed be factors of interest to consumers concerned about pesticide residue, they are not likely to provide useful information to the farmer who is using the index as a basis for discriminating among pest control options. The reason is that farmers are looking for information applicable to the control of a specific target pest, at a certain time in the production cycle. If it is a soil pest, then all of the pest control options are likely to be applied to the soil; if it is a fruit pest, then all of the pest controls will probably be applied to the growing fruit. Since the points assigned for these factors are likely to be the same for all pest control options for a specific target pest, they do not provide useful and discriminating information for farmers about relative hazards of different pest controls. In a similar vein, the Stemilt system gives points for pesticide efficacy, which is of interest to growers, but does not contribute to the environmental protection or consumer safety values of an ecolabel. The developers of these systems might argue that the best product to buy is the one that is best for the farmer to use, based on a composite of factors. This too is a valid point.

    Other measurement systems have potential application in policy as well as ecolabeling arenas. Methods for assessing adoption of IPM, for example, can be adapted to the farm scale for accrediting IPM producers or to the national scale for estimating trends in IPM adoption (Vandeman et al. 1994; Benbrook et al. 1996). Likewise the Environmental Working Group’s study of fruits and vegetables most likely to have pesticide residues has been published in the popular media as a type of "commodity ecolabel," but is also having reverberations in regulatory and policy-setting arenas (Wiles et al. 1995).

     

    Generic methods for ranking pesticides

    This is the "how to" section. Methods are described for calculating results for the various types of pesticide impact assessments discussed in the previous section. "Results" may be in the format of continuous, numerical scores or may be categorical groupings (such as high, moderate, low or no impact) which describe the extent of impact, hazard or risk. In some assessments, the categories are translated into the colors at a stop light: "red" indicating a high impact or risk; "yellow," a moderate impact and the need for caution; and "green" for "go ahead"--indicating there is little or no impact from the practice. Some systems assign scores to these categories, and the scores serve as the "common currency" that is weighted and summed to create a composite assessment rating for a pest control practice. Numerical scores can thus be derived from categories of impact, may be derived directly from toxicity tests (such as an LD50 value), or may be a ratio of environmental concentration to an effective concentration that causes a measurable impact. Whatever the format of the results--whether numerical or categorical--they are generally arrived at via one of three methods:

    Because indexing is still an imperfectly-developed methodology for fully describing the enviro-social impacts of pesticides, and also because the demand for tools to measure pesticide impacts is coming fast and furiously from numerous constituencies, the field is dynamic--new techniques are emerging, being refined and synthesized. Thus this "how to" primer does not cover all possible permutations and combinations of methods. Its broad brush is intended to help readers detect and dissect the inner workings of assessment tools that use an indexing approach. Hopefully this will enable readers to better evaluate which enviro-social factors are considered--and how--by any particular assessment system. By reading this more abstract discussion of methods in conjunction with the descriptions of specific assessment tools presented later, both may be more comprehensible. This section also continues in the trajectory of the previous section by highlighting some of the technical challenges and limitations of this methodology, particularly with regard to scoring, weighting and integrating multiattribute criteria. An awareness of these "sticky points" is key to improving upon hazard indexing tools.

     

    Logical Chain of Decision Rules

    One method for deriving a composite environmental impact index rating for pesticides is to use a set of decision rules. This approach is somewhat akin to following a flow chart or using a dichotomous identification key. The logical chain of decision rules creates a knowledge-based system that can solve complex problems. It is a series of algorithms that can be written on paper as a decision tree, or is amenable to conversion to an interactive, computer based "expert" system. In either case, knowledge is represented in a series of if-then-else rules (Plant and Stone 1991). Answers to a first tier of questions direct the decision-maker to a next tier of questions, which will not be the same for all respondents. Or the response to an algorithm generates a score, while a different response generates a different score (Figure 3).

    Figure 3 Logical chain of decision rules

     

    Advantages to this approach are that it can utilize qualitative as well as quantitative information from different sources. Unlike algebraic models, decision tree models do not necessarily depend upon a complete array of comparable data for each pest control evaluated. With decision rules, detailed input data may be required only when deemed necessary by responses to previous questions. Decision trees can therefore utilize descriptive and anecdotal accounts of impacts. For example, there is mushrooming interest, but limited information, about potential endocrine-disrupting impacts of pest controls. Certainly no rigorously compiled datasets exist which contain results from controlled tests which all follow the same protocols for all pest controls. Because of the many possible modes of action of endocrine disruption, and the likelihood that effects are synergistic at low levels of long term exposure, such protocols may be difficult to develop and such tests may be impossible to conduct. Therefore, it is not facile to include this important environmental variable into an algebraic model, but neither is it intellectually or perhaps morally honest to ignore these sublethal impacts in deriving new assessments of pesticide environmental impacts. Existing information could be incorporated into a decision tree assessment model via a decision rule of the format: "If evidence exists which suggests that x pest control may have endocrine disrupting potential, then this pest control is to be classified as a potential endocrine disrupter and treated as..." Pest controls in the suspect category could then be assigned a certain negative hazard score. The amount and type of supporting evidence required could be specified, but need not all be from the same source or in the same form (the O’Bryan and Ross 1988 screening model includes criteria of this type).

    Decision trees, used alone or in conjunction with algebraic equations, can also flexibly treat situation-specific conditions, such as those which affect exposure potential. A person knowledgeable about field conditions (e.g.: the farmer, the local farm advisor or agricultural researcher) can input situation-specific data, so that the system uses appropriate decision rules for the situation. This type of interactive assessment system can generate better estimates for exposure and other situation-specific variables than can a non-interactive system. Again drawing on honey bees for an example: The impact of a pest control on honey bees is a function of toxicity of the pesticide to the bee and the likelihood of exposure. Honey bees fly only in daylight, however, so that exposure to pesticides is minimal if application is at night. An additional complicating factor is that some very toxic pesticides repel bees before killing them, so the bees are not effectively exposed. Conversely, some less acutely toxic pesticides do not kill the adult worker bees and nor are they repellent, so the chemicals are carried from the field back to the hives where they are stored and fed to the brood. These pesticides can, therefore, affect hive survival and the honey bee population more profoundly than would be expected from a rank order of pesticides determined solely from acute dermal toxicity to adult bees. It is well nigh impossible to capture the complexity of these ecotoxicological effects within the constraints of an algebraic model.

    A logical chain of decision rules can segment these interactive effects into a set of ‘if-then-else’ algorithms which assign appropriate value to the xHONEY BEE EXPOSURE variable. These decision rule-generated values could be linked to an algebraic model by a workbook exercise or simple computerized routine. The flexibility of the decision tree approach makes it well-suited for farm-scale decision tools and other impact assessments which should be sensitive to situation-specific conditions. However, this format creates an unbreakable link between the process of decision-making and the results of the assessment, making it somewhat cumbersome and perhaps unsuitable for the policy arena, where a terse summary of impacts may be preferred.

     

    Algebraic Equations a la Plant Breeding Selection Indexes

    A number of pesticide ranking systems are based on simple algebraic equations, using a format similar to plant breeding selection indexes (Cotterill and Dean 1990) and multiattribute indicators used in the social sciences (Putnam 1993). A generalized form of these equations is:

    Environmental Impact Index ValueCOMPOSITE = f(b1x1 + b2x2 + b3x3... + bixi).

    In algebraic systems, each x is the score assigned to an environmental variable or indicator. Such variables include pesticide physico-chemical properties (e.g.: solubility), test endpoints (e.g.: LD50 to a test organism), and categorical assessments of a hazard, impact or risk (e.g.: high, moderate, low or no impact on aquatic organisms). The b coefficients are weights reflecting the perceived or measured relative importance of the trait to the assessment system. B coefficients can also be used to standardize units or scales of measurement. In their simplest form, ranking systems may be based on only a single environmental parameter or indicator of risk, so that Index ValueCOMPOSITE = f(b1x1).

    Each xi can represent a single test measure, or can be a function of several variables. For example, an xi estimate of potential pesticide risk to birds could be based solely upon acute lethality (LD50) to adults of one species, or could be a composite of acute lethality (LD50) scores for several species (t, for toxicity), the pesticide’s effects on bird reproduction (r), its persistence in the environment (p), and drift potential (d). In this latter multiattribute assessment of impacts on birds, the first two factors are both indicators of toxicity to birds and the last two are indicators of potential exposure to birds. The model could be structured such that f(xi) is an additive function equal to (tj + rj + pj + dj); a multiplicative function, equal to (tj x rj x pj x dj); or the toxicity scores could be summed and multiplied by the sum of scores estimating bird exposure: (tj + rj) x (pj x dj). These are several of the simpler functional forms used to estimate impacts on an environmental parameter, but the conceptual possibilities are almost limitless.

    Ratios are another commonly used algebraic format:

    Environmental Impact Index Value = f(yi) ÷ f(xj).

    Generally with these models, the numerator (yi) is the concentration of the chemical that is measured or predicted to be found in the environment, while the denominator (xj) is the measurable endpoint for a toxic concentration (e.g., the EC50 or the dosage at which 50% of the organisms of a species cease to function effectively). Exposure potential is therefore treated as an integral part of the Index Value for each variable.

    The validity of most algebraic models depends upon complete sets of quantitative input data for all pest controls evaluated by the system. Data gaps have been a limiting factor in the number and types of environmental variables that have been included in this type of assessment model. Algebraic models may be preferred over the logical chain of decision rules when the objective of an assessment system is to generate a tabular ranking of pest controls that can be viewed and utilized separately from the evaluative process. These models are more suitable when the parameters considered are measurable indicators of (relatively) inherent properties, such as LD50 toxicity values. They are less suitable for assessments which are intended to reflect situation-specific conditions and to be sensitive to such variability.

    In 1975 the entomologist Robert Metcalf published what is perhaps the first algebraic assessment model of pesticide impact: Environmental Impact = H + (B + F + HB)/3 + P. The objective of this model was to assess which of the insecticides in common use in the mid-1970s were most suitable for Integrated Pest Management (Metcalf 1982). The assumption was that those which lasted longest in the environment (P) and which were most toxic to human beings (H), birds (B), fish (F) and honey bees (HB), were not suitable. This same basic premise underlies another algebraic model, the more recently developed Environmental Impact Quotient (EIQ) (Kovach et al. 1992). The total EIQ value is the sum of three equations which assess impacts to farmworkers, consumers and non-human biota:

    EIQ = 1/3 {EIQFARMWORKER [C (DT x 5) + (DT x P)] + EIQCONSUMER [(C x (S + P) / 2 x SY) + L]+ EIQNON-HUMAN BIOTA [(F x R) + (D x ((S + P) / 2) x 3) + (Z x P x 3) + (B x P x 5)]}

    The complexities inherent to deriving a single composite index value, as well as other problems with the structure and input data, limit the utility and reliability of both of these pioneering algebraic models for assessing pesticide impact. Later in this section, improved methods for scoring and structuring algebraic models will be shown, but first an alternative approach, suitable for meeting some assessment objectives, is suggested.

     

    The "Checklist" as an Alternative to a Composite Index Value

    Some multiattribute assessment systems avoid the complications inherent to deriving a single composite index value. These assessments do not attempt to integrate the scores, rank positions or hazard categories for all of the environmental variables considered by the system. Rather the evaluation of each variable stands alone, to be checked off on a list of criteria taken into account by the assessment. Thus no overall rank order or "best choice" among pesticide options is produced. The "checklist" can be used when a composite assessment is not needed. It has been the method of choice for many ecolabeling and other accreditation systems, especially when disparate criteria are used to judge products and/or processes. It has also been used for chemical screening tools where the objective is to prioritize research and regulatory action for chemicals that are flagged as potentially hazardous in any parameter (see, for example, O’Bryan and Ross 1988). There is less need in such systems to generate a composite rank order of pesticides. A generic notation for this type of system is:

    |Index Value1 = f(b1x1)|Index Value2 = f(b2x2) |Index Value3 = f(b3x3)|...|Index Valuei = f(bixi)|.

    Index values for the set of variables can be presented in a table or spreadsheet array (Figure 4), or ranked by one parameter with "warning flags" to indicate potential hazards in other parameters. For example the Planetor system (1995) is organized primarily as an economic analysis of alternative farm management scenarios. Scenarios with potentially hazardous ramifications in any of several environmental parameters--such as erosion potential or the use of a highly toxic pesticide--are flagged. The checklist method is one solution to the difficult conceptual problem of ranking pest controls and products in more than one dimension--such as public health hazard as well as consumption of non-renewable resources, or environmental impacts and also costs of production.

    Figure 4 An array of index values for a set of variables, with no composite index value calculated

     

    Environmental Indicators

     

    1

    2

    3

    i

    Pesticide A

    Index value A1

    Index value A2

    Index value A3

    Index value Ai

    Pesticide B

    Index value B1

    Index value B2

    Index value B3

    Index value Bi

    Pesticide C

    Index value C1

    Index value C2

    Index value C3

    Index value Ci

     

    Calculating and Scoring Pest Control Impacts Indicators

     

    Weighting of Variables

    Weighting is an algebraic way of expressing the relative importance of the variables considered in an assessment. Weights can reflect either the greater/lesser importance of certain variables to the ecosystem (e.g.: impacts to aquatic organisms may be weighted heavily when farm fields are near bodies of surface water) or to the evaluation of the system (e.g.: human life may be valued more than impacts to aquatic organisms).

    The most powerful means of assigning weight to a variable in an impact assessment model, however, is to include it in the algebraic equation. No matter what the apparent claims or uses to which an assessment is put, the output of an indexing system reflects no more than the sum of its parts--which are the environmental variables or indicators included in the model. Beyond this truism, a second important point to recognize about the weighting of variables is that all multiattribute assessments which generate a single index score have, effectively, assigned weights of relative importance to each environmental parameter. When no b coefficient is explicitly expressed, implicitly the value of bi= 1. Thus in models without any b coefficients, all x variables have equal weight. Although the "checklist" method (described above) should obviate the problem of assigning relative importance weights to disparate enviro-social parameters, practitioners need guard against the temptation to mentally create a composite index value while scanning across a spreadsheet of scores for independent indicators.

    The EIQ (Kovach et al. 1992) and PestDecide© (Penrose et al. 1995b) models both use weighted b coefficients. However, variables are sometimes also inadvertently weighted by other means--such as using different units or scales of measurement for different variables, or applying inconsistent standards in setting criteria for classification into hazard categories or for assigning scores to the hazard categories. Examples are: using acute mortality (LD50) as the test endpoint for one group of organisms while using effective concentration (EC50) for a temporary disorder as the input data for another group of biota. Or comparing toxicity data from field tests measuring mortality at recommended rates of application--and thus implicitly including a field exposure factor--with toxicity input data from standardized laboratory dosage, unmodified by potential field exposure. Unequal weighting occurs if both sets of toxicity data are multiplied by an indicator of exposure, so that the exposure factor is doubly weighted for the first group.

    Weighting factors can be assigned by the system developers, by a regulating or accrediting body, or by stakeholders in a production system (e.g.: producers and/or consumers). Some systems--particularly those which use a workbook procedure for calculating impact ratings--may enable end-users (i.e.: farmers, farm advisors, accrediting groups) to adjust or modify weighting factors to better reflect local conditions or personal considerations. For example, potential impacts from pesticide drift would logically be weighted quite differently when assessing greenhouse production than open field conditions.

    Weighting is sometimes criticized because it involves "value judgments." It should not be inferred, however, that these judgments are necessarily prejudicial or illogical. To illustrate, the choice and weighting of factors in a ecolabeling system designed to affect consumer purchasing may logically be quite different than in a system intended as a decision aid for farmers. The former might emphasize a set of global enviro-social concerns, whereas field-scale assessments may instead focus on the local agro-ecosystem, including environmental impacts from pesticides on soil microorganisms and the beneficial arthropods at work in a particular production scenario. As in most matters, there is no one objective reality in environmental impact assessment. System developers have a responsibility first, to acknowledge this and secondly, to insure that the value judgments implicit in their systems reflect the expert judgments of scientists and other stakeholders.

    Rigorous decision-making procedures have been developed which formalize and justify parameter selection and weighting of multiple criteria in assessment models. Theories underlying these procedures draw from optimization and risk analysis, game theory, social judgment theory, and multiattribute utility theory (see for example Saaty, 1977; Watson and Buede, 1987; Spires, 1991; Keeney and Raiffa, 1993). These methods have not been stringently applied to many of the pesticide impact assessment indexes--nor are they necessarily to be recommended--because consensus on the relative importance of different variables or modules may not be the primary objective of the weighting system.

     

    Dealing with Data Gaps

    Any assessment of pesticide environmental impacts will undoubtedly confront the problem of data gaps and also, in many cases, the opposite situation of more than one datum for an indicator. The protocols chosen to deal with these situations can influence both the scores for individual pest controls and the rank order of pesticides by risk potential. Missing data for some variables may be available from published Quantitative Structure Activity Relationship (QSAR) values. QSARs are multiple regression equations, similar to economic production functions, which estimate environmental effects of chemicals on the basis of known effects from chemicals with similar chemical structures and activity. There is a large literature debating their reliability and proffering alternative QSAR equations for different chemical groups and effects (for a sampling, see various issues of the journals Chemosphere, Environmental Science and Technology and The Science of the Total Environment). In certain situations data gaps can be filled by a procedure that is less formal than using QSARs, but similar in concept: Where gaps result from exemptions from testing requirements because certain pest controls are presumed to have no negative impact, then scores reflecting this assumption can be used to fill these data gaps. Alternatively, some assessment models fill data gaps with the mean scores for similar class of chemicals (this has been as broad a group as "herbicides" but can also be a more narrowly defined group of chemicals, with similar mode of action or similar molecular structure). Caution should be exercised in imputing values to fill data gaps because biological activity of a chemical may be due to small structural details, rather than the primary structure of the chemical--on which the QSAR is generally based.

    Several methods are available for choosing from among, or utilizing, multiple data points. If data quality are thought to differ, then the most reliable data from the most appropriate tests should be chosen. If multiple data of similar quality are available, then the most conservative data should be used. This is often called the "most-sensitive species" approach (see discussion of the Ipest in Section 5). Alternatively, multiple data points can be averaged by taking the geometric mean (Watkin and Stelljes 1993). These authors also give EPA extrapolation factors to use when data are available for related species, or related test endpoints. The uncertainty factor increases with increased taxonomic difference. A factor of 5 is used between species in a genus; 10 within families; 20 within an order, with adjustments for greater knowledge on a case-by-case basis. To adjust for acute to chronic exposure a factor of 10 is used; also to adjust for a most sensitive or endangered species. Note that these extrapolation factors provide a buffer for uncertainty, but are not appropriate to use in filling data gaps.

     

    Assigning Scores to xi

    In deriving an index value for potential pesticide risk, environmental endpoint data are first compiled in the units of the measurable parameter (e.g.: LD50s are in units of mg kg-1; pesticide soil half-life in units of hours, days or weeks; and solubility in ppb). These raw data do not convey meaningful information about potential hazard or impact, however, unless presented in a context that provides some interpretation of significance (e.g.: At what solubility is there a high risk of pesticide leaching to groundwater? How does the lethal dose correspond to the dosage applied?). Moreover, the disparate units of measure cannot be combined into a composite index value, if that is required. Thus, for indexing purposes, environmental and public health data are generally transformed either to numerical scores or to hazard categories (which may also be scored). The following discussion highlights points to consider when developing, reviewing or applying scoring systems. Much of it is relevant both to scoring each xi, as well as to deriving a composite index value.

    Figure 5 shows the relationship between raw data, numerical index values and descriptive hazard categories for one variable--lethal toxicity to rodents. The figure is intended to be a springboard for looking at the implications of using different numbers of hazard categories and ranges of scores. For illustrative purposes, the US EPA’s toxicity categories (by criteria, class, and Human Hazard Signal Word for pesticide labels from the US Code of Federal Regulations) are shown side-by-side with an additional set of descriptive terms and four sets of possible hazard scores, all of which have been fabricated for this example. For the purpose of pesticide labeling, no additional information would be conveyed by further transforming the descriptive classification into numerical scores. However, this additional and problematic step must be taken if the objective of the system is to create a composite environmental impact rank order of pesticides.

    Figure 5 Scoring hazard categories

    US EPA criteria for classifying pesticides into Toxicity Categories to meet pesticide labeling requirements (40 CFR 156.10h) are shown in Column A. Categories are numbered (Col. B) and associated with a warning label (Col. C). The decision rule for assigning test values to hazard categories should be read as follows: If acute oral LD50 is less than or equal to 50 mg kg-1, then use the EPA Human Hazard Signal Word "Danger" on the pesticide label, and classify the pesticide in Toxicity Category I. Columns D-H are fabricated to illustrate how several scoring functions would assign scores (numerical or descriptive) to test values.

    Acute Oral LD50 (mg kg-1)

    Toxicity Category

    Human Hazard Signal Word

    Hazard Class

    Possible Score Values

    Column a

    b

    c

    d

    e

    f

    g

    h

    ≤ 50

    I

    Danger

    high impact

    4

    8

    5

    100

    50 to ≤ 500

    II

    Warning

    moderate impact

    3

    4

    3

    10

    500 to ≤ 5000

    III

    Caution

    low impact

    2

       

    1

    >5000

    IV

    Caution

    no apparent impact (benign)

    1

    1

    0

    0

     

     

    As shown in Figure 5, criteria are set (Column A) to evaluate the raw impact data so that the array of pesticides can be grouped into hazard categories and described (Columns C and D), and/or transformed to numerical scores (as in Columns E-H). Column A shows the doses of pesticide (mg kg-1) that are considered to have high, moderate, low or no lethal effect on 50% of test organisms. These are the criteria or decision rules for assigning pesticides to hazard categories and/or assigning scores. Column B gives the "numerical name of the category." Several assessment models have made the mistake of confounding these numerical names or category labels with quantitative scores which are manipulated mathematically. The EPA use of Roman numerals diminishes this temptation (Column B), but the effect is shown by the sequential scores in Column E. As a result of using this arithmetic progression, the range of scores in Column E is limited to a 4-fold difference between the most hazardous chemicals and the most benign methods of pest control. This is far less than the >100-fold range of raw scores (Column A).

    Moreover, the numerical intervals between categories do not correlate with measured differences in toxic effect between benign and dangerous pest controls. The result is a compressed index, which does not reflect real differences among pesticides. A second ramification of inadvertently using category labels as scores is that benign impacts receive a positive score greater than zero. The scores for benign effects thus contribute to the hazard index value even though the impact does not add to hazard potential. The distortions caused by rating neutral effects as 1 rather than 0 are compounded in systems where exposure and toxicity terms are multiplied to derive an index value. At issue is the anomaly that a high dose of a relatively harmless input, which would not be hazardous at any reasonable exposure, can receive a rating comparable to a highly toxic input used at a lower dose, at which it might still be problematic (i.e.: 1 x 4 = 4 x 1).

    Column C gives the US EPA human hazard signal word and Column D is simply another way to verbally describe these categories, using words that are more generally applicable to hazard indicators. Columns E-H are possible ways to score these hazard categories. The four sets of scores in Figure 5 are assigned to the same set of impact categories, yet mathematically they carry different weight. The problems with the scores in Column E are discussed above, and the scores in Column F also rate neutral effects with a positive number. However these scores have an 8:1 ratio between maximum and minimum effects. Is this a sufficient ratio to reflect the range of impacts? It is far less than the range of raw scores (Column A), but may be preferable to the 100:1 range of scores in Column H, which is closer to the range of raw test values. While this range may permit greater sensitivity to differences among chemicals, the 100:1 range of scores would produce composite index values of an unwieldy magnitude. The composite index value could also easily be swayed by a single high scoring variable.

    Ideally, the range of scores should parallel the range in magnitude of impacts. An important factor to consider is that the range of impacts may not have a linear correlation with the range of test results. The magnitude of impacts may instead have a threshold dose-response relationship. Unfortunately, relationships between indicator values and meaningful breakpoints in biological, ecological or other impacts are rarely known or are ignored by assessment procedures, and arbitrary scoring criteria are common. Note, for example, that even the criteria for establishing hazard categories (Column A) are based upon magnitudes of order, not on biophysical or ecological response to threshold doses.

    A "criteria and indicator matrix" can provide a useful framework for compiling and organizing relevant information about each environmental indicator proposed for inclusion in the assessment system. Each row represents an environmental variable. Columns contain information about the variable, including the criteria for each hazard category, and the score for that category, if it is to be scored. Additional columns contain information about sources of data for the variable, and citations for the sources of expert judgment used in setting scoring and category thresholds (Levitan and Merwin, in progress).

     

    Scoring functions

    It is sobering to note that even when the same environmental variable is considered (e.g.: pesticide persistence), and the same indicator is used to measure impact (e.g.: soil half life), different scoring methods can generate radically different results which may affect the ordinal hazard ranking of pesticides. This is illustrated by the Environmental Health Policy Program (University of California at Berkeley) comparison of three methods for scoring their proposed pesticide impact assessment model (Pease et al. 1996 is described in more detail in Section 5). The rank order of pesticides generated by categorical, non-continuous scoring functions is compared with rank orders derived from linear and non-linear continuous scoring functions. The five most hazardous pesticides, as calculated by each scoring method, are listed in Figure 6. The disparity in results is unnerving! Which rank order is correct?

    Figure 6a Ranking of the five most hazardous pesticides.

    Categorical or "step" functions are used to score the UC Berkeley Environmental Health Policy Program Ranking System. Adjacent columns show the rank order for the same pesticides, using the same assessment system, scored by linear and non-linear functions (Pease et al. 1996).

    Pesticide Name

    Step Function

    Non-Linear Function

    Linear Function

    methomyl

    1

    1

    22

    aldicarb

    2

    3

    28

    carbofuran

    3

    4

    31

    mevinphos

    4

    5

    9

    2,4-D (+ salts)

    5

    36

    41

     

    b. Ranking of the five most hazardous pesticides, using linear function for scoring the UC Berkeley Health Policy Program Ranking System

    <

    Pesticide Name

    Linear Function

    Step Function

    Non-Linear Function

    sodium hypochlorite

    1

    51

    9

    metam sodium

    2

    19

    30

    diclofop-methyl

    3

    67

    22

    propargite

    4

    20

    10

    methamidophos