<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1742-5573-3-8</ui>
   <ji>1742-5573</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Accuracy of commercial geocoding: assessment and implications</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Whitsel</snm>
               <mi>A</mi>
               <fnm>Eric</fnm>
               <insr iid="I1"/>
               <email>ewhitsel@email.unc.edu</email>
            </au>
            <au id="A2">
               <snm>Quibrera</snm>
               <fnm>P Miguel</fnm>
               <insr iid="I2"/>
               <email>mqm@email.unc.edu</email>
            </au>
            <au id="A3">
               <snm>Smith</snm>
               <mi>L</mi>
               <fnm>Richard</fnm>
               <insr iid="I3"/>
               <email>rls@email.unc.edu</email>
            </au>
            <au id="A4">
               <snm>Catellier</snm>
               <mi>J</mi>
               <fnm>Diane</fnm>
               <insr iid="I4"/>
               <email>diane_catellier@mail.cscc.unc.edu</email>
            </au>
            <au id="A5">
               <snm>Liao</snm>
               <fnm>Duanping</fnm>
               <insr iid="I5"/>
               <email>dliao@psu.edu</email>
            </au>
            <au id="A6">
               <snm>Henley</snm>
               <mi>C</mi>
               <fnm>Amanda</fnm>
               <insr iid="I6"/>
               <email>ahenley@refstaff.lib.unc.edu</email>
            </au>
            <au id="A7">
               <snm>Heiss</snm>
               <fnm>Gerardo</fnm>
               <insr iid="I2"/>
               <email>gerardo_heiss@unc.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Departments of Epidemiology and Medicine, University of North Carolina, Cardiovascular Disease Program, Bank of America Center Suite 306, 137 East Franklin Street, Chapel Hill, NC 27514, USA</p>
            </ins>
            <ins id="I2">
               <p>Department of Epidemiology, University of North Carolina, Cardiovascular Disease Program, Bank of America Center Suite 306, 137 East Franklin Street, Chapel Hill, NC 27514, USA</p>
            </ins>
            <ins id="I3">
               <p>Department of Statistics and Operations Research, University of North Carolina, 201 Smith Building 128, Chapel Hill, NC 27599, USA</p>
            </ins>
            <ins id="I4">
               <p>Department of Biostatistics, University of North Carolina, Collaborative Studies Coordinating Center, 137 East Franklin Street, Chapel Hill, NC 27514, USA</p>
            </ins>
            <ins id="I5">
               <p>Department of Health Evaluation Sciences, Pennsylvania State University College of Medicine, 600 Centerview Drive Suite 2200, A210, Hershey, PA 17033, USA</p>
            </ins>
            <ins id="I6">
               <p>Walter Royal Davis Library, University of North Carolina, Reference Department, Geographic Information Services, Chapel Hill, NC 27599, USA</p>
            </ins>
         </insg>
         <source>Epidemiologic Perspectives &amp; Innovations</source>
         <issn>1742-5573</issn>
         <pubdate>2006</pubdate>
         <volume>3</volume>
         <issue>1</issue>
         <fpage>8</fpage>
         <url>http://www.epi-perspectives.com/content/3/1/8</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">16857050</pubid>
               <pubid idtype="doi">10.1186/1742-5573-3-8</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>03</day>
               <month>11</month>
               <year>2005</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>20</day>
               <month>7</month>
               <year>2006</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>20</day>
               <month>7</month>
               <year>2006</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2006</year>
         <collab>Whitsel et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Published studies of geocoding accuracy often focus on a single geographic area, address source or vendor, do not adjust accuracy measures for address characteristics, and do not examine effects of inaccuracy on exposure measures. We addressed these issues in a Women's Health Initiative ancillary study, the Environmental Epidemiology of Arrhythmogenesis in WHI.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Addresses in 49 U.S. states (n = 3,615) with established coordinates were geocoded by four vendors (A-D). There were important differences among vendors in address match rate (98%; 82%; 81%; 30%), concordance between established and vendor-assigned census tracts (85%; 88%; 87%; 98%) and distance between established and vendor-assigned coordinates (mean <b><it>&#961; </it></b>[meters]: 1809; 748; 704; 228). Mean <b><it>&#961; </it></b>was lowest among street-matched, complete, zip-coded, unedited and urban addresses, and addresses with North American Datum of 1983 or World Geodetic System of 1984 coordinates. In mixed models restricted to vendors with minimally acceptable match rates (A-C) and adjusted for address characteristics, within-address correlation, and among-vendor heteroscedasticity of <b><it>&#961;</it></b>, differences in mean <b><it>&#961; </it></b>were small for street-type matches (280; 268; 275), i.e. likely to bias results relying on them about equally for most applications. In contrast, differences between centroid-type matches were substantial in some vendor contrasts, but not others (5497; 4303; 4210) p<sub>interaction </sub>&lt; 10<sup>-4</sup>, i.e. more likely to bias results differently in many applications. The adjusted odds of an address match was higher for vendor A versus C (odds ratio = 66, 95% confidence interval: 47, 93), but not B versus C (OR = 1.1, 95% CI: 0.9, 1.3). That of census tract concordance was no higher for vendor A versus C (OR = 1.0, 95% CI: 0.9, 1.2) or B versus C (OR = 1.1, 95% CI: 0.9, 1.3). Misclassification of a related exposure measure &#8211; distance to the nearest highway &#8211; increased with mean <b><it>&#961; </it></b>and in the absence of confounding, non-differential misclassification of this distance biased its hypothetical association with coronary heart disease mortality toward the null.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Geocoding error depends on measures used to evaluate it, address characteristics and vendor. Vendor selection presents a trade-off between potential for missing data and error in estimating spatially defined attributes. Informed selection is needed to control the trade-off and adjust analyses for its effects.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Various sources of measurement error have substantial implications for the accuracy of epidemiologic estimates. Exposure measurement error, for example, may arise when geographic information systems are trusted without recognizing the limitations of processes that rely on them. One such process is address matching, the automated pairing of coordinates (latitudes; longitudes) and statistical tabulation areas (e.g. census tracts) with street addresses, typically using TIGER/Line or other street data files <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. The process &#8211; which is also known as geocoding &#8211; has been described in detail <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. Geocoding usually involves matching addresses to specific street segments then positioning the addresses along the segments assuming an even distribution of street numbers within them. Although this form of geocoding involves linear interpolation and assumptions that can be inappropriate, its inaccuracy may be overlooked in large, population-based studies of associations between spatially interpolated environmental exposures, relevant health outcomes, and their contextual, socioeconomic effect modifiers. Nevertheless, geocoding accuracy is critical when such studies focus on exposure mechanisms that operate over short distances <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>.</p>
         <p>Although error in assignment of latitudes, longitudes, and census tracts has the potential to bias both estimation of location-specific exposures and socioeconomic contexts within which they occur <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>, recent studies have reported mean positional errors in commercially geocoded address coordinates between fifty and 300 meters <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. This is a distance over which long-term average ambient air pollution concentrations, meteorological measures and their monitor-to-monitor temporal correlations are relatively constant <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>. However, concentrations of traffic-related emissions rapidly fall to ambient levels within comparable distances from street center-lines <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. Moreover, positional error may be relevant in an even wider range of studies if the previously reported range of distances (50 &#8211; 300 m) is an underestimate. Lack of adjustment for potentially important address characteristics suggests that this is a distinct possibility. Population density in the area surrounding an address, for example, is so strongly and inversely associated with positional error that reported distances may be biased by even small differences in the ratio of rural to urban and suburban address matches <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>. Positional error also varies markedly with match type, i.e. whether vendors match individual addresses to specific streets or to centers of statistical tabulation areas (centroids) <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>, yet to date, most studies have not accounted for these factors.</p>
         <p>Published studies of positional error have several additional features that are pertinent in this context. Many restricted their focus to a single geographic setting, address source or geocoding vendor, while those focusing on multiple vendors did not account for among-vendor heteroscedasticity or within-address correlation of positional error <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>. Others ignored potential for verification bias <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> and with a notable exception, none examined effects of positional error on exposure measures <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Collectively, these observations suggest that the next generation of studies in this area should be designed with generalizability, validity and utility in mind.</p>
         <p>To this end, we established three study objectives: (i) to compare multiple geocoding vendors using an identical sample of addresses with known coordinates selected from a broad range of data sources and geographic areas, (ii) to estimate geocoding accuracy and account for address characteristics that affect it using appropriate statistical procedures, and (iii) to estimate effects of observed inaccuracy on individual- and contextual-level exposure measures. We conducted this study to inform research emanating from two studies. The first, <it>The Environmental Epidemiology of Arrhythmogenesis in WHI </it><abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, is an ancillary study of electrocardiographic mechanisms linking air pollution and cardiovascular disease in 68,133 U.S. women aged 50&#8211;79 years at baseline in the <it>Women's Health Initiative </it>(<it>WHI</it>) clinical trial <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. The second, the <it>Atherosclerosis Risk in Communities </it>(<it>ARIC</it>) study, is a prospective study of cardiovascular disease in 15,792 U.S. men and women aged 45&#8211;64 years at baseline <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. This Institutional Review Board-approved ancillary study complied with all applicable regulations governing human subjects research (University of North Carolina Medical IRB# 03-EPID-12).</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Assembling and cleaning addresses</p>
            </st>
            <p>We screened seven, publicly available electronic data sources for addresses in areas of the contiguous U.S. containing the 75 WHI and four ARIC exam sites <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>. Addresses were eligible for inclusion in this study if they were unique, associated with an established latitude, longitude, street (or route or post office box), city and state; and valid in U.S. Census year 2000. Screening identified 3,615 such addresses: 2,522 of U.S. Environmental Protection Agency (EPA) Air Quality System monitors in the 48 contiguous United States and District of Columbia; 1,050 of WHI clinical trial participants in five counties containing the majority of WHI participants residing in North Carolina (Durham; Forsyth; Guilford; Orange; Wake); and 43 of U.S. National Geodetic Survey (NGS) stations in the four ARIC communities (Forsyth County, NC; Washington County, MD; the city of Jackson, MS; eight suburbs of Minneapolis, MN). We cleaned the addresses (minor edits) when they did not conform to U.S. Postal Service standards <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. We also used web-based utilities <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp> to investigate and correct address information (major edits) when it conflicted with that in accompanying field notes (EPA addresses only). If neither condition was met, we did not edit the addresses and flagged them as "unedited". The locations and characteristics of the addresses are described in Figure <figr fid="F1">1</figr> and Table <tblr tid="T1">1</tblr>.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Location of the 3,615 addresses</p>
               </caption>
               <text>
                  <p><b>Location of the 3,615 addresses</b>. EPA = United States Environmental Protection Agency Air Quality System monitors. NGS = United States National Geodetic Survey stations. WHI = Women's Health Initiative clinical trial participant residential parcels.</p>
               </text>
               <graphic file="1742-5573-3-8-1"/>
            </fig>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Characteristics of the 3,615 addresses</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Characteristic</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Stratum or Units</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>n (%) or mean (standard deviation)</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Address Source</p>
                     </c>
                     <c ca="center">
                        <p>EPA</p>
                     </c>
                     <c ca="center">
                        <p>2,522 (70)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>WHI</p>
                     </c>
                     <c ca="center">
                        <p>1,050 (29)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>NGS</p>
                     </c>
                     <c ca="center">
                        <p>43 (1)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Address Type<sup>a</sup></p>
                     </c>
                     <c ca="center">
                        <p>Complete</p>
                     </c>
                     <c ca="center">
                        <p>2,808 (78)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>No Street Number</p>
                     </c>
                     <c ca="center">
                        <p>460 (13)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Intersection</p>
                     </c>
                     <c ca="center">
                        <p>347 (10)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Zip Code</p>
                     </c>
                     <c ca="center">
                        <p>Absent</p>
                     </c>
                     <c ca="center">
                        <p>2,359 (65)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Present</p>
                     </c>
                     <c ca="center">
                        <p>1,256 (35)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Edit</p>
                     </c>
                     <c ca="center">
                        <p>Unedited</p>
                     </c>
                     <c ca="center">
                        <p>1,533 (42)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Minor</p>
                     </c>
                     <c ca="center">
                        <p>1,392 (39)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Major</p>
                     </c>
                     <c ca="center">
                        <p>690 (19)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Density<sup>b</sup></p>
                     </c>
                     <c ca="center">
                        <p>persons/km<sup>2</sup></p>
                     </c>
                     <c ca="center">
                        <p>1,066 (2,645)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Original Datum<sup>c</sup></p>
                     </c>
                     <c ca="center">
                        <p>NAD83 or WGS84</p>
                     </c>
                     <c ca="center">
                        <p>1,615 (45)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Unknown</p>
                     </c>
                     <c ca="center">
                        <p>1,274 (35)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>NAD27</p>
                     </c>
                     <c ca="center">
                        <p>726 (20)</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>a</sup>Complete = street number, name, city and state present; No Street Number = street name, city and state present; Intersection = crossing street names, city and state present. <sup>b</sup>33<sup>rd </sup>and 67<sup>th </sup>percentiles = 221 and 920 persons/km<sup>2</sup>. <sup>c</sup>Of associated coordinates: NAD83 and NAD27 = North American Datum of 1983 and 1927; WGS84 = World Geodetic System of 1984.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Spatial data quality</p>
            </st>
            <p>Coordinates in decimal degrees with at least six significant digits after the decimal point accompanied all addresses. EPA coordinates were established according to a federal accuracy standard of &lt; 25 m <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>, NGS coordinates, according to a federal standard &lt; 10 m <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>, and WHI coordinates, by applying a spatial routine that determines center points of residential land parcels on digital maps (adapted from O'Rourke <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>). The median accuracy of the latter method approximates that of high resolution aerial photography, 8 to 15 m depending on population density <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. These coordinates and their associated block group, tract, and county identifiers (U.S. Census 2000 Federal Information Processing Standards [FIPS] codes) served as the criterion standards against which the accuracy of vendor-assigned geocodes was measured.</p>
         </sec>
         <sec>
            <st>
               <p>Geocoding addresses and estimating accuracy</p>
            </st>
            <p>We submitted the addresses to four well-known vendors (A-D) frequently contracted by epidemiologists for geocoding and related services or products (Table <tblr tid="T2">2</tblr>). We label the vendors generically in this paper to mask their identity, a practice consistent with our current data use agreements and previously implemented in similar contexts <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B7">7</abbr><abbr bid="B20">20</abbr></abbrgrp>. To examine whether editing introduced error, we also submitted unedited versions of the edited EPA addresses to one of the vendors. We estimated the accuracy of geocodes assigned by the vendors using three previously defined measures: (i) the address match rate (%), i.e. percentage of all addresses to which a given vendor assigned a latitude, longitude and FIPS code; (ii) the concordance (%) between vendor-assigned and criterion standard FIPS codes; and (iii) the distance in meters between vendor-assigned and criterion standard coordinates, as measured using the Haversine spherical Earth formula (<b><it>&#961;</it></b>) <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. We based the measures on analyses of spatial data that we transformed, when necessary, to a standard geographic coordinate system using ArcGIS<sup>&#174; </sup>9.0.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Characteristics of the four vendors</p>
               </caption>
               <tblbdy cols="10">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Vendor</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>CASS</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Street Offset</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Corner Inset</b>
                        </p>
                     </c>
                     <c cspan="3" ca="center">
                        <p>
                           <b>Street Data Files</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Scheduled Data File Updates</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Original Datum<sup><b>a</b></sup></b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Manual Address Cleaning</b>
                           <sup>
                              <b>b</b>
                           </sup>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>TIGER</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>USPS</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Other</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="10">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>Yes</p>
                     </c>
                     <c ca="center">
                        <p>40 ft</p>
                     </c>
                     <c ca="center">
                        <p>Yes</p>
                     </c>
                     <c ca="center">
                        <p>2002</p>
                     </c>
                     <c ca="center">
                        <p>2004</p>
                     </c>
                     <c ca="center">
                        <p>Yes</p>
                     </c>
                     <c ca="center">
                        <p>4&#215;/yr</p>
                     </c>
                     <c ca="center">
                        <p>WGS84</p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>B</p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                     <c ca="center">
                        <p>5 ft</p>
                     </c>
                     <c ca="center">
                        <p>Yes</p>
                     </c>
                     <c ca="center">
                        <p>2002</p>
                     </c>
                     <c ca="center">
                        <p>2004</p>
                     </c>
                     <c ca="center">
                        <p>Yes</p>
                     </c>
                     <c ca="center">
                        <p>4&#215;/yr</p>
                     </c>
                     <c ca="center">
                        <p>NAD83</p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>C</p>
                     </c>
                     <c ca="center">
                        <p>Yes</p>
                     </c>
                     <c ca="center">
                        <p>50 ft</p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                     <c ca="center">
                        <p>2002</p>
                     </c>
                     <c ca="center">
                        <p>2004</p>
                     </c>
                     <c ca="center">
                        <p>Yes</p>
                     </c>
                     <c ca="center">
                        <p>6&#215;/yr</p>
                     </c>
                     <c ca="center">
                        <p>NAD83</p>
                     </c>
                     <c ca="center">
                        <p>Yes</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>D</p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                     <c ca="center">
                        <p>0 ft</p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                     <c ca="center">
                        <p>2002</p>
                     </c>
                     <c ca="center">
                        <p>2003</p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                     <c ca="center">
                        <p>2&#215;/yr</p>
                     </c>
                     <c ca="center">
                        <p>NAD83</p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>a</sup>Of assigned coordinates: NAD83 = North American Datum of 1983. WGS84 = World Geodetic System of 1984. <sup>b</sup>After initial processing by geocoding software. CASS = Address standardization certified by the United States Postal Service National Customer Support Center Certification Program, Coding Accuracy Support System. TIGER = Topologically Integrated Geographic Encoding and Referencing (TIGER/Line<sup>&#174;</sup>) file. USPS = United States Postal Service files e.g. the city-state, ZIP+4<sup>&#174; </sup>and ZIPMove products.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Analysis of variance</p>
            </st>
            <p>We used analysis of variance (ANOVA) to quantify the variation in <b><it>&#961; </it></b>(log-transformed to satisfy the assumption of Gaussian errors) among vendors, before and after controlling for characteristics that affect geocoding accuracy: address source (EPA; WHI; NGS), address type (complete; no street number; intersection), zip code (present; absent), editing (unedited; minor; major), population density of the associated census tract (persons/km<sup>2</sup>), and original coordinate datum (North American Datum of 1983 [NAD83] or World Geodetic System of 1984 [WGS84]; North American Datum of 1927 [NAD27]; unknown). In this context, "no street number" includes rural route and post office box addresses. After testing for effect modification (significance of the interaction between vendor and match type), we stratified ANOVA models. We computed adjusted, least-square means among vendors using weights that were proportional to the observed distribution of covariates in our dataset. We back-transformed predicted values to the original scale as follows: <graphic file="1742-5573-3-8-i1.gif"/>, where <graphic file="1742-5573-3-8-i2.gif"/> and <graphic file="1742-5573-3-8-i3.gif"/> were the vendor-specific least square means and variances of log <b><it>&#961;</it></b>, the latter estimated from the residuals. We used logistic regression to estimate the odds ratios and 95% confidence intervals (OR, 95% CI) for address match and census tract concordance among vendors, before and after adjustment for the same address characteristics used in the ANOVA models. We arbitrarily chose vendor C as a basis for comparison in these logistic models.</p>
         </sec>
         <sec>
            <st>
               <p>Within-address dependence and among-vendor heteroscedasticity of <it>&#961;</it></p>
            </st>
            <p>Recognizing that the above analyses failed to account for the observed dependence of coordinates assigned to the same address by different vendors and the heterogeneity of variances across vendors (among centroid-type matches), we repeated analyses using mixed effects models. This modeling framework allowed simultaneous specification of the within-address dependence and among-vendor heteroscedasticity of <b><it>&#961;</it></b>. Assuming values of <b><it>&#961; </it></b>provided by different vendors were equally correlated, we used a compound symmetric (exchangeable) covariance structure. We were not interested in testing hypotheses concerning the variances and covariances of the within-address covariance matrix. We simply considered them as nuisance parameters needing to be controlled. We also considered the addresses as a random sample of a larger defined population, and the sample of vendors as fixed. Inferences therefore pertain to the four vendors.</p>
         </sec>
         <sec>
            <st>
               <p>Application</p>
            </st>
            <p>We examined the effects of geocoding error over the observed range of <b><it>&#961; </it></b>in a 5% random sample of street-type address matches (n = 2,608) and a census of centroid-type address matches (n = 2,671) from <it>The Environmental Epidemiology of Arrhythmogenesis in WHI</it>, 1999&#8211;2002 <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. Briefly, we displaced the coordinates associated with each address at random over a uniform distribution of <b><it>&#952; </it></b>(range, 0&#8211;360&#176;) and lognormal distributions of <b><it>&#961; </it></b>with means and standard deviations approximating the range of values observed in this context. We used ArcGIS<sup>&#174; </sup>9.0 to assign the original and displaced coordinates to year 2000 U.S. Census tracts and to estimate the distance between the coordinates and the nearest interstate, U.S., or state highway or major traffic thoroughfare at that time. Consistent with prior literature, we dichotomized this distance at 100 meters to create a simple proxy for traffic-related air pollution exposure <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B37">37</abbr></abbrgrp>. Then we examined the effect of displacement on this proxy, exposure misclassification rates and census tract concordance. We completed all analyses using the SAS, Version 9.1 software package.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <p>Door-to-door return times and geocoding costs were generally reasonable across vendors: range, 2&#8211;5 business days and $16&#8211;$25 per 1,000 addresses. However, analyses of the edited address database revealed large differences among vendors A-D in address match rate (98%; 82%; 81%; 30%), census tract concordance (85%; 88%; 87%; 98%) and mean <b><it>&#961; </it></b>(1809; 748; 704; 228 m) (Table <tblr tid="T3">3</tblr> and Figure <figr fid="F2">2</figr>). Address match rate and census tract concordance were relatively high and mean <b><it>&#961;</it></b>, relatively low among WHI, complete, zip-coded, unedited, and urban or suburban addresses; addresses with NAD83 or WGS84 criterion standard coordinates; and street-type matches (Table <tblr tid="T4">4</tblr>).</p>
         <tbl id="T3">
            <title>
               <p>Table 3</p>
            </title>
            <caption>
               <p>Accuracy of geocodes assigned by the four vendors</p>
            </caption>
            <tblbdy cols="8">
               <r>
                  <c ca="left">
                     <p>
                        <b>Vendor</b>
                     </p>
                  </c>
                  <c cspan="3" ca="center">
                     <p>
                        <b>Match Rate</b>
                     </p>
                  </c>
                  <c cspan="3" ca="center">
                     <p>
                        <b>Concordance</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>
                           <it>&#961;</it>
                        </b>
                        <sup>
                           <b>c</b>
                        </sup>
                     </p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Overall</b>
                        <sup>
                           <b>a</b>
                        </sup>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Street</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Centroid</b>
                        <sup>
                           <b>b</b>
                        </sup>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Block Group</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Tract</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>County</b>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
               <r>
                  <c cspan="8">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>A</p>
                  </c>
                  <c ca="center">
                     <p>98%</p>
                  </c>
                  <c ca="center">
                     <p>79%</p>
                  </c>
                  <c ca="center">
                     <p>20%</p>
                  </c>
                  <c ca="center">
                     <p>77%</p>
                  </c>
                  <c ca="center">
                     <p>85%</p>
                  </c>
                  <c ca="center">
                     <p>99%</p>
                  </c>
                  <c ca="center">
                     <p>1809 (8790)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>B</p>
                  </c>
                  <c ca="center">
                     <p>82%</p>
                  </c>
                  <c ca="center">
                     <p>78%</p>
                  </c>
                  <c ca="center">
                     <p>4%</p>
                  </c>
                  <c ca="center">
                     <p>83%</p>
                  </c>
                  <c ca="center">
                     <p>88%</p>
                  </c>
                  <c ca="center">
                     <p>99%</p>
                  </c>
                  <c ca="center">
                     <p>748 (4611)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>C</p>
                  </c>
                  <c ca="center">
                     <p>81%</p>
                  </c>
                  <c ca="center">
                     <p>77%</p>
                  </c>
                  <c ca="center">
                     <p>4%</p>
                  </c>
                  <c ca="center">
                     <p>81%</p>
                  </c>
                  <c ca="center">
                     <p>87%</p>
                  </c>
                  <c ca="center">
                     <p>99%</p>
                  </c>
                  <c ca="center">
                     <p>704 (4418)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>D</p>
                  </c>
                  <c ca="center">
                     <p>30%</p>
                  </c>
                  <c ca="center">
                     <p>30%</p>
                  </c>
                  <c ca="center">
                     <p>0%</p>
                  </c>
                  <c ca="center">
                     <p>97%</p>
                  </c>
                  <c ca="center">
                     <p>98%</p>
                  </c>
                  <c ca="center">
                     <p>100%</p>
                  </c>
                  <c ca="center">
                     <p>228 (884)</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p><sup>a</sup>Due to rounding, may differ from the sum of street- and centroid-type match rates.</p>
               <p><sup>b</sup>Geographic or delivery-weighted center of a statistical tabulation area, e.g. U.S. Census tract. <sup>c</sup>Spherical distance in meters between criterion standard and vendor-assigned coordinates (mean [standard deviation]).</p>
            </tblfn>
         </tbl>
         <fig id="F2">
            <title>
               <p>Figure 2</p>
            </title>
            <caption>
               <p>Distribution of the spherical distance in meters (<it>&#961;</it>) between criterion standard and vendor-assigned coordinates, by vendor</p>
            </caption>
            <text>
               <p><b>Distribution of the spherical distance in meters (<it>&#961;</it>) between criterion standard and vendor-assigned coordinates, by vendor</b>. Column I: Scatterplots in which <b>X</b>s and center points represent vendor-assigned and criterion standard coordinates, respectively. Columns II and III: Normalized frequency histograms before (II) and after (III) log-transformation. Columns I and II exclude outlying values to allow equal cross-vendor scaling of axes in meters. n = sample size. sd = standard deviation.</p>
            </text>
            <graphic file="1742-5573-3-8-2"/>
         </fig>
         <tbl id="T4">
            <title>
               <p>Table 4</p>
            </title>
            <caption>
               <p>Overall match rate, census tract concordance and <it>&#961;</it><sup><b>a</b></sup>, by address and match characteristics</p>
            </caption>
            <tblbdy cols="5">
               <r>
                  <c ca="left">
                     <p>
                        <b>Characteristic</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Stratum</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Match Rate</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Census Tract Concordance</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>
                           <it>&#961;</it>
                        </b>
                        <sup>
                           <b>a</b>
                        </sup>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="5">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Address Source</p>
                  </c>
                  <c ca="center">
                     <p>EPA</p>
                  </c>
                  <c ca="center">
                     <p>62%</p>
                  </c>
                  <c ca="center">
                     <p>47%</p>
                  </c>
                  <c ca="center">
                     <p>1,619 (7,904)</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>NGS</p>
                  </c>
                  <c ca="center">
                     <p>88%</p>
                  </c>
                  <c ca="center">
                     <p>72%</p>
                  </c>
                  <c ca="center">
                     <p>1,125 (3,711)</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>WHI</p>
                  </c>
                  <c ca="center">
                     <p>98%</p>
                  </c>
                  <c ca="center">
                     <p>97%</p>
                  </c>
                  <c ca="center">
                     <p>159 (409)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Address Type</p>
                  </c>
                  <c ca="center">
                     <p>No Street Number</p>
                  </c>
                  <c ca="center">
                     <p>28%</p>
                  </c>
                  <c ca="center">
                     <p>8%</p>
                  </c>
                  <c ca="center">
                     <p>5,111 (6,150)</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>Intersection</p>
                  </c>
                  <c ca="center">
                     <p>60%</p>
                  </c>
                  <c ca="center">
                     <p>43%</p>
                  </c>
                  <c ca="center">
                     <p>1,259 (6,270)</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>Complete</p>
                  </c>
                  <c ca="center">
                     <p>82%</p>
                  </c>
                  <c ca="center">
                     <p>73%</p>
                  </c>
                  <c ca="center">
                     <p>793 (6,063)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Zip Code</p>
                  </c>
                  <c ca="center">
                     <p>Absent</p>
                  </c>
                  <c ca="center">
                     <p>60%</p>
                  </c>
                  <c ca="center">
                     <p>45%</p>
                  </c>
                  <c ca="center">
                     <p>1,609 (8,205)</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>Present</p>
                  </c>
                  <c ca="center">
                     <p>96%</p>
                  </c>
                  <c ca="center">
                     <p>92%</p>
                  </c>
                  <c ca="center">
                     <p>376 (1,634)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Edit</p>
                  </c>
                  <c ca="center">
                     <p>Major</p>
                  </c>
                  <c ca="center">
                     <p>59%</p>
                  </c>
                  <c ca="center">
                     <p>45%</p>
                  </c>
                  <c ca="center">
                     <p>2,622 (10,029)</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>Minor</p>
                  </c>
                  <c ca="center">
                     <p>70%</p>
                  </c>
                  <c ca="center">
                     <p>58%</p>
                  </c>
                  <c ca="center">
                     <p>828 (3,833)</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>Unedited</p>
                  </c>
                  <c ca="center">
                     <p>81%</p>
                  </c>
                  <c ca="center">
                     <p>73%</p>
                  </c>
                  <c ca="center">
                     <p>688 (5,877)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Density<sup>b </sup>(persons/km<sup>2</sup>)</p>
                  </c>
                  <c ca="center">
                     <p>Rural, 0&#8211;221</p>
                  </c>
                  <c ca="center">
                     <p>65%</p>
                  </c>
                  <c ca="center">
                     <p>54%</p>
                  </c>
                  <c ca="center">
                     <p>2,069 (8,280)</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>Suburban, 222&#8211;920</p>
                  </c>
                  <c ca="center">
                     <p>79%</p>
                  </c>
                  <c ca="center">
                     <p>71%</p>
                  </c>
                  <c ca="center">
                     <p>566 (6,172)</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>Urban, &#8805; 920</p>
                  </c>
                  <c ca="center">
                     <p>74%</p>
                  </c>
                  <c ca="center">
                     <p>60%</p>
                  </c>
                  <c ca="center">
                     <p>485 (2,319)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Datum<sup>c</sup></p>
                  </c>
                  <c ca="center">
                     <p>Unknown</p>
                  </c>
                  <c ca="center">
                     <p>60%</p>
                  </c>
                  <c ca="center">
                     <p>43%</p>
                  </c>
                  <c ca="center">
                     <p>1,600 (8,612)</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>NAD27</p>
                  </c>
                  <c ca="center">
                     <p>64%</p>
                  </c>
                  <c ca="center">
                     <p>51%</p>
                  </c>
                  <c ca="center">
                     <p>1,475 (6,619)</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>NAD83 or WGS84</p>
                  </c>
                  <c ca="center">
                     <p>87%</p>
                  </c>
                  <c ca="center">
                     <p>81%</p>
                  </c>
                  <c ca="center">
                     <p>590 (3,961)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Match Type</p>
                  </c>
                  <c ca="center">
                     <p>Centroid</p>
                  </c>
                  <c ca="center">
                     <p>100%</p>
                  </c>
                  <c ca="center">
                     <p>34%</p>
                  </c>
                  <c ca="center">
                     <p>5,331 (9,207)</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>Street</p>
                  </c>
                  <c ca="center">
                     <p>100%</p>
                  </c>
                  <c ca="center">
                     <p>90%</p>
                  </c>
                  <c ca="center">
                     <p>607 (5,577)</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p><sup>a</sup>Spherical distance in meters between criterion standard and vendor-assigned coordinates (mean [standard deviation]). <sup>b</sup>Stratified at the 33<sup>rd </sup>and 67<sup>th </sup>percentiles. <sup>c</sup>Original datum of coordinates. NAD27 and NAD83 = North American Datum of 1927 and 1983. WGS84 = World Geodetic System of 1984.</p>
            </tblfn>
         </tbl>
         <p>In analyses restricted to vendors with minimally acceptable match rates (A-C), among-vendor differences in mean <b><it>&#961; </it></b>were small for street-type matches (293; 287; 288 m). In contrast, differences between centroid-type matches were substantial in some vendor contrasts, but not others (6375; 4854; 5524 m), p for interaction &lt; 10<sup>-4</sup>. Adjustment for address characteristics, within-address correlation and heteroscedasticity of <b><it>&#961; </it></b>reduced the mean and standard deviation of <b><it>&#961; </it></b>(Table <tblr tid="T5">5</tblr>). The pattern of adjusted mean <b><it>&#961; </it></b>among vendors reflected that of the adjusted odds of an address match: it was higher for vendor A versus C (OR = 66, 95% CI: 47, 93), but not B versus C (OR = 1.1, 95% CI: 0.9, 1.3). The adjusted odds of census tract concordance were, by comparison, no higher for vendor A versus C (OR = 1.0, 95% CI: 0.9, 1.2) or B versus C (OR = 1.1, 95% CI: 0.9, 1.3) (Table <tblr tid="T6">6</tblr>).</p>
         <tbl id="T5">
            <title>
               <p>Table 5</p>
            </title>
            <caption>
               <p>Spherical distance in meters (<it>&#961;</it>) between criterion standard and vendor-assigned coordinates (mean [standard deviation]), by match type and vendor</p>
            </caption>
            <tblbdy cols="6">
               <r>
                  <c ca="left">
                     <p>
                        <b>Match Type</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Vendor</b>
                     </p>
                  </c>
                  <c cspan="4" ca="center">
                     <p>
                        <b>
                           <it>&#961;</it>
                        </b>
                     </p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Unadjusted</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Adjusted</b>
                        <sup>
                           <b>a</b>
                        </sup>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Within</b>
                        <sup><b>a</b>,<b>b</b></sup>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Hetero</b>
                        <sup>
                           <b>a-c</b>
                        </sup>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="6">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Street</p>
                  </c>
                  <c ca="center">
                     <p>A</p>
                  </c>
                  <c ca="center">
                     <p>293 (564)</p>
                  </c>
                  <c ca="center">
                     <p>272 (476)</p>
                  </c>
                  <c ca="center">
                     <p>280 (492)</p>
                  </c>
                  <c ca="center">
                     <p>NA</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>B</p>
                  </c>
                  <c ca="center">
                     <p>287 (545)</p>
                  </c>
                  <c ca="center">
                     <p>262 (438)</p>
                  </c>
                  <c ca="center">
                     <p>268 (447)</p>
                  </c>
                  <c ca="center">
                     <p>NA</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>C</p>
                  </c>
                  <c ca="center">
                     <p>288 (551)</p>
                  </c>
                  <c ca="center">
                     <p>266 (456)</p>
                  </c>
                  <c ca="center">
                     <p>275 (471)</p>
                  </c>
                  <c ca="center">
                     <p>NA</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Centroid</p>
                  </c>
                  <c ca="center">
                     <p>A</p>
                  </c>
                  <c ca="center">
                     <p>6,375 (10,437)</p>
                  </c>
                  <c ca="center">
                     <p>6,194 (9,473)</p>
                  </c>
                  <c ca="center">
                     <p>5,630 (8,576)</p>
                  </c>
                  <c ca="center">
                     <p>5,497 (8,345)</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>B</p>
                  </c>
                  <c ca="center">
                     <p>4,854 (27,279)</p>
                  </c>
                  <c ca="center">
                     <p>3,663 (15,948)</p>
                  </c>
                  <c ca="center">
                     <p>4,230 (18,730)</p>
                  </c>
                  <c ca="center">
                     <p>4,303 (19,185)</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>C</p>
                  </c>
                  <c ca="center">
                     <p>5,524 (34,703)</p>
                  </c>
                  <c ca="center">
                     <p>3,298 (13,068)</p>
                  </c>
                  <c ca="center">
                     <p>3,900 (15,943)</p>
                  </c>
                  <c ca="center">
                     <p>4,210 (17,638)</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p><sup>a</sup>For address source, type, zip code, edit, population density (persons/km<sup>2</sup>) and datum.</p>
               <p><sup>b</sup>Also adjusted for within-address correlation of <b><it>&#961;</it></b>. <sup>c</sup>Additionally adjusted for among-vendor heteroscedasticity of <b><it>&#961; </it></b>(see methods). NA = not applicable.</p>
            </tblfn>
         </tbl>
         <tbl id="T6">
            <title>
               <p>Table 6</p>
            </title>
            <caption>
               <p>Odds ratios (95% confidence intervals) for overall address match and census tract concordance, by vendor</p>
            </caption>
            <tblbdy cols="5">
               <r>
                  <c>
                     <p/>
                  </c>
                  <c cspan="2" ca="center">
                     <p>
                        <b>Overall Address Match</b>
                     </p>
                  </c>
                  <c cspan="2" ca="center">
                     <p>
                        <b>Census Tract Concordance</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <b>Vendor</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Unadjusted</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Adjusted</b>
                        <sup>
                           <b>a</b>
                        </sup>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Unadjusted</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Adjusted</b>
                        <sup>
                           <b>b</b>
                        </sup>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="5">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>A</p>
                  </c>
                  <c ca="center">
                     <p>12 (9, 15)</p>
                  </c>
                  <c ca="center">
                     <p>66 (47, 93)</p>
                  </c>
                  <c ca="center">
                     <p>0.8 (0.7, 0.9)</p>
                  </c>
                  <c ca="center">
                     <p>1.0 (0.9, 1.2)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>B</p>
                  </c>
                  <c ca="center">
                     <p>1.1 (0.9, 1.2)</p>
                  </c>
                  <c ca="center">
                     <p>1.1 (0.9, 1.3)</p>
                  </c>
                  <c ca="center">
                     <p>1.1 (0.9, 1.2)</p>
                  </c>
                  <c ca="center">
                     <p>1.1 (0.9, 1.3)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>C</p>
                  </c>
                  <c ca="center">
                     <p>1.0</p>
                  </c>
                  <c ca="center">
                     <p>1.0</p>
                  </c>
                  <c ca="center">
                     <p>1.0</p>
                  </c>
                  <c ca="center">
                     <p>1.0</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p><sup>a</sup>Adjusted for address source, type, zip code, edit, population density, and datum. <sup>b</sup>Also adjusted for match type.</p>
            </tblfn>
         </tbl>
         <p>Further restricting analyses to records successfully geocoded by all vendors A-C attenuated mean <b><it>&#961; </it></b>and its pattern of differences among them. Match rate and census tract concordance were much lower, and mean <b><it>&#961;</it></b>, much higher in analyses of the unedited versus edited EPA addresses (data not shown).</p>
         <p>The percent of street-type address matches &lt; 100 meters away from the nearest highway was relatively constant across mean <b><it>&#961; </it></b>(Table <tblr tid="T7">7</tblr>). This apparent absence of misclassification was related to counter-balancing effects of approximately equal false positive and false negative rates at values of mean <b><it>&#961; </it></b>between 150 and 600 meters. Together, they accounted for a 14% increase in the total error rate over the same range. This increase was accompanied by a 20% decrease in census tract concordance.</p>
         <tbl id="T7">
            <title>
               <p>Table 7</p>
            </title>
            <caption>
               <p>Effect of mean <it>&#961;</it><sup>a </sup>on classification of distance to the nearest highway<sup>b</sup>, exposure misclassification rates<sup>c </sup>and census tract concordance<sup>d</sup></p>
            </caption>
            <tblbdy cols="7">
               <r>
                  <c ca="left">
                     <p>
                        <b>Match</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Mean</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Distance</b>
                     </p>
                  </c>
                  <c cspan="3" ca="center">
                     <p>
                        <b>Misclassification Rates</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Census Tract</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <b>Type</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>
                           <it>&#961;</it>
                        </b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>&lt; 100 m</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>False +</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>False &#8211;</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Total</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Concordance</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="7">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Street</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>27%</p>
                  </c>
                  <c ca="center">
                     <p>0%</p>
                  </c>
                  <c ca="center">
                     <p>0%</p>
                  </c>
                  <c ca="center">
                     <p>0%</p>
                  </c>
                  <c ca="center">
                     <p>100%</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>150</p>
                  </c>
                  <c ca="center">
                     <p>29%</p>
                  </c>
                  <c ca="center">
                     <p>8%</p>
                  </c>
                  <c ca="center">
                     <p>6%</p>
                  </c>
                  <c ca="center">
                     <p>15%</p>
                  </c>
                  <c ca="center">
                     <p>90%</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>300</p>
                  </c>
                  <c ca="center">
                     <p>26%</p>
                  </c>
                  <c ca="center">
                     <p>11%</p>
                  </c>
                  <c ca="center">
                     <p>11%</p>
                  </c>
                  <c ca="center">
                     <p>22%</p>
                  </c>
                  <c ca="center">
                     <p>82%</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>600</p>
                  </c>
                  <c ca="center">
                     <p>27%</p>
                  </c>
                  <c ca="center">
                     <p>15%</p>
                  </c>
                  <c ca="center">
                     <p>14%</p>
                  </c>
                  <c ca="center">
                     <p>29%</p>
                  </c>
                  <c ca="center">
                     <p>70%</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Centroid</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>32%</p>
                  </c>
                  <c ca="center">
                     <p>0%</p>
                  </c>
                  <c ca="center">
                     <p>0%</p>
                  </c>
                  <c ca="center">
                     <p>0%</p>
                  </c>
                  <c ca="center">
                     <p>100%</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>2,500</p>
                  </c>
                  <c ca="center">
                     <p>19%</p>
                  </c>
                  <c ca="center">
                     <p>9%</p>
                  </c>
                  <c ca="center">
                     <p>22%</p>
                  </c>
                  <c ca="center">
                     <p>31%</p>
                  </c>
                  <c ca="center">
                     <p>66%</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>5,000</p>
                  </c>
                  <c ca="center">
                     <p>16%</p>
                  </c>
                  <c ca="center">
                     <p>9%</p>
                  </c>
                  <c ca="center">
                     <p>25%</p>
                  </c>
                  <c ca="center">
                     <p>33%</p>
                  </c>
                  <c ca="center">
                     <p>55%</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>10,000</p>
                  </c>
                  <c ca="center">
                     <p>14%</p>
                  </c>
                  <c ca="center">
                     <p>8%</p>
                  </c>
                  <c ca="center">
                     <p>26%</p>
                  </c>
                  <c ca="center">
                     <p>34%</p>
                  </c>
                  <c ca="center">
                     <p>42%</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p><sup>a</sup>Spherical distance in meters between criterion standard and vendor-assigned coordinates. Standard deviation of <b><it>&#961; </it></b>= 500 and 15,000 meters for street- and centroid-type matches, respectively. <sup>b</sup>Interstate, U.S., or state highway or major traffic thoroughfare. <sup>c</sup>False + indicates misclassification of the unexposed (&#8805; 100 m) as exposed (&lt; 100 m). False &#8211; indicates misclassification of the exposed as unexposed. The sum of false + and &#8211; error rates may not equal the total error rate due to rounding. <sup>d</sup>Percent of census tracts matching those in the datasets without positional error (<b><it>&#961; </it></b>= 0). Based on a 5% random sample of street-type address matches (n = 2,608) and a census of centroid-type address matches (n = 2,671) in The Environmental Epidemiology of Arrhythmogenesis in WHI, 1999&#8211;2002.</p>
            </tblfn>
         </tbl>
         <p>In contrast, the percent of centroid-type address matches classified as &lt; 100 meters away from the nearest highway was approximately two-fold higher at zero versus non-zero values of mean <b><it>&#961; </it></b>(Table <tblr tid="T7">7</tblr>). This finding was related to the two- to three-fold excess of false negative versus false positive rates at values of mean <b><it>&#961; </it></b>between 2,500 and 10,000 meters. The total error rate increased by 3% and census tract concordance decreased by 24% over the same range.</p>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>Persistent concerns about the potential effects of inaccurate geocoding on spatially interpolated environmental exposures, exposure-outcome associations, and their contextual effect modifiers have stimulated interest in the positional error of commercially geocoded address coordinates. However, studies of the topic have often reported average positional errors in the range of fifty to 300 meters <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>. Although these reports have reduced such concerns, few studies have focused on multiple geographic areas, address sources and vendors; adjusted accuracy measures for important address and methodological characteristics; and estimated the influence of inaccuracy on individual- and contextual-level exposure measures. The generalizability, validity and utility of these estimates is therefore unclear.</p>
         <p>We addressed this issue in a Women's Health Initiative ancillary study, <it>the Environmental Epidemiology of Arrhythmogenesis in WHI</it>, by submitting addresses selected from a broad range of data sources and geographic areas to four well-known vendors often contracted by epidemiologists for geocoding and related services or products (at the time of submission, they had been in business for a combined total of > 35 years, employed > 650 persons, and reported > $50 million of annual sales <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>). We then examined differences between vendors in address match rate, census tract concordance and mean <b><it>&#961;</it></b>.</p>
         <p>We found that geocoding error depends on measures used to evaluate it and vendor. More specifically, vendors matching lower proportions of addresses geocoded them with higher spatial accuracy, i.e. higher census tract concordance and lower mean <b><it>&#961;</it></b>. We also found that that geocoding error depends on address characteristics. Mean <b><it>&#961;</it></b>, for example, was relatively high among EPA, incomplete, unzip-coded, edited and rural addresses; addresses with NAD27 criterion standard coordinates; and in particular, centroid-type address matches. After stratifying by match type, then adjusting for the remaining address characteristics and other methodological factors, mean <b><it>&#961; </it></b>remained twenty times higher among vendor A's centroid- versus street-type address matches. The adjusted odds of an address match also remained more than sixty times higher for vendor A than either B or C. Lastly, by randomly displacing address coordinates over the range of mean <b><it>&#961; </it></b>observed in this context, we found that traffic-related pollution exposure misclassification rates increased and census tract concordance decreased with corresponding increases in mean <b><it>&#961;</it></b>.</p>
         <p>Considered together, these findings suggest that vendor selection presents a trade-off between potential for missing data and error in estimating spatially defined attributes such as environmental exposure and socioeconomic context. They also indicate that the trade-off can be quite unbalanced. Vendor D, for example, matched an unacceptably low proportion of addresses, but geocoded them with a singularly high level of spatial accuracy. Moreover, the observed association between missing data and positional error across vendors suggests that while vendors may be targeting different points along the trade-off spectrum, they tend to retain observations that are likely to have positional errors. Deleting these observations would of course translate into reduced potential for bias due to individual- and contextual-level exposure measurement error, but it remains unclear whether vendors can increase data accuracy without compromising its availability.</p>
         <p>Although these findings may have greater generalizability, validity and utility than those previously reported, our criterion standards may have been imperfect. Interpretation must therefore recognize potential for bias due to the elusiveness of a definitive criterion standard. Indeed, match rate and concordance may have been overestimated and mean <b><it>&#961;</it></b>, underestimated because using imperfect criterion standards tends to artificially inflate accuracy <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>.</p>
         <p>Since errors in accuracy measures vary with errors in imperfect criterion standards, we therefore edited addresses when they failed to conform to U.S. postal standards or conflicted with field notes. Editing was intended to reduce misspelled, misspaced or inappropriately abbreviated state, street suffix or secondary unit designators like "apartment" <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. Though well-intentioned, editing may have introduced error instead of reducing it. Mindful of this possibility, we submitted both the unedited and edited versions of EPA addresses for geocoding. We found that, on average, match rate and census tract concordance were much higher and mean <b><it>&#961;</it></b>, much lower in analyses of the edited versus unedited versions of the database. This finding confirmed that, on average, editing tended to correct addresses and thereby reduce error in accuracy measures, but as a precaution, we also adjusted measures of accuracy for edit type.</p>
         <p>Even after editing addresses, our criterion standards may have contained erroneous coordinates of EPA monitors, NGS stations and WHI participants. Such errors have been identified, for example, within EPA databases of environmental hazards in South Carolina <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. Although theses errors vary across data sources, among states and over time, their potential existence in this context is no less a concern. The EPA implemented its Locational Data Policy in 1991 in response to concerns of this sort. It stipulated adoption of uniform methods, use of global positioning systems and collection of monitor coordinates according to a Federal Interagency Coordinating Committee on Digital Cartography accuracy standard of 25 meters <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. Five years later, the EPA also launched its Locational Data Improvement Project as a vehicle for further improvement in the accuracy of its databases <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. Moreover, the NGS adheres to a stricter, 1998 Federal Geographic Data Committee standard of less than ten meters <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> &#8211; a distance identical to that between parcel center points and true residential locations in urban settings and somewhat less than that in rural areas <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. We also adjusted measures of accuracy for differences among address sources despite these reassurances.</p>
         <p>Interpretation of the findings reported here must also consider the challenges inherent in disentangling the general effect of vendor and the specific effect of a given geocoding method. Street offset &#8211; the perpendicular distance between vendor-assigned coordinates and the corresponding street centerline &#8211; serves as an illustrative example. Although researchers are often troubled by vendors' underlying assumption that this distance is equal for all addresses, a different study design would have been required to discriminate effects of vendor and offset because as a default, vendors A-D used distinct offsets between zero and fifty feet. However, a repeated-measures design &#8211; one in which the same addresses would have been geocoded repeatedly by the same vendors using different offsets &#8211; was not feasible: the option of changing defaults was not uniformly available among vendors A-D. Even if it had been, prior reports suggesting that the contribution of offset to geocoding accuracy is rather modest within the narrow range of defaults observed in this context are reassuring <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B16">16</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>With these caveats in mind, we conclude that informed selection of geocoding practices and approaches to data analysis involves estimating potential for, balancing the trade-off between, and when appropriate, adjusting for the effects of missing data and error in spatially defined attributes. We suggest beginning this process by submitting (masked) addresses associated with high quality criterion standard coordinates in a given study area to geocoding vendors, estimating the accuracy of vendor-assigned coordinates, and selecting vendors that balance the tradeoff between missing data and error in ways that best meet study needs. If edited and unedited forms of the same address are included in the geocoded data set, address cleaning procedures &#8211; which should (but may not) be standardized &#8211; can be simultaneously evaluated.</p>
         <p>Comparing the limitations of methods commonly used to analyze incomplete data with those used to adjust for positional or exposure measurement error may help prioritize individual study needs in advance <abbrgrp><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp>. Basic algebra, for instance, can be used to adjust associations for exposure measurement error <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. Consider the cell counts observed in a hypothetical case-control study of the association between distance to the nearest highway and coronary heart disease mortality (Table <tblr tid="T8">8</tblr>). The sensitivity (se) and specificity (sp) of the 100 m distance classification at mean <b><it>&#961; </it></b>= 150 m can be calculated from the corresponding false negative (fn) and false positive (fp) rates in Table <tblr tid="T7">7</tblr>:</p>
         <tbl id="T8">
            <title>
               <p>Table 8</p>
            </title>
            <caption>
               <p>Cell counts from a hypothetical case-control study of the association between distance to the nearest highway and coronary heart disease mortality</p>
            </caption>
            <tblbdy cols="3">
               <r>
                  <c ca="left">
                     <p>
                        <b>Distance</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Case</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Non-Case</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="3">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>&lt; 100 m</p>
                  </c>
                  <c ca="center">
                     <p>a* = 88</p>
                  </c>
                  <c ca="center">
                     <p>b* = 108</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>&#8805; 100 m</p>
                  </c>
                  <c ca="center">
                     <p>c* = 137</p>
                  </c>
                  <c ca="center">
                     <p>d* = 294</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p>OR* = (a* &#215; d*) &#247; (b* &#215; c*) = 1.8</p>
            </tblfn>
         </tbl>
         <p>se = 1 - fn = 1 - 0.06 = 0.94</p>
         <p>sp = 1 - fp = 1 - 0.08 = 0.92</p>
         <p>Under non-differential misclassification, the corrected cell counts are</p>
         <p>a = (a* - 0.08 &#215; (a* + c*)) &#247; (0.94 + 0.92 - 1) = 81.40</p>
         <p>b = (b* - 0.08 &#215; (b* + d*)) &#247; (0.94 + 0.92 - 1) = 88.19</p>
         <p>c = (a* + c*) - a = 143.61</p>
         <p>d = (b* + d*) - b = 313.81</p>
         <p>and in the absence of confounding, the corrected odds ratio is</p>
         <p>OR = (a &#215; d) &#247; (b &#215; c) = (81.40 &#215; 313.81) &#247; (88.19 &#215; 143.61) = 2.0</p>
         <p>This odds ratio is more extreme than its uncorrected counterpart, OR* (Table <tblr tid="T8">8</tblr>), which is biased toward the null. Its corrected probability distribution can be estimated using Monte Carlo simulation <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>.</p>
         <p>However, the magnitude of exposure measurement error in a continuous variable such as distance to the nearest highway may not vary directly with the magnitude of a given exposure-outcome association. When it is independent of disease status, the resulting misclassification of commonly used exposure categories (e.g. distance &lt; or &#8805; 100 meters) may be differential and vary in unanticipated ways. Seemingly appropriate adjustments may also be inaccurate even when this type of misclassification is non-differential <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. Such adjustments must therefore be applied with caution.</p>
         <p>Nonetheless, uninformed selection of geocoding practices and data analysis appears to be a less desirable alternative, particularly in studies of exposure mechanisms operating within short distances. The positional errors reported here suggest that "short" should be defined as less than 280 meters for potentially geocodable addresses matched at the street level and less than 5.5 kilometers for those matched at the centroid level by well-known vendors with minimally acceptable match rates. Critical distances, though, may be substantially lower given the non-negligible misclassification rates we observed when mean <b><it>&#961; </it></b>was approximately one-half as large as these values. More accurate geocoding methods that involve global positioning or parcel matching can be used to reduce potential for bias in studies requiring such high levels of spatial resolution <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B16">16</abbr></abbrgrp>. Use of the latter method is expected to grow over time as high quality, parcel-level databases become more uniformly available across larger study areas.</p>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>ARIC Atherosclerosis Risk in Communities</p>
         <p>CASS Coding Accuracy Support System</p>
         <p>EPA Environmental Protection Agency</p>
         <p>FIPS Federal Information Processing Standards</p>
         <p>NAD27 and NAD83 North American Datum of 1927 and 1983</p>
         <p>NGS National Geodetic Survey</p>
         <p>TIGER Topologically Integrated Geographic Encoding and Referencing</p>
         <p>USPS United States Postal System</p>
         <p>WHI Women's Health Initiative</p>
         <p>WGS84 World Geodetic System of 1984</p>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>The author(s) declare that they have no competing interests.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>EAW conceived of the study, designed it, and drafted the manuscript. PMQ assembled and analyzed the data, and helped draft the manuscript. RLS directed the statistical analysis and helped draft the manuscript. DJC helped direct the statistical analysis and draft the manuscript. DL helped design the study and draft the manuscript. ACH directed handling of geographic data and helped draft the manuscript. GH helped design the study and draft the manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The National Institute of Environmental Health Sciences funded this ancillary study (5-R01-ES012238). The National Heart, Lung and Blood Institute, U.S. Department of Health and Human Services funded the WHI program. The authors published their preliminary findings as an abstract <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> and acknowledge the contributions of WHI Investigators in the:</p>
            <p><ul>Program Office</ul> (National Heart, Lung, and Blood Institute, Bethesda, Maryland) Barbara Alving, Jacques Rossouw, Shari Ludlam, Linda Pottern, Joan McGowan, Leslie Ford, and Nancy Geller.</p>
            <p><ul>Clinical Coordinating Center</ul> (Fred Hutchinson Cancer Research Center, Seattle, WA) Ross Prentice, Garnet Anderson, Andrea LaCroix, Charles L. Kooperberg, Ruth E. Patterson, Anne McTiernan; (Wake Forest University School of Medicine, Winston-Salem, NC) Sally Shumaker; (Medical Research Labs, Highland Heights, KY) Evan Stein; (University of California at San Francisco, San Francisco, CA) Steven Cummings.</p>
            <p><ul>Clinical Centers</ul> (Albert Einstein College of Medicine, Bronx, NY) Sylvia Wassertheil-Smoller; (Baylor College of Medicine, Houston, TX) Jennifer Hays; (Brigham and Women's Hospital, Harvard Medical School, Boston, MA) JoAnn Manson; (Brown University, Providence, RI) Annlouise R. Assaf; (Emory University, Atlanta, GA) Lawrence Phillips; (Fred Hutchinson Cancer Research Center, Seattle, WA) Shirley Beresford; (George Washington University Medical Center, Washington, DC) Judith Hsia; (Harbor-UCLA Research and Education Institute, Torrance, CA) Rowan Chlebowski; (Kaiser Permanente Center for Health Research, Portland, OR) Evelyn Whitlock; (Kaiser Permanente Division of Research, Oakland, CA) Bette Caan; (Medical College of Wisconsin, Milwaukee, WI) Jane Morley Kotchen; (MedStar Research Institute/Howard University, Washington, DC) Barbara V. Howard; (Northwestern University, Chicago/Evanston, IL) Linda Van Horn; (Rush Medical Center, Chicago, IL) Henry Black; (Stanford Prevention Research Center, Stanford, CA) Marcia L. Stefanick; (State University of New York at Stony Brook, Stony Brook, NY) Dorothy Lane; (The Ohio State University, Columbus, OH) Rebecca Jackson; (University of Alabama at Birmingham, Birmingham, AL) Cora E. Lewis; (University of Arizona, Tucson/Phoenix, AZ) Tamsen Bassford; (University at Buffalo, Buffalo, NY) Jean Wactawski-Wende; (University of California at Davis, Sacramento, CA) John Robbins; (University of California at Irvine, CA) F. Allan Hubbell; (University of California at Los Angeles, Los Angeles, CA) Howard Judd; (University of California at San Diego, LaJolla/Chula Vista, CA) Robert D. Langer; (University of Cincinnati, Cincinnati, OH) Margery Gass; (University of Florida, Gainesville/Jacksonville, FL) Marian Limacher; (University of Hawaii, Honolulu, HI) David Curb; (University of Iowa, Iowa City/Davenport, IA) Robert Wallace; (University of Massachusetts/Fallon Clinic, Worcester, MA) Judith Ockene; (University of Medicine and Dentistry of New Jersey, Newark, NJ) Norman Lasser; (University of Miami, Miami, FL) Mary Jo O'Sullivan; (University of Minnesota, Minneapolis, MN) Karen Margolis; (University of Nevada, Reno, NV) Robert Brunner; (University of North Carolina, Chapel Hill, NC) Gerardo Heiss; (University of Pittsburgh, Pittsburgh, PA) Lewis Kuller; (University of Tennessee, Memphis, TN) Karen C. Johnson; (University of Texas Health Science Center, San Antonio, TX) Robert Brzyski; (University of Wisconsin, Madison, WI) Gloria E. Sarto; (Wake Forest University School of Medicine, Winston-Salem, NC) Denise Bonds; (Wayne State University School of Medicine/Hutzel Hospital, Detroit, MI) Susan Hendrix.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Topologically Integrated GeographicEncoding and Referencing (TIGER) system</p>
            </title>
            <aug>
               <au>
                  <cnm>U.S. Census Bureau</cnm>
               </au>
            </aug>
            <url>http://www.census.gov/geo/www/tiger/index.html</url>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Geocoding in cancer research: A review</p>
            </title>
            <aug>
               <au>
                  <snm>Rushton</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Armstrong</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>Gittler</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Greene</snm>
                  <fnm>BR</fnm>
               </au>
               <au>
                  <snm>Pavlick</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>West</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Zimmerman</snm>
                  <fnm>DL</fnm>
               </au>
            </aug>
            <source>Am J Prev Med</source>
            <pubdate>2006</pubdate>
            <volume>30</volume>
            <issue>2S</issue>
            <fpage>S16</fpage>
            <lpage>S24</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.amepre.2005.09.011</pubid>
                  <pubid idtype="pmpid" link="fulltext">16458786</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Geographic information systems: Their use in environmental epidemiologic research</p>
            </title>
            <aug>
               <au>
                  <snm>Vine</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Degnan</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hanchette</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Environ Health Perspect</source>
            <pubdate>1997</pubdate>
            <volume>106</volume>
            <issue>6</issue>
            <fpage>598</fpage>
            <lpage>605</lpage>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Using geographic information systems for exposure assessment in environmental epidemiology studies</p>
            </title>
            <aug>
               <au>
                  <snm>Nuckols</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Ward</snm>
                  <fnm>MH</fnm>
               </au>
               <au>
                  <snm>Jarup</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Environ Health Perspect</source>
            <pubdate>2004</pubdate>
            <volume>112</volume>
            <issue>9</issue>
            <fpage>1007</fpage>
            <lpage>1015</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1247194</pubid>
                  <pubid idtype="pmpid" link="fulltext">15198921</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>On the wrong side of the tracts? Evaluating the accuracy of geocoding in public health research</p>
            </title>
            <aug>
               <au>
                  <snm>Krieger</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Waterman</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Lemieux</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Zierler</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hogan</snm>
                  <fnm>JW</fnm>
               </au>
            </aug>
            <source>Am J Public Health</source>
            <pubdate>2001</pubdate>
            <volume>91</volume>
            <issue>7</issue>
            <fpage>1114</fpage>
            <lpage>1116</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1446703</pubid>
                  <pubid idtype="pmpid" link="fulltext">11441740</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Development and evaluation of a framework for assessing the efficiency and accuracy of street address geocoding strategies</p>
            </title>
            <aug>
               <au>
                  <snm>Yu</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>PhD Thesis</source>
            <publisher>State University of New York at Albany, Rockefeller College of Public Affairs and Policy</publisher>
            <pubdate>1996</pubdate>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Positional accuracy of two methods of geocoding</p>
            </title>
            <aug>
               <au>
                  <snm>Ward</snm>
                  <fnm>MH</fnm>
               </au>
               <au>
                  <snm>Nuckols</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Giglierano</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bonner</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Wolter</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Airola</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Mix</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Colt</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Hartge</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Epidemiology</source>
            <pubdate>2005</pubdate>
            <volume>16</volume>
            <issue>4</issue>
            <fpage>542</fpage>
            <lpage>547</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1097/01.ede.0000165364.54925.f3</pubid>
                  <pubid idtype="pmpid" link="fulltext">15951673</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Evaluation of uncertainties associated with geocoding techniques</p>
            </title>
            <aug>
               <au>
                  <snm>Karimi</snm>
                  <fnm>HA</fnm>
               </au>
               <au>
                  <snm>Durcik</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rasdorf</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Computer-aided Civil and Infrastructure Engineering</source>
            <pubdate>2004</pubdate>
            <volume>19</volume>
            <issue>3</issue>
            <fpage>170</fpage>
            <lpage>185</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1111/j.1467-8667.2004.00346.x</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Positional accuracy of geocoded addresses in epidemiologic research</p>
            </title>
            <aug>
               <au>
                  <snm>Bonner</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Nie</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Rogerson</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Vena</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Freudenheim</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>Epidemiology</source>
            <pubdate>2003</pubdate>
            <volume>14</volume>
            <issue>4</issue>
            <fpage>408</fpage>
            <lpage>412</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12843763</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Locational uncertainty in georeferencing public health datasets</p>
            </title>
            <aug>
               <au>
                  <snm>Dearwent</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Jacobs</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Halbert</snm>
                  <fnm>JB</fnm>
               </au>
            </aug>
            <source>J Expo Anal Environ Epidemiol</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <issue>4</issue>
            <fpage>329</fpage>
            <lpage>334</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.jea.7500173</pubid>
                  <pubid idtype="pmpid" link="fulltext">11571612</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>On the accuracy of TIGER-type geocoded address data in relation to cadastral and census areal units</p>
            </title>
            <aug>
               <au>
                  <snm>Ratcliffe</snm>
                  <fnm>JH</fnm>
               </au>
            </aug>
            <source>Int J Geographical Information Science</source>
            <pubdate>2001</pubdate>
            <volume>15</volume>
            <issue>5</issue>
            <fpage>473</fpage>
            <lpage>485</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1080/13658810110047221</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Guidelines on procedures for constructing air pollution isopleth profiles and population exposure analysis</p>
            </title>
            <aug>
               <au>
                  <cnm>U.S. Environmental Protection Agency</cnm>
               </au>
            </aug>
            <source>EPA-450/2-77-024a</source>
            <publisher>Research Triangle Park, NC</publisher>
            <pubdate>1977</pubdate>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Monitor-to-monitor temporal correlation of air pollution in the contiguous US</p>
            </title>
            <aug>
               <au>
                  <snm>Ito</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>De</snm>
                  <fnm>Leon S</fnm>
               </au>
               <au>
                  <snm>Thurston</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>N&#225;das</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lippmann</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Expo Anal Environ Epidemiol</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <issue>2</issue>
            <fpage>172</fpage>
            <lpage>184</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.jea.7500386</pubid>
                  <pubid idtype="pmpid" link="fulltext">15199379</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Monitor-to-monitor temporal correlation of air pollution and weather variables in the North-Central U.S</p>
            </title>
            <aug>
               <au>
                  <snm>Ito</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Thurston</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>N&#225;das</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lippmann</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Expo Anal Environ Epidemiol</source>
            <pubdate>2001</pubdate>
            <volume>15</volume>
            <issue>2</issue>
            <fpage>172</fpage>
            <lpage>184</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1038/sj.jea.7500386</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Concentration and size distribution of ultrafine particles near a major highway</p>
            </title>
            <aug>
               <au>
                  <snm>Zhu</snm>
                  <fnm>YF</fnm>
               </au>
               <au>
                  <snm>Hinds</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sioutas</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>J Air Waste Manag Assoc</source>
            <pubdate>2002</pubdate>
            <volume>52</volume>
            <issue>9</issue>
            <fpage>1032</fpage>
            <lpage>1042</lpage>
            <xrefbib>
               <pubid idtype="pmpid">12269664</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Positional error in automated geocoding of residential addresses</p>
            </title>
            <aug>
               <au>
                  <snm>Cayo</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Talbot</snm>
                  <fnm>TO</fnm>
               </au>
            </aug>
            <source>International J Health Geographics</source>
            <pubdate>2003</pubdate>
            <volume>2</volume>
            <issue>10</issue>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Post office box addresses: a challenge for geographic information system-based studies</p>
            </title>
            <aug>
               <au>
                  <snm>Hurley</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Saunders</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Nivas</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hertz</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Reynolds</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Epidemiology</source>
            <pubdate>2003</pubdate>
            <volume>14</volume>
            <issue>4</issue>
            <fpage>386</fpage>
            <lpage>391</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12843760</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Accuracy of commercial geocoding in a Women's Health Initiative ancillary study: The Environmental Epidemiology of Arrhythmogenesis in WHI [Abstract]</p>
            </title>
            <aug>
               <au>
                  <snm>Whitsel</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Quibrera</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Catellier</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Liao</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Henley</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Heiss</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Circulation</source>
            <volume>111</volume>
            <issue>14</issue>
            <fpage>237</fpage>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Using geographic information systems to assess individual historical exposure to air pollution from traffic and house heating in Stockholm</p>
            </title>
            <aug>
               <au>
                  <snm>Bellander</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Berglind</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Gustavsson</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Jonson</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Nyberg</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Pershagen</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Jarup</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Environ Health Perspect</source>
            <pubdate>2001</pubdate>
            <volume>109</volume>
            <issue>6</issue>
            <fpage>633</fpage>
            <lpage>639</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1240347</pubid>
                  <pubid idtype="pmpid" link="fulltext">11445519</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Accuracy and repeatability of commercial geocoding</p>
            </title>
            <aug>
               <au>
                  <snm>Whitsel</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Rose</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Wood</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Henley</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Liao</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Heiss</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Am J Epidemiol</source>
            <pubdate>2004</pubdate>
            <volume>160</volume>
            <issue>10</issue>
            <fpage>1023</fpage>
            <lpage>1029</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/aje/kwh310</pubid>
                  <pubid idtype="pmpid" link="fulltext">15522859</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Biases in the assessment of diagnostic tests</p>
            </title>
            <aug>
               <au>
                  <snm>Begg</snm>
                  <fnm>CB</fnm>
               </au>
            </aug>
            <source>Stat Med</source>
            <pubdate>1987</pubdate>
            <volume>6</volume>
            <fpage>411</fpage>
            <lpage>423</lpage>
            <xrefbib>
               <pubid idtype="pmpid">3114858</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Environmental Epidemiology of Arrhythmogenesis in WHI</p>
            </title>
            <aug>
               <au>
                  <snm>Whitsel</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Heiss</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Catellier</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Liao</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Peuquet</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Prineas</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>GL</fnm>
               </au>
            </aug>
            <url>http://crisp.cit.nih.gov/crisp/CRISP_LIB.getdoc?textkey=6599396&amp;p_grant_num=1R01ES012238-01&amp;p_query=&amp;ticket=6776514&amp;p_audit_session_id=30381838&amp;p_keywords=</url>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Design of the Women's Health Initiative clinical trial and observational study</p>
            </title>
            <aug>
               <au>
                  <cnm>The WHI Study Group</cnm>
               </au>
            </aug>
            <source>Control Clin Trials</source>
            <pubdate>1998</pubdate>
            <volume>19</volume>
            <issue>1</issue>
            <fpage>61</fpage>
            <lpage>109</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0197-2456(97)00078-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">9492970</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>The Atherosclerosis Risk in Communities (ARIC) Study: design and objectives</p>
            </title>
            <aug>
               <au>
                  <cnm>ARIC investigators</cnm>
               </au>
            </aug>
            <source>Am J Epidemiol</source>
            <pubdate>1989</pubdate>
            <volume>129</volume>
            <issue>4</issue>
            <fpage>687</fpage>
            <lpage>702</lpage>
            <xrefbib>
               <pubid idtype="pmpid">2646917</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Technology Transfer Network. Air Quality System</p>
            </title>
            <aug>
               <au>
                  <cnm>U.S. Environmental Protection Agency</cnm>
               </au>
            </aug>
            <url>http://www.epa.gov/ttn/airs/airsaqs/detaildata/downloadaqsdata.htm</url>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Links to North Carolina county geographic information systems (GIS) websites</p>
            </title>
            <url>http://www.unc.edu/~ewhitsel/NCGISlinks2.html</url>
         </bibl>
         <bibl id="B27">
            <title>
               <p>NGS datasheet page</p>
            </title>
            <aug>
               <au>
                  <cnm>National Geodetic Survey</cnm>
               </au>
            </aug>
            <url>http://www.ngs.noaa.gov/cgi-bin/datasheet.prl</url>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Official USPS Abbreviations</p>
            </title>
            <aug>
               <au>
                  <cnm>U.S. Postal Service</cnm>
               </au>
            </aug>
            <url>http://www.usps.com/ncsc/lookups/usps_abbreviations.html</url>
         </bibl>
         <bibl id="B29">
            <title>
               <p>American Fact Finder</p>
            </title>
            <aug>
               <au>
                  <cnm>U.S. Census Bureau</cnm>
               </au>
            </aug>
            <url>http://factfinder.census.gov/servlet/AGSGeoAddressServlet?_lang=en&amp;_programYear=50&amp;_treeId=420</url>
         </bibl>
         <bibl id="B30">
            <title>
               <p>EnviroMapper</p>
            </title>
            <aug>
               <au>
                  <cnm>U.S. Environmental Protection Agency</cnm>
               </au>
            </aug>
            <url>http://www.epa.gov/enviro/html/em/index2.html</url>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Mapquest</p>
            </title>
            <url>http://www.mapquest.com</url>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Google Maps</p>
            </title>
            <url>http://maps.google.com</url>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Locational data</p>
            </title>
            <aug>
               <au>
                  <cnm>U.S. Environmental Protection Agency</cnm>
               </au>
            </aug>
            <source>Information Resources Management Policy Manual. EPA directive 2100</source>
            <pubdate>1991</pubdate>
            <url>http://www.epa.gov/irmpoli8/archived/polman/chaptr13.htm</url>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Geospatial positioning accuracy standards. Part 2: Standards for geodetic networks</p>
            </title>
            <aug>
               <au>
                  <cnm>Federal Geographic Data Committee</cnm>
               </au>
            </aug>
            <source>FGDC-STD-007.2-1998</source>
            <url>http://www.fgdc.gov/standards/standards_publications/index_html</url>
         </bibl>
         <bibl id="B35">
            <aug>
               <au>
                  <snm>O'Rourke</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Computational Geometry in C</source>
            <publisher>Cambridge: Cambridge University Press</publisher>
            <edition>2</edition>
            <pubdate>1998</pubdate>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Heart rate variability, ambient particulate matter and socioeconomic context: The Environmental Epidemiology of Arrhythmogenesis in WHI [Abstract]</p>
            </title>
            <aug>
               <au>
                  <snm>Whitsel</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Liao</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Prineas</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Peuquet</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Quibrera</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Catellier</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Heiss</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>RL</fnm>
               </au>
            </aug>
            <source>Circulation</source>
            <pubdate>2006</pubdate>
            <volume>113</volume>
            <issue>8</issue>
            <fpage>338</fpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16415376</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Estimation of long-term average exposure to outdoor air pollution for a cohort study on mortality</p>
            </title>
            <aug>
               <au>
                  <snm>Hoek</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Fischer</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Van Den Brandt</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Goldbohm</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Brunekreef</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>J Expo Anal Environ Epidemiol</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <issue>6</issue>
            <fpage>459</fpage>
            <lpage>469</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.jea.7500189</pubid>
                  <pubid idtype="pmpid" link="fulltext">11791163</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Small business solutions. Company profile reports</p>
            </title>
            <aug>
               <au>
                  <cnm>Dun and Bradstreet</cnm>
               </au>
            </aug>
            <url>http://www.dnb.com/us</url>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Spatial accuracy of the EPA's environmental hazards databases and their use in environmental equity analyses</p>
            </title>
            <aug>
               <au>
                  <snm>Scott</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cutter</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Menzel</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ji</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Wagner</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Appl Geog Studies</source>
            <pubdate>1997</pubdate>
            <volume>1</volume>
            <issue>1</issue>
            <fpage>45</fpage>
            <lpage>61</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1002/(SICI)1520-6319(199721)1:1&lt;45::AID-AGS5>3.0.CO;2-V</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Locational Data Improvement Project (LDIP)</p>
            </title>
            <aug>
               <au>
                  <cnm>U.S. Environmental Protection Agency</cnm>
               </au>
            </aug>
            <url>http://www.epa.gov/enviro/html/locational/ldip</url>
         </bibl>
         <bibl id="B41">
            <title>
               <p>What do we do with missing data? Some options for analysis of incomplete data</p>
            </title>
            <aug>
               <au>
                  <snm>Raghunathan</snm>
                  <fnm>TE</fnm>
               </au>
            </aug>
            <source>Annu Rev Public Health</source>
            <pubdate>2004</pubdate>
            <volume>25</volume>
            <fpage>99</fpage>
            <lpage>117</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.publhealth.25.102802.124410</pubid>
                  <pubid idtype="pmpid">15015914</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Spatial statistics in the presence of location error with an application to remote sensing of the environment</p>
            </title>
            <aug>
               <au>
                  <snm>Cressie</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Kornak</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Stat Sci</source>
            <pubdate>2003</pubdate>
            <volume>18</volume>
            <issue>4</issue>
            <fpage>436</fpage>
            <lpage>456</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1214/ss/1081443228</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Differential misclassification arising from nondifferential errors in exposure measurement</p>
            </title>
            <aug>
               <au>
                  <snm>Flegal</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Keyl</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Nieto</snm>
                  <fnm>FJ</fnm>
               </au>
            </aug>
            <source>Am J Epidemiol</source>
            <pubdate>1991</pubdate>
            <volume>134</volume>
            <issue>10</issue>
            <fpage>1233</fpage>
            <lpage>1244</lpage>
            <xrefbib>
               <pubid idtype="pmpid">1746532</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Basic methods for sensitivity analysis and external adjustment</p>
            </title>
            <aug>
               <au>
                  <snm>Greenland</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Modern epidemiology</source>
            <publisher>Philadelphia: Lippincott Williams and Wilkins</publisher>
            <editor>Rothman KJ, Greenland S</editor>
            <edition>second</edition>
            <pubdate>1998</pubdate>
            <fpage>343</fpage>
            <lpage>357</lpage>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Quantifying and reporting uncertainty from systematic errors</p>
            </title>
            <aug>
               <au>
                  <snm>Phillips</snm>
                  <fnm>CV</fnm>
               </au>
            </aug>
            <source>Epidemiology</source>
            <pubdate>2003</pubdate>
            <volume>14</volume>
            <issue>4</issue>
            <fpage>459</fpage>
            <lpage>466</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12843772</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
