Home | Archives | Announcements | About the Journal | Submission Information | Contact Us


Printer Friendly Version

Volume 4, December 7, 2004

www.psljournal.com/archives/all/kain.html

 

 

A CHALLENGE TO THE ADMISSIBILITY OF FIREARMS AND TOOLMARK IDENTIFICATIONS: AMICUS BRIEF PREPARED ON BEHALF OF THE DEFENDANT IN UNITED STATES V. KAIN, CRIM. NO. 03-573-1 (E.D. PA. 2004) [1]

 

 

INTRODUCTION

 

The following amicus brief was prepared in connection with a Daubert challenge in federal court to the admission of a firearms and toolmark examiner’s testimony that cuts in a fence and grate were made by the defendant’s bolt cutters, to the exclusion of all other bolt cutters in the world.  Before the brief could be filed with the court, the government offered the defendant a plea bargain that was too good to refuse.

 

The expert testimony in this case was typical of that offered by firearms and toolmark examiners.  The goal of the forensic science discipline of firearms and toolmark identification is to identity particular tools, such as a bolt cutter or the barrel of a particular gun, as the unique source of marks on crime scene evidence, such as a fence or a fired bullet.[2]  Although numerous convictions are based on this type of testimony[3], courts have yet to recognize that adequate statistical and empirical foundations have not been developed for these identifications.  The following brief explains the systemic scientific problems that should make firearms and toolmark identifications inadmissible in court.[4]

 

AMICUS BRIEF PREPARED ON BEHALF OF THE DEFENDANT IN UNITED STATES V. KAIN, CRIM. NO. 03-573-1 (E.D. PA. 2004)

 

 STATEMENT OF INTERESTS OF AMICUS CURIAE

 

Amicus Adina Schwartz, J.D., Ph.D. (Philosophy), is an Associate Professor in the Department of Law, Police Science and Criminal Justice Administration at John Jay College of Criminal Justice and in the Ph.D. Program in Criminal Justice of The Graduate School and University Center, City University of New York (CUNY).  John Jay College is the only liberal arts college in the United States devoted to criminal justice, and the CUNY Criminal Justice Ph.D. Program is the only Criminal Justice Ph.D. program in the country that has a forensic science track.

 

Amicus regularly teaches evidence law to undergraduates at John Jay College, and has twice taught a course, entitled “Science, Experts and Evidence in the Criminal Justice System,” in the CUNY Criminal Justice Ph.D. Program.  Beginning in Spring 2005, she will be teaching evidence law in John Jay College’s newly created M.S. Program in Forensic Computing.  As someone who teaches many current and future law enforcement agents and significant numbers of future forensic scientists, she submits this brief in the belief that high standards for the admission of scientific evidence are needed to motivate forensic scientists and law enforcement agents to do the careful scholarly and investigative work of which they are capable.  As a scholar who writes on evidence law, forensic identification, and philosophy of science issues, she submits this brief in the belief that this case presents this Court with an important opportunity to apply the Daubert-Kumho standard to exclude unreliable forensic identification testimony.

 

I. THIS COURT’S RELIABILITY INQUIRY SHOULD FOCUS ON THE REASONING UNDERLYING THE EXPERT’S IDENTIFICATION OF THE BOLT CUTTERS ALLEGEDLY FOUND IN THE DEFENDANT’S CAR AS THE PAIR, TO THE EXCLUSION OF ALL OTHERS, THAT CUT  THE GRATE, CHAIN LINK, AND PIECES OF CHAIN LINK FENCE.

 

A. The Need to Consider the Expert’s Specific Reasoning. Instead of mechanically applying the specific factors listed in Daubert, trial judges are to perfom the Daubert-Kumho reliability inquiry by evaluating the reasoning underlying an expert’s testimony. See, e.g., Daubert v. Merrell Dow Pharmaceuticals, Inc., 509 U.S. 579, 593 (1993); Kumho Tire Company, Ltd. v. Carmichael, 526 U.S. 137, 151 (1999).  As the Advisory Committee explained when Fed. R. Evid. 702 was amended in response to Daubert and its progeny, “The trial judge in all cases of proffered expert testimony must find that it is properly grounded, well-reasoned, and not speculative before it can be admitted.  The expert’s testimony must be grounded in an accepted body of learning or experience in the expert’s field, and the expert must explain how the conclusion is so grounded. ” (citation omitted). Advisory Committee’s Note to Amendment to Fed. R.Evid. 702 (2000).

 

Kumho explains that judicial gatekeeping must focus on the specific reasoning employed by the particular expert in a case, as opposed to the reasoning generally employed by experts in the field.  526 U.S. at 153-54.  The focus must also be on the particular stage of the expert’s reasoning whose reliability is suspect, whether it be the proffered “testimony’s factual basis, data, principles, methods, or their application [to reach specific conclusions].” Id. at 149, 154.  See also Barry Scheck, DNA and Daubert, 15 Cardozo L.Rev. 1959, 1959 n.3 (1994) (warning that under Daubert, “judges have to resist the temptation to reach simplistic conclusions about ‘DNA testing’ in general and focus instead on the scientific merits of each application of DNA technology”).

 

There is no contest in this case that the bolt cutters allegedly found in the defendant’s car can properly be admitted to show that they could have made the cuts in the metal grate, chain link, and chain link fence.  The issue before this Court is whether there is a reliable foundation for the prosecution expert’s conclusion that these particular bolt cutters, and no others, were the source of the cuts.  See Ramirez v. State, 810 So.2d 836, 851-52 (Fla. 2001) (“Ramirez III”) (“We hold that while the knife that was recovered in Ramirez’s constructive possession may be admitted as conventional evidence of guilt, testimony based on [the prosecution expert’s] knife mark identification procedure … is … unreliable and inadmissible.”).

 

B. The Expert’s Testimony. The government’s firearms and toolmark expert[5], Joseph J. Masson, used the bolt cutters allegedly found in the defendant’s car, introduced as Government Exhibit 31 (“G.E. 31”) and referred to in Mr. Masson’s Laboratory Report (Government Exhibit 15 (“G.E. 15”)) as Exhibit 19, to make test cuts in lead.  A comparison microscope was then used to compare the test toolmarks in the lead with the evidence toolmarks found on the metal grate, chain link, and pieces of chain link fence, respectively referred to in the Laboratory Report as Exhibits 3, 4, and 5.  Mr. Masson testified that “I make my identification on similarities, not dissimilarities.”  Tr. 41.  On the basis of his microscopic comparison of the similarities between the test and evidence toolmarks, Mr. Masson concluded that the G.E. 31 boltcutters were the pair, to the exclusion of all others in the world, that cut some of the ends of the grate in Exhibit 3, the two pieces of chain link in Exhibit 4, and  “a number of representative samples of the pieces of chain link fence” in Exhibit 5.  Tr.54, 57, 59, 60, 63-64; G.E. 15,  pp.1-2.

 

According to Mr. Masson, the class characteristics of the G.E. 31 bolt cutters were “slight[ly] dissimilar” to those of the bolt cutters identified as G.E. 33 and referred to in the Laboratory Report as Exhibit 15.  Tr. 48.  Mr. Masson took test cuts from the G.E.33 bolt cutter, and eliminated that boltcutter as the source of the cuts “by testing it the same way [he] tested” the G.E. 31 bolt cutter.  Tr. 49, 58.  There were “[s]ufficient microscopic matching striations to identify it [the G.E. 31 bolt cutter] and to eliminate the other one.”  Tr. 60.[6]

 

The Daubert-Kumho standard requires that “each stage of the expert’s testimony be reliable.”  Heller v. Shaw Industries, Inc., 167 F.3rd 146, 155 (3rd Cir. 1999).  See also In re Paoli R.R. Yard PCB Litig., 35 F.3d 717, 745 (3rd Cir. 1994) (stating that “any step that renders the analysis unreliable … renders the expert’s testimony inadmissible.”).  Mr. Masson’s testimony should be excluded because of the unreliability of the procedures he employed for (1) making test toolmarks, (2) eliminating the G. E. 33 bolt cutters as the source of the evidence toolmarks, and (3) concluding that the similarities between the test and evidence toolmarks were so great that the G.E. 31 bolt cutters were the unique source of the evidence toolmarks, to the exclusion of all other bolt cutters in the world.   

 

C. The Invalid Comparison of Toolmarks in Lead and in Harder Materials.  In making firearms identifications, Mr. Masson compares test and evidence toolmarks on the same type of ammunition component (e.g., test fired and evidence bullets of the same caliber and make). Tr. 26.  By contrast, his toolmark identifications are based on comparisons between test toolmarks and evidence toolmarks that have been made in different media.  Tr. 30.  Despite admitting that he is “not a metallurgist [and does] not know the consistency of metal and how it’s made and all” (Tr. 28),  Mr. Masson testified that the rationale for the use of different media in toolmark (though not  firearms) identification is that lead is more impressionable than the harder material in which evidence toolmarks are found.  “Lead is a softer material and it leaves the tool marks from the blades, or screw driver or chisel, and it picks them up more distinctly, where a harder material would – might look a little differently.”   Tr. 30.

 

This testimony implies that there will be differences in the toolmarks that the same tool leaves in lead and in harder material.  Accordingly, it was incumbent on Mr. Masson to explain why comparing test toolmarks in lead with evidence toolmarks in harder material was not equivalent to comparing apples and oranges.  See General Elec. Co. v. Joiner, 522 U.S. 136, 146 (1997) (explaining that an expert’s testimony may fail to pass the Daubert test if “there is simply too great an analytical gap between the data and the opinion proffered”).  Since Mr. Masson did nothing to allay the doubts about the validity of comparing toolmarks in different materials that the firearms and toolmarks literature as well as common sense  raise (consider, for example, the differences between the marks a knife makes in butter and in steak), the Daubert-Kumho standard precludes the admission of the conclusions that he based on such comparisons.  See C. Champod, D. Baldwin, F. Taroni, and J.S. Buckleton, Firearms and Toolmark Identification: The Bayesian Approach, 35(3) AFTE J. 307, 314 (Summer 2003) (stating that because lead rod “is far too good a medium,” it was a serious error for a study (Shirley J. Butcher & P.D. Pugh, A Study of Marks Made by Bolt Cutters, 15  J. Forens. Sci. Soc. 115 (1975)) to use data about bolt cutter marks in lead rod to draw conclusions about “the more common marks [that bolt cutters make] in hardened steel”); J. Hall, Consecutive Cuts by Bolt Cutters and Their Effect on Identification, 24(3 ) AFTE J. 260 (July 1992) (successive marks that individual boltcutters cut into lead were more similar to each other than successive marks that the same bolt cutters cut into shackles comprised of harder materials).

 

D. The Questionable Procedure for Excluding the G.E. 33 Bolt Cutters.  Mr. Masson’s testimony that the G.E.33 and G.E.31 bolt cutters had different class characteristics conflicts with his testimony that he made test cuts with both sets of bolt cutters and used microscopic comparisons to exclude G.E.33, but identify G.E. 31 as the unique source of the evidence toolmarks.  Tr. 48-49, 58, 60.  A tool can be the source of an evidence toolmark only if the class characteristics of the tool and the evidence toolmark match.  See, e.g., Alfred Biasotti & John Murdock, The Scientific Basis of Firearms and Toolmark Identification (“The Scientific Basis”) in 3 DAVID L. FAIGMAN ET AL., MODERN SCIENTIFIC EVIDENCE 502 (2002); Bruce Moran, Firearms Examiner Expert Witness Testimony, 32(3) AFTE J. 231, 237-39 (Summer 2000).  Hence, unless they first find that the class characteristics of a suspect tool and an evidence toolmark agree, firearms and toolmark examiners neither make test toolmarks with the suspect tool nor compare such test marks with evidence marks under a comparison microscope.  See Biasotti & Murdock, supra; Moran, supra, at 239 (“The firearms examiner relies on the evaluation of these [microscopic, individualized] markings to distinguish a barrel as having fired a bullet to the exclusion of all other barrels with the same rifling class characteristics.” (emphasis added)); Jerry Miller, An Introduction to the Forensic Examination of Toolmarks, 33(3) AFTE J. 233, 241 (Summer 2001) (stating that class characteristics “can be used … to eliminate a tool from having been used.”).

 

Mr. Masson’s identification of G.E.31 as the source of the evidence toolmarks was justified only if its class characteristics matched those of the evidence toolmarks.  If, as Mr. Masson testified, the class characteristics of the G.E. 33 and G.E.31 bolt cutters differed, G.E. 33 should have been excluded on the basis of class characteristics alone.  In accord with the basic principles of toolmark and firearms identifications, Mr. Masson had no reason to make test cuts and microscopic comparisons with both bolt cutters unless, contrary to his testimony, the class characteristics of both were identical and matched the evidence toolmarks.  Since, as the Advisory Committee has explained, the Daubert-Kumho standard requires that expert testimony be well-reasoned, Mr. Masson’s self-contradictory account of his procedure for excluding the G.E. 33 bolt cutters precludes the admission of his testimony.  Advisory Committee’s Note to  Amendment to Fed. R.Evid. 702 (2000).

 

E. Mr. Masson’s Failure to Base His Identity Conclusions on Objective Criteria. The preceding doubts about Mr. Masson’s reasoning pale beside the more fundamental issue of whether he had a reliable basis for concluding that the similarities between the test and evidence toolmarks were so great that the G.E. 31 bolt cutters must be the source of the evidence toolmarks, to the exclusion of all other bolt cutters in the world.  In his testimony, Mr. Masson refused to articulate any objective criteria for how many or what kinds of striae must match in order to determine that two toolmarks must have been made by the same tool.  He testified that his subjective judgment was his sole basis for concluding that the resemblances between the test and evidence toolmarks were so great that the G.E. 31 bolt cutters, to the exclusion of all others in the world, must have made the cuts in the grate, chain link, and pieces of chain link fence.  Tr. 68.   The basis for his identifications was “pattern recognition, training and experience that comes up with this.”  Tr. 91.

 

Like Mr. Masson, many, though by no means all, toolmark examiners do not rely on any objective criteria as to how many and what kinds of matches between striae are necessary to justify identity conclusions.  Instead, they make purely subjective identity determinations, and claim that their identifications are correct because of their experience and training.  While acknowledging that “it is something of a stereotype to visualize the distinguished, greying individual on the stand saying, ‘my opinion is based on my many years of experience in the field,’” prominent forensic scientists Christophe Champod and Ian W. Evett deplore this practice on the ground that it conflicts with the basic scientific value of transparency.  A Probabilistic Approach to Fingerprint Identification, 51 J. Forens. Identification 101, 106-107 (2001).  “[A]s a matter of principle, … the scientist should, as far as possible, support his/her opinion by reference to logical reasoning and an established corpus of scientific knowledge.”  Id.

 

As Champod and Evett recognize, the value of transparency is also implicit in the Daubert-Kumho reliability inquiry.  Id.  To avoid the well-known danger that juries will be awed by expert testimony, Federal Rule of Evidence 702 conditions admission on a trial court’s determination that such testimony “will assist the trier of fact to understand the evidence or determine a fact in issue. ”  See also Daubert v. Merrell Dow Pharmaceuticals, 509 U.S. at 595 (“Judge Weinstein has explained: ‘Expert evidence can be both powerful and quite misleading because of the difficulty in evaluating it.’” (citation omitted)); Adina Schwartz, A“Dogma of Empiricism” Revisited: Daubert v. Merrell Dow Pharmaceuticals, Inc. and the Need to Resurrect the Philosophical Insight of Frye v. United States," 10 Harv. J. L. & Tech. 149, 196-98 (1997) (“A‘Dogma of Empiricism’) (explaining the relations between the Frye and Daubert standards and the fear that jurors will be awed by scientific expert testimony).  By insisting that his identity conclusions were based solely on his own subjective judgments, Mr. Masson in effect refused to explain how he knew that the resemblances between the test toolmarks made with the G.E.31 bolt cutters and the toolmarks cut into the grate, chain link fence and chain link were so great that no other bolt cutter in the world could possibly have made the cuts.  Since the value of transparency is basic to science and to the Daubert inquiry, Mr. Masson’s testimony should be excluded on the ground that it is obscure rather than transparent.  See Champod & Evett, supra, at 107.  The proffered testimony will do nothing to help the jury understand whether the G.E. 31 bolt cutters are or are not the only possible source of the cuts on the grate, chain link fence and chain link.

 

A preference for transparency over obscurity is also implicit in the distinction that Daubert-Kumho draws between the reliability of an expert’s testimony and the expert’s personal qualifications. By refusing to articulate any criteria for when the resemblances between toolmarks are so great that they must have been made by the same tool, Mr. Masson implied that the jury should accept his identification of G.E.31 because his experience and training make him capable of correctly (if ineffably) judging when the resemblances between toolmarks are sufficient to justify an identification. The Advisory Committee has recognized that the Daubert-Kumho inquiry would be wrongly reduced into an inquiry into experts’ qualifications if experts could thus substitute invocations of their experience for explanations of the basis for their conclusions.  “If the witness is relying solely or primarily on experience, then the witness must explain how that experience leads to the conclusion reached, why that experience is a sufficient basis for the opinion …. The trial court’s gatekeeping function requires more than simply ‘taking the expert’s word for it’. “ Advisory Committee’s Note to Amendment to Fed. R.Evid. 702 (2000) (citation omitted).  In accord with this, Her Honor stated, during the hearing, that even though she had no doubts as to Mr. Masson’s personal qualifications, this did not dispose of the question of whether firearms and toolmark identification is a reliable discipline.  Tr. 98.

 

See also Joiner, 522 U.S. at 146 (reasoning that “nothing in ... Daubert ... requires a district court to admit opinion evidence which is connected to existing data only by the ipse dixit of the expert”); Daubert v. Merrell Dow Pharmaceuticals, Inc., 43 F.3d 1311, 1316 (9th Cir. 1995) (“Daubert II)  (reasoning that the point of the Daubert standard is lost if “an expert’s self-serving assertion that his conclusions were ‘derived by the scientific method’ [is] deemed conclusive”); Ambrosini v. Labarraque, 101 F.3d 129, 143 (D.C. Cir. 1996) (Henderson, J., dissenting), cert. denied, 520 U.S. 1205 (1997) (warning that “if such conclusory statements [as the expert's statement that he employed “the traditional methodology of experts in the field”] must be accepted at face value, ... the Daubert standard becomes meaningless").  Ignoring this body of law, the government’s direct examination of Mr. Masson during the hearing and the brief that the government filed before the hearing both dwelt on Mr. Masson’s credentials, but made only the slightest attempt to explain the scientific principles underlying firearms and toolmark identification.

 

The Florida Supreme Court’s recent decision in Ramirez III, supra, also argues against admitting the identification testimony in this case.  In Ramirez III, as in this case, the toolmarks at issue were striated toolmarks.  Similarly to Mr. Masson, the prosecution experts in Ramirez III testified that a comparison of the striae on the cast of Ramirez’s knife with the striae on the cast of the victim’s cartilage enabled them to identify the knife as the murder weapon, to the exclusion of all others, even though they employed no objective criteria for how many or what kinds of striae must match in order to establish identity.  The Ramirez III experts averred that an individual toolmark examiner’s subjective judgment, gained through experience and training, suffices for determining whether toolmarks are so similar that they must  have come from the same tool. 

 

As defense counsel explained during the hearing, the relevance of Ramirez III to this Court’s reliability inquiry is not diminished by the fact that Florida is a Frye state. While ostensibly adhering to Frye in excluding the toolmark experts’ testimony (810 So.2d at 843 & 843 n.8), the Ramirez III Court in fact used the factors specifically listed in Daubert as surrogates for the Frye general acceptance test.  810 So.2d at 849-51.  See also, The Judicial Response to Firearms and Toolmark Identification Expert Evidence, in 3 FAIGMAN ET AL., supra, at 489 n.30 (2002); David W. Barnes, General Acceptance Versus Scientific Soundness, 31 Fla. St. U. L. Rev. 303, 305 (Winter 2004) (stating that in Ramirez III, the Florida Supreme Court simultaneously rejected the federal rule and elaborated an approach remarkably similar to that rule, requiring judges to evaluate the scientific basis for novel expert testimony”).

 

As will be argued in Section IV below, amicus agrees with the Florida Supreme Court that the specific Daubert factors preclude the admission of toolmark identifications that are not based on objective criteria, such as those proffered by Mr. Masson and by the prosecution experts in Ramirez III.  Before considering the specific Daubert factors, however, amicus wishes to inform this Court of the basic principles and pitfalls of toolmark and firearms identification and of the cogent arguments that prominent toolmark and firearms examiners have advanced, since the 1930’s, to show that objective, statistically-based identification criteria are needed.  It is hoped that this will aid this Court to reach a scientifically informed decision on the admissibility of the toolmark expert’s testimony in this case.  See Joiner, 522 U.S. at 147-48 (Breyer, J., concurring) (stating that Daubert’s gatekeeping “requirement will sometimes ask judges to make subtle and sophisticated determinations about scientific methodology and its relation to the conclusions an expert witness seeks to offer--particularly when a case arises in an area where the science itself is tentative or uncertain.…”).

 

II.AN UNDERSTANDING OF THE BASIC CHARACTERISTICS OF TOOLMARKS IMPLIES THAT OBJECTIVE, STATISTICALLY-BASED IDENTIFICATION CRITERIA ARE NEEDED.

A. The Distinctions between Class, Subclass and Individual Characteristics of Toolmarks.  The distinctions between class, subclass and individual characteristics of toolmarks must be grasped in order to understand the problems with the conclusions of identity that firearms and toolmark examiners draw.  As a result of the distinctive designed features of different types of tools, different types of toolmarks, or, in other words, toolmarks with different class characteristics, result when different types of tools are used or applied to materials.  For example, the intentionally manufactured differences between steak and butter knife blades result in different types of marks when the two types of knives are inserted in butter.  See Biasotti &  Murdock, The Scientific Basis, supra, at 496 n.3.

 

Mr. Masson failed to inform this Court that manufacturing processes may also produce subclasses within a type of tool.  The tools in each subclass share similarities in appearance, size, or surface finish that are not shared by other tools of the same type. The toolmarks produced by tools of a particular subclass have similarities, or subclass characteristics, that distinguish them from the toolmarks produced by other tools of the type.  For example, a study found subclass characteristics among the toolmarks produced by the ram of one, but not another, brand of desk stapler. Id. at 500-501; John E. Murdock, The Individuality of Toolmarks Produced by Desk Staplers (“Desk Staplers), 6 AFTE J. 23 (1974).

 

The forensic science discipline of toolmark identification is premised on the existence of individual characteristics that, by contrast to class and subclass characteristics, are unique to the toolmarks each individual tool produces.  The individual characteristics of a toolmark correspond to random imperfections or irregularities on tool surfaces produced by the manufacturing process and/or subsequent use, corrosion or damage.  If the same class characteristics are found on evidence and test toolmarks (for example, the same rifling impressions on a bullet test fired by a gun barrel and an evidence bullet recovered from a crime scene), a toolmark examiner uses a comparison microscope to compare the toolmarks’ individual characteristics (for example, microscopic striations within rifling impressions). The object is to determine whether the individual characteristics are so similar that one and the same tool (for example, a particular gun barrel) must have produced both the test and the evidence toolmarks.

 

Contrary to Mr. Masson’s testimony that he had never “seen or heard of two different tools creating the same exact tool markings” (Tr. 33, 38-39, 73), a substantial literature argues that only some manufacturing processes make each tool capable of producing toolmarks with individual characteristics from the moment of manufacture.  Other manufacturing processes result in batches of tools that are so similar that their toolmarks have the same subclass characteristics, and may or may not also have individual characteristics.  See, e.g., Murdock, Desk Staplers, supra; Ronald G. Nichols, Firearm and  Toolmark Identification Criteria, 42 J. Forensic Sci. 466, 470 (1997) (“Nichols I”); Biasotti & Murdock, The Scientific Basis,  supra, at 500-501; Alfred A. Biasotti & John Murdock, “Criteria for Identification” or “State of the Art” of Firearm and Toolmark Identification, 16  AFTE J. 16, 17 (1984) (“Criteria for Identification”); Bruce Moran, A Report on the AFTE Theory of Identification and Range of Conclusions for Tool Mark Identification and Resulting Approaches to Casework, 34 (2) AFTE J. 227, 227-28  (Spring 2002) (A Report); Kristen A. Tomasetti, Analysis of the Essential Aspects of Striated Toolmark Examination and the Methods for Identification, 34(3) AFTE J. 289, 295 (Summer 2002).

 

According to the scientific literature, the tools in the uniform batches produce toolmarks with individual characteristics only as they are used, damaged, or corroded.  Biasotti & Murdock, The Scientific Basis, supra, at 500-501.  Even if a tool is capable of  producing unique toolmarks from the time of manufacture, the individual characteristics of its toolmarks will change as the tool is used or as damage or corrosion occur. 1 PAUL C. GIANNELLI & EDWARD J. MWINKELRIED, SCIENTIFIC EVIDENCE 633 (3rd ed. 1999) (citing Flynn, Tool Mark Identification, 2 J. Forensic Sci. 95, 102 (1957) for the proposition that “the characteristics of a tool will change with use”).

 

B. Central Pitfalls in Toolmark Identification. The foregoing analysis of the distinctions between class, subclass and individual characteristics of toolmarks makes it possible to appreciate three central pitfalls that stand in the way of reliably identifying one and only one tool as the source of a particular toolmark(s).  Due to their recognition of these pitfalls, many prominent toolmark examiners do not share the prosecution expert’s complacency about relying on subjective judgments to make unique identifications.

 

1. The Individual Characteristics of Toolmarks Are Combinations of Non-Unique Marks. A first barrier in the way of reliably identifying the source of an evidence toolmark is that, just as, notwithstanding their uniqueness, parts of each individual’s fingerprints and nuclear DNA are the same as other people’s, the individual characteristics of toolmarks are comprised of non-unique marks.  In 1935, Gunther and Gunther used the analogy of oak leaves to illustrate this point.  “No two oak leaves may be exactly alike, but the exact counterpart of a small area of leaf can probably be found in other leaves .... It is probably true that no two firearms with the same class characteristics will produce the same signature, but it is likewise true that each element of a firearm’s signature may be found in the signatures of other firearms .... An individual peculiarity of a firearm can, therefore, be established by elements of identity which form a combination the coexistence of which is highly improbable in the signature of other firearms with the same class characteristics.”  JACK D. GUNTHER & C.O. GUNTHER, THE IDENTIFICATION OF FIREARMS 90-91 (1935). See also Biasotti and Murdock, “Criteria for Identification”, supra, at 17 (using the passage from GUNTHER & GUNTHER to explain why toolmark examiners “have come to expect to find small isolated areas of corresponding striae agreement when comparing toolmarks known to have been produced by different working surfaces.”) .

 

Empirical work has shown that a substantial percentage of the striae comprising the individual characteristic of one toolmark can match the striae comprising the individual characteristic of another toolmark.[7]  In assessing the expert testimony in this case, this Court should note that up to 29% of the striae were found to match on toolmarks that were made by different bolt cutters of the Record brand 930 centre-cut type.  Butcher & Pugh, supra, at 122-23.  Similarly, in 1942, Burd and Kirk found that up to 25% of the striae matched in comparisons of marks known to have been made by different tools.  D.Q. Burd & P.L. Kirk, Tool Marks— Factors Involved in Their Comparison and Use as Evidence, 32 J. Crim. L., Criminology & Police Sci. 679 (1942).  See also Eliot Springer, Toolmark Examinations, 40 J. Forensic Sci. 964, 965 (1995) (describing Burd and Kirk’s “important article”); Nichols I, supra, at 470 (describing Burd and Kirk’s “often cited study”).

 

Likewise, in 1955, Biasotti found that 15 to 20% of the striae on bullets fired from different .38 Special Smith & Wesson revolvers (i.e., known non-matches) matched.  A.A. Biasotti, Bullet Comparison, A Study of Fired Bullets Statistically Analyzed ( Unpublished Thesis, University of California, Berkeley 1955);  A.A. Biasotti, A Statistical Study of the Individual Characteristics of Fired Bullets, 4 J. Forensic Sci. 34 (1959) (summary of his 1955 thesis).  See also Springer, supra, at 965 (describing the Biasotti study); Biasotti, Principles of Evidence Evaluation, supra, at 431 (explaining that his study’s results “corresponded well” to Burd & Kirk’s results).  In 1997, Nichols claimed that “[t]o date, [the Biasotti study] stands as the most exhaustive statistical empirical study ever published.”  Nichols I, supra, at 467.

 

In the 1990’s, the development of the BATF’s computerized comparison system, IBIS (Integrated Ballistics Information System), enabled investigators to compare the tool marks on vast numbers of bullets and cartridge cases.  See Schwartz, Ballistics, supra, at 7-9.  Studies using the IBIS data base support the claim that there can be significant numbers and percentages of matching striae on pairs of bullets fired from different guns.  See, e.g., Jerry Miller & Michael McLean, Criteria for Identification of Toolmarks, 30 (1) AFTE J. 15 (Winter 1998); Jerry Miller, Criteria for Identification of Toolmarks Part II,  32 (2) AFTE J. 116 (2000) (“Miller II”).

 

Although Mr. Masson was unable to recall the paper on direct examination (Tr. 23), in 1997, he published a study that strongly suggests that toolmarks made by different tools may be much more similar to each other than firearms and toolmark examiners currently believe they can be.  The study found that as the IBIS data base grew for guns of a particular caliber, increasing similarities were discovered in the individual characteristics of the toolmarks on ammunition components known to have been fired by different guns of that caliber.  According to Mr. Masson, “a number of known non-matched testfires from different firearms ... were coming up near the top of the candidate list [for matches with the toolmarks on evidence ammunition components.]  When retrieving these known non-matches on the comparison screen, there were numerous two dimensional similarities.  When using a comparison microscope, these similarities are still present and it is difficult to eliminate comparisons even though we know they are from different firearms.” Joseph J. Masson, Confidence Level Variations in Firearms Identification through Computerized Technology, 29 (1) AFTE J. 42 (1997) (DD-1).

 

Masson urged examiners to avoid misidentifications by using the IBIS database to increase their knowledge of the possible extent of the similarities between non-matching toolmarks.  “In the past, best examples of known non-matched agreements were collected from casework and thus, surfaced sporadically.  Firearms examiners should take advantage of this current expanded database to fully familiarize themselves with the extent of similarities found in many non-identifications in order to hone their criteria for striae identification.”  Id. at 43.  However, as Mr. Masson acknowledged on cross-examination, there are no databases for bolt cutter toolmarks or toolmarks made by any other type of tool besides firearms.  Tr. 67-68.  By implying that computerized databases were needed to reveal the extensiveness of the possible similarities between toolmarks made by different firearms, Mr. Masson’s study strongly suggests that, in the absence of computerized databases, toolmark examiners are likely to underestimate the extent of the possible similarities between toolmarks made by different tools of the same type, including bolt cutters.  Accordingly, Mr. Masson and other toolmark examiners risk making misidentifications when they base their identity conclusions on their subjective sense, unaided by the use of computerized databases, of how similar two toolmarks can be and yet come from different tools of the same type. 

 

2. The Danger of Confusing Subclass with Individual Characteristics.  A difference between fingerprint, nuclear DNA and mitochondrial DNA (mtDNA) identification, on the one hand, and firearms and toolmark identification, on the other, makes firearms and toolmark identification especially difficult.  On the one hand, each individual’s fingerprints are unique. With the sole exception of identical twins, the same is true of each individual’s nuclear DNA sequence.  Since, by contrast with the nuclear DNA that one inherits both parents, mtDNA is, in theory, inherited only from one’s mother, even the most remote maternal cousins should share the same mtDNA.  See Adina Schwartz, Book Review, 3 Punishment and Society 446, 447 (2001) (reviewing BARRY SCHECK, PETER NEUFELD AND JIM DWYER, ACTUAL INNOCENCE: FIVE DAYS TO EXECUTION AND OTHER DISPATCHES FROM THE WRONGLY CONVICTED (2000)) (“Book Review”). 

 

By contrast to these well-established generalizations about the uniqueness of fingerprints and nuclear DNA and the sharing of mtDNA sequences in people descended from the same maternal line, we saw above that only some manufacturing processes produce individual tools whose surfaces are differentiated enough to produce toolmarks with different individual characteristics.  Other manufacturing processes result in batches of tools so similar that their toolmarks have the same subclass characteristics, and may or may not also have individual characteristics.  Compounding the absence of any straightforward rule, wear and tear on some tools will cause the subclass characteristics on their toolmarks to be completely replaced by individual characteristics.  In other tools, subclass characteristics may persist alongside individual characteristics.  See Schwartz, Ballistics, supra, at 3.

 

By failing even to recognize the existence of subclass characteristics, Mr. Masson ignored a major difficulty that bedevils firearms and toolmark identification, and is not analogous to any difficulty scientists face in making fingerprint, nuclear DNA or mtDNA identifications.  A particular tool may be wrongly identified as the source of an evidence toolmark if an examiner wrongly concludes that subclass characteristics on test and evidence toolmark(s) are individual characteristics. 

 

This confusion is possible because there are no rules for distinguishing subclass from individual characteristics.  To avoid confusing subclass characteristic shared by more than one tool with individual characteristics unique to one and only one tool, examiners can only rely on their personal familiarity with types of forming and finishing processes and their reflections in toolmarks. In accord with this, Biasotti and Murdock explain that “some machining processes are capable of reproducing remarkably similar surface characteristics (i.e., gross contour and/or fine striae, etc.) on the working surfaces of many consecutively produced  tools which if not recognized and properly evaluated could lead to a false identification.” “Criteria for Identification”,supra, at 17.  They go on to warn that “[t]he examiner must ... be familiar with the various forming and finishing processes in order to distinguish those ... surface characteristics that are truly individual from those surface characteristics that may characterize more than one tool.”  Id.  See also Nichols I, supra, at 470-72.

 

In ignoring the possibility of misidentifications resulting from the  confusion of subclass with individual characteristics, Mr. Masson failed to inform this Court of a danger that is real, not theoretical.  In the 1980’s, this type of confusion was discovered to have in fact resulted in misidentifications of striated toolmarks.  In response, members of the Association of Firearms and Toolmark  Examiners (“AFTE”) formed the Criteria for Identification Committee.  The term “subclass characteristics” was coined in 1989 and incorporated in the AFTE glossary definitions in 1992.  See Moran, A Report, supra, at 227-28 (relating this history and warning that “[c]aution should be exercised in distinguishing subclass characteristics from individual characteristics”).  See also Jerry Miller, An Examination of Two Consecutively Rifled Barrels and a Review of the Literature, 32(3) AFTE J. 259, 260 (Summer 2000) (describing a study that found that the matching characteristics on the toolmarks of bullets fired from ten consecutively manufactured gang broach barrels were so great that a false identification would have resulted if the matching characteristics had been incorrectly identified as individual, rather than subclass characteristics).

 

It is particularly relevant to this case that prominent firearms and toolmark examiner John Murdock has claimed that a bolt cutter’s tendency to produce toolmarks with individual or subclass characteristics can be expected to vary with the extent to which it has been used.  John E. Murdock, Some Suggested Court Questions to Test Criteria for Identification Qualifications, 24(1) AFTE J. 69, 73 (January 1992)  (“Court Questions”).  Although Hall, supra, at 264, states, to the contrary, that “no two bolt cutters manufactured will leave identical marks,” his statement is not supported by the two studies he cites.

 

One of the studies – Billy Hornsby, MCC Bolt Cutters, 21(3) AFTE J. 508 (1989) – is a one-page report on a visit to a bolt cutter manufacturing facility, the Matsuzaka Casting Company (MCC), in Tsu-shi, Mie-Ken, Japan.  Hornsby reported that, “Since I was unable to obtain consecutively made bolt cutters, I obtained tests from three pairs of bolt cutters that were made during the same production run.  Intercomparisons of these tests disclosed individual characteristics so different that there would be no possibility of misidentification.”  The only other basis for Hall’s denial that any bolt cutters can produce toolmarks with subclass, but not individual, characteristics, is the study by Butcher and Pugh, criticized in Section I C above.  

 

In assessing the support that these studies provide for Hall’s conclusion, this Court should take account of both the extremely small sample in the Hornsby study (only three bolt cutters) and Butcher and Pugh’s problematic procedure of using data on toolmarks in lead to reach conclusions about the marks bolt cutters will make in harder material.  See Champod, Baldwin, Taroni, and Buckleton, supra, at 314.  These problems pale beside the fact that Hornsby only studied MCC bolt cutters and Butcher and Pugh only studied “Record” brand cutters of the 930 centre-cut type manufactured by C& J. Hampton, Ltd. in Sheffield, England. 

 

Murdock’s well-regarded study of desk staplers shows that Hall’s extrapolation is far too broad.  Hornsby’s and Butcher and Pugh’s studies of particular brands and types of bolt cutters cannot provide an adequate foundation for Hall’s conclusion that no two bolt cutters of any brand or type will ever produce exactly the same toolmarks.  Murdock found that when newly manufactured, the surfaces of the rams of one brand of desk stapler were so similar that they produced toolmarks with subclass, but not individual, toolmarks.  By contrast, the process used to manufacture another brand of desk stapler resulted in rams whose unique working surfaces made them capable of leaving toolmarks with unique characteristics.  Murdock, supra; Biasotti & Murdock, The Scientific Basis, supra, at 501 (describing Murdock’s study); CRIME LABORATORY MANAGEMENT FORUM 177-78 (R.H. Fox & F.H. Wynbrandt eds. 1976) (favorably evaluating the Murdock study). 

 

Mr. Masson’s testimony about different types of bolt cutters makes the results of Murdock’s desk stapler study particularly relevant to this case.  According to Mr. Masson, the G.E.31 and 33 bolt cutters are “not your standard Stanley bolt cutter, which would be a high class, well made bolt cutter – if you got  a cheaper bolt cutter, they’re not made to take much abuse and they will – the blades will dull and chip almost immediately after using them.”  Tr. 35.  Together with Murdock’s finding of subclass characteristics in the toolmarks produced by some, but not other, brands of stapler rams, Mr. Masson’s acknowledgment of major differences among types of bolt cutters shows that even if some bolt cutters produce toolmarks with individual, but not subclass characteristics, this need not be true of all bolt cutters.  Indeed, the possibility of subclass characteristics would appear to be particularly high in this case because inexpensive manufacturing methods were presumably used to create the “cheaper bolt cutter[s]” in this case.  An article introduced by the government for the March 12 hearing warns that misidentifications of both firearms and other tools may result if subclass characteristics are confused with individual characteristics, and goes on to explain: “As tool manufacturers minimize the steps necessary to produce tools in an effort to become more efficient and economical, the possibility for tools produced with similar characteristics increases.” Stephanie J. Eckerman, A Study of Consecutively Manufactured Chisels, 34 (4) AFTE J. 379 (Fall 2002). 

 

For all these reasons, Mr. Masson’s identification of the G.E. 31 bolt cutter was unreliable because he failed to rule out the possibility of subclass characteristics.  To avoid misidentifications, “[t]he examiner must …, for any specific tool, be able to: (1) recognize the presence of subclass characteristics and (2) properly evaluate the significance of subclass toolmarks when they are present by determining whether or not they are influencing the nature of any evidence.”  Biasotti & Murdock, The Scientific Basis, supra, at 501.

 

3. The Individual Characteristics of Toolmarks Change with Time.  Firearms and toolmark identification is also difficult because, by contrast to an individual’s fingerprints and nuclear DNA, the individual characteristics of the marks made by a particular tool change with time.  Studies of the statistical foundations of fingerprint and mtDNA identification bear out the claim that temporal changes in the characteristics of individual tools are a major barrier to developing  a reliable method of firearms and toolmark identification.  According to prominent statistician Stephen Stigler, “it was only in 1890-95 with the work of Francis Galton that the use of fingerprints acquired a scientific basis.”  Stephen Stigler, Galton and Identification by Fingerprints, 140 Genetics 857 (1995).  Stigler praises Galton for recognizing that proving that “[a]n individual’s prints [are] persistent over time” was a crucial step in establishing that a single individual can be reliably identified as the source of a particular fingerprint(s).  Id. (italics omitted).  See also Paul C. Giannelli, The DNA Story: An Alternative View (Book Review), 88 J. Crim. L. & Criminology 380, 395 (1998) (stating that the fact that fingerprints do not change over time is one reason why “[f]ingerprints are considered the most reliable type of scientific evidence”).  By contrast, heteroplasmy, the existence of more than one mtDNA type in a single human being over the course of his or her lifetime, is a major problem in mtDNA identification.  See Schwartz, Book Review, supra, at 447. 

 

As seen above in Section A, changes in toolmarks occur because the surfaces of a tool change as the tool is used, and/or as damage or corrosion occur.  Giannelli and Imwinkelried state that “if the barrel of the firearm has changed significantly, due to erosion or corrosion, a positive identification may be impossible.” Supra, at 613.  They conclude that toolmark identification “has the same limitations as firearms identification: ‘The characteristics of a tool will change with use.’” Id. at 633 (quoting Flynn, supra, at 102).  Similarly, Mr. Masson agreed that “each time a tool is used, the individual characteristics of that tool may be altered.”  Tr. 74.

 

Biasotti’s well-regarded statistical empirical study, discussed in subsection 1 above, reveals the significant problems that temporal changes in the surfaces of tools and their associated toolmarks create for toolmark identification.  Biasotti found that “bullets from the same gun (i.e., known matches) gave only 21 to 38% matching lines [i.e., striae].”  Biasotti & Murdock, “Criteria for Identification, supra, at 20.  As Springer stated, this result implies that even between toolmarks created by the same tool, “there is no such thing as a perfect match!”  Supra, at 965.  In a 1997 review of the toolmark and firearms literature, Nichols claimed that this surprising result had held up over time.  According to Nichols,  one of the results of Biasotti’s study, which is “not particularly news to us now,” is “[t]hat the average percentage of matching lines in jacketed bullets fired from the same gun was 21-24%.”  Nichols I, supra, at 467.  

 

Similarly, Hall’s 1992 study of bolt cutters was premised on the fact that bolt cutters produce toolmarks whose individual characteristics change over time.  Hall reasoned that: “It is known that the condition of the cutting edges of the bolt cutters will change over time during the use of the bolt cutters.  The question which arises, however, is how long is it before the individual characteristics have changed sufficiently to prevent a positive identification?”  Hall, supra, at 261. 

 

In assessing Mr. Masson’s reasoning, it is crucial to note that his testimony that the G.E. 31 and 33 bolt cutters were “cheaper bolt cutters … whose blades will dull and chip almost immediately after using them” (Tr. 35), implies that the toolmarks the bolt cutters could produce were likely to have changed between the time the evidence and test toolmarks were made.  In turn, this implies that to justify the identification of G.E. 31 as the unique source of the evidence toolmarks, Mr. Masson needed to establish that the differences between the test and evidence toolmarks were small enough to be explained by changes in G.E. 31.  The identification would be mistaken if the differences between the test and evidence toolmarks were instead so great that they could only have been made by two different bolt cutters.  By testifying that “I make my identification on similarities, not dissimilarities” (Tr. 41), Mr. Masson implied that he had not even considered this issue.  Therefore, this Court should exclude his testimony on the ground that “there is simply too great an analytical gap between the data [about the test and evidence toolmarks in this case] and the opinion proffered” identifying G.E.31 as the unique source of the evidence toolmarks.  See Joiner, 522 U.S. at 146.

 

C. The Statistical Nature of Identity Determinations.  In sum, substantial resemblances between toolmarks produced by different tools may result from shared subclass characteristics or from similarities between the striae comprising the individual characteristics of the toolmarks.  At the same time, because the surfaces of tools change over time, even toolmarks made by the same tool do not perfectly match.  Springer, supra, at 965.  The similarities between toolmarks made by different tools and the differences between toolmarks made by the same tool imply that a statistical question must be answered to determine whether a particular tool was the source of the toolmark on an object recovered from a crime scene. What is the likelihood that the toolmarks made by a randomly selected tool of a given type would do as good a job as the toolmarks made by the suspected tool at matching the characteristics of the questioned toolmark?  See Biasotti, The Principles of Evidence Evaluation, supra, at 429-30; GUNTHER & GUNTHER, supra, at 90-91 (“An individual peculiarity of a firearm can, therefore, be established by elements of identity which form a combination the coexistence of which is highly improbable in the signature of other firearms with the same class characteristics.”); Biasotti & Murdock, “Criteria for Identification”, supra, at 21 (arguing that “conclusions of identity in firearms and toolmarks ... mean that there is no credible possibility that a gun barrel or tool other than the one identified was used to produce the toolmark in question”).  Cf. Brim v. State, 695 So.2d 268, 269 (Fla. 1997) (“Brim I”), 695 So.2d 268, 269 (Fla. 1997) (explaining that “the results obtained through this first step in the DNA testing process simply indicate that two DNA samples look the same.  A second statistical step is needed to give significance to a match.”); Murray v. State, 692 So.2d157, 162 (Fla. 1997) (“The fact that a match is found in the first step of the DNA testing process may be ‘meaningless’ without qualitative or quantitative estimates demonstrating the significance of the match.”).

 

At the hearing in this case, the Court was not informed of a major division among toolmark and firearms examiners.  Mr. Masson misleadingly suggested that all examiners resemble him in relying solely on subjective judgments of when the similarities between the striae of test and evidence toolmarks are so great that they must have been made by the same tool.  To the contrary, significant numbers of examiners base their identity conclusions on the objective CMS (consecutive matching striae) criterion that Biasotti and Murdock propounded in 1997 and developed on the basis of statistical empirical studies.  See, e.g., Biasotti & Murdock, The Scientific Basis, supra, at 511-16; Champod, Baldwin, Taroni, & Buckleton, supra, at 310-11.

 

The crucial difference, here, is that when toolmark examiners – such as the expert in this case – insist on relying on inarticulable, mind’s eye criteria to reach conclusions of identity, they evade the task of providing the requisite statistical and empirical foundations for identity claims.  In following a subjective approach, examiners implicitly admit that “we lack necessary statistical data which would permit us to formulate precise criteria for distinguishing between identity and nonidentity with a reasonable degree of certainty.”  Biasotti, The Principles of Evidence Evaluation, supra, at 430.  By contrast, as even critics admit, the CMS approach is a serious attempt to solve the problem of defining the amounts and types of resemblance between striae necessary to create a vanishingly small probability that the same tool did not produce the evidence and test toolmarks in a case.  See, e.g., Champod, Baldwin, Taroni, & Buckleton, supra, at 311-15; Stephen G. Bunch, Ph.D., Consecutive Matching Striation Criteria: A General Critique, 45 (5) J. Forens. Sci. 955, 957-62 (2000).

 

This contrast between the CMS and the traditional, subjective approach is obscured by the fact that, in accord with the AFTE Range of Conclusions, all firearms and toolmark examiners in the United States testify to only four conclusions.  As defense counsel pointed out in the hearing in this case, the only options for examiners are (1) identifying or (2) eliminating a particular tool as the source of the mark(s) found on an object, (3) concluding that the comparison of test and evidence toolmarks is inconclusive, or (4) concluding that the evidence toolmark is unsuitable for comparison.  Tr. 78-79.  See, e.g., Moran, A Report, supra, at 228-29; Biasotti & Murdock, The Scientific Basis, supra, at 506-507.  This range of conclusions is misleading because it is never possible to know, as the expert in this case claims he does, that a given tool is the source of a particular toolmark, to the exclusion of all other tools in the world.  See Champod, Baldwin, Taroni, & Buckleton, supra, at 310-11.

 

Although firearms and toolmark examiners who follow the CMS approach also testify in accord with the AFTE Range of Conclusions, the CMS approach contrasts with the subjective approach in being interpretable in a way that is compatible with the statistical nature of identity claims.  The proponents of CMS are best viewed as having used statistical empirical studies to formulate a cut-off point at which the likelihood that another tool of the same type will do as good a job at matching the evidence toolmark as the suspect tool is so exceedingly small that, for all practical purposes, the suspect tool can be identified as the unique source of the evidence mark.  Champod, Baldwin, Taroni, & Buckleton, supra, at 311-12; Moran, A Report, supra, at 233 (stating that “CMS is a probability model used for toolmark identification”).

 

The CMS criterion is based on Biasotti’s classic study of .38 Special Smith & Wesson revolvers, discussed in sections B(1) and (3) above.  Follow up studies used IBIS (the BATF’s computerized database for toolmarks on bullets and cartridge cases) to compare numbers of matching striae on ammunition components known to have been fired by the same gun and by different guns of  the same type.  See, e.g., Biasotti, A Statistical Study of the Individual Characteristics of Fired Bullets, supra; Miller & McLean, supra; Miller II, supra.  Other studies made similar comparisons of numbers of matching striae on toolmarks made by chisels and other tools besides firearms.  See, e.g., Ronald G. Nichols, Consecutive Matching Striations (CMS), 35(3) AFTE J. 298, 301-02 (Summer 2003); Biasotti & Murdock, The Scientific Basis, supra, at 514, 516-17, 516 n.56, 517 n.57.

 

The CMS criterion is based on these studies’ findings of significant differences between the numbers of consecutive matching striae, but not the percentages or total numbers of matching striae, on pairs of toolmarks known to have been made by the same and different tools.  See, e.g., Moran, A Report, supra, at 229-232; Biasotti & Murdock, The Scientific Basis, supra, at 516 & 516 n.56.  The criterion, which is intended to be applied to all firearms and all other types of tools, defines the threshold for identifying a particular tool as the source of a three-dimensional toolmark as a match between evidence and test toolmarks of one group of six consecutive matching striae or two different groups of at least three consecutive matching striae in the same relative position.  The threshold for two-dimensional toolmarks is one group of eight consecutive matching striae or two groups of at least five consecutive matching striae in the same relative position.  See Biasotti & Murdock, The Scientific Basis, supra, at  516.  However, since CMS requires examiners to compare numbers of striae on individual characteristics of toolmarks, misidentifications will result if, in applying the criterion, examiners mistakenly assume that subclass characteristics on test and evidence toolmarks are individual characteristics.  Id.  See also Miller II, supra, at 127.

 

This Court should be aware that increasing numbers of firearms and toolmark examiners rely on the CMS criterion to determine when the match between evidence and test toolmarks is so great that they must have been made by the same tool.  See, e.g., Moran, A Report, supra, at 229-32; Nichols, Consecutive Matching Striae, supra; Biasotti & Murdock, The Scientific Basis, supra, at 517 (stating that “approximately 300 members of the … AFTE … voluntarily participated in a four-hour workshop [on CMS] at their 1999 annual training seminar.”).  At the same time, CMS is not a definitive solution to the problems of firearms and toolmark identification.  Among the unresolved scientific issues is whether the CMS criterion can reliably lead to accurate identifications when different examiners sometimes find different numbers of striae on the same toolmark.  Another issue, of particular relevance here, is whether the CMS criterion, which was originally based on studies of .38 Special Smith & Wesson revolvers, can be appropriately applied to all types of tools.  See, e.g., Miller II, supra, at 116, 130 (stating that while the CMS criteria for two-dimensional and three-dimensional toolmarks were respectively met by only 2% and 0% of pairs of .25 ACP bullets known to have been fired from the same Raven pistol, 0% and 8% of pairs of .380 ACP bullets known to have been fired from the same Lorcin pistol, and less than 2% and 6.5% of pairs of 9mm bullets known to have been fired from the same Stallard pistol, 5% and 14.8% of pairs of .38 special  bullets known to have been fired from the same Smith & Wesson revolver respectively met the two- and three-dimensional CMS criteria); Ronald G. Nichols, Firearm and  Toolmark Identification Criteria: A Review of the Literature, Part II, 48 (2) J. Forensic Sci. 318 (March 2003) (“Nichols II”); Champod, Baldwin, Taroni, & Buckleton, supra, at 313-15; Bunch, supra, at 955, 957-62; Moran, Comments and Clarification of Responses from a Member of the AFTE 2001 Criteria for Identification of Toolmarks Discussion Panel, 35(1) AFTE J. 55 (Winter 2003).

 

This Court need not take sides on the scientific disputes about the CMS criterion in order to decide on the reliability and admissibility of Mr. Masson’s testimony.  However, the admissibility decision in this case should be grounded in an awareness that CMS is a serious attempt to develop the necessary statistical and empirical foundations for identity claims.  See, e.g., Miller & McLean, supra (“The consecutive marks criteria is a solid foundation from which to conduct more research.”).  By contrast, Mr. Masson and other adherents of the traditional, subjective approach evade the scientific questions when they insist that their ineffable, mind’s eye judgments are sufficient to determine when the resemblance between toolmarks are so great that they must have come from the same tool.  This Court should not countenance this evasion of necessary scientific work. 

 

III.  SUBJECTIVE IDENTITY DETERMINATIONS ARE NOT ERROR- FREE.

Nor should this Court be moved by the plea that “the benefit of the doubt should go to the traditional [subjective] methods” because “with methods such as professional certification and rigorous validation/proficiency testing, the traditional, subjective examination regime can strengthen its scientific grounding.”  Bunch, supra, at 962.  Even assuming that the Daubert-Kumho standard could be satisfied by a method that evades the basic scientific requirement of giving reasons for conclusions, nothing resembling rigorous proficiency testing has been done.  See, e.g., Champod, Baldwin, Taroni, & Buckleton, supra, at 315; Biasotti & Murdock, The Scientific Basis, supra, at 508-510.  In addition, results from the inadequate proficiency testing that has been done, together with theoretical arguments and the experience of prominent toolmark examiners, belie the claim that the traditional, subjective procedure results in so few mistakes that objective criteria are simply not needed. As Champod and his colleagues explain: “What would be required [to show that there is no need for objective identification criteria]?  First the examiners must often declare a match when the two marks have been made by the same firearm or tool.  Next they must NEVER do so when the two marks have been made by differing firearms.  How many proficiency tests are required to show that examiners NEVER declare a match when the marks are from differing tools?  The standard statistical answer is that an infinite number of tests are required.  Examination of CTS proficiency results would suggest that we are not quite there yet.”  Champod, Baldwin, Taroni, & Buckleton, supra, at 315.

 

A. Theorical Arguments for the Possibility of False Positives.  Above, we saw that the scientific literature argues that three central difficulties in identifying a tool as the unique source of a toolmark(s) make it necessary to develop objective, statistically-based identification criteria.  Two of the difficulties – (i) the danger of confusing subclass with individual characteristics of toolmarks and (ii) the fact that non-unique marks combine to form the individual characteristics of toolmarks –may cause examiners to overestimate the significance of matching portions of toolmarks.  Consequently, a comparison of striae may lead an examiner to identify a tool as the source of an evidence as well as a test toolmark, even though a mark made by a different tool would do at least as good a job at matching the evidence toolmark. 

 

A danger of false positive identifications also arises from the fact that the individual characteristics of toolmarks change with time. Hence, differences between an evidence toolmark and test toolmark do not necessarily rule out the suspect tool as the source of the evidence mark.  It follows that it will sometimes be correct for examiners to attribute differences between evidence and test toolmarks to changes in the surfaces of the suspect tool between the time the evidence and test toolmarks were made.  At other times, such an attribution will be wrong; the evidence and test toolmarks differ because the source of the evidence mark was a tool similar, but not identical, to the suspect tool.