Thursday, April 17, 2008

dna fingerprinting

DNA profiling is a molecular testing method used to uniquely identify people and other organisms. In many ways, it is similar to blood typing and fingerprinting, and it is sometimes called "DNA fingerprinting." Because every organism's DNA is unique, DNA can be examined to identify people who might be related to each other, to compare suspected criminals to DNA left at the scene of a crime, or even to identify certain strains of disease-causing bacteria.

Blood Typing and the Abo Groupings

Before the development of the molecular biology tools that make DNA testing possible, investigators identified people through blood typing. This method hails from 1900, when Karl Landsteiner first discovered that people inherited different blood types. Several decades later, researchers determined that the basis for those blood types was a set of proteins on the surface of red blood cells.

The main proteins on the surface of red blood cells used in blood typing come in two varieties: A and B. Every person inherits from their parents either the genes for the A protein, the B protein, both, or neither. Someone who inherits the A gene from one parent and neither gene from the other parent has blood type A. If a person inherits both genes, they are AB. A person who inherits neither is type O. Another protein group found on red blood cells is referred to collectively as the Rh factor. People either have the Rh factor or they do not, regardless of which of the A and B genes they inherited. To type a person's blood, antibodies against these various proteins (A, B, and Rh) are mixed with a blood sample. If the proteins are present, the blood cells will stick together and the sample will get cloudy.

Blood typing can be used to exclude the possibility that a blood sample came from a particular person, if the person's type does not match that of the sample. However, it cannot be used to claim that any particular person is the source of the sample, because there are so few blood types, and they are shared by so many people. About 45 percent of people in the United States are type O, and another 40 percent are type A. If four people were physically present at the scene of a murder, and the candlestick found nearby had type O blood spilled on it, chances are good that two of those individuals could be found guilty of the crime, based solely on the blood typing evidence. Most court cases, however, rely on more evidence than just blood or DNA typing, such as whose fingerprints are also found on the candlestick (see Statistics and the Prosecutor's Fallacy, below).

Dna Polymorphism Offers High Resolution

DNA is the molecule that contains all the genetic information of an individual. One person's DNA is made up of about three billion building blocks known as nucleotides or bases. Every organism in the world has a unique DNA sequence except for identical twins. Although identical twins accrue changes as they develop, they generally do not accumulate enough genetic differences for DNA typing to be useful. Portions of the DNA, called genes, encode proteins within the sequence of bases. Genes are separated by long stretches of noncoding DNA. Because these sequences do not have to code for functional proteins, they are free to accumulate more differences over time, and thus provide more variation than genes. Thus, they are more useful than gene sequences in distinguishing individuals.

Polymorphisms are differences between individuals that occur in DNA sequences which occupy the same locus in the chromosome. An individual will have only one sequence at a particular polymorphic locus in each chromosome, but if the population bears several to dozens of different possible sequences at the site in question, then the locus is considered "highly variable" within the population. DNA profiling determines which polymorphisms a person has at a small number of these highly variable loci. Because of this, DNA profiling can provide high resolution in distinguishing different individuals. The chances of one person having the same DNA profile as another are typically much less than the chances of winning a lottery.

Str Analysis

The technology of DNA profiling has advanced from its beginnings in the 1980s. Today, DNA profiling primarily examines "short tandem repeats," or STRs. STRs are repetitive DNA elements between two and six bases long that are repeated in tandem, like GATAGATAGATAGATA. These repeat sequences often exist in a chromosomal region called heterochromatin, a largely unused portion of DNA found in each chromosome.

Different STR sequences (also called genetic markers) occur at different loci. While their positions are fixed, the number of repeated units varies within the population, from four to forty depending on the STR. Therefore, one genetic marker may have between four and forty different variations, and each variation is referred to as an allele of that marker. Each person has at most two alleles of each marker, one inherited from each parent. The two alleles for a particular marker may be identical, if both parents had the same form.

The United States Federal Bureau of Investigation has designated thirteen of these sequences to use with STR analysis. These thirteen markers are all four-base repeats, and were chosen because multiple alleles of each exist throughout the population. The FBI system, called CODIS (Combined DNA Indexing System), has become the standard DNA profiling system in use today.

STR analysis begins with sample collection. Because of the often small samples involved and the legal weight that will be given to them, it is vital that the sample not be contaminated by other DNA. This may occur for instance if skin cells from the person collecting the sample are mixed with skin cells under the fingernails of a victim. Once the sample is collected, it must be kept secure at all times, to prevent any possibility of tampering.

In the laboratory, the DNA is isolated and purified, and then multiple copies of it are made using the polymerase chain reaction (PCR). Technicians can specify which DNA sequences to multiply, so that only the thirteen core STR sequences will be amplified (multiple copies produced), leaving the rest of the billions of irrelevant bases alone.

In order to specify which DNA to amplify, "primers" are used. The primers are DNA sequences that recognize a nonrepeated sequence in the genetic markers, and which are used by the DNA polymerase that does the actual copying. After the DNA has been copied, the new DNA molecules are separated by size, by gel electrophoresis. A fluorescent molecule previously attached to each primer will send a light signal to the machine that measures the length of the molecule, or allele.

Vntr Analysis

An early form of DNA profiling, rarely used today, is based on VNTRs, or "variable number of tandem repeats." VNTRs requires extensive sample processing: The DNA is chopped up with restriction enzymes, separated by size, and probes are applied to the fragmented DNA to view only the relevant DNA pieces. In the DNA of two different individuals, different spacing between two cut sites for the restriction enzymes gives a unique pattern of DNA size fragments, called "restriction fragment length polymorphisms," or RFLPs.

Making a Match

To understand how DNA profiling is used to identify a person, imagine a sample of blood collected at a crime scene that doesn't match the victim's blood, and is presumably from the unknown perpetrator. DNA from the blood is isolated and its set of STRs are analyzed. The results will be a list of the alleles found at each of the markers (for example, VWA-12, 13; TH01-6, 7, and so on), where the initial symbol is the abbreviation for the markers and the last two are the numbers of the alleles found in the sample for that marker. The full set of thirteen markers may or may not be analyzed in each case. When a suspect is identified, his or her DNA can be analyzed for these same markers. If the set of alleles are different, the investigators can be sure that the two DNAs came from different sources, and the suspect is not the source of the blood. Since the introduction of DNA profiling, an absence of matching DNA has been used to free dozens of wrongly convicted prisoners.

If the samples do match, the question becomes whether the blood is actually from the suspect, or from someone else with the same set of alleles. As with blood typing, this is a matter of statistics, and depends on how frequently each allele occurs in the population. This information has been tabulated and is kept on file in the FBI CODIS database. If two samples share a very rare allele, that increases the likelihood they came from the same source.

Matching multiple alleles increases the certainty they came from the same source. Since the thirteen STRs are inherited independently of each other, the likelihood that one person's DNA will include specific alleles of all thirteen STR sites is the product of the individual allele frequencies. For example, if each allele a person carries occurs in 25 percent of the population, then the probability that all thirteen alleles will occur in one individual is (0.25 × 0.25 × 0.25 × 0.25 × 0.25 × 0.25 × 0.25 × 0.25 × 0.25 × 0.25 × 0.25 × 0.25 × 0.25) or 1 in more than 67 million. This analysis can discriminate between millions of people, far better than is possible using the four blood groups. Since many alleles are even rarer than 25 percent, their presence in both samples further increases the probability that they came from the same source.

Statistics and the Prosecutor's Fallacy

Despite the persuasiveness of such figures, it is quite possible to misuse DNA evidence to incorrectly argue that an innocent suspect must be the perpetrator of the crime, or that a guilty suspect should go free. Both defense and prosecution attorneys can—accidentially or otherwise—misinterpret data to make a highly likely event seem improbable, or a highly unlikely event seem probable. Jurors can be confused because DNA testing reveals the probability that an innocent person's DNA profile matches the sample at the scene of the crime. Jurors must decide, however, what the probability is that a person is innocent, if his DNA matches that sample. The prosecutor's fallacy occurs when investigators focus on the existence of the match, rather than the possibility that the match could be a coincidence.

Let's assume the DNA profile found at the crime scene—and the matching DNA of the suspect—is expected to occur once in every million people. The correct statement of probability arising from these facts is, "If the suspect is innocent, there is a one-in-one-million chance of obtaining this DNA match." The fallacy is to reverse these clauses, and state, "If the DNA matches, there is a one in one million chance that the suspect is innocent." To understand the logical fallacy, imagine the statement, "If it's Tuesday, it must be a school day." The reverse is not true—there are other school days besides Tuesday.

Similarly, there are other ways of misusing statistics in DNA profiling. Let's assume the suspect in the above case is actually guilty. If the suspect hails from a city with a population of ten million, there are ten people in the city whose DNA matches the DNA at the crime scene. Therefore, his defense lawyers could argue there is a 90 percent chance that the suspect is innocent, because he is 1 out of 10 individuals with that same DNA profile. If the defense can convince the jury to ignore other incriminating evidence, such as the suspect's bloody glove left behind at the scene, then the attorney may introduce reasonable doubt. Only by considering DNA typing within the context of other evidence can the probability of a DNA match improve the integrity of the justice system.

Dna Profiling Comes of Age

Although DNA profiling was viewed with some skepticism when it first made its way into the courts, DNA typing is now used routinely, in and out of the courthouse. It is commonly used in rape and murder cases, where the assailant generally leaves behind some personal evidence such as hair, blood, or semen. In paternity tests, the child's DNA profile will be a combination of the profiles of both parents. DNA profiling has also been used to identify victims in disasters where large numbers of people died at once, such as in airplane crashes, large fires, or military conflicts.

DNA testing can also used in organisms other than humans. For instance, it has been used to type cattle in a cattle-stealing case. It can also be used to identify pathogenic strains of bacteria to track the outbreak of disease epidemics.

Bibliography

Bloom, Mark V., Greg A. Freyer, and David A. Micklos. Laboratory DNA Science: An Introduction to Recombinant DNA Techniques and Methods of Genome Analysis. Menlo Park, CA: Addison-Wesley, 1996.

Evert, Ian W., and Bruce S. Weir. Interpreting DNA Evidence: Statistical Genetics for Forensic Scientists. Sunderland, MA: Sinauer Associates, 1998.

Steward, Ian. "The Interrogator's Fallacy." Scientific American (September 1996): 172-175.

Internet Resources

"13 CODIS Core STR Loci with Chromosomal Positions." National Institute of Standards and Technology. http://www.cstl.nist.gov/biotech/strbase/images/codis.jpg.

The Biology Project. The University of Arizona. http://www.biology.arizona.edu/human_bio/activities/blackett2/gifs/sample2.gif.

FBI Core STR Markers.http://www.cstl.nist.gov/biotech/strbase/fbicore.htm.

The Innocence Project.http://www.innocenceproject.org/.

—Mary Beckman


No comments:

 
Cool Web Site Listings
Search Engine Optimization - AddMe