Lecture 16: Recombinant DNA, Cloning, & Editing

Flash and JavaScript are required for this feature.

Download the video from Internet Archive.

Description

In today's lecture, the focus shifts from pure genetics to molecular genetics, beginning with cloning, followed by polymerase chain reaction (PCR), and finally genome editing.

Instructor: Adam Martin

ADAM MARTIN: All right, so we're going to switch gears again today, and we're going to move off of kind of pure genetics and start to talk about molecular genetics. And I want to start with the concept of-- let's say you want to identify a piece of DNA, purify it, and propagate it so that you have it for future use. And so the process of doing this is known as cloning. And it's the process of, if you will, purifying and propagating a piece of DNA in an organism.

So sort of the goal for this lecture is for you to know how if you wanted to, let's say, identify a piece of DNA-- maybe you're interested in the piece of DNA that contains a given gene. How would you go about getting that DNA in a state that can be propagated sort of on and on? And how can you identify the piece of DNA you want?

And so one tool that we're going to use is we're going to use an organism bacteria as a tool. So I'll draw my sample bacteria cell here. Could be something like E. coli, just some bacterium. And you'll recall, when I talked about cells a few weeks ago, bacteria and prokaryotic cells have a chromosome called the nucleoid that's present inside. So that's the bacterial chromosome.

But bacteria can also have extra chromosomal pieces of DNA that are called plasmids that exist in their cytoplasm. These are plasmids. And these extra chromosomal pieces of DNA replicate independently of the chromosome. And they can persist in the bacterial cell and be passed on to the daughters of this bacterial cell as the bacterial cells divide.

So if we focus on an individual plasmid, what a plasmid would look like, often they have a cassette or a gene that confers antibiotic resistance. And that's often the reason that these bacteria are harboring these plasmids, because it gives them a sort of advantage if they're exposed to a certain antibiotic from a predator organism. So this would confer antibiotic resistance.

One common example is ampicillin resistance, which I'll abbreviate just amp with an R next to it. But these plasmids, for them to propagate from bacterial cell to bacterial cell, they also need to be able to undergo replication. So they also have what is known as an origin of replication, which is often abbreviated ori, which basically allows this plasmid to be replicated such that copies of the plasma are passed on to the subsequent generation of bacteria.

And we can take advantage of this plasmid system in bacteria, because we could take, let's say, a piece of foreign DNA-- and this foreign DNA could be of eukaryotic origin. We could take a piece of eukaryotic DNA and insert it inside of this plasmid. And we can basically use the plasma as a sort of platform or a vector to carry the piece of DNA that we might be interested in and to use the bacteria to replicate that DNA and pass it on to its descendants so that you essentially have a clone of bacteria, and you have a clone of this DNA in a given bacterial population.

So, again, this would have an origin and maybe an ampicillin resistance to it. So how would you determine whether or not bacteria have a plasmid in it? Can you think of an experiment you could do to determine whether the bacteria has this plasmid? Stephen?

AUDIENCE: You could add ampicillin, and it'll survive with the plasmid.

ADAM MARTIN: Exactly. So what Stephen suggested is that if he wanted to know whether or not this bacteria had the plasmid in it, he would add ampicillin to the media. And if the bacteria doesn't have the plasmid, it won't be able to grow. But if it has the plasma, it encodes this gene that confers ampicillin resistance, and it will thus be able to grow. So that's exactly right. So you're able to select for bacteria with a given plasmid by simply growing it on media that contains an antibiotic.

I now want to go through steps in cloning a piece of DNA. And we'll go through it sort of as a series of ordered steps so you can see how the process works. I'm going to start with a step to cut the DNA. After cutting the DNA, we'll then mix pieces together. Once we mix the pieces together, we'll do something known as a ligation, and I'll explain that to you in just a minute. And then, finally, we'll end with selecting the plasmids that have the piece of foreign DNA that we want. And this is known as recombinant DNA, because you've recombined a piece of DNA from one organism-- it could be a eukaryotic organism, like humans-- with a piece of DNA that's from a prokaryotic cell, a bacterium.

So we then have some sort of selection process. So we're going to go through this step-by-step. And we're going to start with cutting DNA, OK? So, cutting DNA. And it turns out, we've talked about-- what type of enzyme do you think would cut DNA? Just generally, not as specific as what's up on the slide. What type of enzyme would cleave DNA? Think about how enzymes are named.

AUDIENCE: DNase?

ADAM MARTIN: Yeah, so Stephen suggested DNase, which is a really good suggestion, right? So an enzyme that will cut DNA would be a DNase. Another word for that is a nuclease. It's some type of nuclease. And the type of nuclease we're going to talk about is going to be an endonuclease. We talked about exonucleases, which chew DNA from the ends in the context of DNA replication.

But what an endonuclease is, is it's a nuclease that's going to recognize a fragment of DNA and cleave it in the middle. So it doesn't require an end. It's going to chop it right in the middle. And there's a type of endonuclease, and these are called restriction endonucleases. They are nucleases that our natively present in a lot of different bacteria. And these restriction endonucleases essentially look through the DNA sequence, and they recognize a specific sequence of nucleotides and make a cut right at that sequence.

So I have a few examples to show you here. The first is this EcoR1 restriction endonuclease. EcoR1's from E. coli, and it recognizes the sequence GAATTC. So it recognizes this six-nucleotide . Sequence and it cleaves the double-stranded DNA on the top strand between the G and the A and on the bottom strand between the G and the A, OK? And you can see that the two cuts are staggered. So when this cut is made, it leaves the DNA with two ends, and they're sticky, because there's a five-prime overhang at each end.

So each end has this TTAA sequence. And these nucleotide bases can base pair with the complementary sequence. So this sequence could base pair with a sequence that has an end AATT. So these two ends that I've generated here could stick to each other, or there could be other ends that have the TTAA sequence that could stick to them.

So another example is this Kpn1 endonuclease. And this is from a different bacterium. But again, it cleaves the DNA on the two strands. And this time, it cleaves the top strand farther down the sequence. And that generates what's known as a three-prime overhang. But again, you have an overhang. So this is what is known as having a sticky end. Because again, these nucleotides are available to base pair with a complementary sequence.

The final type of restriction enzyme that I'll tell you about is EcoR5, which is a different enzyme from E. coli. And this generates a break, but this time, it cuts at the same position on both DNA strands. And that generates an end that's known as a blunt end, because there's no overhang, and there are no nucleotides that would sort of basically recognize a complementary sort of end like the sticky ends do, OK?

So these are several of many, many different types of restriction endonucleases that are present in a wide range of bacteria. So then, now that you have a tool that allows you to cut DNA, you could then cut DNA from two different sources. And I've outlined that here. The vector is what the plasmid DNA is commonly referred to. So we commonly refer to this prokaryotic part of the plasmid, the vector DNA, and the part that we're trying to insert that's the foreign DNA, the insert. That's just kind of the lingo in the field.

So here, I have a plasmid. It looks like a plasmid. It has an origin of replication. It has ampicillin resistance. And it has this EcoR1 site, which just means that this DNA sequence has a GAATTC, OK? So it's something that will be recognized by this restriction endonuclease. And then if you cut this enzyme with EcoR1, you start with the linear piece, right? So I, at 6:00 AM, started engineering this here.

So let's say I have my plasmid DNA, and I cut it at the EcoR1 site. Then I cleave it. If you cleave a circle, now you have a linear piece of DNA, OK? But it has sticky ends, right? And these sticky ends-- so pretend that this is a foreign piece of DNA. This is my eukaryotic DNA. Let's pretend it carries the gene elastin. And this has ends, too. And if they're EcoR1 ends, then they will be able to stick to the sticky ends of the plasmid.

And if you just get one sticking, now you have this piece of DNA which is two different fragments in tandem. But it's going to be moving around in space in the cytoplasm. And at some point, it might be recognized and stick to the other EcoR1 end. And then you have now a circular piece of DNA again, but now your circular piece of DNA has this piece of foreign DNA that's present inside the vector, which is the sort of poster tape here.

So I just wanted to show you that. So you can kind of-- when you're doing molecular biology, you have to imagine the sort of end stick sticking to each other and how they're going to sort of wrap around and connect for the final product. OK, so let's say you get DNA, and your DNA could be eukaryotic DNA from, let's say, humans or flies or whatever your favorite eukaryote is. And in the genome of that organism, there will be many restriction sites. But if you chop it up, you will get various fragments that have sticky ends for EcoR1 on both sides.

And then if you mixed the vector and the insert together, you have some probability of getting that insert to be incorporated into the vector. And then once this is present and ligated together, you can then put this into bacteria, and it will be replicated. So we focused on cutting, but if you mix these together, like I showed you at the tape, you're going to have sticky ends that come together and stick together.

And you'll eventually get a situation where you have your insert-- so you have your insert here that might have your gene of interest and your vector DNA here. But when these things initially stick together, you don't have a single molecule where everything is covalently attached, right? You just have these base pair interactions between the two overhangs as they stick to each other through base pair interactions.

So if we think about what's going on right here, you have, initially, if we're thinking about EcoR1, a sequence that is-- oh, sorry, this should be a C. This is the nucleotide sequence, but when it's sticking together, the top strand will have a free five-prime phosphate on this adenosine, and there will be no covalent bond between this adenosine and this guanosine. So that's what the top strand would look like. The bottom strand would look like this, where there are covalent bonds along the DNA backbone. Sorry. Had a mutation there.

And this is incorporated in a broader sequence. And this bottom strand will have a free five-prime phosphate here and a free three-prime hydroxyl here, but there's no covalent bond here. There's no covalent bond here, right? So, at this stage, you don't have a single piece of DNA. You have two pieces of DNA that are interacting with each other through base pair interactions. So, eventually, you want it to be a single piece of DNA. And so you have to perform a step that is known as-- sorry, my phosphate got in the way.

But you want to perform what's known as a ligation, where you take something that's just sticking together through nucleotide base pair interactions and you add a type of enzyme, which is called DNA ligase, to catalyze the formation of covalent bonds here and here. So DNA ligase is an enzyme that if you have a three five-prime phosphate here and a free three-prime hydroxyl, it'll catalyze the formation of a phosphodiester bond between these two bases and the DNA.

So this DNA ligase forms a phosphodiester bond. And you go from having no bond there to having a bond. So then you eventually would get this sequence now, where you have covalent bonds between each of the base pairs. And what you see here is you've recreated the EcoR1 site. So when you get two EcoR1 sticky ends sort of recognizing each other and sticking to each other and then you ligate them together, you recreate that nuclease site so that you can cleave it again with the EcoR1 enzyme.

So now, moving on. I'll move on right here. So the last step is that once we have pieces of DNA with this insert-- and let's say we're trying to find a piece of DNA from a eukaryotic organism. We might start with an animal. We have to extract its chromosomal DNA, digest it with a restriction enzyme, and then we digest the vector with the same restriction enzyme. And then we're going to make-- we're going to randomly insert these pieces of DNA into different vectors such that each bacteria who gets one of these plasmids will be replicating a different piece of DNA that's of eukaryotic origin.

And this is what is known as a DNA library. So this is making a DNA library. And a DNA library is essentially a collection of different pieces of DNA that are from some source, OK? But different bacterial clones will be replicating a different piece of that DNA. So you can see the challenge now is to find the needle in the haystack, right? You're trying to find that one piece of DNA which is the one you're interested in.

And I'll talk about several strategies that you can use to find the piece of DNA that you're interested in. I'll focus on selection. But first, I just want to differentiate between two different types of ways you could search for a piece of DNA. You could do a screen. And this is similar to what we talked about on Monday, where you look through a whole population of individuals, and you look for a given phenotype.

So in the case of flies, we talked about how Morgan's lab was looking for differences in eye color. And that was a screen, because they looked through a ton of normal flies to find the one they want. Another strategy would be to do what's called a selection, where you kill off everything that's not what you want by making the organism grow in some sort of selective condition. And then you only allow the organisms to grow that are the ones that you want. So this is called a selection.

And I'm going to illustrate several examples of selections, just so that you get an idea of how this works. So the first example I'll give is antibiotic resistance. And as Stephen so kindly pointed out earlier in the lecture, the way we can select for bacteria that have taken up this plasmid is to select the bacteria that grow in the presence of the antibiotic. So let's say you had a population of bacteria and let's say this started out being sensitive to an antibiotic. You could transform them with DNA from a strain that's resistant.

And maybe that resistant strain has a plasmid that has a gene that confers antibiotic resistance, in which case, if it's taken up by this sensitive bacteria, if you then grow it on a plate that has the antibiotic, you might get a colony or a clone of cells that has taken up the piece of DNA that you're interested in-- in this case, the piece of DNA that confers antibiotic resistance. Everyone see how that works? So you're selecting only the cells to grow here that have taken up this antibiotic resistance gene.

I'm going to use another example now from yeast, and it involves functional complementation. And I'm going to start with something that involves the biosynthesis of an essential amino acid. And then I'm going to go to a more interesting case, which is a case that involves an experiment that involved the identification of the master regulator of cell division in humans. But we'll start with just amino acid biosynthesis.

And there are mutants of yeast known as auxotrophs. And these are mutant yeast, or mutant microorganisms, that fail to produce an essential nutrient. So an auxotroph is a mutant that fails to make a nutrient that's essential. So fails to make a nutrient. And the opposite of an auxotroph is called a prototroph. A prototroph is essentially a normal-functioning microorganism that's able to produce all of the essential nutrients that it needs in order to grow and survive, OK? So this produces all nutrients.

And so let's say you had an auxotroph for leucine. So you had a strain that if you didn't provide leucine to it, it would fail to grow. So we'll start with a leucine auxotroph. And let's say you want to identify the gene that's defective in this leucine auxotroph.

You'd perform a similar strategy to this one, where you'd have your auxotroph that you're transforming, so your auxotroph down here. And you would transform that strain with DNA from what organism? If you're trying to identify the functional gene, what organism would you use to produce the DNA you're going to transform into that organism?

AUDIENCE: Prototroph?

ADAM MARTIN: Javier's exactly right. You'd use the prototroph, right? So in this case, you would use DNA from the prototroph, because the prototroph has a functional copy of that gene. You know it does, because it's able to grow without adding leucine. And then you could take the auxotroph mutants that's transformed with DNA from the prototroph, and you played it on media. And what should be a property of the media? Should leucine be present or absent? Carmen?

AUDIENCE: Absent.

ADAM MARTIN: It should be absent, exactly right. So you'd look on plates that lack leucine or minus for leucine. And you'd select for colonies that now are all of a sudden able to grow leucine. So you've restored the function of leucine biosynthesis in this clone, and you've made it into a prototroph again.

OK, this is what's known as functional complementation, because you're taking a cell that is defective in some function, and you're complementing it. You're complementing or rescuing the phenotype, OK? Now, even as a former yeast geneticist, I don't find amino acid biosynthesis and functional complementation in the context of leucine all that exciting. So I want to present one last example that involves an experiment that is going to involve the yeast cell cycle mutants.

And I'm going to tell you about the experiment that led to the cloning of the master regulator of cell division in humans. And it involves a yeast mutant, and specifically, a yeast cell cycle mutant. And these yeast cell cycle mutants are what are known as conditional mutants. They are isolated as conditional mutants, meaning that these mutants are able to grow under certain conditions, but not others. And specifically, the condition they used is temperature, so they're temperature-sensitive mutants.

The yeast cells can grow at 25 degrees, but not at 37 degrees. So this is known as a temperature-sensitive mutant, where you can propagate the mutant at one temperature, but then you can see if you raise the temperature, then it stops growing. And so you can see the mutant phenotype, because normal wild type functional yeast can grow at both temperatures. So this is a special type of mutant.

And I'm going to tell you about an experiment done by Paul Nurse, who's an excellent yeast geneticist. And what he did was he used these yeast cell cycle mutants to identify the human gene for what's now known as cyclin-dependent kinase, or CDK for short. This is the master regulator of cell division in organisms ranging from yeast all the way up to humans, OK? But he used yeast as a model system to identify this gene.

And the process was he took yeast cells-- and Paul Nurse worked on fission yeast cells, which are rod-shaped cells. And he identified yeast mutants. Yeast mutants. And he had a mutant in the CDK gene of yeast. He didn't really know it at the time. But the yeast CDK mutant-- what he knew was that this mutant was critically involved in the cell cycle in numerous types of yeast. So he knew this is an important gene.

And what he wanted to do was to identify if humans had an equivalent gene that could function in the same way. So if you just have this CDK mutant and do nothing to it, it will not grow at 37 degrees, OK? But what he did was to take a DNA library-- similar to what I showed you before, where you just chop up DNA from an organism. In this case, he's using a human DNA library. And he used a particular type of library, but I'm going to skip over that for now and come back to it later.

So he used a human DNA library. That's just a collection of pieces of DNA from a human source, OK? So he's taking human DNA, putting it into a yeast plasmid, and transforming yeast with that human DNA. And he's looking for a piece of DNA that's able to complement the CDK mutant, meaning the yeast cells would then be able to grow at the non-permissive temperature of 37 degrees.

So he then looks for, on a plate, colonies of yeast that are growing at the non-permissive temperature of 37 degrees. So if you didn't do anything with this mutant, if you didn't transform in the DNA, nothing would grow. But he identified pieces of human DNA that rescued the phenotype of this mutant, OK? And so these are yeast that have the human gene for CDK, and they now grow.

And this is a functional complementation experiment, because you're rescuing the growth of this yeast now not with a yeast gene, but with a human gene. And this human CDK gene is so conserved across eukaryotes that it's able to still function in a yeast cell, which is pretty amazing. So this just outlines the experiment here. At 25 degrees, these yeast mutants can grow and form colonies. And at that temperature, you can transform the yeast with different pieces of human DNA.

Most of the human DNA is not going to be what you want. You're looking for the needle in the haystack. So most of these are not going to grow at 37 degree. But you're looking for this guy here that gets the human CDK, and that restores growth to this mutant strain. So voila, you get a colony of cells that are growing. And boom, Paul Nurse wins a Nobel Prize and the rest of the yeast field, as well, or a number of people who are working on cell cycle mutants. This is one of the experiments that led to the 2001 Nobel Prize for a bunch of yeast cell cycle geneticists.

All right, so I've told you about how to find the needle in the haystack. And this was more common when we didn't know the genome sequence of an organism. But now I want to tell you how knowing the genome sequence of an organism would allow you to replicate and amplify a piece of DNA in vitro. So I'm going to tell you about an approach known as Polymerase Chain Reaction, or PCR.

And what PCR is, is it's an in vitro method. So it's an in vitro approach to essentially amplify DNA. And so let's say you have a piece of DNA-- it could be a piece of DNA in the genome-- and you know the sequence of this DNA. And it has base pairs between the two strands. So, normally, for DNA replication to occur, what do you need? What needs to happen here? Can a polymerase get in now? No? Why not? Miles?

AUDIENCE: The DNA's going to-- because they'll try and [INAUDIBLE] away [INAUDIBLE] from each other, so you have to [INAUDIBLE].

ADAM MARTIN: Yeah, you have to unwind it, right? So you have to denature it first. So if you do nature it, now you have two single-stranded pieces of DNA, right? Now what would a polymerase need to replicate that? Yeah, Jeremy?

AUDIENCE: A prime.

ADAM MARTIN: A primer. Exactly, right? And if you know the sequence, you can have a company synthesize a primer that's the exact sequence here and base pairs here. And I'll just draw the five-prime end of the primer right there. And now this primer has a free three-prime hydroxyl here. And if you added a polymerase, it would synthesize this bottom strand here. So this is known as the forward primer.

And on the other strand, you can design a primer that's complementary to these bases here. Again, the five-primer end is out. This would be known as the reverse primer. And then you could have the DNA polymerase synthesize the opposite strand.

All right, so the step here will be to first denature. So the first step would be to melt or denature the DNA, double-stranded DNA. So you denature the double-stranded DNA. This is commonly done above 90 degrees Celsius. And then the next step is once you have these single-stranded pieces of DNA, you can act you can have a primer present that anneals to the opposite strands. So you can have primer annealing.

And this is commonly done between 50 and 60 degrees Celsius. You have to cool it down so that the primer can now base pair, such that not everything is denatured. So you have to cool it down for these primers to recognize their cognate sequence and base pair with it. And then once you have the primer annealed to the template, then you can add DNA polymerase to synthesize a new strand. So DNA polymerize for new strand synthesis. And this is commonly done at around 70 degrees Celsius.

And then you can repeat this process over and over again. And at each step, you're going to double the amount of DNA that you have between these two primers. So let me just-- this is just a figure illustrating this. It's on the handout and online. Basically, you have your original double-stranded piece of DNA. You denature it and allow the primers to anneal. New strand synthesis.

Then you take these new pieces of double-stranded DNA, denature them. The primers anneal to those new strands, and now you get new strands. And you just keep doing this cycle over and over again, and you essentially amplify the piece of DNA that's between the two primer sequences. So this is often used in forensics, because you can have very little DNA, and just by adding primers, you can really amplify the number of pieces of DNA you have between these two fragments. So you go from having very little DNA to a lot of DNA.

OK, any questions about PCR? I'm going to move on to something that-- all right. I've really been focused on discovery up to this point. But I know that a number of you are engineers, and you probably want to engineer something. And so I've had to-- I'm going to tell you about a field that is moving so rapidly, I'm going to probably have to totally revamp my lecture for next year, OK? And I'm going to tell you about genome editing.

So the last part of this story, genome or DNA editing. And I'm going to tell you about a specific type of system called CRISPR-Cas9, which has been in the news a lot, and there's a lot of excitement about this approach in the context of editing the human genome and possibly curing genetic diseases. Who here has heard of CRISPR-Cas9? OK, good. That's good. Our media is doing its job.

So who knows what it is? OK, some of us know what it is. I just want to just give you a very superficial overview of what it is and why it's important. And I'm going to keep coming back to it during the course of the semester, because I think it raises a lot of ethical questions, and especially in the context of stem cells. I need you to know the foundation before we get into the really good stuff.

So, let's see. So we're going to engineer something. So we're going to talk about repairing DNA. And if we want to edit the genome, the way this is most often done is by making a double-stranded break, OK? So if you make a double-stranded break in a piece of DNA, it can be repaired one of two ways. One is by non-homologous end joining, where the two pieces of DNA are basically just shoved back together again. And this results, often, in mutations.

So if you're trying to fix something, unless you're just trying to break it, that's probably not what you want. But an alternative approach to DNA repair that organisms have is something called homology-directed repair. In this case, you can break a piece of DNA and add a piece of DNA that has a different sequence, but with homology near where the double-stranded break is. And in that case, you can replace the original sequence with what you provide as donor DNA.

So it gives you an ability to essentially change the DNA sequence at a given locus if you're able to cleave the DNA at a specific locus. So, first, I want to start with just a thought experiment, right? You're all thinking, OK, we need to cleave double-stranded DNA. And boy, I just gave you a perfect tool for that. I gave you all these restriction endonucleases, right? So what's the problem with those?

Well, let's think about the human genome. The human genome is 3 billion base pairs. And an EcoR1 site looks like this-- GAATTC. So it's six nucleotides long. And if you think of just a random sequence of 3 billion base pairs, you would get this sequence randomly one out of every four to the sixth times. So that's going to be one every 4,096 times.

So if you get this in random DNA 1 every roughly 4,000 times, if you use it to cleave the human genome, it's going to cleave hundreds of thousands of places in the human genome. So we need much more specificity if we want to select, let's say, a given gene that has a disease-causing allele and try to fix it. Because if we use a restriction endonuclease, we just chop up the whole genome, and that would be bad.

So specificity is the name of the game here. This is not specific, and we need a tool that's more specific. And that tool is going to be CRISPR-Cas9. And what CRISPR-Cas9 is, is it's essentially an RNA-guided endonuclease. So it's RNA guided, and it's an endonuclease.

Restriction enzymes, right, they have nothing to do with RNA. They don't use RNA to recognize the nucleotide sequence. It's just a protein, and the protein recognizes the nucleotide sequence. In CRISPR-Cas9, you have an endonuclease, which is Cas9. Let's bump this up.

So the endonuclease is the-- the Cas9 is the protein. That's the endonuclease. But its selection of a target depends on an RNA molecule that it's bound to, OK? So the specificity comes, at least in part, from what's known as a guide RNA, or single guide RNA. That's what's most often used in genome editing.

So this guide RNA basically allows this enzyme to find a specific sequence. And the guide RNA is 20 nucleotides, or looks for homology for a 20-nucleotide base pair sequence. So you can see, already, we're doing way better than the six base pair recognition motif. We have 20 nucleotides. And there are other components of the system which increase the specificity.

Then you have your Cas9 in blue, which is the endonuclease, your RNA, the guide RNA, in black, and the template here is in gray. And what you see is this RNA sort of exhibiting complementarity to this target sequence. And only if there's complementarity between the RNA and the target will this endonuclease get activated and cleave at this site. So the RNA is sort of serving like a guide dog for this endonuclease to guide it to a certain location to cleave.

So the idea, then, is if you want to edit the genome-- and why people are so excited about this these days is you now have a system that might allow you to generate a double-stranded break in one specific place in the genome. And if you can do that, then if you provide donor DNA that maybe has a different sequence-- if you consider a disease allele, right? Let's say you know there's a gene that when there's a certain allele causes an inherited form of a disease.

You could then take donor DNA from an unaffected individual and take cells from the affected individual and cut the locus that's problematic and get a repair of the defective allele using a normal allele of the gene. And that would essentially rescue the function of that gene if it were then reintroduced into the patient, OK? Do you see sort of roughly how this works?

So this is a very sort of broad and general sort of conceptual framework for how this might happen. Let's say you have an individual with a blood disorder-- let's say sickle cell anemia or beta thalassemia. Those are inherited blood disorders, which lead to anemia. You could remove cells, and what might be the best are the stem cells from a patient. And you could then take those stem cells and use CRISPR-Cas9 in vitro in cell culture to edit that individual's cells to repair the genetic defect.

And you could then reintroduce those to the patient, where if they're stem cells, they'd reoccupy the stem cell niche and produce functional blood cells that would then essentially cure the individual of the disease. This is how scientists are thinking about the use of the system nowadays. And this hasn't really been successful yet, but there are several clinical trials that are currently underway, where people are trying to show that this can be used to treat human genetic diseases.

So in the next year, you are going to hear more about this, almost undoubtedly, as we start to hear the results of some of these patients. There are concerns about this, as well. I don't want to overblow it. There are certainly concerns. We don't know this is going to work. I mean, people have been talking about this type of stuff since I was a student 20 years ago. But I feel like we're getting-- we're much more advanced now, and the tools are more advanced. And so I feel like we're kind of getting to the point where there's a much greater chance that this will be successful these days than it was 20 years ago.

I just want to point out where this system-- how it was discovered and where it came from. And I like this as an example. Much like for the fly genes that defined major signaling pathways, this is a discovery that came from fundamental research on, basically, the ecology of bacteria. So this CRISPR-Cas9 system essentially evolved in bacteria as a form of an arms race between bacteria and their predators, bacteriophage, which are viruses that infect bacteria.

So this is an arms race between bacteria and their vicious predators, bacteriophage. And what CRISPR is, where these enzymes and this system evolved from, is this is a form of an adaptive immune system for bacteria, which is pretty wild in and of itself. So CRISPR is an adaptive immune system for bacteria. If you haven't gotten your flu shot, you should get it. We'll talk about human immunity later in the semester.

But this is where bacterial immunity kind of-- I'm sneaking it in. So the way this works in bacteria-- what CRISPR stands for is Clusters of Regularly Interspaced Short Palindromic Repeats. So you can see already thank god they gave it an acronym. Otherwise, it wouldn't be getting nearly as much buzz, because no one can say that.

And so where this CRISPR came from is on bacterial chromosomes of many bacteria, there's these clusters of interspaced short palindromic repeats, and the repeats are interrupted by spacers. And what researchers discovered are these spacers have sequence similarity and identity to sequences that are from bacteriophage. So each of these colored sequences here has some type of complementarity to some type of bacteriophage.

And so when a phage infects bacteria, or some bacteria, what happens is that there's a system to recognize that foreign genetic element and take a piece of it and insert it in the genome. And that serves as a memory for the bacteria to remember that it got infected by that particular phage. And then, later on, what the bacteria does is it transcribes this region and forms these mature what are known as CRISPR RNAs, where you can see there's some sequence would recognize a foreign genetic element.

So, therefore, in the future, if this phage came around again, what would happen is one of these CRISPR RNAs would recognize the foreign genetic element through base pair complementarity, and it would know to cut it. And after the target is cut, it's then degraded by the bacterial cell. So this is a way for bacteria to remember what viruses have infected them and to have a defense mechanism against it. So it's a pretty cool system.

You know, what's also cool about this system is it's an adaptive immune system, similar to how we sort of recognize foreign pathogens. What's different about it is this is heritable. It's incorporated into the genome. And the more phage the descendants of this bacteria see, the more of these repeats you see. So this is a heritable immune system, which, unfortunately, we don't have.

So you should still get your flu shot. We'll talk about vaccination later on. Have a good few days, and I will see you on Friday.