Libraries
A big collection of clones.
|
What is a library? |
We know what a "library" is in real life - a room or building filled with a collection of books. In molecular biology we use the term in a slightly different way, to mean a collection of recombinant vectors containing different inserted sequences. You might, for example, clone random genomic DNA or cDNA fragments into a vector, with the hope of identifying a gene you are trying to clone through screening of the library. When random fragments are cloned, the library is sometimes called a "shotgun" approach, because one doesn't really need to aim too carefully to hit the target with a pellet. When a library is designed to contain many possible versions of a sequence, often by synthesis using degenerate oligonucleotides, it is often called a "combinatorial" library. | ||||||||||||
|
First let us discuss the vectors that can be used - for example lambda |
Before we get too involved in discussing the DNA insertions in a library, let us discuss the vectors that hold the foreign DNA. You could think of this as being the "bindings" and "covers" for the books in a real library - they carry the information by holding together the pages. Almost any recombinant vector can be used to make a library, but some are easier to work with than others. Phage lambda derivatives are easy to use for genomic and cDNA libraries, because the plaques are a stable reference stock for your clones. When you lift a membrane from the collection of plaques, you always leave some plaques behind that can be picked out and propagated.
What next? You now have a gt11 clone that expresses a fusion protein that reacts or cross-reacts with your antibody probe. It could be a good result, but then again it might not be the protein you are really looking for. You will need to sequence the inserted DNA to see what you've got, and it is often the case that you want to move the inserted DNA to a plasmid vector.
|
||||||||||||
|
Plasmid vectors |
Plasmids are a bit more problematic, because lifting a membrane from
a collection of colonies may strip away the entire colony, leaving you nothing to
grow later. One solution is to make an imprint of the live colonies on a piece of
autoclaved velvet, then print the colony pattern onto a reference plate, a process
called "replica plating,"
but this is an unpleasant task. Plasmids vectors are generally not used for "expression
systems" like gt11, and are typically used for DNA detection only (i.e. using
DNA probes to detect a specific clone rather than antibodies). In the diagram below,
the plate at left contains a collection of bacterial colonies that are first replicated
to velvet and printed on a clean reference plate (starting with step 1, going horizontally
to the right). After the plate is replicated, the remaining cells in the colony are
adsorbed to a piece of nitrocellulose (step 2), and the colonies are lysed directly
onto the nitrocellulose (or nylon). This imprint can then be probed with a DNA probe,
just as you might handle a Southern blot, and you may find that one of the colonies
corresponds to a positive hybridization result. ![]() Lucky thing that you remembered to replicate the plate before adsorbing the colonies to nitrocellulose - now you have a reference colony you can pick and grow! A second problem with standard laboratory plasmids is that it is difficult to clone large insertions into them, and this is particularly a problem if the library consists of genomic DNA fragments rather than cDNAs. Most plasmids do not grow well if they have large (10 kbp) insertions. While that is similar to the limit for gt10 and gt11, we'll be discussing later, lambda phage that can handle up to 23 kbp of DNA. The problem of "insertion size" and representation in the library may become more clear from the next example. |
||||||||||||
|
How many different recombinants must a library have? |
Suppose we want to protect a group of movie stars from the rain, while they're walking from their limos to the doors of the Dorothy Chandler Pavillion ( a journey of 100 killibradpitts (kbp) - a new measure of distance). We have 10 umbrellas on hand, each able to cover 1/10th of the distance from the street to the door. If we distribute them randomly, will they cover the entire path and keep the stars dry?
Of course, if we are thinking that umbrellas = clones and sidewalk = genome, then some sequences in the genome will not have been cloned, and will not be represented in your library. That's fine, so long as the sequences that you want to clone are not left out, but we are not always so fortunate. What would happen if we made the library twice as big? Let's go back to the Academy Awards, and see how we fare with twice the coverage:
As you can see, we got closer to complete coverage, but still have some unprotected areas. To be precise, only 86 killibradpitts of sidewalk are covered and 14 are uncovered. With our analogy of clones, the uncovered regions represent uncloned sequences in the genome. The way we can calculate what's missing is pretty easy -- it's based on the Poisson distribution. If we have one "covering" of the genome (as in the ten clone example) the chance of a sequence being cloned is (1- 1/exp(1)) or about 0.63 (or 63%), where exp(x) = the base of the natural logarithm e to the x power, . If we have two "coverings" of the genome (as in the twenty clone example) the chance of a sequence being cloned is (1- 1/exp(2)) or about 0.86 (or 86%), where exp(2) means the square of e. The pattern continues as your library contains more independent clones. With ten "coverings", the chances reach 0.99995 (or 99.995%), but you can see that you never reach absolute certainty of cloning any particular sequence. |