CA2510103A1 - Modified green fluorescent proteins - Google Patents
Modified green fluorescent proteins Download PDFInfo
- Publication number
- CA2510103A1 CA2510103A1 CA002510103A CA2510103A CA2510103A1 CA 2510103 A1 CA2510103 A1 CA 2510103A1 CA 002510103 A CA002510103 A CA 002510103A CA 2510103 A CA2510103 A CA 2510103A CA 2510103 A1 CA2510103 A1 CA 2510103A1
- Authority
- CA
- Canada
- Prior art keywords
- replacement
- cell
- fluorescent protein
- gfp
- wild
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 108010043121 Green Fluorescent Proteins Proteins 0.000 title description 64
- 102000004144 Green Fluorescent Proteins Human genes 0.000 title description 64
- 230000005284 excitation Effects 0.000 claims abstract description 46
- 108091005971 Wild-type GFP Proteins 0.000 claims abstract description 32
- 238000000295 emission spectrum Methods 0.000 claims abstract description 20
- 238000000695 excitation spectrum Methods 0.000 claims abstract description 20
- 238000012986 modification Methods 0.000 claims abstract description 12
- 230000004048 modification Effects 0.000 claims abstract description 12
- 102000034287 fluorescent proteins Human genes 0.000 claims description 24
- 108091006047 fluorescent proteins Proteins 0.000 claims description 24
- 229920001184 polypeptide Polymers 0.000 claims description 21
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 21
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 21
- 239000003086 colorant Substances 0.000 claims description 13
- 150000001413 amino acids Chemical class 0.000 claims description 11
- 238000006467 substitution reaction Methods 0.000 claims description 11
- 230000004927 fusion Effects 0.000 claims description 6
- 230000001580 bacterial effect Effects 0.000 claims description 4
- 108020004707 nucleic acids Proteins 0.000 claims description 3
- 102000039446 nucleic acids Human genes 0.000 claims description 3
- 150000007523 nucleic acids Chemical class 0.000 claims description 3
- 125000003275 alpha amino acid group Chemical group 0.000 claims 2
- 241000243290 Aequorea Species 0.000 abstract description 13
- 230000004075 alteration Effects 0.000 abstract description 2
- 108090000623 proteins and genes Proteins 0.000 description 73
- 102000004169 proteins and genes Human genes 0.000 description 60
- 239000005090 green fluorescent protein Substances 0.000 description 58
- 210000004027 cell Anatomy 0.000 description 25
- 230000035772 mutation Effects 0.000 description 16
- 239000002299 complementary DNA Substances 0.000 description 14
- 238000001228 spectrum Methods 0.000 description 13
- 102220566455 GDNF family receptor alpha-1_Y66W_mutation Human genes 0.000 description 11
- 230000014509 gene expression Effects 0.000 description 11
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical class O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 10
- 108020004414 DNA Proteins 0.000 description 9
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 9
- 230000015572 biosynthetic process Effects 0.000 description 8
- 241000588724 Escherichia coli Species 0.000 description 7
- 102220566451 GDNF family receptor alpha-1_Y66H_mutation Human genes 0.000 description 7
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 7
- 108091034117 Oligonucleotide Proteins 0.000 description 7
- 238000000034 method Methods 0.000 description 7
- 238000002703 mutagenesis Methods 0.000 description 6
- 231100000350 mutagenesis Toxicity 0.000 description 6
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 5
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 5
- 239000000284 extract Substances 0.000 description 5
- 239000000499 gel Substances 0.000 description 5
- 210000003000 inclusion body Anatomy 0.000 description 5
- 230000003647 oxidation Effects 0.000 description 5
- 238000007254 oxidation reaction Methods 0.000 description 5
- 229940035893 uracil Drugs 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- 241000242583 Scyphozoa Species 0.000 description 4
- 239000007983 Tris buffer Substances 0.000 description 4
- 238000002835 absorbance Methods 0.000 description 4
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 4
- 239000000872 buffer Substances 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 229940088598 enzyme Drugs 0.000 description 4
- 238000000338 in vitro Methods 0.000 description 4
- 230000007935 neutral effect Effects 0.000 description 4
- 238000007699 photoisomerization reaction Methods 0.000 description 4
- 238000000746 purification Methods 0.000 description 4
- 230000003595 spectral effect Effects 0.000 description 4
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 4
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 4
- QFVHZQCOUORWEI-UHFFFAOYSA-N 4-[(4-anilino-5-sulfonaphthalen-1-yl)diazenyl]-5-hydroxynaphthalene-2,7-disulfonic acid Chemical compound C=12C(O)=CC(S(O)(=O)=O)=CC2=CC(S(O)(=O)=O)=CC=1N=NC(C1=CC=CC(=C11)S(O)(=O)=O)=CC=C1NC1=CC=CC=C1 QFVHZQCOUORWEI-UHFFFAOYSA-N 0.000 description 3
- 229920001817 Agar Polymers 0.000 description 3
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 3
- 241000894006 Bacteria Species 0.000 description 3
- 108091026890 Coding region Proteins 0.000 description 3
- 241000880493 Leptailurus serval Species 0.000 description 3
- 241000242739 Renilla Species 0.000 description 3
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical group OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 102000006943 Uracil-DNA Glycosidase Human genes 0.000 description 3
- 108010072685 Uracil-DNA Glycosidase Proteins 0.000 description 3
- 239000008272 agar Substances 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 230000009274 differential gene expression Effects 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 239000007850 fluorescent dye Substances 0.000 description 3
- 239000006166 lysate Substances 0.000 description 3
- 102000035118 modified proteins Human genes 0.000 description 3
- 108091005573 modified proteins Proteins 0.000 description 3
- 231100000219 mutagenic Toxicity 0.000 description 3
- 230000003505 mutagenic effect Effects 0.000 description 3
- 239000002773 nucleotide Substances 0.000 description 3
- 125000003729 nucleotide group Chemical group 0.000 description 3
- 238000006862 quantum yield reaction Methods 0.000 description 3
- 238000002708 random mutagenesis Methods 0.000 description 3
- 238000007363 ring formation reaction Methods 0.000 description 3
- 241000894007 species Species 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 2
- 241000242764 Aequorea victoria Species 0.000 description 2
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 2
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 2
- BMNVSPMWMICFRV-DCAQKATOSA-N Arg-His-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CN=CN1 BMNVSPMWMICFRV-DCAQKATOSA-N 0.000 description 2
- GWNMUVANAWDZTI-YUMQZZPRSA-N Asn-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N GWNMUVANAWDZTI-YUMQZZPRSA-N 0.000 description 2
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 2
- VBVKSAFJPVXMFJ-CIUDSAMLSA-N Asp-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N VBVKSAFJPVXMFJ-CIUDSAMLSA-N 0.000 description 2
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- 102220566467 GDNF family receptor alpha-1_S65A_mutation Human genes 0.000 description 2
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 2
- JNENSVNAUWONEZ-GUBZILKMSA-N Gln-Lys-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JNENSVNAUWONEZ-GUBZILKMSA-N 0.000 description 2
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 2
- UCZXXMREFIETQW-AVGNSLFASA-N Glu-Tyr-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O UCZXXMREFIETQW-AVGNSLFASA-N 0.000 description 2
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 2
- UXSATKFPUVZVDK-KKUMJFAQSA-N His-Lys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N UXSATKFPUVZVDK-KKUMJFAQSA-N 0.000 description 2
- YYOCMTFVGKDNQP-IHRRRGAJSA-N His-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N YYOCMTFVGKDNQP-IHRRRGAJSA-N 0.000 description 2
- LVQDUPQUJZWKSU-PYJNHQTQSA-N Ile-Arg-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LVQDUPQUJZWKSU-PYJNHQTQSA-N 0.000 description 2
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 2
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 2
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 2
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 2
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 2
- 229910021380 Manganese Chloride Inorganic materials 0.000 description 2
- GLFNIEUTAYBVOC-UHFFFAOYSA-L Manganese chloride Chemical compound Cl[Mn]Cl GLFNIEUTAYBVOC-UHFFFAOYSA-L 0.000 description 2
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 2
- RBRNEFJTEHPDSL-ACRUOGEOSA-N Phe-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 RBRNEFJTEHPDSL-ACRUOGEOSA-N 0.000 description 2
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 2
- 108700008625 Reporter Genes Proteins 0.000 description 2
- PXIPVTKHYLBLMZ-UHFFFAOYSA-N Sodium azide Chemical compound [Na+].[N-]=[N+]=[N-] PXIPVTKHYLBLMZ-UHFFFAOYSA-N 0.000 description 2
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 2
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 2
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 2
- 102220615016 Transcription elongation regulator 1_S65C_mutation Human genes 0.000 description 2
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 2
- HFJJDMOFTCQGEI-STECZYCISA-N Tyr-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HFJJDMOFTCQGEI-STECZYCISA-N 0.000 description 2
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 2
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 2
- LMSBRIVOCYOKMU-NRPADANISA-N Val-Gln-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N LMSBRIVOCYOKMU-NRPADANISA-N 0.000 description 2
- 230000008033 biological extinction Effects 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 238000000326 densiometry Methods 0.000 description 2
- 229940079593 drug Drugs 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 2
- 238000002189 fluorescence spectrum Methods 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 2
- 108010038983 glycyl-histidyl-lysine Proteins 0.000 description 2
- 108010037850 glycylvaline Proteins 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- 238000005286 illumination Methods 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 238000002372 labelling Methods 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 239000011565 manganese chloride Substances 0.000 description 2
- 235000002867 manganese chloride Nutrition 0.000 description 2
- 238000007243 oxidative cyclization reaction Methods 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N phenol group Chemical group C1(=CC=CC=C1)O ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- -1 phenolic anion Chemical class 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 229920002704 polyhistidine Polymers 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 230000000750 progressive effect Effects 0.000 description 2
- 230000026447 protein localization Effects 0.000 description 2
- 238000001243 protein synthesis Methods 0.000 description 2
- RXWNCPJZOCPEPQ-NVWDDTSBSA-N puromycin Chemical compound C1=CC(OC)=CC=C1C[C@H](N)C(=O)N[C@H]1[C@@H](O)[C@H](N2C3=NC=NC(=C3N=C2)N(C)C)O[C@@H]1CO RXWNCPJZOCPEPQ-NVWDDTSBSA-N 0.000 description 2
- PYWVYCXTNDRMGF-UHFFFAOYSA-N rhodamine B Chemical compound [Cl-].C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=CC=C1C(O)=O PYWVYCXTNDRMGF-UHFFFAOYSA-N 0.000 description 2
- 229940043267 rhodamine b Drugs 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 2
- 238000002741 site-directed mutagenesis Methods 0.000 description 2
- 108010061238 threonyl-glycine Proteins 0.000 description 2
- 229940113082 thymine Drugs 0.000 description 2
- 230000014616 translation Effects 0.000 description 2
- 108700004896 tripeptide FEG Proteins 0.000 description 2
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 2
- 108010051110 tyrosyl-lysine Proteins 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- VXXKWPUPMFOLKM-UHFFFAOYSA-N 5-[(4-hydroxyphenyl)methylidene]imidazol-2-one Chemical compound C1=CC(O)=CC=C1C=C1C=NC(=O)N1 VXXKWPUPMFOLKM-UHFFFAOYSA-N 0.000 description 1
- XJGFWWJLMVZSIG-UHFFFAOYSA-N 9-aminoacridine Chemical compound C1=CC=C2C(N)=C(C=CC=C3)C3=NC2=C1 XJGFWWJLMVZSIG-UHFFFAOYSA-N 0.000 description 1
- LZRNYBIJOSKKRJ-XVYDVKMFSA-N Ala-Asp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LZRNYBIJOSKKRJ-XVYDVKMFSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- 108010083590 Apoproteins Proteins 0.000 description 1
- 102000006410 Apoproteins Human genes 0.000 description 1
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 1
- YFBGNGASPGRWEM-DCAQKATOSA-N Arg-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFBGNGASPGRWEM-DCAQKATOSA-N 0.000 description 1
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 1
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 1
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 1
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 1
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 1
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 1
- LNENWJXDHCFVOF-DCAQKATOSA-N Asp-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N LNENWJXDHCFVOF-DCAQKATOSA-N 0.000 description 1
- SWTQDYFZVOJVLL-KKUMJFAQSA-N Asp-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)O SWTQDYFZVOJVLL-KKUMJFAQSA-N 0.000 description 1
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 238000011537 Coomassie blue staining Methods 0.000 description 1
- NAPULYCVEVVFRB-HEIBUPTGSA-N Cys-Thr-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CS NAPULYCVEVVFRB-HEIBUPTGSA-N 0.000 description 1
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 230000006820 DNA synthesis Effects 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 102100023933 Deoxyuridine 5'-triphosphate nucleotidohydrolase, mitochondrial Human genes 0.000 description 1
- MYMOFIZGZYHOMD-UHFFFAOYSA-N Dioxygen Chemical compound O=O MYMOFIZGZYHOMD-UHFFFAOYSA-N 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 1
- CITDWMLWXNUQKD-FXQIFTODSA-N Gln-Gln-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CITDWMLWXNUQKD-FXQIFTODSA-N 0.000 description 1
- BLOXULLYFRGYKZ-GUBZILKMSA-N Gln-Glu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BLOXULLYFRGYKZ-GUBZILKMSA-N 0.000 description 1
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 1
- SRZLHYPAOXBBSB-HJGDQZAQSA-N Glu-Arg-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SRZLHYPAOXBBSB-HJGDQZAQSA-N 0.000 description 1
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 1
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 1
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 1
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 1
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 1
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 1
- OJNZVYSGVYLQIN-BQBZGAKWSA-N Gly-Met-Asp Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O OJNZVYSGVYLQIN-BQBZGAKWSA-N 0.000 description 1
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- 108091027305 Heteroduplex Proteins 0.000 description 1
- MDBYBTWRMOAJAY-NHCYSSNCSA-N His-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MDBYBTWRMOAJAY-NHCYSSNCSA-N 0.000 description 1
- AVXURJPOCDRRFD-UHFFFAOYSA-N Hydroxylamine Chemical compound ON AVXURJPOCDRRFD-UHFFFAOYSA-N 0.000 description 1
- VQUCKIAECLVLAD-SVSWQMSJSA-N Ile-Cys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VQUCKIAECLVLAD-SVSWQMSJSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- CIDLJWVDMNDKPT-FIRPJDEBSA-N Ile-Phe-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N CIDLJWVDMNDKPT-FIRPJDEBSA-N 0.000 description 1
- GMUYXHHJAGQHGB-TUBUOCAGSA-N Ile-Thr-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMUYXHHJAGQHGB-TUBUOCAGSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- 101710173438 Late L2 mu core protein Proteins 0.000 description 1
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 1
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 1
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- ZJSZPXISKMDJKQ-JYJNAYRXSA-N Lys-Phe-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=CC=C1 ZJSZPXISKMDJKQ-JYJNAYRXSA-N 0.000 description 1
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 1
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 1
- GODBLDDYHFTUAH-CIUDSAMLSA-N Met-Asp-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O GODBLDDYHFTUAH-CIUDSAMLSA-N 0.000 description 1
- BQHLZUMZOXUWNU-DCAQKATOSA-N Met-Pro-Glu Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BQHLZUMZOXUWNU-DCAQKATOSA-N 0.000 description 1
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- 108010021466 Mutant Proteins Proteins 0.000 description 1
- 102000008300 Mutant Proteins Human genes 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 108010038807 Oligopeptides Proteins 0.000 description 1
- 102000015636 Oligopeptides Human genes 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 1
- SNSYSBUTTJBPDG-OKZBNKHCSA-N Pro-Trp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N4CCC[C@@H]4C(=O)O SNSYSBUTTJBPDG-OKZBNKHCSA-N 0.000 description 1
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 1
- 101710188315 Protein X Proteins 0.000 description 1
- 101710188306 Protein Y Proteins 0.000 description 1
- 229940123573 Protein synthesis inhibitor Drugs 0.000 description 1
- 241000242743 Renilla reniformis Species 0.000 description 1
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 1
- 241001620634 Roger Species 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 1
- FDALPRWYVKJCLL-PMVVWTBXSA-N Thr-His-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O FDALPRWYVKJCLL-PMVVWTBXSA-N 0.000 description 1
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 1
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 1
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 description 1
- IKUMWSDCGQVGHC-UMPQAUOISA-N Trp-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O IKUMWSDCGQVGHC-UMPQAUOISA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- QUILOGWWLXMSAT-IHRRRGAJSA-N Tyr-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QUILOGWWLXMSAT-IHRRRGAJSA-N 0.000 description 1
- JKUZFODWJGEQAP-KBPBESRZSA-N Tyr-Gly-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O JKUZFODWJGEQAP-KBPBESRZSA-N 0.000 description 1
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 1
- KLOZTPOXVVRVAQ-DZKIICNBSA-N Tyr-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KLOZTPOXVVRVAQ-DZKIICNBSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 1
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- 239000000370 acceptor Substances 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 229960001441 aminoacridine Drugs 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 1
- 108010058966 bacteriophage T7 induced DNA polymerase Proteins 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000005415 bioluminescence Methods 0.000 description 1
- 230000029918 bioluminescence Effects 0.000 description 1
- 230000001851 biosynthetic effect Effects 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 125000002915 carbonyl group Chemical group [*:2]C([*:1])=O 0.000 description 1
- 239000006285 cell suspension Substances 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000013522 chelant Substances 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 1
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 1
- RGWHQCVHVJXOKC-SHYZEUOFSA-J dCTP(4-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-J 0.000 description 1
- HAAZLUGHYHWQIW-KVQBGUIXSA-N dGTP Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HAAZLUGHYHWQIW-KVQBGUIXSA-N 0.000 description 1
- NHVNXKFIZYSCEB-XLPZGREQSA-N dTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C1 NHVNXKFIZYSCEB-XLPZGREQSA-N 0.000 description 1
- 108010011219 dUTP pyrophosphatase Proteins 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000006356 dehydrogenation reaction Methods 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 229910001882 dioxygen Inorganic materials 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 230000005281 excited state Effects 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000002866 fluorescence resonance energy transfer Methods 0.000 description 1
- 238000001506 fluorescence spectroscopy Methods 0.000 description 1
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 1
- 238000001215 fluorescent labelling Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 230000005283 ground state Effects 0.000 description 1
- 235000008216 herbs Nutrition 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- YAMHXTCMCPHKLN-UHFFFAOYSA-N imidazolidin-2-one Chemical compound O=C1NCCN1 YAMHXTCMCPHKLN-UHFFFAOYSA-N 0.000 description 1
- GVONPBONFIJAHJ-UHFFFAOYSA-N imidazolidin-4-one Chemical class O=C1CNCN1 GVONPBONFIJAHJ-UHFFFAOYSA-N 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 108010034529 leucyl-lysine Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 108010012988 lysyl-glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 229910052760 oxygen Inorganic materials 0.000 description 1
- 239000001301 oxygen Substances 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 150000002989 phenols Chemical class 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 238000003752 polymerase chain reaction Methods 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 230000005892 protein maturation Effects 0.000 description 1
- 239000000007 protein synthesis inhibitor Substances 0.000 description 1
- 229950010131 puromycin Drugs 0.000 description 1
- 238000004445 quantitative analysis Methods 0.000 description 1
- 239000000376 reactant Substances 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 238000002165 resonance energy transfer Methods 0.000 description 1
- 230000002207 retinal effect Effects 0.000 description 1
- 239000000790 retinal pigment Substances 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 125000006850 spacer group Chemical group 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 238000010257 thawing Methods 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Landscapes
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
Modifications in the sequence of Aequorea wild-type GFP provide products having markedly different excitation and emission spectra from corresponding products from wild-type GFP. In one class of modifications, the product derived from the modified GFP exhibits an alteration in the ratio of two main excitation peaks observed with the product derived from wild-type GFP. In another class, the product derived from the modified GFP fluoresces at a shorter wavelength than the corresponding product from wild-type GFP. In yet another class of modifications, the product derived from the modified GFP exhibits only a single excitation peak and enhanced emission relative to the product derived from wild-type GFP.
Description
lA
Background of the Invention This invention relates generally to the fields of biology and chemistry.
More particularly, the invention is directed to modified fluorescent proteins and to - methods for the preparation and use thereof.
In biochemistry, molecular biology and medical diagnostics, it is often desirable to add a fluorescent label to a protein so that the protein can be easily tracked and quantified. The normal procedures for labeling requires that the protein be covalently reacted in vitro with fluorescent dyes, then repurified to remove excess dye and any damaged protein. If the labeled protein is to be used inside cells, it usually has to be microinjected; this is a difficult and time-consuming operation that caj-mot be performed-on large numbers of cells. These problems may, however, bL eliminated by joining a rucleotide sequence coding for the protein of interest with the sequence for a naturally fluorescent protein, then expressing the fusion protein.
The green fluorescent protein (GFP) of the jellyfish Aequorea vicroria is a remarkable protein with strong visible absorbance and' fluorescence from a p-hydroxybenzylideneimidazolone chromophore, which is generated by cyclization and oxidation of the protein's own Ser-Tyr-Gly sequence at positions 65 to 67.
A
cDNA sequence [SEQ ID NO:1] for one isotype of GFP has been reported [Prasher, D. C. et al:, Gene lil, 229-233 (1992)]; cloning of this cDNA has enabled GFP expression in different organisms. The finding that the expressed protein becomes fluorescent in cells from a wide variety of argani.sms [Chalfie, M.
et al., Science 263, 802-805 (I994)] -makes GFP a powerful new tool in molecular and cell biology and indicates that the oxidative cyclization must be either spontaneous or dependent only on ubiquitous enzymes and reactants.
Background of the Invention This invention relates generally to the fields of biology and chemistry.
More particularly, the invention is directed to modified fluorescent proteins and to - methods for the preparation and use thereof.
In biochemistry, molecular biology and medical diagnostics, it is often desirable to add a fluorescent label to a protein so that the protein can be easily tracked and quantified. The normal procedures for labeling requires that the protein be covalently reacted in vitro with fluorescent dyes, then repurified to remove excess dye and any damaged protein. If the labeled protein is to be used inside cells, it usually has to be microinjected; this is a difficult and time-consuming operation that caj-mot be performed-on large numbers of cells. These problems may, however, bL eliminated by joining a rucleotide sequence coding for the protein of interest with the sequence for a naturally fluorescent protein, then expressing the fusion protein.
The green fluorescent protein (GFP) of the jellyfish Aequorea vicroria is a remarkable protein with strong visible absorbance and' fluorescence from a p-hydroxybenzylideneimidazolone chromophore, which is generated by cyclization and oxidation of the protein's own Ser-Tyr-Gly sequence at positions 65 to 67.
A
cDNA sequence [SEQ ID NO:1] for one isotype of GFP has been reported [Prasher, D. C. et al:, Gene lil, 229-233 (1992)]; cloning of this cDNA has enabled GFP expression in different organisms. The finding that the expressed protein becomes fluorescent in cells from a wide variety of argani.sms [Chalfie, M.
et al., Science 263, 802-805 (I994)] -makes GFP a powerful new tool in molecular and cell biology and indicates that the oxidative cyclization must be either spontaneous or dependent only on ubiquitous enzymes and reactants.
A major question in protein photophysics is how a single chromophore can give widely different spectra depending on its local protein environment. This question has received the most attention with respect to the multiple colors of visual pigments based on retinal [herbs, S. L. & Nathans, J. Science 258, 464-S (1992)], but is also important in GFP. The GFP from Aequorea and that of the sea pansy Renilla reniformis share the same chromophore, yet Aequorea GFP has two absorbance peaks at 395 and 475 nm, whereas Renilla GFP has only a single absorbance peak at 498 nm, with about 5.5 fold greater monomer extinction coefficient than the major 395 nm peak of the Aequorea protein [Ward, W. W. in Bioluminescence and Chemiluminescence (eds. DeLuca, M.A. & McElroy, W_D.) 235-242 (Academic Press, New York, 1981)]. The spectra of the isolated chromophore and denatured protein at neutral pH do not match the spectra of either native protein [Cody, C. W. et al., Biochemistry 32, 1212-1218 (1993)].
For many practical applications, the spectrum of Renilla GFP would be preferable to that of Aequorea, because wavelength discrimination between different fluorophores and detection of resonance energy transfer are easier if the component -spectra are tall and narrow rather than low and broad. Furthermore, the longer wavelength excitation peak (475 nm) of Aequorea GFP is almost ideal for fluorescein filter sets and is resistant to photobleaching, but has tower .
amplitude than the shorter wavelength peak at 395. nm, which is more susceptible to photobleaching [Chalfie et al. (1994), supra]. For all these reasons, it would clearly be advantageous to convert the Aequorea GFP excitation spectrum to a single peak, and preferably at longer wavelengths.
There is also a need in the art for proteins which fluoresce at different wavelengths. Variants of fluorescent proteins with different colors would also be very useful for simultaneous comparisons of multiple protein fates, developmental lineages, and gene expression levels.
Accordingly, it is an object of the present invention to provide improved fluorescent proteins which do not suffer from the drawbacks of native Aequorea GFP.
For many practical applications, the spectrum of Renilla GFP would be preferable to that of Aequorea, because wavelength discrimination between different fluorophores and detection of resonance energy transfer are easier if the component -spectra are tall and narrow rather than low and broad. Furthermore, the longer wavelength excitation peak (475 nm) of Aequorea GFP is almost ideal for fluorescein filter sets and is resistant to photobleaching, but has tower .
amplitude than the shorter wavelength peak at 395. nm, which is more susceptible to photobleaching [Chalfie et al. (1994), supra]. For all these reasons, it would clearly be advantageous to convert the Aequorea GFP excitation spectrum to a single peak, and preferably at longer wavelengths.
There is also a need in the art for proteins which fluoresce at different wavelengths. Variants of fluorescent proteins with different colors would also be very useful for simultaneous comparisons of multiple protein fates, developmental lineages, and gene expression levels.
Accordingly, it is an object of the present invention to provide improved fluorescent proteins which do not suffer from the drawbacks of native Aequorea GFP.
Summary of the Invention In accordance with the present invention, it has been determined that particular modifications in the polypeptide sequence of an Aequorea wild-type GFP
[SEQ ID N0:2] lead to formation of products having markedly different excitation S and emission spectra from corresponding products derived from wild-type GFP.
Visibly distinct colors and/or increased intensities of emission make these products useful in a wide variety of contexts, such as tracking of differential gene expression and protein localization.
Brief Description of the Drawings The invention may be better understood with reference to the accompanying drawings, in which:
Fig. 1 compares different versions of GFP by gel electrophoresis and Coomassie blue staining;
Fig. 2 illustrates a proposed biosynthetic scheme for GFP;
Figs. 3a and 3b illustrate the excitation and emission spectra of wild-type and a first group of mutant GFPs;
Figs. 4a and 4b illustrate the excitation and emission spectra of wild-type and a second group of mutant GFPs;
Fig. 5 illustrates the rate of fluorophore formation in the wild-type GFP and the Ser 65-~Thr mutant;
Figs. 6a and 6b illustrate the, behavior of wild-type GFP and the Ser 65~Thr mutant, respectively, upon progressive irradiation with ultraviolet light; and Fig. 7 illustrates fluorescence excitation and emission spectra of a third group of GFP mutants.
Detailed Description of the Invention GFP was expressed in E. coli under the control of a T7 promoter for quantitative analysis of the properties of the recombinant protein. Gel electrophoresis under denaturing conditions showed protein of the expected molecular weight (27 kDa) as a dominant band (Fig. 1), which could be quantified simply by densitometry of staining with Coomassie blue. Soluble recombinant GFP proved to have identical spectra and the same or even slightly more fluorescence per mole of protein as GFP purified from Aequorea victoria, showing that the soluble protein in E. coli undergoes correct folding and oxidative cyclization with as high an efficiency as in the jellyfish.
The bacteria also contained inclusion bodies consisting of protein indistinguishable from jellyfish or soluble recombinant protein on denaturing gels (Fig. ~). However, this material was completely non-fluorescent, lacked the visible absorbance bands of the chromophore, and could not be made fluorescent even when solubilized and subjected to protocols that renature GFP [Ward, W.
W.
& Bokman, S. H., Biochemistry 21, 4535-4540 (1982); Surpin, M. A. & Ward, W. W., Photochem. Photobiol. 49, Abstract, 25S (1989)J. Therefore, protein from inclusion bodies seemed permanently unable to generate the internal chromophore. An interesting intermediate stage in protein maturation could be generated by growing the bacteria anaerobically. The soluble protein again looked the same as GFP on denaturing gels (Fig. 1) but was non-fluorescent. In this case, fluorescence gradually developed after admission of air, even when fresh protein synthesis was blocked using puromycin and tetracycline. Evidently, the soluble non-fluorescent protein synthesized under anaerobic conditions was ready to become fluorescent once atmospheric oxygen was readmitted. The fluorescence per protein molecule approached its final asymptotic value with a single-exponential time course and a rate constant of 0.24 ~ .06 hr~' (at 22°C) measured either in intact cells with protein-synthesis inhibitors or in a lysate in which the soluble proteins and cofactors were a thousand fold more dilute. Such pseudo-first order kinetics strongly suggest that no enzymes or cofactors are necessary for the final step of fluorophore formation in GFP.
It has thus been determined that formation of the final fluorophore requires molecular oxygen and proceeds in wild-type protein with a time constant of - 4 h at 22°C and atmospheric p02. This was independent of dilution, implying that the oxidation does not require enzymes or cofactors.
A molecular interpretation is presented in Fig. 2. If the newly translated apoprotein (top left) evades precipitation into inclusion bodies, the amino group of Gly 67 might cyclize onto the carbonyl group of Ser 65 to form an imidazolidin-one, where the process would stop (top center) if OZ is absent. The new N=C
S double bond would be expected to promote dehydrogenation to form a conjugated chromophore; imidazolidin-5-ones are indeed known to undergo autoxidative formation of double bonds at the 4-position [Kjaer, A. Acta Chem. Scand. 7, 1030-1035 (1953); Kidwai, A. R. & Devasia, G. M. J. Org. Chem. 27, 4527-4531 (1962)], which is exactly what is necessary to complete the fluorophore (upper right). The protonated and deprotonated species (upper and lower right) may be responsible for the 395 and 470-475 nm excitation peaks, respectively. The excited states of phenols are much more acidic than their ground states, so that emission would come only from a deprotonated species.
The Aequorea GFP cDNA was subjected to random mutagenesis by hydroxylamine treatment or polymerase chain reaction. Approximately six thousand bacterial colonies on agar plates were illuminated with alternating 395 and 475 nm excitation and visually screened for ~;.:ed excitation properties or emission colors.
According to a first aspect of the present invention, modifications are provided which result in a shift in the ratio of the two excitations peaks of the product after oxidation and cyclization relative to the wild type. Three mutants were found with significant alterations in the ratio of the two main excitation peaks (Table I). The mutations were sequenced and recombined with the wild-type gene in different ways to eliminate neutral mutations and assign the fluorescence effects to single amino acid substitutions, except for H9 where two neighboring mutations have not yet been separated. They all lay in the C terminal part of the protein (Table I), remote in primary sequence from the chromophore formed from residues 65-67.
These and other modifications are defined herein with reference to the amino acid sequence [SEQ ID N0:2) encoded by the reported cDNA [SEQ ID
NO:1]; the first amino acid identified is the one found at the indicated location in the reported sequence, while the second indicates the substitution found in the modified form. The fluorescent product derived from a wild-type or modified GFP
polypeptide sequence is no longer strictly speaking a simple polypeptide after oxidation and cyclization; however, reference is sometimes made for sake of simplicity herein to the polypeptide (e.g., "wild-type GFP" or "modified GFP") where what is intended would be obvious from the context. Compared with wild-type GFP, H9 (Ser 2D2~Phe, Thr 203~I1e) had increased fluorescence at 395 nm excitation; P9 (Ile 167~Va1) and P11 (Ile 167-~Thr) were more fluorescent at nm excitation.
One possibility for these spectral perturbatrons in P9 and P11 is that the mutations at Ile 167 shift a positive charge slightly closer to the phenolic group of the fluorophore; this should both increase the percentage of phenolic anion, which is probably the species responsible for the 470-475 nm excitation peak, and shift the emission peak hypsochromically. However, the hypothesized ionizable phenolic group would have to be buried inside the protein at normal pH, because the ratio of 471 to 396 nm peaks in the mutants could not be further affected by external pH until it was raised to 10, just below the threshold for denaturation.
The pH-sensitivity of wild-type GFP is similar [Ward, W. W. et al., Photochem.
Photobiol. 35, 803-808 (1982)].
According to another aspect of the invention, a mutant P4 (Tyr 66->His) was identified which was excitable by ultraviolet and fluoresced bright blue in contrast to the green of wild type protein. The excitation and emission maxima were hypsochromically shifted by 14 and 60 nm respectively from those of wild-type GFP: The mutated DNA was sequenced and found to contain five amino acid substitutions, only one of which proved to be critical: replacement of Tyr 66 in the center of the chromophore by His (corresponding to a change in the GFP cDNA
sequence [SEQ ID N0:1] at 196-198 from TAT to CAT).
The surprising tolerance for substitution at this key residue prompted further site-directed mutagenesis to Trp and Phe at this position. Trp gave excitation and emission wavelengths intermediate between Tyr and His (Table I) but was only weakly fluorescent, perhaps due to inefficiency of folding or chromophore formation due to steric considerations. Phe gave weak fluorescence with an excitation maximum at 358 nm and an emission maximum at 442 nm.
Accordingly, pursuant to this aspect of the invention modified GFP proteins which fluoresce at different wavelengths (preferably, different by at least 10 nm and more S preferably, by at least 50 nm) relative to the native protein are provided, for example, those wherein Tyr 66 is replaced by Phe, His or Trp.
In a further embodiment of this aspect of the invention, a double mutant Y66H, YI45F was identified which had almost the same wavelengths as the single mutant Y66H but almost twice the brightness, due mainly to a higher quantum efficiency of fluorescence. The double mutant also developed its fluorescence during overnight growth, whereas the single mutant required several days.
In accordance with further embodiments of this aspect of the invention, a first round of mutagenesis to increase the brightness of Y66~V yielded ' M153T/V163A/N212K as additional substitutions. This mutant was subjected to another round of mutagenesis, resulting in two further sets, N146I and I123V/Y145H/H148R (Table II). The quantum efficiency of these mutants is now comparable to wild-type GFP. The clustering of the substitutions in residues to 163 suggest that those residues lie relatively close to the chromophore and that reductions in the size of their side chains might be compensating for the larger size of tryptophan compared to tyrosine.
Pursuant to yet another aspect of the present invention, modified GFP
proteins are provided which provide substantially more intense fluorescence per molecule than the wild type protein. Modifications at Ser 65 to Ala, Leu, Cys, Val, Ile or Thr provide proteins with red-shifted and brighter spectra relative to the native protein. In particular, the Thr mutant (corresponding to a change in the GFP cDNA sequence [SEQ ID NO:1) at I93-195 from TCT to ACT) and Cys mutant (corresponding to a change in the GFP cDNA sequence [SEQ ID NO:1] at 193-195 from TCT to TGT) are about six times brighter than wild type when excited at the preferred long-wavelength band above 450 nm. As a consequence, these modified proteins are superior to wild type proteins for practically all applications. Further, the brightness of these modified proteins matches the brightness reported in the literature for Renilla GFP; thus, these proteins clearly obviate the objections to the dimness of Aequorea GFP. In fact, it is speculated that the chromophores in these modified proteins may exhibit the optimum brightness which could be achieved with a general structure derived from the Aequorea GFP chromophore. In particular, these mutations provide products exhibiting one or more of the following salient characteristics which distinguish them clearly over the corresponding product from a wild-type GFP: reduced efficiency of excitation by wavelengths between about 350 and 420 nm; enhanced excitation and emission efficiency when excited with wavelengths longer than about 450 nm; increased resistance to light-induced shifts in the excitation spectrum; and faster kinetics of fluorophore generation. In contrast, mutations to Trp, Arg, Asn, Phe and Asp did not provide improved brightness.
Mutagenesis of S65T to shift its wavelengths further to the red yielded M153A/K238E (Table II) as the GFP variant with the longest-wavelength excitation maximum yet described, 504 nm vs. 490 nm for S65T. Surprisingly, the emission peak hardly changed (514 nm vs. 511 nm), so that the separation between the excitation and emission peaks (Stokes' shift) is extremely narrow, only 10 nm.
This is one of the smallest values reported for any fluorophore in aqueous solution at room temperature. As in the Y66W series, M153 seems to be influential. It is doubtful that K238E is important, because this substitution has been found to be without effect in other mutants.
As would be readily apparent to those working in the field, to provide the desired fluorescent protein it would not be necessary to include the entire sequence of GFP. In particular, minor deletions at either end of the protein sequence are expected to have little or no impact on the fluorescence spectrum of the protein.
Therefore, by a mutant or wild-type GFP sequence for purposes of the present invention are contemplated not only the complete polypeptide and oligonucleotide sequences discussed herein, but also functionally-equivalent portions thereof (i.e., portions of the polypeptide sequences which exhibit the desired fluorescence properties and oligonucleotide sequences encoding these polypeptide sequences).
For example, whereas the chromophore itself (position 65-67) is obviously crucial, the locations of known neutral mutations suggest that amino acids 76-115 are less critical to the spectroscopic properties of the product. In addition, as would be immediately apparent to those working in the field, the use of various types of fusion sequences which lengthen the resultant protein and serve some functional purpose in the preparation or purification of the protein would also be routine and are contemplated as within the scope of the present invention. For example, it is common practice to add amino acid sequences including a polyhistidine tag to facilitate purification of the product proteins. As such fusions do not significantly alter the salient properties of the molecules comprising same, modified GFPs as described herein including such fusion sequences-at either end thereof are also clearly contemplated as within the scope of the present invention.
Similarly, in addition to the specific mutations disclosed herein, it is well understood by those working in the field that in many instances modifications in particular locations in the polypeptide sequence may have no effect upon the properties of the resultant polypeptide. Unlike the specific mutations described in detail herein, other mutations provide polypeptides which have properties a o, r , c ~ ~rs ic' r ~
..ss~:~tially a sub~t .ntially ~,_d ~i _g,~ah?~° ~_w: n ~_:~~se of the spec'fic polypeptides disclosed herein. For example, the following substitutions have been found to be neutral (i.e., have no significant impact on the properties of the product):
Lys 3-~Arg; Asp 76-~Gly; Phe 99-~Ile; Asn 105-~Ser; GIu I 15-jVal; Thr 225-~Ser;
and Lys 238~G1u. These equivalent polypeptides (and oligonucleotide sequences encoding these polypeptides) are also regarded as within the scope of the present invention. In general, the polypeptides and oligonucleotide sequences of the present invention (in addition to containing at least one of the specific mutations identified herein) will be at least about 85 % homologous, more preferably at least about 90% homologous, and most preferably at least about 95% homologous, to the wild-type GFP described herein. Because of the significant difference in properties observed upon introduction of the specified modifications into a GFP
sequence, the presence of the specified modifications relative to the corresponding reported sequence for wild-type GFP [SEQ ID N0:2] are regarded as central to the invention.
The oligonucleotide sequences of the present invention are particularly useful in processes for labelling polypeptides of interest, e.g., by the construction of genes encoding fluorescent fusion proteins. Fluorescence labeling via gene fusion is site-specific and eliminates the present need to purify and label proteins 5 in vitro and microinject them into cells. Sequences encoding the modified GFPs of the present invention may be used for a wide variety of purposes as are well known to those working in the field. For example, the sequences may be employed as reporter genes for monitoring the expression of the sequence fused thereto; unlike other reporter genes, the sequences require neither substrates nor 10 cell disruption to evaluate whether expression has- be achieved. Similarly, the sequences of the present invention may be used as a means to trace lineage of a gene fused thereto during the development of a cell or organism. Further, the sequences of the present invention may be used as a genetic marker; cells or organisms labeled in this manner can be selected by, e.g.; fluorescence-activated cell sorting. The sequences of the present invention may also be used as a fluorescent tag to monitor protein expression in vivo, or to encode donors or acceptors for fluorescence resonance energy transfer. Other uses for the sequences of the present invention would be readily apparent to those working in the field, as would appropriate techniques for fusing a gene of interest to an oligonucleotide sequence of the present invention .in the proper reading frame and in a suitable expression vector so as to achieve expression of the combined sequence.
The availability of several forms of GFP with such different spectral properties should facilitate two-color assessment of differential gene expression, developmental fate, or protein trafficking. For example, if one wanted to screen for a drug that is specific to activate expression of gene A but not gene B, one could fuse the cDNA for one color of GFP to the promoter region of gene A and fuse the cDNA for another color to the promoter region of gene B. Both constructs would be transfected into target cells and the candidate drugs could be assayed to determine if they stimulate fluorescence of the desired color, but not fluorescence of the undesired color. Similarly, one could test for the simultaneous _- 11 expression of both A and B by searching for the presence of both colors simultaneously.
As another example, to examine the precise temporal or spatial relationship between the generation or location of recombinant proteins X and Y within a cell or an organism, one could fuse genes for different colors of GFP to the genes for proteins X and Y, respectively. If desired, DNA sequences encoding flexible oligopeptide spacers could be included to allow the linked domains to function autonomously in a single construct. By examining the appearance of the two distinguishable colors of fluorescence in the very same cells or organisms, one could compare and contrast the generation or location of the proteins X and Y
with much greater precision and less biological variability than if one had to compare two separate sets of cells or organisms, each containing just one color of GFP
fused to either protein X or Y. Other examples of the usefulness of two colors would be obvious to those skilled in the art.
The further mutations to brighten the Y66H and Y66W variants of GFP
enhance the possibility of using two or three colors of fluorescent protein to track differential gene expression, protein localizations or cell fates. For example, mutants P4-3 (Y66H1YI45F), W7 (Y66W/NI46I/M153T/V163A/N212K) and S65T can all be distinguished from each other. P4-3 is specifically detected by exciting at 290-370 nm and collecting emission at 420-460 nm. W7 is specifically detected by exciting at 410-4.57 nm and collecting emission at 465-495 nm.
is specifically detected by exciting at 483-493 nm and collecting emission at wavelengths greater than 510 nm. Bacteria carrying these three proteins are readily discriminated under a microscope using the above wavelength bandpass filters.
The chromophore in GFP is well buried inside the rest of the protein, so much of the dimness of the original point mutants was presumably due to steric mismatch between the substituted amino acid and the cavity optimized for tyrosine.
The location of the beneficial mutations implies that residues 145-163 are probably close to the chromophore. The M153A/S65T mutant has the longest wavelengths and smallest Stokes' shift of any known fluorescent protein that does not use a cofactor.
The invention may be better understood with reference to the accompanying examples, which are intended for purposes of illustration only and should not be construed as in any sense limiting the scope of the invention as defined by the claims appended hereto.
Example 1 The coding region of GFP clone 10.1 [Prasher et al. (1992), supra] was amplified by PCR to create NdeI and BamHI sites at the 5' and 3' ends, respectively, and was cloned behind the T7 promoter of pGEMEX2 (Promega) replacing most of the T7 gene 10. The resulting plasmid was transformed into the strain JM109(DE3) (Promega Corp., Madison, WI), and high level expression was achieved by growing the cultures at 24°C to saturation without induction by IPTG.
To prepare soluble extracts, 1.5 ml cell suspension were collected, washed and resuspended in 150 ~cl 50 mM Tris/HCI, pH 8.0, 2 mM EDTA. Lysozyme and DNAse I were added to 0.2 mg/ml and 20 ~g/ml, respectively, and the samples were incubated on ice until lysis occurred (1-2 hours). The lysates were then clarified by centrifuging at 12,000 x g for 15 minutes. Inclusion bodies were obtained as described in the literature [Sambrook, J. et al. in Molecular Cloning:
A Laboratory Manual Vol. 2, 17.37-17.41 (Cold Spring Harbor Press, Cold Spring Harbor, New York, 1989)] .
As illustrated in Fig. 1, soluble extracts of E. coli expressing GFP show a predominant band which is absent in extracts from control cells and has the same electrophoretic mobility as native GFP isolated from the jellyfish A.
victoria.
Inclusion bodies of expressing cells consist mainly of non-fluorescent GFP
which has the same mobility as soluble GFP. Non-fluorescent soluble GFP of anaerobically grown cultures is also a major band with correct mobility.
Soluble extracts of the mutated clones H9, P9, P11 and P4 again contain a dominant protein with essentially the same molecular weight.
Random mutagenesis of the GFP cDNA was done by increasing the error rate of the polymerise chain reaction with 0.1 mM MnCl2, 50 ~cM dATP and 200 tcM of dGTP, dCTP, and dTTP [Muhlrad, D. et al., Yeast 8, 79-82 (1992)]. The product was ligated into pGEMEX2 and subsequently transformed into JM109(DE3). Colonies on agar were visually screened for different emission colors and ratios of brightness when excited at 475 vs. 395 nm.
5 Figs. 3a and 3b illustrate the excitation and emission spectra of wild-type and mutant GFPs. In Figs. 3a and 3b, wild-type; - - S202F,T203I; - -- I167T; ----- Y66W; - ~ - ~ Y66H. Samples were soluble fractions from E.
coli expressing the proteins at high level, except for Y66W, which was obtained in very low yield and measured on intact cells. Autofluorescence was negligible 10 for all spectra except those of Y66W, whose excitation spectrum below 380 nm may be contaminated by autofluorescence. Excitation and emission spectra were measured with 1.8 nm bandwidths and the non-scanning wavelength set to the appropriate peak. Excitation spectra were corrected with a rhodamine B quantum counter, while emission spectra (except for Y66W) were corrected for 15 monochromator and detector efficiencies using manufacturer-supplied correction spectra. All amplitudes have been arbitrarily normalized to a maximum value of 1Ø A comparison of brightness at equal protein concentrations is provided in Table I.
Table I
i Characteristics of mutated vs. wild-type GFP
Variant Mutation Excitation Emission Relative' Maxima (nm)= Maxima (nm)b ~ Fluorescence Wild type none 396 (476) 508 (503) ( =100 % ) H9 Ser 202->Phe, 398 511 117 % ~' j Thr 203-IIe P9 _ _ Ile 167--~Val 471 (396) 502 (507) 166 % ' _Pli _ Ile 167-~Thr_ 471 (396) 502 (507) 18g%' P4 Tyr 66-->His 382 448 57 % f W Tyr 66->Tip 458 480 . n.d.
aValues in parentheses are lower-amplitude peaks.
Primary values were observed when exciting at the main excitation peak; values in parentheses were observed when illuminating at the lower-amplitude excitation . peak.
Equal amounts of protein were used based on densitometry of gels stained with Coomassie Blue (Fig. 1).
j ~ Emission .maxima of spectra recorded at excitation 395 nm were compared.
'Emission maxima.of spectra recorded at excitation 475 rlm were compared.
(Emission spectrum of P4 recorded at 378 nm excitation was integrated and compared to the integrated emission spectrum of wild type recorded at 475 nm v excitation; both excitation and emission characteristics were corrected.
', Example 2 -Oligonucleotide-directed mutagenesis~at the codon for Ser-65 of GFP cDNA
.
y was performed by the literature method [Kunkel, T.A. (1985) Proc. Natl.
Acad.
Sci. USA 82, 488J using the Muta-Gene Phagemid in Yitro Mutagenesis Kit version 2, commercially available from Bio-Rad, Richmond, CA. The method employs i a bacterial host strain deficient for dUTPase (dut) and uracil-N-glycosylase (ung), which results in an occasional substitution of uracil for thymine in newly-i synthesized DNA. When the uracil-containing DNA is used as a wild-type template for oligonucleotide-directed in vitro mutagenesis, the complementary (mutant) strand can be synthesized in the presence of deoxynucleotides, ligase and polymerase using the mutagenic oligonucleotide to prime DNA synthesis; the Version 2 kit utilizes unmodified T7 DNA polymerase to synthesize the complementary strand. When the heteroduplex molecule is transformed into a host with an active uracil-N-glycosylase (which cleaves the bond between the uracil base 5 and the ribose molecule, yielding an apyrimidic site), the uracil-containing wild-type strand is inactivated, resulting in an enrichment of the mutant strand.
The coding region of GFP cDNA was cloned into the BamHl site of the phagemid pRSETB from Invitrogen (San Diego, CA). This construct was introduced into the dut, ung double mutant E. coli strain CJ236 provided with the 10 Muta-Gene kit and superinfected with helper phage VCSM13 (Stratagene, La Jolla, CA) to produce phagemid particles with single-stranded DNA containing some uracils in place of thymine. The uracil-containing DNA was purified to serve as templates for in vitro synthesis of the second strands using the mutagenic nucleotides as primers. The DNA hybrids were transformed into the strain 15 XLlblue (available from Stratagene), which has a functional uracil-N-glycosylase;
this enzyme inactivates the parent wild-type DNA strand and selects for mutant clones. DNA of several colonies were isolated and checked for proper mutation by sequencing.
To express the mutant proteins, the DNA constructs obtained by mutagenesis were transformed into E. coli strain BL21 (DE3)LysS (Novagen, Madison, WI), which has a chromosomal copy of T7 polymerase to drive expression from the strong T7 promotor. At room temperature 3 ml cultures were grown to saturation (typically, overnight) without induction. Cells from 1 ml of culture were collected, washed and finally resuspended in 100 ~1 of 50 mM Tris pH 8.0, 300 mM NaCI. The cells were then lysed by three cycles of freeze/thawing (liquid nitrogen/30° C water bath). The soluble fraction was obtained by pelletting cell debris and unbroken cells in a microfuge.
To facilitate purification of the recombinant proteins, the vector used fuses a histidine tag (6 consecutive His) to the N-terminus of the expressed proteins .
The strong interaction between histidine hexamers and Ni2+ ions permitted purification of the proteins by NI-NTA resin (available commercially from Qiagen, Chatsworth, CA). Microcolumns (10 ~1 bed volume) were loaded with 100 ~,l soluble extract (in 50 mM Tris pH 8.0, 300 mM NaCI), washed with 10 bed volumes of the same buffer and with 10 volumes of the buffer containing 20 mM
imidazole. The recombinant proteins were then eluted with the same buffer containing 100 mM imidazole.
Aliquots of the purified mutant GFP proteins were run along with wild-type GFP on a denaturing polyacrylamide gel. The gel was stained with Coomassie blue and the protein bands were quantified by scanning on a densitometer.
Based on these results, equal amounts of each version of protein were used to run fluorescence emission and excitation spectra.
Figs. 4a and 4b compare the excitation and emission spectra of wild-type and Ser 65 mutants. In Fig. 4a, -- S65T; - - S65A; - - - S65C; - ~ -wild-type (emission at 508 nm). In Fig. 4B, -- S65T; - - S65A; - - - S65C;
~ ~ ~ wild-type (excitation at 395 nm); - ~ - ~ wild-type (excitation at 475 nm).
Excitation and emission spectra were measured with 1.8 nm bandwidths and the non-scanning wavelength set to the appropriate peak. As is apparent from Fig.
4b, all three mutants exhibited substantially higher intensity of emission relative to the wild-type protein.
Fig. 5 illustrates the rates of fluorophore formation in wild-type GFP and in the Ser 6~-~Thr mutant. E. coli expressing either wild-type or mutant GFP
were grown anaerobically. At time=0, each sample was exposed to air; further growth and protein synthesis were prevented by transferring the cells to nutrient-free medium also containing sodium azide as a metabolic inhibitor. Fluorescence was subsequently monitored as a function of time. For each culture, the fluorescence intensities are expressed as a fraction of the final fluorescence intensity obtained at t = 18 to 20 hours, after oxidation had proceeded to completion. From Fig.
5, it is apparent that development of fluorescence proceeds much more quickly in the mutant than in wild-type GFP, even after normalization of the absolute brightnesses (Figs. 4a and 4b). Therefore, when the development of GFP fluorescence is used as an assay for promotor activation and gene expression, the mutant clearly gives a more rapid and faithful measure than wild-type protein.
Figs. 6a and 6b illustrate the behavior of wild-type GFP and the Ser 65->Thr mutant, respectively, upon progressive irradiation with ultraviolet light.
Numbers indicate minutes of exposure to illumination at 280 nm; intensity was the same for both samples. Wild-type GFP (Fig. 6a) suffered photoisomerization, as shown by a major change in the shape of the excitation spectrum. Illumination with broad band (240-400 nm) UV caused qualitatively similar behavior but with less increase of amplitude in the 430-500 nm region of the spectrum: The photoisomerization was not reversible upon standing in the dark. This photoisomerization would clearly be undesirable for most uses of wild-type GFP, I0 because the protein rapidly loses brightness when excited at its main peak near 395 nm. The mutant (Fig. 6b) showed no such photoisomerization or spectral shift.
Examvle 3 GFP cDNAs encoding for Tyr6C»His (Y66H), Tyr66-~Trp (Y66W), or Ser65-~Thr (S65T) were separately further mutagenized by the polymerise chain reaction and transformed into E. coli for visual screening of colonies with unusual intensities or colors. Isolation, spectral characterization (Table II and Fig.
7), and DNA sequencing yielded several additional useful variants.
Random mutagenesis of the gfp cDNA was done by increasing the error rate of the PCR with 0.1 mM MnCl2 and unbalanced nucleotide concentrations.
The GFP mutants S65T, Y66H and Y66W had been cloned into the BamHl site of the expression vector pRSETB (Invitrogen), which includes a T7 promoter and <. a polyhistidine tag. The GFP coding region (shown in bold) was flanked 'by the following 5' and 3' sequences: S'-G GAT CCC CCC GCT GAA TTC ATG ...
AAA TAA TAA GGA TCC-3'. The 5' primer for the mutagenic PCR was the T7 primer matching the vector sequence; the 3' primer was 5'-GGT AAG CTT TTA
TTT GTA TAG TTC ATC CAT GCC-3', specific for the 3' end of GFP, creating a HindIII restriction site next to the stop codon. Amplification was over 25 cycles TM
(1 min at 94' C, 1 min 52' C, I min 72' C) using the AmpliTaq polymerise from ~
Perkin Elmer. Four separate reactions were run in which the concentration of a different nucleotide was lowered from 200 ZcM to 50 ~M. The PCR products were combined, digested with BamHI and HindIII and ligated to the pRSETB cut with BamHI and HindIII. The ligation mixture was dialyzed against water, dried and subsequently transformed into the bacterial strain BL21(DE3) by electroporation (50 ~cl electrocompetent cells in 0.1 cm cuvettes, 1900 V, 200 ohm, 25 ~F).
Colonies on agar were visually screened for brightness as previously described herein. The selected clones were sequenced with the Sequenase version 2.0 kit from United States Biochemical Cultures with freshly transformed cells were grown at 37' C to an optical density of 0.8 at 600 nm, then induced with 0.4 mM isopropylthiogaIactoside overnight at room temperature. Cells were washed in PBS pH 7.4, resuspended in 50 mM Tris pH 8,.0, 300 mM~ NaCI and lysed in a French press. The polyhistidine-tagged GFP proteins were purified from cleared lysates on nickel-chelate columns (Qiagen) using 100 mM imidazole in the above buffer to elute the protein.
Excitation spectra were obtained by collecting emission at the respective peak wavelengths and were corrected by a Rhodamine B quantum counter.
Emission spectra were likewise measured at the respective excitation peaks and were corrected using factors from the fIuorometer manufacturer (Spex Industries, Edison, NJ). In cleavage experiments emission spectra were recorded at excitation 368 nm. For measuring molar extinction eoe~cients, 20 to 30 beg of protein were used in 1 ml of PBS pH 7.4. Quantum yields of wild-type GFP, S65T, and P4-1 mutants were estimated by comparison with fluorescein in 0.1 N NaOH as a standard of quantum yield 0.91 [ed. Miller, J.N., Standards in Fluorescence Spectrometry (Chapman and Hall, New York, 1981)]. Mutants=P4 and P4-3 were likewise compared to 9-amino-acridine in water (quantum yield 0.98). W2 and W7 were compared -to both standards, which fortunately gave concordant results.
Fig. 7 illustrates the fluorescence excitation and emission spectra of different GFP mutants. All spectra were normalized to a maximal value of 1.
Each pair of excitation and emission spectrum is depicted by a distinct Iine style.
The fluorescence properties of the obtained GFP mutants are reported in Table II.
Table II
Fluorescence properties of GFP
mutants Clone Mutations Excitation EmissionExtinct.Coeff.Quantum max (nm) max (nm) (M-lcai 1) yield P4-3 Y66H 381 445 14,000 0.38 W7 Y66W 433 (453) 475 (501)18,000 (17,100)0.67 W2 Y66W 432 (453) 480 10,000 (9,600)0.72 P4-1 S65T 504 (396) 514 14,500 (8;600)0.54 SECUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT: Tsien, Roger Y.
Heim, Roger (ii) TITLE OF INVENTION: MODIFIED GREEN FLUORESCENT PROTEINS
(iii) NUMBER OF SECUENCES: 2 (iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: Robbins, Berliner 8 Carson (B) STREET: 201 North Figueroa Street, Suite 500 (C) CITY: Los Angeles (D) STATE: California (E) COUNTRY: USA
(F) ZIP: 90012 (v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy desk (B) COMPUTER: IBM PC compatible (C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER:
(B) FILING DATE:
(C) CLASSIFICATION:
(viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: Spitals, John P.
(B) REGISTRATION NUMBER: 29,215 (C) REFERENGE/DOCKET NUMBER: 1279-178 (ix) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: (213) 977-1001 (B) TELEFAX: (213) 977-1003 (2) INFORMATION FOR SEO ID N0:1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH:-717 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 1..717 (xi) SEQUENCE DESCRIPTION: SEO ID N0:1:
Met Ser Lys Gly Gtu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Vai Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe Ser Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg His Asp Phe Phe Lys Ser Ala Mei Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 1le Glu Leu Lys Gly Ile Asp Phe Lys Gtu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn 130 135 i40 _ Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gty Ile Lys Val Asn Phe Lys Ile Arg His Asn lle Gtu Asp Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro 180. 185 190 Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr His Gly Met Asp Glu Leu Tyr Lys (2) INFORMATION FOR SEA ID N0:2:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 238 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID N0:2:
Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Giu Leu Asp Gly Asp Vai Asn Gly His Lys Phe Ser Val Ser Giy Giu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Vat Thr Thr Phe Ser Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg His Asp Phe Phe Lys Ser Ata Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Vat Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly t45 i50 155 160 Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser Vat Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser AIa Leu Ser lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr His Gly Met Asp Glu Leu Tyr Lys
[SEQ ID N0:2] lead to formation of products having markedly different excitation S and emission spectra from corresponding products derived from wild-type GFP.
Visibly distinct colors and/or increased intensities of emission make these products useful in a wide variety of contexts, such as tracking of differential gene expression and protein localization.
Brief Description of the Drawings The invention may be better understood with reference to the accompanying drawings, in which:
Fig. 1 compares different versions of GFP by gel electrophoresis and Coomassie blue staining;
Fig. 2 illustrates a proposed biosynthetic scheme for GFP;
Figs. 3a and 3b illustrate the excitation and emission spectra of wild-type and a first group of mutant GFPs;
Figs. 4a and 4b illustrate the excitation and emission spectra of wild-type and a second group of mutant GFPs;
Fig. 5 illustrates the rate of fluorophore formation in the wild-type GFP and the Ser 65-~Thr mutant;
Figs. 6a and 6b illustrate the, behavior of wild-type GFP and the Ser 65~Thr mutant, respectively, upon progressive irradiation with ultraviolet light; and Fig. 7 illustrates fluorescence excitation and emission spectra of a third group of GFP mutants.
Detailed Description of the Invention GFP was expressed in E. coli under the control of a T7 promoter for quantitative analysis of the properties of the recombinant protein. Gel electrophoresis under denaturing conditions showed protein of the expected molecular weight (27 kDa) as a dominant band (Fig. 1), which could be quantified simply by densitometry of staining with Coomassie blue. Soluble recombinant GFP proved to have identical spectra and the same or even slightly more fluorescence per mole of protein as GFP purified from Aequorea victoria, showing that the soluble protein in E. coli undergoes correct folding and oxidative cyclization with as high an efficiency as in the jellyfish.
The bacteria also contained inclusion bodies consisting of protein indistinguishable from jellyfish or soluble recombinant protein on denaturing gels (Fig. ~). However, this material was completely non-fluorescent, lacked the visible absorbance bands of the chromophore, and could not be made fluorescent even when solubilized and subjected to protocols that renature GFP [Ward, W.
W.
& Bokman, S. H., Biochemistry 21, 4535-4540 (1982); Surpin, M. A. & Ward, W. W., Photochem. Photobiol. 49, Abstract, 25S (1989)J. Therefore, protein from inclusion bodies seemed permanently unable to generate the internal chromophore. An interesting intermediate stage in protein maturation could be generated by growing the bacteria anaerobically. The soluble protein again looked the same as GFP on denaturing gels (Fig. 1) but was non-fluorescent. In this case, fluorescence gradually developed after admission of air, even when fresh protein synthesis was blocked using puromycin and tetracycline. Evidently, the soluble non-fluorescent protein synthesized under anaerobic conditions was ready to become fluorescent once atmospheric oxygen was readmitted. The fluorescence per protein molecule approached its final asymptotic value with a single-exponential time course and a rate constant of 0.24 ~ .06 hr~' (at 22°C) measured either in intact cells with protein-synthesis inhibitors or in a lysate in which the soluble proteins and cofactors were a thousand fold more dilute. Such pseudo-first order kinetics strongly suggest that no enzymes or cofactors are necessary for the final step of fluorophore formation in GFP.
It has thus been determined that formation of the final fluorophore requires molecular oxygen and proceeds in wild-type protein with a time constant of - 4 h at 22°C and atmospheric p02. This was independent of dilution, implying that the oxidation does not require enzymes or cofactors.
A molecular interpretation is presented in Fig. 2. If the newly translated apoprotein (top left) evades precipitation into inclusion bodies, the amino group of Gly 67 might cyclize onto the carbonyl group of Ser 65 to form an imidazolidin-one, where the process would stop (top center) if OZ is absent. The new N=C
S double bond would be expected to promote dehydrogenation to form a conjugated chromophore; imidazolidin-5-ones are indeed known to undergo autoxidative formation of double bonds at the 4-position [Kjaer, A. Acta Chem. Scand. 7, 1030-1035 (1953); Kidwai, A. R. & Devasia, G. M. J. Org. Chem. 27, 4527-4531 (1962)], which is exactly what is necessary to complete the fluorophore (upper right). The protonated and deprotonated species (upper and lower right) may be responsible for the 395 and 470-475 nm excitation peaks, respectively. The excited states of phenols are much more acidic than their ground states, so that emission would come only from a deprotonated species.
The Aequorea GFP cDNA was subjected to random mutagenesis by hydroxylamine treatment or polymerase chain reaction. Approximately six thousand bacterial colonies on agar plates were illuminated with alternating 395 and 475 nm excitation and visually screened for ~;.:ed excitation properties or emission colors.
According to a first aspect of the present invention, modifications are provided which result in a shift in the ratio of the two excitations peaks of the product after oxidation and cyclization relative to the wild type. Three mutants were found with significant alterations in the ratio of the two main excitation peaks (Table I). The mutations were sequenced and recombined with the wild-type gene in different ways to eliminate neutral mutations and assign the fluorescence effects to single amino acid substitutions, except for H9 where two neighboring mutations have not yet been separated. They all lay in the C terminal part of the protein (Table I), remote in primary sequence from the chromophore formed from residues 65-67.
These and other modifications are defined herein with reference to the amino acid sequence [SEQ ID N0:2) encoded by the reported cDNA [SEQ ID
NO:1]; the first amino acid identified is the one found at the indicated location in the reported sequence, while the second indicates the substitution found in the modified form. The fluorescent product derived from a wild-type or modified GFP
polypeptide sequence is no longer strictly speaking a simple polypeptide after oxidation and cyclization; however, reference is sometimes made for sake of simplicity herein to the polypeptide (e.g., "wild-type GFP" or "modified GFP") where what is intended would be obvious from the context. Compared with wild-type GFP, H9 (Ser 2D2~Phe, Thr 203~I1e) had increased fluorescence at 395 nm excitation; P9 (Ile 167~Va1) and P11 (Ile 167-~Thr) were more fluorescent at nm excitation.
One possibility for these spectral perturbatrons in P9 and P11 is that the mutations at Ile 167 shift a positive charge slightly closer to the phenolic group of the fluorophore; this should both increase the percentage of phenolic anion, which is probably the species responsible for the 470-475 nm excitation peak, and shift the emission peak hypsochromically. However, the hypothesized ionizable phenolic group would have to be buried inside the protein at normal pH, because the ratio of 471 to 396 nm peaks in the mutants could not be further affected by external pH until it was raised to 10, just below the threshold for denaturation.
The pH-sensitivity of wild-type GFP is similar [Ward, W. W. et al., Photochem.
Photobiol. 35, 803-808 (1982)].
According to another aspect of the invention, a mutant P4 (Tyr 66->His) was identified which was excitable by ultraviolet and fluoresced bright blue in contrast to the green of wild type protein. The excitation and emission maxima were hypsochromically shifted by 14 and 60 nm respectively from those of wild-type GFP: The mutated DNA was sequenced and found to contain five amino acid substitutions, only one of which proved to be critical: replacement of Tyr 66 in the center of the chromophore by His (corresponding to a change in the GFP cDNA
sequence [SEQ ID N0:1] at 196-198 from TAT to CAT).
The surprising tolerance for substitution at this key residue prompted further site-directed mutagenesis to Trp and Phe at this position. Trp gave excitation and emission wavelengths intermediate between Tyr and His (Table I) but was only weakly fluorescent, perhaps due to inefficiency of folding or chromophore formation due to steric considerations. Phe gave weak fluorescence with an excitation maximum at 358 nm and an emission maximum at 442 nm.
Accordingly, pursuant to this aspect of the invention modified GFP proteins which fluoresce at different wavelengths (preferably, different by at least 10 nm and more S preferably, by at least 50 nm) relative to the native protein are provided, for example, those wherein Tyr 66 is replaced by Phe, His or Trp.
In a further embodiment of this aspect of the invention, a double mutant Y66H, YI45F was identified which had almost the same wavelengths as the single mutant Y66H but almost twice the brightness, due mainly to a higher quantum efficiency of fluorescence. The double mutant also developed its fluorescence during overnight growth, whereas the single mutant required several days.
In accordance with further embodiments of this aspect of the invention, a first round of mutagenesis to increase the brightness of Y66~V yielded ' M153T/V163A/N212K as additional substitutions. This mutant was subjected to another round of mutagenesis, resulting in two further sets, N146I and I123V/Y145H/H148R (Table II). The quantum efficiency of these mutants is now comparable to wild-type GFP. The clustering of the substitutions in residues to 163 suggest that those residues lie relatively close to the chromophore and that reductions in the size of their side chains might be compensating for the larger size of tryptophan compared to tyrosine.
Pursuant to yet another aspect of the present invention, modified GFP
proteins are provided which provide substantially more intense fluorescence per molecule than the wild type protein. Modifications at Ser 65 to Ala, Leu, Cys, Val, Ile or Thr provide proteins with red-shifted and brighter spectra relative to the native protein. In particular, the Thr mutant (corresponding to a change in the GFP cDNA sequence [SEQ ID NO:1) at I93-195 from TCT to ACT) and Cys mutant (corresponding to a change in the GFP cDNA sequence [SEQ ID NO:1] at 193-195 from TCT to TGT) are about six times brighter than wild type when excited at the preferred long-wavelength band above 450 nm. As a consequence, these modified proteins are superior to wild type proteins for practically all applications. Further, the brightness of these modified proteins matches the brightness reported in the literature for Renilla GFP; thus, these proteins clearly obviate the objections to the dimness of Aequorea GFP. In fact, it is speculated that the chromophores in these modified proteins may exhibit the optimum brightness which could be achieved with a general structure derived from the Aequorea GFP chromophore. In particular, these mutations provide products exhibiting one or more of the following salient characteristics which distinguish them clearly over the corresponding product from a wild-type GFP: reduced efficiency of excitation by wavelengths between about 350 and 420 nm; enhanced excitation and emission efficiency when excited with wavelengths longer than about 450 nm; increased resistance to light-induced shifts in the excitation spectrum; and faster kinetics of fluorophore generation. In contrast, mutations to Trp, Arg, Asn, Phe and Asp did not provide improved brightness.
Mutagenesis of S65T to shift its wavelengths further to the red yielded M153A/K238E (Table II) as the GFP variant with the longest-wavelength excitation maximum yet described, 504 nm vs. 490 nm for S65T. Surprisingly, the emission peak hardly changed (514 nm vs. 511 nm), so that the separation between the excitation and emission peaks (Stokes' shift) is extremely narrow, only 10 nm.
This is one of the smallest values reported for any fluorophore in aqueous solution at room temperature. As in the Y66W series, M153 seems to be influential. It is doubtful that K238E is important, because this substitution has been found to be without effect in other mutants.
As would be readily apparent to those working in the field, to provide the desired fluorescent protein it would not be necessary to include the entire sequence of GFP. In particular, minor deletions at either end of the protein sequence are expected to have little or no impact on the fluorescence spectrum of the protein.
Therefore, by a mutant or wild-type GFP sequence for purposes of the present invention are contemplated not only the complete polypeptide and oligonucleotide sequences discussed herein, but also functionally-equivalent portions thereof (i.e., portions of the polypeptide sequences which exhibit the desired fluorescence properties and oligonucleotide sequences encoding these polypeptide sequences).
For example, whereas the chromophore itself (position 65-67) is obviously crucial, the locations of known neutral mutations suggest that amino acids 76-115 are less critical to the spectroscopic properties of the product. In addition, as would be immediately apparent to those working in the field, the use of various types of fusion sequences which lengthen the resultant protein and serve some functional purpose in the preparation or purification of the protein would also be routine and are contemplated as within the scope of the present invention. For example, it is common practice to add amino acid sequences including a polyhistidine tag to facilitate purification of the product proteins. As such fusions do not significantly alter the salient properties of the molecules comprising same, modified GFPs as described herein including such fusion sequences-at either end thereof are also clearly contemplated as within the scope of the present invention.
Similarly, in addition to the specific mutations disclosed herein, it is well understood by those working in the field that in many instances modifications in particular locations in the polypeptide sequence may have no effect upon the properties of the resultant polypeptide. Unlike the specific mutations described in detail herein, other mutations provide polypeptides which have properties a o, r , c ~ ~rs ic' r ~
..ss~:~tially a sub~t .ntially ~,_d ~i _g,~ah?~° ~_w: n ~_:~~se of the spec'fic polypeptides disclosed herein. For example, the following substitutions have been found to be neutral (i.e., have no significant impact on the properties of the product):
Lys 3-~Arg; Asp 76-~Gly; Phe 99-~Ile; Asn 105-~Ser; GIu I 15-jVal; Thr 225-~Ser;
and Lys 238~G1u. These equivalent polypeptides (and oligonucleotide sequences encoding these polypeptides) are also regarded as within the scope of the present invention. In general, the polypeptides and oligonucleotide sequences of the present invention (in addition to containing at least one of the specific mutations identified herein) will be at least about 85 % homologous, more preferably at least about 90% homologous, and most preferably at least about 95% homologous, to the wild-type GFP described herein. Because of the significant difference in properties observed upon introduction of the specified modifications into a GFP
sequence, the presence of the specified modifications relative to the corresponding reported sequence for wild-type GFP [SEQ ID N0:2] are regarded as central to the invention.
The oligonucleotide sequences of the present invention are particularly useful in processes for labelling polypeptides of interest, e.g., by the construction of genes encoding fluorescent fusion proteins. Fluorescence labeling via gene fusion is site-specific and eliminates the present need to purify and label proteins 5 in vitro and microinject them into cells. Sequences encoding the modified GFPs of the present invention may be used for a wide variety of purposes as are well known to those working in the field. For example, the sequences may be employed as reporter genes for monitoring the expression of the sequence fused thereto; unlike other reporter genes, the sequences require neither substrates nor 10 cell disruption to evaluate whether expression has- be achieved. Similarly, the sequences of the present invention may be used as a means to trace lineage of a gene fused thereto during the development of a cell or organism. Further, the sequences of the present invention may be used as a genetic marker; cells or organisms labeled in this manner can be selected by, e.g.; fluorescence-activated cell sorting. The sequences of the present invention may also be used as a fluorescent tag to monitor protein expression in vivo, or to encode donors or acceptors for fluorescence resonance energy transfer. Other uses for the sequences of the present invention would be readily apparent to those working in the field, as would appropriate techniques for fusing a gene of interest to an oligonucleotide sequence of the present invention .in the proper reading frame and in a suitable expression vector so as to achieve expression of the combined sequence.
The availability of several forms of GFP with such different spectral properties should facilitate two-color assessment of differential gene expression, developmental fate, or protein trafficking. For example, if one wanted to screen for a drug that is specific to activate expression of gene A but not gene B, one could fuse the cDNA for one color of GFP to the promoter region of gene A and fuse the cDNA for another color to the promoter region of gene B. Both constructs would be transfected into target cells and the candidate drugs could be assayed to determine if they stimulate fluorescence of the desired color, but not fluorescence of the undesired color. Similarly, one could test for the simultaneous _- 11 expression of both A and B by searching for the presence of both colors simultaneously.
As another example, to examine the precise temporal or spatial relationship between the generation or location of recombinant proteins X and Y within a cell or an organism, one could fuse genes for different colors of GFP to the genes for proteins X and Y, respectively. If desired, DNA sequences encoding flexible oligopeptide spacers could be included to allow the linked domains to function autonomously in a single construct. By examining the appearance of the two distinguishable colors of fluorescence in the very same cells or organisms, one could compare and contrast the generation or location of the proteins X and Y
with much greater precision and less biological variability than if one had to compare two separate sets of cells or organisms, each containing just one color of GFP
fused to either protein X or Y. Other examples of the usefulness of two colors would be obvious to those skilled in the art.
The further mutations to brighten the Y66H and Y66W variants of GFP
enhance the possibility of using two or three colors of fluorescent protein to track differential gene expression, protein localizations or cell fates. For example, mutants P4-3 (Y66H1YI45F), W7 (Y66W/NI46I/M153T/V163A/N212K) and S65T can all be distinguished from each other. P4-3 is specifically detected by exciting at 290-370 nm and collecting emission at 420-460 nm. W7 is specifically detected by exciting at 410-4.57 nm and collecting emission at 465-495 nm.
is specifically detected by exciting at 483-493 nm and collecting emission at wavelengths greater than 510 nm. Bacteria carrying these three proteins are readily discriminated under a microscope using the above wavelength bandpass filters.
The chromophore in GFP is well buried inside the rest of the protein, so much of the dimness of the original point mutants was presumably due to steric mismatch between the substituted amino acid and the cavity optimized for tyrosine.
The location of the beneficial mutations implies that residues 145-163 are probably close to the chromophore. The M153A/S65T mutant has the longest wavelengths and smallest Stokes' shift of any known fluorescent protein that does not use a cofactor.
The invention may be better understood with reference to the accompanying examples, which are intended for purposes of illustration only and should not be construed as in any sense limiting the scope of the invention as defined by the claims appended hereto.
Example 1 The coding region of GFP clone 10.1 [Prasher et al. (1992), supra] was amplified by PCR to create NdeI and BamHI sites at the 5' and 3' ends, respectively, and was cloned behind the T7 promoter of pGEMEX2 (Promega) replacing most of the T7 gene 10. The resulting plasmid was transformed into the strain JM109(DE3) (Promega Corp., Madison, WI), and high level expression was achieved by growing the cultures at 24°C to saturation without induction by IPTG.
To prepare soluble extracts, 1.5 ml cell suspension were collected, washed and resuspended in 150 ~cl 50 mM Tris/HCI, pH 8.0, 2 mM EDTA. Lysozyme and DNAse I were added to 0.2 mg/ml and 20 ~g/ml, respectively, and the samples were incubated on ice until lysis occurred (1-2 hours). The lysates were then clarified by centrifuging at 12,000 x g for 15 minutes. Inclusion bodies were obtained as described in the literature [Sambrook, J. et al. in Molecular Cloning:
A Laboratory Manual Vol. 2, 17.37-17.41 (Cold Spring Harbor Press, Cold Spring Harbor, New York, 1989)] .
As illustrated in Fig. 1, soluble extracts of E. coli expressing GFP show a predominant band which is absent in extracts from control cells and has the same electrophoretic mobility as native GFP isolated from the jellyfish A.
victoria.
Inclusion bodies of expressing cells consist mainly of non-fluorescent GFP
which has the same mobility as soluble GFP. Non-fluorescent soluble GFP of anaerobically grown cultures is also a major band with correct mobility.
Soluble extracts of the mutated clones H9, P9, P11 and P4 again contain a dominant protein with essentially the same molecular weight.
Random mutagenesis of the GFP cDNA was done by increasing the error rate of the polymerise chain reaction with 0.1 mM MnCl2, 50 ~cM dATP and 200 tcM of dGTP, dCTP, and dTTP [Muhlrad, D. et al., Yeast 8, 79-82 (1992)]. The product was ligated into pGEMEX2 and subsequently transformed into JM109(DE3). Colonies on agar were visually screened for different emission colors and ratios of brightness when excited at 475 vs. 395 nm.
5 Figs. 3a and 3b illustrate the excitation and emission spectra of wild-type and mutant GFPs. In Figs. 3a and 3b, wild-type; - - S202F,T203I; - -- I167T; ----- Y66W; - ~ - ~ Y66H. Samples were soluble fractions from E.
coli expressing the proteins at high level, except for Y66W, which was obtained in very low yield and measured on intact cells. Autofluorescence was negligible 10 for all spectra except those of Y66W, whose excitation spectrum below 380 nm may be contaminated by autofluorescence. Excitation and emission spectra were measured with 1.8 nm bandwidths and the non-scanning wavelength set to the appropriate peak. Excitation spectra were corrected with a rhodamine B quantum counter, while emission spectra (except for Y66W) were corrected for 15 monochromator and detector efficiencies using manufacturer-supplied correction spectra. All amplitudes have been arbitrarily normalized to a maximum value of 1Ø A comparison of brightness at equal protein concentrations is provided in Table I.
Table I
i Characteristics of mutated vs. wild-type GFP
Variant Mutation Excitation Emission Relative' Maxima (nm)= Maxima (nm)b ~ Fluorescence Wild type none 396 (476) 508 (503) ( =100 % ) H9 Ser 202->Phe, 398 511 117 % ~' j Thr 203-IIe P9 _ _ Ile 167--~Val 471 (396) 502 (507) 166 % ' _Pli _ Ile 167-~Thr_ 471 (396) 502 (507) 18g%' P4 Tyr 66-->His 382 448 57 % f W Tyr 66->Tip 458 480 . n.d.
aValues in parentheses are lower-amplitude peaks.
Primary values were observed when exciting at the main excitation peak; values in parentheses were observed when illuminating at the lower-amplitude excitation . peak.
Equal amounts of protein were used based on densitometry of gels stained with Coomassie Blue (Fig. 1).
j ~ Emission .maxima of spectra recorded at excitation 395 nm were compared.
'Emission maxima.of spectra recorded at excitation 475 rlm were compared.
(Emission spectrum of P4 recorded at 378 nm excitation was integrated and compared to the integrated emission spectrum of wild type recorded at 475 nm v excitation; both excitation and emission characteristics were corrected.
', Example 2 -Oligonucleotide-directed mutagenesis~at the codon for Ser-65 of GFP cDNA
.
y was performed by the literature method [Kunkel, T.A. (1985) Proc. Natl.
Acad.
Sci. USA 82, 488J using the Muta-Gene Phagemid in Yitro Mutagenesis Kit version 2, commercially available from Bio-Rad, Richmond, CA. The method employs i a bacterial host strain deficient for dUTPase (dut) and uracil-N-glycosylase (ung), which results in an occasional substitution of uracil for thymine in newly-i synthesized DNA. When the uracil-containing DNA is used as a wild-type template for oligonucleotide-directed in vitro mutagenesis, the complementary (mutant) strand can be synthesized in the presence of deoxynucleotides, ligase and polymerase using the mutagenic oligonucleotide to prime DNA synthesis; the Version 2 kit utilizes unmodified T7 DNA polymerase to synthesize the complementary strand. When the heteroduplex molecule is transformed into a host with an active uracil-N-glycosylase (which cleaves the bond between the uracil base 5 and the ribose molecule, yielding an apyrimidic site), the uracil-containing wild-type strand is inactivated, resulting in an enrichment of the mutant strand.
The coding region of GFP cDNA was cloned into the BamHl site of the phagemid pRSETB from Invitrogen (San Diego, CA). This construct was introduced into the dut, ung double mutant E. coli strain CJ236 provided with the 10 Muta-Gene kit and superinfected with helper phage VCSM13 (Stratagene, La Jolla, CA) to produce phagemid particles with single-stranded DNA containing some uracils in place of thymine. The uracil-containing DNA was purified to serve as templates for in vitro synthesis of the second strands using the mutagenic nucleotides as primers. The DNA hybrids were transformed into the strain 15 XLlblue (available from Stratagene), which has a functional uracil-N-glycosylase;
this enzyme inactivates the parent wild-type DNA strand and selects for mutant clones. DNA of several colonies were isolated and checked for proper mutation by sequencing.
To express the mutant proteins, the DNA constructs obtained by mutagenesis were transformed into E. coli strain BL21 (DE3)LysS (Novagen, Madison, WI), which has a chromosomal copy of T7 polymerase to drive expression from the strong T7 promotor. At room temperature 3 ml cultures were grown to saturation (typically, overnight) without induction. Cells from 1 ml of culture were collected, washed and finally resuspended in 100 ~1 of 50 mM Tris pH 8.0, 300 mM NaCI. The cells were then lysed by three cycles of freeze/thawing (liquid nitrogen/30° C water bath). The soluble fraction was obtained by pelletting cell debris and unbroken cells in a microfuge.
To facilitate purification of the recombinant proteins, the vector used fuses a histidine tag (6 consecutive His) to the N-terminus of the expressed proteins .
The strong interaction between histidine hexamers and Ni2+ ions permitted purification of the proteins by NI-NTA resin (available commercially from Qiagen, Chatsworth, CA). Microcolumns (10 ~1 bed volume) were loaded with 100 ~,l soluble extract (in 50 mM Tris pH 8.0, 300 mM NaCI), washed with 10 bed volumes of the same buffer and with 10 volumes of the buffer containing 20 mM
imidazole. The recombinant proteins were then eluted with the same buffer containing 100 mM imidazole.
Aliquots of the purified mutant GFP proteins were run along with wild-type GFP on a denaturing polyacrylamide gel. The gel was stained with Coomassie blue and the protein bands were quantified by scanning on a densitometer.
Based on these results, equal amounts of each version of protein were used to run fluorescence emission and excitation spectra.
Figs. 4a and 4b compare the excitation and emission spectra of wild-type and Ser 65 mutants. In Fig. 4a, -- S65T; - - S65A; - - - S65C; - ~ -wild-type (emission at 508 nm). In Fig. 4B, -- S65T; - - S65A; - - - S65C;
~ ~ ~ wild-type (excitation at 395 nm); - ~ - ~ wild-type (excitation at 475 nm).
Excitation and emission spectra were measured with 1.8 nm bandwidths and the non-scanning wavelength set to the appropriate peak. As is apparent from Fig.
4b, all three mutants exhibited substantially higher intensity of emission relative to the wild-type protein.
Fig. 5 illustrates the rates of fluorophore formation in wild-type GFP and in the Ser 6~-~Thr mutant. E. coli expressing either wild-type or mutant GFP
were grown anaerobically. At time=0, each sample was exposed to air; further growth and protein synthesis were prevented by transferring the cells to nutrient-free medium also containing sodium azide as a metabolic inhibitor. Fluorescence was subsequently monitored as a function of time. For each culture, the fluorescence intensities are expressed as a fraction of the final fluorescence intensity obtained at t = 18 to 20 hours, after oxidation had proceeded to completion. From Fig.
5, it is apparent that development of fluorescence proceeds much more quickly in the mutant than in wild-type GFP, even after normalization of the absolute brightnesses (Figs. 4a and 4b). Therefore, when the development of GFP fluorescence is used as an assay for promotor activation and gene expression, the mutant clearly gives a more rapid and faithful measure than wild-type protein.
Figs. 6a and 6b illustrate the behavior of wild-type GFP and the Ser 65->Thr mutant, respectively, upon progressive irradiation with ultraviolet light.
Numbers indicate minutes of exposure to illumination at 280 nm; intensity was the same for both samples. Wild-type GFP (Fig. 6a) suffered photoisomerization, as shown by a major change in the shape of the excitation spectrum. Illumination with broad band (240-400 nm) UV caused qualitatively similar behavior but with less increase of amplitude in the 430-500 nm region of the spectrum: The photoisomerization was not reversible upon standing in the dark. This photoisomerization would clearly be undesirable for most uses of wild-type GFP, I0 because the protein rapidly loses brightness when excited at its main peak near 395 nm. The mutant (Fig. 6b) showed no such photoisomerization or spectral shift.
Examvle 3 GFP cDNAs encoding for Tyr6C»His (Y66H), Tyr66-~Trp (Y66W), or Ser65-~Thr (S65T) were separately further mutagenized by the polymerise chain reaction and transformed into E. coli for visual screening of colonies with unusual intensities or colors. Isolation, spectral characterization (Table II and Fig.
7), and DNA sequencing yielded several additional useful variants.
Random mutagenesis of the gfp cDNA was done by increasing the error rate of the PCR with 0.1 mM MnCl2 and unbalanced nucleotide concentrations.
The GFP mutants S65T, Y66H and Y66W had been cloned into the BamHl site of the expression vector pRSETB (Invitrogen), which includes a T7 promoter and <. a polyhistidine tag. The GFP coding region (shown in bold) was flanked 'by the following 5' and 3' sequences: S'-G GAT CCC CCC GCT GAA TTC ATG ...
AAA TAA TAA GGA TCC-3'. The 5' primer for the mutagenic PCR was the T7 primer matching the vector sequence; the 3' primer was 5'-GGT AAG CTT TTA
TTT GTA TAG TTC ATC CAT GCC-3', specific for the 3' end of GFP, creating a HindIII restriction site next to the stop codon. Amplification was over 25 cycles TM
(1 min at 94' C, 1 min 52' C, I min 72' C) using the AmpliTaq polymerise from ~
Perkin Elmer. Four separate reactions were run in which the concentration of a different nucleotide was lowered from 200 ZcM to 50 ~M. The PCR products were combined, digested with BamHI and HindIII and ligated to the pRSETB cut with BamHI and HindIII. The ligation mixture was dialyzed against water, dried and subsequently transformed into the bacterial strain BL21(DE3) by electroporation (50 ~cl electrocompetent cells in 0.1 cm cuvettes, 1900 V, 200 ohm, 25 ~F).
Colonies on agar were visually screened for brightness as previously described herein. The selected clones were sequenced with the Sequenase version 2.0 kit from United States Biochemical Cultures with freshly transformed cells were grown at 37' C to an optical density of 0.8 at 600 nm, then induced with 0.4 mM isopropylthiogaIactoside overnight at room temperature. Cells were washed in PBS pH 7.4, resuspended in 50 mM Tris pH 8,.0, 300 mM~ NaCI and lysed in a French press. The polyhistidine-tagged GFP proteins were purified from cleared lysates on nickel-chelate columns (Qiagen) using 100 mM imidazole in the above buffer to elute the protein.
Excitation spectra were obtained by collecting emission at the respective peak wavelengths and were corrected by a Rhodamine B quantum counter.
Emission spectra were likewise measured at the respective excitation peaks and were corrected using factors from the fIuorometer manufacturer (Spex Industries, Edison, NJ). In cleavage experiments emission spectra were recorded at excitation 368 nm. For measuring molar extinction eoe~cients, 20 to 30 beg of protein were used in 1 ml of PBS pH 7.4. Quantum yields of wild-type GFP, S65T, and P4-1 mutants were estimated by comparison with fluorescein in 0.1 N NaOH as a standard of quantum yield 0.91 [ed. Miller, J.N., Standards in Fluorescence Spectrometry (Chapman and Hall, New York, 1981)]. Mutants=P4 and P4-3 were likewise compared to 9-amino-acridine in water (quantum yield 0.98). W2 and W7 were compared -to both standards, which fortunately gave concordant results.
Fig. 7 illustrates the fluorescence excitation and emission spectra of different GFP mutants. All spectra were normalized to a maximal value of 1.
Each pair of excitation and emission spectrum is depicted by a distinct Iine style.
The fluorescence properties of the obtained GFP mutants are reported in Table II.
Table II
Fluorescence properties of GFP
mutants Clone Mutations Excitation EmissionExtinct.Coeff.Quantum max (nm) max (nm) (M-lcai 1) yield P4-3 Y66H 381 445 14,000 0.38 W7 Y66W 433 (453) 475 (501)18,000 (17,100)0.67 W2 Y66W 432 (453) 480 10,000 (9,600)0.72 P4-1 S65T 504 (396) 514 14,500 (8;600)0.54 SECUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT: Tsien, Roger Y.
Heim, Roger (ii) TITLE OF INVENTION: MODIFIED GREEN FLUORESCENT PROTEINS
(iii) NUMBER OF SECUENCES: 2 (iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: Robbins, Berliner 8 Carson (B) STREET: 201 North Figueroa Street, Suite 500 (C) CITY: Los Angeles (D) STATE: California (E) COUNTRY: USA
(F) ZIP: 90012 (v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy desk (B) COMPUTER: IBM PC compatible (C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER:
(B) FILING DATE:
(C) CLASSIFICATION:
(viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: Spitals, John P.
(B) REGISTRATION NUMBER: 29,215 (C) REFERENGE/DOCKET NUMBER: 1279-178 (ix) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: (213) 977-1001 (B) TELEFAX: (213) 977-1003 (2) INFORMATION FOR SEO ID N0:1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH:-717 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 1..717 (xi) SEQUENCE DESCRIPTION: SEO ID N0:1:
Met Ser Lys Gly Gtu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Vai Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe Ser Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg His Asp Phe Phe Lys Ser Ala Mei Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 1le Glu Leu Lys Gly Ile Asp Phe Lys Gtu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn 130 135 i40 _ Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gty Ile Lys Val Asn Phe Lys Ile Arg His Asn lle Gtu Asp Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro 180. 185 190 Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr His Gly Met Asp Glu Leu Tyr Lys (2) INFORMATION FOR SEA ID N0:2:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 238 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID N0:2:
Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Giu Leu Asp Gly Asp Vai Asn Gly His Lys Phe Ser Val Ser Giy Giu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Vat Thr Thr Phe Ser Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg His Asp Phe Phe Lys Ser Ata Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Vat Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly t45 i50 155 160 Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser Vat Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser AIa Leu Ser lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr His Gly Met Asp Glu Leu Tyr Lys
Claims (17)
1. An organism comprising a cell, said cell comprising a nucleic acid having a sequence encoding a fluorescent protein having a polypeptide sequence which is modified by amino acid substitution compared to the polypeptide sequence of wild type GFP
shown in SEQ. ID. NO:2, whereby said modification of the polypeptide sequence results in a different excitation and / or emission spectrum, the emission spectrum showing visible distinct colors and / or increased intensities of emission compared to wild-type GFP, or functionally equivalent portions or fusions thereof.
shown in SEQ. ID. NO:2, whereby said modification of the polypeptide sequence results in a different excitation and / or emission spectrum, the emission spectrum showing visible distinct colors and / or increased intensities of emission compared to wild-type GFP, or functionally equivalent portions or fusions thereof.
2. A cell comprising a nucleic acid having a sequence encoding a fluorescent protein having a polypeptide sequence which is modified by amino acid substitution compared to the polypeptide sequence of wild type GFP shown in SEQ. ID. NO:2, whereby said modification of the polypeptide sequence results in a different excitation and/or emission spectrum, the emission spectrum showing visible distinct colors and/or increased intensities of emission compared to wild-type GFP, or functionally equivalent portions or fusions thereof.
3. The cell of claim 1 or 2, wherein the fluorescent protein exhibits increased fluorescence at a shorter wavelength peak of the two main excitation peaks.
4. The cell of claim 3, wherein the fluorescent protein exhibits increased fluorescence at a shorter wavelength peak of the two main excitation peaks.
5. The cell of claim 4, wherein the fluorescent protein comprises a replacement of Ser 202 by Phe and a replacement of Thr 203 by Ile.
6. The cell of claim 3, wherein the fluorescent protein exhibits increased fluorescence at a longer wavelength peak of the two main excitation peaks.
7. The cell of claim 6, wherein the fluorescent protein comprises a replacement of Ile 167 by Val or Thr.
8. The cell of claim 6, wherein the fluorescent protein comprises a replacement of Ser 65 by Thr, a replacement of Met 153 by Ala, and a replacement of Lys 238 by Glu.
9. The cell of claim 6, wherein the fluorescent protein exhibits fluorescence at a shorter wavelength than the corresponding product derived from wild-type GFP.
10. The cell of claim 6, wherein the fluorescent protein comprises a replacement of Tyr 66 by His or Phe.
11. The cell of claim 6, wherein the fluorescent protein comprises a replacement of Tyr 66 by His and a replacement of Tyr 145 by Phe.
12. The cell of claim 6, wherein the fluorescent protein comprises a replacement of Tyr 66 by Trp, a replacement of Asn 146 by Ile, a replacement of Met 153 by Thr, a replacement of Val 163 by Ala, and a replacement of Asn 212 by Lys.
13, The cell of claim 6, wherein the fluorescent protein comprises a replacement of Tyr 66 by Trp, a replacement of Ile 123 by Val, a replacement of Tyr 145 by His, a replacement of His 148 by Arg, a replacement of Met 153 by Thr, a replacement of Val 163 by Ala, and a replacement of Asn 212 by Lys.
14. The cell of claim 6, wherein the fluorescent protein exhibits enhanced emission relative to the corresponding product derived from wild-type GFP.
15. The cell of claim 6, wherein the fluorescent protein comprises a replacement of Ser 65 by an amino acid selected from the group consisting of Ala, Cys, Thr, Leu, Val and Ile.
16. The cell of claim 15, wherein the fluorescent protein comprises a replacement of Ser 65 by an amino acid selected from the group consisting of Cys and Thr.
17. A cell according to any one of claims 2-15 wherein said cell is a bacterial cell.
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US08/337,915 US5625048A (en) | 1994-11-10 | 1994-11-10 | Modified green fluorescent proteins |
| US08/337,915 | 1994-11-10 | ||
| CA002343586A CA2343586C (en) | 1994-11-10 | 1995-11-13 | Modified green fluorescent proteins |
Related Parent Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CA002343586A Division CA2343586C (en) | 1994-11-10 | 1995-11-13 | Modified green fluorescent proteins |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CA2510103A1 true CA2510103A1 (en) | 1996-08-08 |
Family
ID=35253773
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CA002510103A Abandoned CA2510103A1 (en) | 1994-11-10 | 1995-11-13 | Modified green fluorescent proteins |
Country Status (1)
| Country | Link |
|---|---|
| CA (1) | CA2510103A1 (en) |
-
1995
- 1995-11-13 CA CA002510103A patent/CA2510103A1/en not_active Abandoned
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| AU702205B2 (en) | Modified green fluorescenct proteins | |
| US5777079A (en) | Modified green fluorescent proteins | |
| Heim et al. | Wavelength mutations and posttranslational autoxidation of green fluorescent protein. | |
| DE69713059T2 (en) | TANDEM FLUORESCENT PROTEIN CONSTRUCTS | |
| Ward | Biochemical and physical properties of green fluorescent protein | |
| Inouye et al. | Evidence for redox forms of the Aequorea green fluorescent protein | |
| US7015310B2 (en) | Oxidation reduction sensitive green fluorescent protein variants | |
| US20060003420A1 (en) | Monomeric and dimeric fluorescent protein variants and methods for making same | |
| EP1494697B1 (en) | Monomeric and dimeric fluorescent protein variants and methods for making same | |
| US20050196768A1 (en) | Monomeric and dimeric fluorescent protein variants and methods for making same | |
| US20020177189A1 (en) | Novel fluorescent proteins | |
| EP1954713A2 (en) | Modified green fluorescent proteins and methods for using same | |
| CA2343586C (en) | Modified green fluorescent proteins | |
| CN100384871C (en) | Isolated fluorescent protein (CGFP) from CLYTIA GREGARIA and its application | |
| CA2510103A1 (en) | Modified green fluorescent proteins | |
| JP2005511027A (en) | Fluorescent protein | |
| DE29522103U1 (en) | Green fluorescent, modified proteins | |
| US20040072995A1 (en) | Novel fluorescent proteins | |
| HK1088016B (en) | Isolated fluorescent protein from clytia gregaria cgfp and use thereof |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| EEER | Examination request | ||
| FZDE | Dead |