LOCUS pLeu100 7418 bp DNA circular 20-JUL-2008 FEATURES Location/Qualifiers misc_feature complement(8..712) /function="rRNA spacer" /note="Complement of 36 to 694 of accession number Z46903 and complement of 809632 to 810308 on chromosome 1 (AL929603) (the entire 'intergenic region' on chromosome 1 is about 10 kb." 3'UTR 3921..4632 /function="ALD 3' UTR" /note="long aldolase 3' UTR from pHD103 for optimized BF expression (about 6x higher than the short ALD UTR used in pLew79 & pLew82)." CDS 5563..6420 /function="LACTAMASE" 3'UTR complement(744..1021) /function="ACT 3' UTR" CDS complement(1082..1456) /note="BLE" promoter complement(1599..1620) /function="T7 Promoter" /note="T7 promoter, full-strength, constitutive. Note that there are no T7 terminators downstream of T7 transcription unit in this construct, which could be a problem in some insertion sites!" promoter 1629..1877 /function="GPEET Promoter" /note="Best match with GPEET2 promoter on chromosome 6: only 2 nt differ in 249 BLASTed. Originally Christine's PARP-A promoter region." protein_bind 1883..1923 /function="Tet Operators" 5'UTR 1927..2012 /function="GPEET SAS" /note="To SAS" 5'UTR 2013..2043 /function="GPEET 5´ UTR" CDS 2075..2116 /function="Extraneous ORF" /note="Present in Clontech vectors, from which LUC appears to have been cloned as a convenient HindIII-BamHI fragment. Genbank Accession number (U02437)" CDS 2118..3770 /note="LUCIFERASE" 5'UTR complement(1477..1593) /function="ACT 5' UTR & SAS" conflict 588..594 /note="The Sac site that the sequence indicates is not present" prim_transcript 1872..1873 /note="'Approximate' tanscription initiation site according to Sherman et al 1991" BASE COUNT 1948 a 1710 c 1862 g 1895 t 3 others ORIGIN 1 GAATTCGAGC TCATATAGTT GGTATGTATT CTAATTCCAG ACTACTGGCG TGGATAAACA 61 TGTCCCCTGA TTAAAAGGAA AGATTCCATA GCCCATTAGT GCAAAGATAA TTGGTACACC 121 GTCAAAAACA CATGGGAGAC CACACGATAC GACACTGTGA CCGTAGCATC AATTCCGCAC 181 TCGATATAAC TCTATCGAAG AACTTCCATG GTACAAACTG GTCGCCACGG TCTGCTCCAC 241 ACGGAGATCA TCGTATCATT TTTATCGATA GCGGCCGCTA TCGATGTATG CCTTGGCCCT 301 GATGGCATGC CAATTTCACT ACAACGGACT ATGTGGACCC CGTTATCATG GAAATGCGCT 361 AGTTGGAGGA AGTTAGACCG CGCCGGAAAA GAGAGGGGTA GAGAAAATGA CAACTTGGAA 421 GATATCCACA CACGCACGGT GAAACGTTAG CAACAATTAT TAGGGAAGCA CGCTTGCCCT 481 AGTCCCACGA GTAACCAAGA CTCCAAAAGC CTTTCTGGCA CAGAGAGCGA GCCGAAATGG 541 AAAAGAGAAA CAATGCCTGC ACTAACACTA CTGAGCGATT CGCCTCGCCG CGGAGGACCG 601 AATACTAATA ACGACACTTG CGGTCAAAAA GTAGAAGAAC AAATGCTCAA CGATGAGTGA 661 ATCAGGTTAG GGTAGTTGGA AAATTATACA GAATGTCTTT GGCAACACAC CGGCnnnCTT 721 GCATGCCTGC AAGGCCTTGC AGAATACTGC ATAGATAACA AACGCATCAA ACACAACTGG 781 TGGCACTATT TCAATCATGT CGACACACCA AGACGAGAAA ATTTGAGTTA AACAAGATGC 841 CTACAACCCT ATATTTATTA TCTTCAAGTG ATCTTCTCTT CTCTTGTGCG CTGTACGTAA 901 ATGTGTTGCA AATAAGGCAT AGGGCTGAGT ACAGGCACCA CACGCGAGGA AAATCATAAG 961 CACTAGCCGT GTCAACTCAC AGCGACTACA GAACAATTTT GGCCACACAA CCCGGTGTTA 1021 GGATCTCCGA GGCCTGGGAC CCGTGGGCCG CCGTCGGACC GGCGGTGTTG GTCGGCGTCG 1081 GTCAGTCCTG CTCCTCGGCC ACGAAGTGCA CGCAGTTGCC GGCCGGGTCG CGCAGGGCGA 1141 ACTCCCGCCC CCACGGCTGC TCGCCGATCT CGGTCATGGC CGGCCCGGAG GCGTCCCGGA 1201 AGTTCGTGGA CACGACCTCC GACCACTCGG CGTACAGCTC GTCCAGGCCG CGCACCCACA 1261 CCCAGGCCAG GGTGTTGTCC GGCACCACCT GGTCCTGGAC CGCGCTGATG AACAGGGTCA 1321 CGTCGTCCCG GACCACACCG GCGAAGTCGT CCTCCACGAA GTCCCGGGAG AACCCGAGCC 1381 GGTCGGTCCA GAACTCGACC GCTCCGGCGA CGTCGCGCGC GGTGAGCACC GGAACGGCAC 1441 TGGTCAACTT GGCCATGGTG ATATAGCTTA TTTTATGGCA GCAACGAGAC CTTACGTAAA 1501 GCAAAACTAA ATAAGAGCGG AGACTGCAAT GCAGAGTAAA AAAAAAAAAA ACATGAATTT 1561 CAGAAGACCT TGCTGTGCCC CCGGTACGGG CTAGATCTCC CTATAGTGAG TCGTATTAAT 1621 CAGGTACCGT CATTGGGGTT AAGCGGAAAG GTGTGTGTCA GTAGGTTGTG AGGTGAAAGC 1681 GTTTTCAGAT GCATAGTGAG CTTAATGTCC TTTTCACAGT ATATCGTGTC TGATAGGTAT 1741 CTCTTATTAG TATAGTCGAA TACTAGTCAA TAGTGCGTTT TGTGCAAAAT GTCCATTTTG 1801 TGGCAGTGAT GGGGTTGTTT TATGCTATTC CGTGTCTCTG GGTGGGCGTG CATTGAAAAT 1861 AGGGGTTATC GGGTGAGGGA TCTCCCTATC AGTGATAGAG ATCTCCCTAT CAGTGATAGA 1921 GATCCCTGAG TACTGAGTTT AACATGTTCT CGTCCCGGGC TGCACGCGCC TTCGAGTTTT 1981 TTTTCCTTTT CCCCATTTTT TTCAACTTGA AGACTTCAAT TACACCAAAA AGTAAAATTC 2041 ACAAGCTTGG AATTCCTTTG TGTTACATTC TTGAATGTCG CTCGCAGTGA CATTAGCATT 2101 CCGGTACTGT TGGTAAAATG GAAGACGCCA AAAACATAAA GAAAGGCCCG GCGCCATTCT 2161 ATCCTCTAGA GGATGGAACC GCTGGAGAGC AACTGCATAA GGCTATGAAG AGATACGCCC 2221 TGGTTCCTGG AACAATTGCT TTTACAGATG CACATATCGA GGTGAACATC ACGTACGCGG 2281 AATACTTCGA AATGTCCGTT CGGTTGGCAG AAGCTATGAA ACGATATGGG CTGAATACAA 2341 ATCACAGAAT CGTCGTATGC AGTGAAAACT CTCTTCAATT CTTTATGCCG GTGTTGGGCG 2401 CGTTATTTAT CGGAGTTGCA GTTGCGCCCG CGAACGACAT TTATAATGAA CGTGAATTGC 2461 TCAACAGTAT GAACATTTCG CAGCCTACCG TAGTGTTTGT TTCCAAAAAG GGGTTGCAAA 2521 AAATTTTGAA CGTGCAAAAA AAATTACCAA TAATCCAGAA AATTATTATC ATGGATTCTA 2581 AAACGGATTA CCAGGGATTT CAGTCGATGT ACACGTTCGT CACATCTCAT CTACCTCCCG 2641 GTTTTAATGA ATACGATTTT GTACCAGAGT CCTTTGATCG TGACAAAACA ATTGCACTGA 2701 TAATGAATTC CTCTGGATCT ACTGGGTTAC CTAAGGGTGT GGCCCTTCCG CATAGAACTG 2761 CCTGCGTCAG ATTCTCGCAT GCCAGAGATC CTATTTTTGG CAATCAAATC ATTCCGGATA 2821 CTGCGATTTT AAGTGTTGTT CCATTCCATC ACGGTTTTGG AATGTTTACT ACACTCGGAT 2881 ATTTGATATG TGGATTTCGA GTCGTCTTAA TGTATAGATT TGAAGAAGAG CTGTTTTTAC 2941 GATCCCTTCA GGATTACAAA ATTCAAAGTG CGTTGCTAGT ACCAACCCTA TTTTCATTCT 3001 TCGCCAAAAG CACTCTGATT GACAAATACG ATTTATCTAA TTTACACGAA ATTGCTTCTG 3061 GGGGCGCACC TCTTTCGAAA GAAGTCGGGG AAGCGGTTGC AAAACGCTTC CATCTTCCAG 3121 GGATACGACA AGGATATGGG CTCACTGAGA CTACATCAGC TATTCTGATT ACACCCGAGG 3181 GGGATGATAA ACCGGGCGCG GTCGGTAAAG TTGTTCCATT TTTTGAAGCG AAGGTTGTGG 3241 ATCTGGATAC CGGGAAAACG CTGGGCGTTA ATCAGAGAGG CGAATTATGT GTCAGAGGAC 3301 CTATGATTAT GTCCGGTTAT GTAAACAATC CGGAAGCGAC CAACGCCTTG ATTGACAAGG 3361 ATGGATGGCT ACATTCTGGA GACATAGCTT ACTGGGACGA AGACGAACAC TTCTTCATAG 3421 TTGACCGCTT GAAGTCTTTA ATTAAATACA AAGGATATCA GGTGGCCCCC GCTGAATTGG 3481 AATCGATATT GTTACAACAC CCCAACATCT TCGACGCGGG CGTGGCAGGT CTTCCCGACG 3541 ATGACGCCGG TGAACTTCCC GCCGCCGTTG TTGTTTTGGA GCACGGAAAG ACGATGACGG 3601 AAAAAGAGAT CGTGGATTAC GTCGCCAGTC AAGTAACAAC CGCGAAAAAG TTGCGCGGAG 3661 GAGTTGTGTT TGTGGACGAA GTACCGAAAG GTCTTACCGG AAAACTCGAC GCAAGAAAAA 3721 TCAGAGAGAT CCTCATAAAG GCCAAGAAGG GCGGAAAGTC CAAATTGTAA AATGTAACTG 3781 TATTCAGCGA TGACGAAATT CTTAGCTATT GTAATATTAT ATGCAAATTG ATGAATGGTA 3841 ATTTTGTAAT TGTGGGTCAC TGTACTATTT TAACGAATAA TAAAATCAGG TATAGGTAAC 3901 TAAAAAGGAA TTCGAGCTCG GATCCTGCCC ATTTAGTTGG CTTTTCCCTT GTCTCGTGTC 3961 TTTTCCGTGG AAAGGTTCCC GGAGTAATCT GATGGCACAG CAGGGAGGTG CGCCTGCAGG 4021 TTGGTTAGGA AGGGGGGATG ATGTAAAAGA AGAAAATGGG GGGATATCTT ATGTTTAAGA 4081 GGATAAATAA TGTGAAGGGG CTTTATATGC TTGCTTGGTT GTCTCGTTGG TGCATGGGGA 4141 ATCTGCATGT TTGCTTTGGA GCACGCGTGG TACATCAATG GGATCATATC TTGCTGCCTC 4201 CCGCAGCTCA CCGTGCGAGC TGCCGGGACC CCGTTTCTCA TAGGCGTGTG CACCTTCCGT 4261 CTGAGACCTG TAAATGGTTA AAAAGGAATA TATAATTACC TTTTGAAAAG TGGTAAAAAA 4321 CGAATATATC TTTTTTTTTG GTTTTAATAC GCTCTTTTGT GTGTATGAGA GGAATAAATA 4381 AGTGTGTGTG TGTGTGTGTG TTGTTGTTGT TGAAAAAATG AAGGACACGA AAGCGCTCAT 4441 TAGGGGTGGG GATACACCAG GCATAATTAT CCGAAAATAT TTTGATGCAC CAAATAAGTG 4501 AACATACTGA CGAAATCAAA GAGTTGGAGG ATGGATAGGG AGGCCTCATT GGCAGTGGTA 4561 AATTGATTTG ACTTGAATAT GATACCCTCT TACTGTTTCA TTGCCACTAC AAGTTGGTTT 4621 CCTTCCCCTG CAGGCATGCA AGCTAGCTTG TATTCTATAG TGTCACCTAA ATCGTATGTG 4681 TATGATACAT AAGGTTATGT ATTAATTGTA GCCGCGTTCT AACGACAATA TGTACAAGCC 4741 TAATTGTGTA GCATCTGGCT TACTGAAGCA GACCCTATCA TCTCTCTCGT AAACTGCCGT 4801 CAGAGTCGGT TTGGTTGGAC GAACCTTCTG AGTTTCTGGT AACGCCGTTC CGCACCCCGG 4861 AAATGGTCAG CGAACCAATC AGCAGGGTCA TCGCTAGCCA GATCCTCTAC GCCGGACGCA 4921 TCGTGGCCGG CATCACCGGC GCCACAGGTG CGGTTGCTGG CGCCTATATC GCCGACATCA 4981 CCGATGGGGA AGATCGGGCT CGCCACTTCG GGCTCATGAG CGCTTGTTTC GGCGTGGGTA 5041 TGGTGGCAGG CCCCGTGGCC GGGGGACTGT TGGGCGCCAT CTCCTTGCAC CATTCCTTGC 5101 GGCGGCGGTG CTCAACGGCC TCAACCTACT ACTGGGCTGC TTCCTAATGC AGGAGTCGCA 5161 TAAGGGAGAG CGTCGATATG GTGCACTCTC AGTACAATCT GCTCTGATGC CGCATAGTTA 5221 AGCCAGCCCC GACACCCGCC AACACCCGCT GACGCGCCCT GACGGGCTTG TCTGCTCCCG 5281 GCATCCGCTT ACAGACAAGC TGTGACCGTC TCCGGGAGCT GCATGTGTCA GAGGTTTTCA 5341 CCGTCATCAC CGAAACGCGC GAGACGAAAG GGCCTCGTGA TACGCCTATT TTTATAGGTT 5401 AATGTCATGA TAATAATGGT TTCTTAGACG TCAGGTGGCA CTTTTCGGGG AAATGTGCGC 5461 GGAACCCCTA TTTGTTTATT TTTCTAAATA CATTCAAATA TGTATCCGCT CATGAGACAA 5521 TAACCCTGAT AAATGCTTCA ATAATATTGA AAAAGGAAGA GTATGAGTAT TCAACATTTC 5581 CGTGTCGCCC TTATTCCCTT TTTTGCGGCA TTTTGCCTTC CTGTTTTTGC TCACCCAGAA 5641 ACGCTGGTGA AAGTAAAAGA TGCTGAAGAT CAGTTGGGTG CACGAGTGGG TTACATCGAA 5701 CTGGATCTCA ACAGCGGTAA GATCCTTGAG AGTTTTCGCC CCGAAGAACG TTTTCCAATG 5761 ATGAGCACTT TTAAAGTTCT GCTATGTGGC GCGGTATTAT CCCGTATTGA CGCCGGGCAA 5821 GAGCAACTCG GTCGCCGCAT ACACTATTCT CAGAATGACT TGGTTGAGTA CTCACCAGTC 5881 ACAGAAAAGC ATCTTACGGA TGGCATGACA GTAAGAGAAT TATGCAGTGC TGCCATAACC 5941 ATGAGTGATA ACACTGCGGC CAACTTACTT CTGACAACGA TCGGAGGACC GAAGGAGCTA 6001 ACCGCTTTTT TGCACAGCAT GGGGGATCAT GTAACTCGCC TTGATCGTTG GGAACCGGAG 6061 CTGAATGAAG CCATACCAAA CGACGAGCGT GACACCACGA TGCCTGTAGC AATGGCAACA 6121 ACGTTGCGCA AACTATTAAC TGGCGAACTA CTTACTCTAG CTTCCCGGCA ACAATTAATA 6181 GACTGGATGG AGGCGGATAA AGTTGCAGGA CCACTTCTGC GCTCGGCCCT TCCGGCTGGC 6241 TGGTTTATTG CTGATAAATC TGGAGCCGGT GAGCGTGGGT CTCGCGGTAT CATTGCAGCA 6301 CTGGGGCCAG ATGGTAAGCC CTCCCGTATC GTAGTTATCT ACACGACGGG GAGTCAGGCA 6361 ACTATGGATG AACGAAATAG ACAGATCGCT GAGATAGGTG CCTCACTGAT TAAGCATTGG 6421 TAACTGTCAG ACCAAGTTTA CTCATATATA CTTTAGATTG ATTTAAAACT TCATTTTTAA 6481 TTTAAAAGGA TCTAGGTGAA GATCCTTTTT GATAATCTCA TGACCAAAAT CCCTTAACGT 6541 GAGTTTTCGT TCCACTGAGC GTCAGACCCC GTAGAAAAGA TCAAAGGATC TTCTTGAGAT 6601 CCTTTTTTTC TGCGCGTAAT CTGCTGCTTG CAAACAAAAA AACCACCGCT ACCAGCGGTG 6661 GTTTGTTTGC CGGATCAAGA GCTACCAACT CTTTTTCCGA AGGTAACTGG CTTCAGCAGA 6721 GCGCAGATAC CAAATACTGT CCTTCTAGTG TAGCCGTAGT TAGGCCACCA CTTCAAGAAC 6781 TCTGTAGCAC CGCCTACATA CCTCGCTCTG CTAATCCTGT TACCAGTGGC TGCTGCCAGT 6841 GGCGATAAGT CGTGTCTTAC CGGGTTGGAC TCAAGACGAT AGTTACCGGA TAAGGCGCAG 6901 CGGTCGGGCT GAACGGGGGG TTCGTGCACA CAGCCCAGCT TGGAGCGAAC GACCTACACC 6961 GAACTGAGAT ACCTACAGCG TGAGCTATGA GAAAGCGCCA CGCTTCCCGA AGGGAGAAAG 7021 GCGGACAGGT ATCCGGTAAG CGGCAGGGTC GGAACAGGAG AGCGCACGAG GGAGCTTCCA 7081 GGGGGAAACG CCTGGTATCT TTATAGTCCT GTCGGGTTTC GCCACCTCTG ACTTGAGCGT 7141 CGATTTTTGT GATGCTCGTC AGGGGGGCGG AGCCTATGGA AAAACGCCAG CAACGCGGCC 7201 TTTTTACGGT TCCTGGCCTT TTGCTGGCCT TTTGCTCACA TGTTCTTTCC TGCGTTATCC 7261 CCTGATTCTG TGGATAACCG TATTACCGCC TTTGAGTGAG CTGATACCGC TCGCCGCAGC 7321 CGAACGACCG AGCGCAGCGA GTCAGTGAGC GAGGAAGCGG AAGAGCGCCC AATACGCAAA 7381 CCGCCTCTCC CCGCGCGTTG GCCGATTCAT TAATGCAG //