当前位置:首页>技术专栏
发布时间:2026/4/13 8:22:44 阅读次数:13
Human papillomavirus type 31 (HPV-31) complete genome
GenBank: J04353.1
FASTA Graphics
Go to:
LOCUS PPH31A 7912 bp DNA circular VRL 18-MAR-1994
DEFINITION Human papillomavirus type 31 (HPV-31) complete genome.
ACCESSION J04353
VERSION J04353.1
KEYWORDS complete genome.
SOURCE human papillomavirus 31
ORGANISM human papillomavirus 31
Viruses; Monodnaviria; Shotokuvirae; Cossaviricota;
Papovaviricetes; Zurhausenvirales; Papillomaviridae;
Firstpapillomavirinae; Alphapapillomavirus.
REFERENCE 1 (bases 1 to 7912)
AUTHORS Goldsborough,M.D., DiSilvestre,D., Temple,G.F. and Lorincz,A.T.
TITLE Nucleotide sequence of human papillomavirus type 31: a cervical
neoplasia-associated virus
JOURNAL Virology 171 (1), 306-311 (1989)
PUBMED 2545036
COMMENT Draft entry and computer-readable copy of sequence [1] kindly
submitted by M.D.Goldsborough, 05-JUL-1989.
FEATURES Location/Qualifiers
source 1..7912
/organism="human papillomavirus 31"
/mol_type="genomic DNA"
/db_xref="taxon:10585"
regulatory 19..24
/regulatory_class="TATA_box"
regulatory 69..74
/regulatory_class="TATA_box"
gene 108..557
/gene="E6"
CDS 108..557
/gene="E6"
/note="ORF E6 from bp 39 to 557"
/codon_start=1
/product="transforming protein"
/protein_id="AAA46950.1"
/translation="MFKNPAERPRKLHELSSALEIPYDELRLNCVYCKGQLTETEVLD
FAFTDLTIVYRDDTPHGVCTKCLRFYSKVSEFRWYRYSVYGTTLEKLTNKGICDLLIR
CITCQRPLCPEEKQRHLDKKKRFHNIGGRWTGRCIACWRRPRTETQV"
misc_feature 228..236
/gene="E6"
/standard_name="Splice donor"
misc_feature 403..414
/gene="E6"
/standard_name="Splice acceptor"
gene 560..856
/gene="E7"
CDS 560..856
/gene="E7"
/note="ORF E7 from bp 545 to 856"
/codon_start=1
/product="transforming protein"
/protein_id="AAA46951.1"
/translation="MRGETPTLQDYVLDLQPEATDLHCYEQLPDSSDEEDVIDSPAGQ
AEPDTSNYNIVTFCCQCKSTLRLCVQSTQVDIRILQELLMGSFGIVCPNCSTRL"
gene 862..2751
/gene="E1"
CDS 862..2751
/gene="E1"
/note="ORF E1 from bp 850 to bp 2751"
/codon_start=1
/product="replication protein"
/protein_id="AAA46952.1"
/translation="MADPAGTDGEGTGCNGWFYVEAVIDRQTGDNISEDENEDSSDTG
EDMVDFIDNCNVYNNQAEAETAQALFHAQEAEEHAEAVQVLKRKYVGSPLSDISSCVD
YNISPRLKAICIENNSKTAKRRLFELPDSGYGNTEVETQQMVQVEEQQTTLSCNGSDG
THSERENETPTRNILQVLKTSNGKAAMLGKFKELYGVSFMELIRPFQSNKSTCTDWCV
AAFGVTGTVAEGFKTLLQPYCLYCHLQSLACSWGMVMLMLVRFKCAKNRITIEKLLEK
LLCISTNCMLIQPPKLRSTAAALYWYRTGMSNISDVYGETPEWIERQTVLQHSFNDTT
FDLSQMVQWAYDNDVMDDSEIAYKYAQLADSDSNACAFLKSNSQAKIVKDCGTMCRHY
KRAEKRQMSMGQWIKSRCDKVSDEGDWRDIVKFLRYQQIEFVSFLSALKLFLKGVPKK
NCILIHGAPNTGKSYFGMSLISFLQGCIISYANSKSHFWLQPLADAKIGMLDDATTPC
WHYIDNYLRNALDGNPVSIDVKHKALMQLKCPPLLITSNINAGKDDRWPYLHSRLVVF
TFPNPFPFDKNGNPVYELSDKNWKSFFSRTWCRLNLHEEEDKENDGDSFSTFKCVSGQ
NIRTL"
gene 2693..3811
/gene="E2"
CDS 2693..3811
/gene="E2"
/note="ORF E2 from bp 2663 to 3811"
/codon_start=1
/product="regulatory protein"
/protein_id="AAA46953.1"
/translation="METLSQRLNVCQDKILEHYENDSKRLCDHIDYWKHIRLECVLMY
KAREMGIHSINHQVVPALSVSKAKALQAIELQMMLETLNNTEYKNEDWTMQQTSLELY
LTAPTGCLKKHGYTVEVQFDGDVHNTMHYTNWKFIYLCIDGQCTVVEGQVNCKGIYYV
HEGHITYFVNFTEEAKKYGTGKKWEVHAGGQVIVFPESVFSSDEISFAGIVTKLPTAN
NTTTSNSKTCALGTSEGVRRATTSTKRPRTEPEHRNTHHPNKLLRGDSVDSVNCGVIS
AAACTNQTRAVSCPATTPIIHLKGDANILKCLRYRLSKYKQLYEQVSSTWHWTCTDGK
HKNAIVTLTYISTSQRDDFLNTVKIPNTVSVSTGYMTI"
gene 3270..3578
/gene="E4"
CDS <3270..3578
/gene="E4"
/note="ORF E4 from bp 3270 to 3578"
/codon_start=1
/protein_id="AAA46949.1"
/translation="LFFLNLYLAVTKYPLLGLLQSYQQPTTPPHRIPKPAPWAPVKVC
GGRRRLLSDQEQSQSTETPTTPTSCCEATPWTVSTVGLSVQLHAQTKQGLSVVLQLHL
"
gene 3816..4070
/gene="E5"
CDS 3816..4070
/gene="E5"
/note="ORF E5 from bp 3804 to 4070"
/codon_start=1
/protein_id="AAA46954.1"
/translation="MIELNISTVSIVLCFLLCFCVLLFVCLVIRPLVLSVSVYATLLL
LIVILWVIATSPLRCFCIYVVFIYIPLFVIHTHASFLSQQ"
repeat_region 4099..4134
regulatory 4138..4143
/regulatory_class="polyA_signal_sequence"
/note="putative"
repeat_region 4143..4158
gene 4171..5571
/gene="L2"
CDS 4171..5571
/gene="L2"
/note="ORF L2 from bp 4060 to bp 5571"
/codon_start=1
/product="minor capsid protein"
/protein_id="AAA46955.1"
/translation="MRSKRSTKRTKRASATQLYQTCKAAGTCPSDVIPKIEHTTIADQ
ILRYGSMGVFFGGLGIGSGSGTGGRTGYVPLSTRPSTVSEASIPIRPPVSIDPVGPLD
PSIVSLVEESGIVDVGAPAPIPHPPTTSGFDIATTADTTPAILDVTSVSTHENPTFTD
PSVLQPPTPAETSGHLLLSSSSISTHNYEEIPMDTFIVSTNNENITSSTPIPGVRRPA
RLGLYSKATQQVKVIDPTFLSAPKQLITYENPAYETVNAEESLYFSNTSHNIAPDPDF
LDIIALHRPALTSRRNTVRYSRLGNKQTLRTRSGATIGARVHYYYDISSINPAGESIE
MQPLGASATTTSTLNDGLYDIYADTDFTVDTPATHNVSPSTAVQSTSAVSAYVPTNTT
VPLSTGFDIPIFSGPDVPIEHAPTQVFPFPLAPTTPQVSIFVDGGDFYLHPSYYMLKR
RRKRVSYFFTDVSVAA"
gene 5552..7066
/gene="L1"
CDS 5552..7066
/gene="L1"
/note="ORF L1 from bp 5516 to 7066"
/codon_start=1
/product="major capsid protein"
/protein_id="AAA46956.1"
/translation="MSLWRPSEATVYLPPVPVSKVVSTDEYVTRTNIYYHAGSARLLT
VGHPYYSIPKSDNPKKIVVPKVSGLQYRVFRVRLPDPNKFGFPDTSFYNPETQRLVWA
CVGLEVGRGQPLGVGISGHPLLNKFDDTENSNRYAGGPGTDNRECISMDYKQTQLCLL
GCKPPIGEHWGKGSPCSNNAITPGDCPPLELKNSVIQDGDMVDTGFGAMDFTALQDTK
SNVPLDICNSICKYPDYLKMVAEPYGDTLFFYLRREQMFVRHFFNRSGTVGESVPTDL
YIKGSGSTATLANSTYFPTPSGSMVTSDAQIFNKPYWMQRAQGHNNGICWGNQLFVTV
VDTTRSTNMSVCAAIANSDTTFKSSNFKEYLRHGEEFDLQFIFQLCKITLSADIMTYI
HSMNPAILEDWNFGLTTPPSGSLEDTYRFVTSQAITCQKTAPQKPKEDPFKDYVFWEV
NLKEKFSADLDQFPLGRKFLLQAGYRARPKFKAGKRSAPSASTTTPAKRKKTKK"
regulatory 7227..7231
/regulatory_class="polyA_signal_sequence"
/note="putative"
repeat_region 7291..7302
repeat_region 7314..7333
/note="putative"
protein_bind 7406..7420
/standard_name="glucocorticoid responsive element"
/bound_moiety="hormone receptor"
protein_bind 7477..7488
/bound_moiety="E2"
/function="gene transcription"
regulatory 7542..7549
/regulatory_class="enhancer"
/standard_name="keratinocyte-dependant enhancer"
protein_bind 7868..7879
/bound_moiety="E2"
/function="gene transcription"
ORIGIN
1 taataataat aatcttagta taaaaaagta gggagtgacc gaaagtggtg aaccgaaaac
61 ggttggtata taaagcacat agtattttgt gcaaacctac agacgccatg ttcaaaaatc
121 ctgcagaaag acctcggaaa ttgcatgaac taagctcggc attggaaata ccctacgatg
181 aactaagatt gaattgtgtc tactgcaaag gtcagttaac agaaacagag gtattagatt
241 ttgcatttac agatttaaca atagtatata gggacgacac accacacgga gtgtgtacaa
301 aatgtttaag attttattca aaagtaagtg aatttagatg gtatagatat agtgtgtatg
361 gaacaacatt agaaaaattg acaaacaaag gtatatgtga tttgttaatt aggtgtataa
421 cgtgtcaaag accgttgtgt ccagaagaaa aacaaagaca tttggataaa aagaaacgat
481 tccacaacat aggaggaagg tggacaggac gttgcatagc atgttggaga agacctcgta
541 ctgaaaccca agtgtaaaca tgcgtggaga aacacctacg ttgcaagact atgtgttaga
601 tttgcaacct gaggcaactg acctccactg ttatgagcaa ttacccgaca gctcagatga
661 ggaggatgtc atagacagtc cagctggaca agcagaaccg gacacatcca attacaatat
721 cgttaccttt tgttgtcagt gtaagtctac acttcgtttg tgtgtacaga gcacacaagt
781 agatattcgc atattgcaag agctgttaat gggctcattt ggaatcgtgt gccccaactg
841 ttctactaga ctgtaactac aatggctgat ccagcaggta cagatgggga ggggacggga
901 tgcaatggtt ggttttatgt agaagcagta attgacagac agacagggga caacatttca
961 gaggacgaaa atgaagacag tagtgatact ggggaggata tggttgactt tattgacaat
1021 tgtaatgtat acaacaatca ggcagaagca gagacagcac aggcattgtt tcatgcacag
1081 gaagcggagg aacatgcaga ggctgtgcag gttctaaaac gaaagtatgt aggtagtcct
1141 ttaagtgata ttagtagttg tgtggattat aatattagtc cacggttaaa agctatatgc
1201 atagaaaata acagtaaaac agcaaaacga agactctttg aacttccaga cagcgggtat
1261 ggcaatactg aagtggaaac gcagcagatg gtacaggtag aggagcaaca aacaacatta
1321 agttgtaatg gtagtgacgg gacacatagt gaacgagaga atgaaactcc aacacgtaat
1381 atattgcaag tgttaaaaac tagcaatggt aaagctgcta tgttaggtaa atttaaagaa
1441 ttatatggtg taagttttat ggaactaatt aggccatttc aaagcaataa aagcacatgt
1501 actgattggt gtgtagctgc gtttggagtt acaggtacag ttgcagaagg atttaaaacc
1561 ctattgcaac catattgttt gtattgccat ttacaaagtt tagcatgttc ctggggcatg
1621 gttatgttaa tgcttgtgag atttaaatgt gcaaaaaata gaataacaat tgaaaaatta
1681 ttagaaaaat tattgtgtat atctacaaat tgtatgttaa ttcagccacc caaattacgt
1741 agcacagctg cagcattata ttggtacaga acaggaatgt caaacattag cgatgtatat
1801 ggtgaaacac cagaatggat agaaagacaa acagtattac agcatagttt taatgacaca
1861 acatttgatt tgtcccaaat ggtacaatgg gcatatgaca atgatgttat ggatgatagt
1921 gaaattgcct ataaatatgc acaattagct gacagtgata gtaatgcatg tgcattttta
1981 aaaagtaatt cgcaggcaaa aatagttaaa gattgtggaa caatgtgtag acattataaa
2041 cgagcagaaa aacgacaaat gtccatggga cagtggatta aaagtagatg tgacaaagtt
2101 agtgacgaag gtgactggag ggacatagta aagtttttaa gatatcaaca aatagaattt
2161 gtgtcatttt tatctgcatt aaagctgttt ttaaaaggag tgccaaagaa aaactgtatt
2221 ttaatacatg gtgcacctaa tacaggtaaa tcatattttg gaatgagcct tattagcttt
2281 ttacaaggat gtataatatc atatgcaaat tcaaaaagtc atttttggtt acaaccactg
2341 gctgatgcta aaataggcat gttagatgat gctacaacgc catgttggca ttatatagac
2401 aattacctac gaaatgcact agatggcaac cctgtatcta tagatgtaaa gcataaagct
2461 ttaatgcagt taaaatgtcc tcctttattg attacatcta atataaatgc aggtaaggat
2521 gacagatggc catacctaca tagcagactg gtggttttta catttccaaa tccatttcca
2581 tttgacaaaa acggaaatcc agtatatgaa ttaagtgata aaaactggaa atcctttttc
2641 tcaaggacgt ggtgcagatt aaatttgcac gaggaagagg acaaagaaaa cgatggagac
2701 tctttctcaa cgtttaaatg tgtgtcagga caaaatatta gaacattatg aaaatgatag
2761 taaacgactt tgtgatcata tagactattg gaaacatatt cgacttgaat gtgtattaat
2821 gtataaagca agagaaatgg gaatacacag tattaaccac caggtggtgc cagcgttgtc
2881 agtatcaaag gccaaagcct tacaagctat tgaactacaa atgatgttgg aaacattaaa
2941 taacactgaa tacaaaaatg aggactggac aatgcagcaa acaagtcttg aactgtattt
3001 aactgcacct acagggtgtt taaaaaaaca tggatatact gtagaggtgc aatttgatgg
3061 tgatgtacac aacaccatgc attatactaa ctggaaattt atatacctat gtatagatgg
3121 ccaatgtact gttgtggaag ggcaagttaa ttgtaagggc atttattatg tacatgaagg
3181 acatataaca tattttgtaa attttacaga agaggcaaaa aaatatggga ctggtaaaaa
3241 atgggaagtg catgcgggtg gtcaggtaat tgtttttcct gaatctgtat ttagcagtga
3301 cgaaatatcc tttgctggga ttgttacaaa gctaccaaca gccaacaaca ccaccacatc
3361 gaattccaaa acctgcgcct tgggcaccag tgaaggtgtg cggcgggcga cgacgtctac
3421 taagcgacca agaacagagc cagagcacag aaacacccac caccccaaca agttgttgcg
3481 aggcgactcc gtggacagtg tcaactgtgg ggttatcagt gcagctgcat gcacaaacca
3541 aacaagggct gtcagttgtc ctgcaactac acctataata cacttaaaag gtgatgcaaa
3601 tatattaaaa tgtttaagat ataggctgtc aaaatataaa caattgtatg aacaagtgtc
3661 atctacatgg cattggacat gtacagatgg aaaacataaa aatgctattg taaccttaac
3721 atatataagt acatcacaaa gagacgattt tttaaatact gtaaaaatac ctaacacagt
3781 atcagtgtca acaggatata tgactattta gcctaatgat tgaactaaat atttctacag
3841 taagcattgt gctatgcttt ttgctttgct tttgtgtgct actatttgtg tgtcttgtca
3901 tacgtccact tgtgctgtct gtgtcggtat atgcaacact actattatta attgtgattt
3961 tatgggttat tgcaacctct ccattacgtt gtttttgtat atatgttgtg tttatatata
4021 ttccattatt tgtaattcat acacatgcat cttttttaag tcaacagtaa cttttttact
4081 tgtgtatact gttgtttgta ttggtattgg tattggtatt ggtattggta ttggtataat
4141 aaactttttt actttttttt tattattacc atgcggtcca aacgctctac aaaacgcact
4201 aaacgtgcgt ctgctacaca attatatcaa acatgtaaag cagcaggtac ttgtccatca
4261 gacgttatac ctaaaataga acatactacc attgcagacc aaatattaag gtatggtagt
4321 atgggtgttt tttttggtgg gttgggtatt gggtccggct ctggtactgg gggtcgcact
4381 ggatatgtcc ctcttagtac acgtccttct acagtatctg aggcaagtat acctattaga
4441 ccaccagtta gcattgaccc tgtaggtccc ttggacccct ctatagtaag tcttgttgaa
4501 gaatctggaa ttgttgatgt tggtgcccct gctcctatac cacaccctcc tacaacatct
4561 gggtttgaca ttgctacaac tgcagacaca acacctgcaa ttttagatgt aacaagtgtt
4621 agcacacatg aaaatcctac ttttactgat ccatctgtat tgcagcctcc tacacctgca
4681 gaaacatcag gtcatttact actttcatca tcatctatta gcacacataa ttatgaggaa
4741 atacctatgg atacatttat tgtttctact aataatgaaa acataacaag tagcacaccc
4801 attccagggg tgcgccgtcc tgcacgttta gggttatata gtaaggctac acaacaagta
4861 aaagttattg atccaacgtt tcttagtgct ccaaaacagc taattacata tgaaaaccct
4921 gcctatgaaa ctgtaaatgc tgaagaatct ttatactttt ccaatacatc gcataatata
4981 gcccctgatc ccgactttct agatattata gcattacata ggcctgccct tacctcacgt
5041 aggaacactg ttagatatag tagactaggt aataaacaaa ctttgcgcac tcgtagtggt
5101 gctactattg gtgcaagggt gcattattat tatgatatta gtagtattaa tcctgcaggt
5161 gaaagtattg aaatgcaacc tttaggggcg tctgcaacta ctacttctac tttaaatgat
5221 ggcttatatg acatttatgc agacactgat tttactgtgg atacacctgc cacacataat
5281 gtttcccctt ctactgctgt acagtccaca tctgctgtgt ctgcctatgt acctacaaat
5341 accactgtgc cactaagtac aggttttgac attcccatat tttctgggcc tgatgtacct
5401 atagagcatg cacctacaca ggttttccca tttcctttgg cccctacaac gccacaagtg
5461 tctatttttg ttgatggggg tgatttttat ttgcacccta gttattatat gttaaaacgt
5521 cgacgtaaac gtgtatcata tttttttaca gatgtctctg tggcggccta gcgaggctac
5581 tgtctactta ccacctgtcc cagtgtctaa agttgtaagc acggatgaat atgtaacacg
5641 aaccaacata tattatcacg caggcagtgc taggctgctt acagtaggcc atccatatta
5701 ttccatacct aaatctgaca atcctaaaaa aatagttgta ccaaaggtgt caggattaca
5761 atatagggta tttagggttc gtttaccaga tccaaacaaa tttggatttc ctgatacatc
5821 tttttataat cctgaaactc aacgcttagt ttgggcctgt gttggtttag aggtaggtcg
5881 cgggcagcca ttaggtgtag gtattagtgg tcatccatta ttaaataaat ttgatgacac
5941 tgaaaactct aatagatatg ccggtggtcc tggcactgat aatagggaat gtatatcaat
6001 ggattataaa caaacacaac tgtgtttact tggttgcaaa ccacctattg gagagcattg
6061 gggtaaaggt agtccttgta gtaacaatgc tattacccct ggtgattgtc ctccattaga
6121 attaaaaaat tcagttatac aagatgggga tatggttgat acaggctttg gagctatgga
6181 ttttactgct ttacaagaca ctaaaagtaa tgttcctttg gacatttgta attctatttg
6241 taaatatcca gattatctta aaatggttgc tgagccatat ggcgatacat tattttttta
6301 tttacgtagg gaacaaatgt ttgtaaggca tttttttaat agatcaggca cggttggtga
6361 atcggtccct actgacttat atattaaagg ctccggttca acagctactt tagctaacag
6421 tacatacttt cctacaccta gcggctccat ggttacttca gatgcacaaa tttttaataa
6481 accatattgg atgcaacgtg ctcagggaca caataatggt atttgttggg gcaatcagtt
6541 atttgttact gtggtagata ccacacgtag taccaatatg tctgtttgtg ctgcaattgc
6601 aaacagtgat actacattta aaagtagtaa ttttaaagag tatttaagac atggtgagga
6661 atttgattta caatttatat ttcagttatg caaaataaca ttatctgcag acataatgac
6721 atatattcac agtatgaatc ctgctatttt ggaagattgg aattttggat tgaccacacc
6781 tccctcaggt tctttggagg atacctatag gtttgtcacc tcacaggcca ttacatgtca
6841 aaaaactgcc ccccaaaagc ccaaggaaga tccatttaaa gattatgtat tttgggaggt
6901 taatttaaaa gaaaagtttt ctgcagattt agatcagttt ccactgggtc gcaaattttt
6961 attacaggca ggatataggg cacgtcctaa atttaaagca ggtaaacgta gtgcaccctc
7021 agcatctacc actacaccag caaaacgtaa aaaaactaaa aagtaatgga tgtgtatgta
7081 atacatgtgt ctgtatgtgt atgtgcttgt gctgtattgt atatgtgtgt gtttgtgtgt
7141 tatatatggt atatgtatgt ttatgtatgc gtgtgtactt gtatatatgt atagtatgtt
7201 atgtgtgtat gtatgctatg tatgttaata aatatgtgta tacctgtgtg tgttgtgtat
7261 gttgtcctta tatacaccct attagtaaca tactattact attttataaa ctattgttcc
7321 tacttgttcc tacttgttcc tgctcctccc aatagtcatg tacttatttc tgcctataat
7381 ttaggtgtca cgccatagta aaagttgtac acccggtccg ttttttgcaa ctaaagctac
7441 tccattttga ttttatgcag ccattttaaa tccctaaccg ttttcggttg cattgtttaa
7501 acatgctagt acaactatgc tgatgcagta gttctgcggt ttttggtttc ctgaatacta
7561 gtttttgcca acattctggc ttgtagtttc ctgcctaaca caccttgcca acatataatc
7621 cagtccaact ttgcaattat actatgaatc atgtttgttt aaatacaact gtagttcaac
7681 tatgtgtcat gcacatatat tatattatcc tacacacctt aaactgcttt taggcacata
7741 ttttgtagat tatctatatc cttgattgca gtgctggctt ttgcacatgt ttaaactgcc
7801 aaggttgtgt catgcattat aaataagttg tatgttactc atataattaa ttgcatatag
7861 gtattacacc gttttcggtt acagttttac aagcaattgt tctttttata ct
注:仅供参考,具体请咨询客服,以客服说明书为准
上海极威生物科技有限公司 公司地址:上海市崇明县长兴镇潘园公路1800号2号楼9703室
邮箱:435399850@qq.com 电话:021-80186165 手机:16602115336
备案号:沪ICP备15046135号-4


