CGDB Gene Information
Tag | Content | |||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CGDB ID | CGD-DaR-072764 | |||||||||||||||||||||
Ensembl Accession | ENSDARP00000007258.6 | |||||||||||||||||||||
Protein Name | Period 4 | |||||||||||||||||||||
Gene Name | per1b; per4 | |||||||||||||||||||||
EMBL (Genbank) ID | AAR05819.1 | |||||||||||||||||||||
Organism | Danio rerio | |||||||||||||||||||||
NCBI Taxa ID | 7955 | |||||||||||||||||||||
Circadian Information |
| |||||||||||||||||||||
Protein Sequence (Fasta) |
MSDDNSDSAP SNDADSGAGG IEKKAGRSCG MSESSPSSNP ESSGSGGLSG PKGSAGGNRG 60 VNSDDTDGLS SGNDSGERES EGGMQRGSGS RGRQSNRSYQ SSSSQNGKDS AMGMETTESN 120 KSSNSHSPSP PSSSLAYSLL SASSEQDPPS TSGCSSDQSA RVQTQKELMR ALNELKIRLP 180 PERKMKGRSS TLNALKYALS CVRQVRANRE YYHQWNVEEC HGCSLDLSTF TVEELDNITS 240 EYTLKNTDTF TMAVSFLSGK VVYISPQGSS LLRSKPERLH GVLFSELLAP QDVSTFYSNT 300 APCKLPAWAS CIGSVSPPME CTQEKSMSCR ISGDVSSSSD VRYYPFRLTP YLLTLRDSDM 360 AFPQPCCLLI AERVHSGYEA PRIPLDKRIF TTSHTPSCVF QEVDERAVPL LGYLPQDLVG 420 TPVLLCIHPD DRHIMVAIHK KILQFAGQPF EHSPLRMCAR NGEYMTIDTS WSSFINPWSR 480 KVAFIVGRHK VRTSPLNEDV FTPPRGLEER ALTPDIVQLS EQIHRLLVQP VHCGSSQGYG 540 SLPSNGSHEH QPSAASSSDS SGPGLEDPSQ LHKPMTFQQI CKDVHMVKTN GQQVFIDSRN 600 RPPPKKHSTA GALKAGQSAE VCRSLVPCAA PPSKSSAPSL IVQKEPPTTF SYQQINCLDS 660 IIRYLESCNV PNTVKRKCGS SSCTASSTSD DDKQQEAPGN AKGPSVSLVD DSALLPPLAL 720 HNKAESVASV TSQCSFSSTI VHVGDKKPPE SDIVMEEAPP TPNTALPVTQ PQFPPMATPS 780 LPLSPAPDRD AGRRGGPGAS AGGERLGLTK EVLSAHTQQE EQNFMCRFGD LSKLRVFDPT 840 SAVRRRPNAP LSRGVRCSRD YPAAGSSGRR RGRGGKRLKH QESSEQTGSC SPAGPIRGLL 900 PGVPALGRPS NPSIPMGPTA SSSSWPTSGS QASVPNVQYP PTVLPLYPVY PPISHPVSDP 960 SMQPGLRFPL QNSQMAPPMV PPMMALVLPN YMFPQPSVGM AQPFYSPNSA FPFAAANMGS 1020 PAPCQIQTPI QRAHSRSSTP HSYSQRENGA EREGAESPLF QSRCSSPLNL LQLEESPSNR 1080 FEVASGQQTT SPMVGQGGGA GGQASSNQGG SAVDSKTNEN GETNESNQDA MSTSSDLLDL 1140 LLQEDSRSGT GSAASGSGSS GTGSSGSGSG SSGSGSNGCS SSGSGTRSSQ SSNTSKYFGS 1200 VDSSENSHSR KQTAEGDGEA QLIKCVLQDP IWLLMANTDE KTMMTYQLPI RDRDSVLKED 1260 RAALRAMQEH QPRFTEEQKS ELSQVHPWIR TGRLPRAINI SACAGCRSPP SVPSATPFDV 1320 EIHEMEFCSV LAVAEEKQTP TDTVMEKSET DGQNETCKEN NGTVTTAQIN DQEMLTEEQE 1380 MTSQIEEEMG ASHTQMTH 1398 |
|||||||||||||||||||||
Nucleotide Sequence (Fasta) |
ATGAGTGATG ACAACTCAGA CTCCGCCCCC AGCAATGATG CAGACAGCGG AGCAGGAGGG 60 ATAGAGAAAA AGGCTGGGCG ATCCTGTGGC ATGAGCGAGA GCTCGCCCTC ATCAAATCCA 120 GAGAGTAGTG GGTCAGGAGG GCTATCCGGT CCCAAAGGAT CTGCAGGTGG CAACCGAGGT 180 GTGAATTCTG ATGATACAGA TGGACTGTCC AGTGGAAATG ACTCTGGGGA GAGGGAAAGT 240 GAAGGAGGGA TGCAGAGAGG GAGCGGATCC AGAGGGAGAC AGTCAAACAG AAGTTACCAG 300 AGCTCCTCCA GCCAGAATGG GAAAGACTCT GCCATGGGTA TGGAGACAAC GGAGAGCAAC 360 AAGAGCTCAA ACTCTCACAG CCCTTCTCCA CCCAGCAGTT CTCTGGCATA CAGTCTGTTG 420 AGTGCCAGCT CTGAGCAGGA CCCGCCATCC ACCAGCGGCT GCAGCAGTGA TCAGTCTGCC 480 CGCGTCCAGA CTCAGAAAGA GCTAATGAGA GCTCTGAATG AGCTGAAGAT TCGTCTGCCG 540 CCAGAGAGGA AGATGAAGGG ACGCTCCAGC ACCCTGAACG CCCTCAAATA CGCCCTCAGC 600 TGTGTCAGAC AAGTCAGAGC CAATCGGGAG TATTACCACC AGTGGAATGT AGAGGAGTGC 660 CATGGCTGTA GTCTAGATCT CTCCACCTTC ACTGTCGAAG AACTGGACAA TATCACCTCA 720 GAATACACAC TCAAAAACAC AGACACATTC ACCATGGCGG TGTCATTCCT GTCTGGGAAG 780 GTTGTGTACA TCTCTCCGCA GGGCTCATCT CTTCTGCGCA GTAAGCCAGA GAGGCTGCAC 840 GGGGTGTTGT TTTCAGAGCT GCTGGCTCCC CAGGACGTGA GCACCTTTTA CAGCAACACA 900 GCACCCTGCA AACTGCCAGC GTGGGCCTCC TGCATCGGCT CTGTTTCTCC ACCAATGGAG 960 TGCACCCAAG AGAAGTCGAT GTCTTGTCGC ATCAGTGGAG ACGTCTCATC CAGCAGTGAT 1020 GTGAGGTACT ACCCTTTCAG ACTGACCCCG TACCTGCTGA CCCTCAGAGA CTCCGACATG 1080 GCTTTCCCTC AGCCCTGCTG TCTGCTCATC GCCGAGAGAG TGCACTCGGG ATATGAGGCC 1140 CCTCGCATTC CTCTGGATAA GAGGATATTC ACCACTAGTC ACACCCCCAG CTGTGTCTTC 1200 CAGGAGGTGG ATGAGAGGGC AGTGCCTCTG TTAGGATACC TGCCTCAGGA CCTGGTTGGA 1260 ACGCCGGTCC TGCTGTGCAT TCATCCAGAT GACAGACACA TCATGGTGGC AATCCATAAG 1320 AAGATTCTTC AGTTTGCTGG GCAGCCGTTC GAGCACTCTC CCTTGAGGAT GTGTGCGCGT 1380 AATGGAGAGT ATATGACTAT AGACACCTCT TGGTCCTCCT TCATCAACCC CTGGAGCAGG 1440 AAAGTGGCCT TCATCGTTGG ACGGCATAAA GTCAGAACCT CTCCACTGAA CGAAGATGTG 1500 TTCACGCCGC CGCGTGGTCT GGAAGAGCGA GCCCTGACTC CAGACATTGT TCAGCTCAGT 1560 GAACAGATCC ACAGGCTCCT AGTTCAGCCC GTCCACTGTG GCAGCTCTCA GGGCTATGGC 1620 AGTCTGCCCA GCAACGGCTC CCACGAGCAT CAGCCCAGCG CAGCCTCTTC ATCAGACAGC 1680 AGCGGGCCTG GTCTGGAGGA CCCCTCACAG CTACACAAAC CTATGACCTT CCAGCAAATC 1740 TGTAAAGATG TTCACATGGT CAAGACAAAC GGACAGCAAG TCTTCATTGA TTCCCGCAAC 1800 CGACCTCCTC CCAAAAAACA TTCCACTGCA GGTGCTCTGA AAGCAGGCCA GTCAGCAGAA 1860 GTGTGTAGAA GTCTGGTTCC TTGTGCTGCT CCACCCTCAA AGAGCTCCGC GCCTTCACTA 1920 ATCGTGCAAA AAGAACCACC AACAACATTC TCGTACCAGC AGATTAACTG TTTGGACAGC 1980 ATCATAAGGT ATCTTGAGAG CTGTAATGTC CCAAACACGG TCAAGAGGAA GTGTGGCTCC 2040 TCCTCCTGCA CCGCCTCCTC TACGTCGGAT GATGACAAAC AGCAGGAAGC TCCAGGCAAC 2100 GCTAAAGGTC CGTCTGTGTC TCTGGTAGAT GATAGCGCTT TGTTACCTCC ACTGGCCCTG 2160 CACAATAAAG CTGAGAGCGT TGCCTCTGTC ACATCCCAGT GCAGTTTCAG CAGCACCATC 2220 GTCCATGTTG GGGACAAGAA ACCTCCCGAG TCAGATATCG TAATGGAGGA AGCTCCTCCC 2280 ACTCCAAATA CTGCTCTGCC TGTCACTCAA CCTCAATTTC CTCCCATGGC CACACCCTCT 2340 CTGCCTCTTA GCCCTGCTCC TGATAGGGAT GCTGGGAGAA GAGGAGGACC AGGGGCTTCT 2400 GCAGGAGGAG AAAGGTTGGG TCTTACGAAG GAAGTCCTGT CAGCTCACAC ACAACAAGAG 2460 GAGCAAAACT TCATGTGCCG CTTCGGAGAC CTGAGCAAAC TTAGAGTCTT TGACCCAACT 2520 TCAGCTGTCC GCCGACGACC AAATGCACCT TTATCAAGAG GTGTGCGCTG CTCTCGTGAT 2580 TATCCAGCGG CAGGCAGCAG TGGGCGCAGA CGTGGTCGAG GAGGCAAAAG ACTAAAGCAT 2640 CAGGAATCTT CAGAGCAAAC AGGTTCTTGT AGTCCAGCCG GACCAATAAG AGGCCTCCTC 2700 CCTGGAGTTC CCGCTCTGGG CAGACCCTCT AACCCCTCCA TACCCATGGG TCCTACAGCC 2760 AGCTCCTCGT CCTGGCCCAC ATCAGGGTCA CAAGCCAGTG TTCCCAATGT CCAATATCCA 2820 CCCACCGTCC TTCCTTTATA TCCAGTCTAT CCCCCAATCT CCCATCCCGT CTCAGACCCC 2880 AGCATGCAGC CAGGCCTTCG CTTTCCCCTT CAGAACTCAC AAATGGCGCC TCCGATGGTC 2940 CCCCCAATGA TGGCATTGGT ATTACCCAAC TACATGTTCC CACAGCCCAG CGTAGGCATG 3000 GCTCAGCCCT TCTACAGCCC AAACTCGGCT TTCCCCTTCG CCGCTGCCAA CATGGGCTCG 3060 CCTGCTCCCT GTCAGATCCA GACCCCAATA CAACGCGCTC ATTCTCGCTC CAGCACACCT 3120 CACTCCTATA GCCAAAGAGA AAACGGTGCA GAGAGAGAAG GAGCAGAGTC TCCCCTCTTT 3180 CAGTCCCGAT GCTCATCGCC CTTAAATCTG CTGCAGCTGG AAGAGTCGCC AAGCAATCGG 3240 TTCGAGGTGG CGTCAGGACA GCAGACTACA TCTCCTATGG TGGGACAAGG AGGTGGAGCT 3300 GGAGGACAGG CCTCATCTAA CCAGGGAGGG TCAGCGGTGG ATTCAAAAAC AAACGAAAAT 3360 GGTGAAACAA ATGAGTCCAA TCAAGACGCC ATGTCCACCT CCAGTGATCT GCTGGACCTG 3420 CTGTTACAGG AGGATTCCCG CTCAGGCACT GGCTCGGCTG CCTCAGGCTC GGGCTCTTCA 3480 GGCACAGGGT CCTCAGGCTC AGGCTCCGGT TCCTCTGGAT CCGGCTCCAA TGGCTGCAGC 3540 TCTTCCGGAA GTGGAACTAG AAGCAGTCAG AGCAGCAATA CCAGCAAATA CTTTGGCAGC 3600 GTGGACTCAT CAGAGAACAG TCATTCCCGC AAGCAGACAG CAGAGGGCGA CGGAGAGGCG 3660 CAGCTCATCA AATGTGTCCT TCAGGACCCC ATCTGGCTCC TCATGGCCAA CACAGATGAA 3720 AAAACTATGA TGACCTACCA GCTGCCAATA AGAGACAGAG ATTCAGTGTT GAAAGAGGAC 3780 AGAGCTGCGC TGAGGGCCAT GCAGGAACAT CAGCCTCGCT TCACTGAAGA ACAGAAGAGC 3840 GAGTTGAGTC AGGTTCACCC ATGGATCCGC ACCGGACGCC TTCCTCGTGC CATTAACATT 3900 TCAGCATGTG CAGGCTGTAG ATCCCCTCCT TCAGTCCCCT CTGCGACACC ATTCGACGTG 3960 GAGATCCACG AGATGGAGTT CTGCAGTGTT TTAGCAGTGG CGGAAGAGAA ACAAACCCCT 4020 ACAGATACAG TTATGGAGAA AAGCGAAACT GACGGACAAA ATGAGACATG CAAAGAGAAC 4080 AATGGGACTG TTACGACAGC ACAAATCAAT GACCAAGAAA TGCTGACGGA AGAACAGGAA 4140 ATGACCTCGC AGATCGAGGA AGAAATGGGT GCCTCACATA CACAGATGAC ACACTAA 4197 |
|||||||||||||||||||||
Sequence Source | Ensembl | |||||||||||||||||||||
Gene Ontology | GO:0071542--P:dopaminergic neuron differentiation |
|||||||||||||||||||||
Interpro | IPR000014--PAS |
|||||||||||||||||||||
PROSITE | PS50112--PAS |
|||||||||||||||||||||
Pfam | ||||||||||||||||||||||
SMART | SM00091--PAS |