SFI142
Library SF
(Link to library)
Clone ID SFI142
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U13888-1
Original site URL
Representative seq. ID SFI142P
(Link to Original site)
Representative DNA sequence
>SFI142 (SFI142Q) /CSM/SF/SFI1-B/SFI142Q.Seq.d/
ACTGTTGGCCTACTGGNATTTTTTTTTATTTTTATTTTCACAACTAATAACAGNACTAAA
TTAATAAAAATAAAAATAAAAATAAACAAACCAAATTAAAAATATTAAATAAAATGGAAT
CTAACACAAATTCTCAAGGACAAGGTATTATTCCTCAATCATATCATTCATCAATTTTCT
TTTCAATTTCAAAAGGATCTGATAAGATTGGTGGATTATTAGAGTATTTAGAGATTATTA
AAAAACATAATATCAACATTACAAGAATTGAATCAAGACCATCAAAAACCGAAAAAAAAG
ATTATGATTTCTTTTTAGATTTAGAATATCCAACAGAAAACAATAAAGAAGTTGAAAAGG
TTATTAAAGATCTCGAAGAAAAAGGTGTAAAAGCTACAACCCTTCAAGAAAGTTCAAACC
AAACTTATGCTCCATGGTTTCCAAGAAAAATTTCAGATTTAGATTTATTTGCAAATAAAG
TATTAGAAATGGGATCAGATTTAACTTCAGATCATCCAGGTGCTTCAGATCCAGTTTACA
GAGAAAXXXXXXXXXXTTCGTCCAGTACAAGGTTTACTCTCTGCTAGAGATTTCTTAAAT
GGTTTAGCTTTCCGTGTATTCCATGCAACTCAATATATTAGACATCCATCCGTACCATTA
TATACACCAGAACCAGATTGTTGTCATGAATTATTAGGTCATGTTCCATTATTAGCTGAT
CCTGATTTCGCTGATTTTAGTCAAGAGATTGGTTTAGCTTCAATTGGTGCTTCTGATGAA
GATATTCAATTACTTAGTACTTGTTATTGGTTTACAGTTGAATTTGGATTATGTAAAGAA
GGTGATACAATTAGAGCATATGGTGCAGGTATTTTATCATCAACAGGTGAAATGGAACAC
TTTTTAACTGATAAAGCAAAAAAATTACCATTTAATCCATTTGACGCATGCAATACTGAA
TATCCAATTACAACATTCCAACCACTTTACTATGTTGCAGAAAGTTTCCAAAAAGCAAAA
GAACAAATGAGACAATTTGCTGATAGCTTTAAAAAACCATTTTCAATTCGTTACAATCCA
TACACTCAATCAATTGAAATACTTGATAACAAAGATAAATTATTAAATATTTGCAATNAT
ATTAGAAATCAATC
sequence update 2001.11.22
Translated Amino Acid sequence
cwptgiffyfyfhn**qx*INKNKNKNKQTKLKILNKMESNTNSQGQGIIPQSYHSSIFF
SISKGSDKIGGLLEYLEIIKKHNINITRIESRPSKTEKKDYDFFLDLEYPTENNKEVEKV
IKDLEEKGVKATTLQESSNQTYAPWFPRKISDLDLFANKVLEMGSDLTSDHPGASDPVYR
E---

---RPVQGLLSARDFLNGLAFRVFHATQYIRHPSVPLYTPEPDCCHELLGHVPLLADPDF
ADFSQEIGLASIGASDEDIQLLSTCYWFTVEFGLCKEGDTIRAYGAGILSSTGEMEHFLT
DKAKKLPFNPFDACNTEYPITTFQPLYYVAESFQKAKEQMRQFADSFKKPFSIRYNPYTQ
SIEILDNKDKLLNICNXIRNQ


Translated Amino Acid sequence (All Frames)
Frame A:
tvgllxfffififttnnxtklikikikinkpn*ky*ikwnltqilkdkvlflnhiihqfs
fqfqkdlirlvdy*si*rllkniistlqelnqdhqkpkkkimisf*i*niqqktikklkr
llkiskkkv*klqpfkkvqtklmlhgfqekfqi*iylqiky*kwdqi*lqiiqvlqiqft
ek---

---fvqykvyslleis*mv*lsvysmqlnildihpyhyihqnqivvmny*vmfhy*lili
slilvkrlv*lqlvllmkifnylvlviglqlnldyvkkviqlehmvqvfyhqqvkwntf*
likqknyhlihlthailniqlqhsnhftmlqkvskkqknk*dnllialknhfqfvtihtl
nqlkylitkiny*ifaxilein

Frame B:
llaywxfflflfsqlitxln**k*k*k*tnqiknik*ngi*hkfsrtryyssiisfinfl
fnfkri**dwwiirvfrdy*kt*yqhykn*iktiknrkkrl*flfrfrisnrkq*rs*kg
y*rsrrkrcksynpsrkfkpnlcsmvskknfrfrfick*sirngirfnfrssrcfrsslq
r---

---ssstrftlc*rflkwfsfpcipcnsiy*tsirtiiytrtrlls*iirscsiis*s*f
r*f*srdwfsfnwcf**rysit*ylllvys*iwim*rr*yn*siwcryfiinr*ngtlfn
**skkiti*si*rmqy*isnynipttllccrkfpkskrtnetic**l*ktifnslqsihs
in*nt**qr*iikylqxy*ksi

Frame C:
cwptgiffyfyfhn**qx*INKNKNKNKQTKLKILNKMESNTNSQGQGIIPQSYHSSIFF
SISKGSDKIGGLLEYLEIIKKHNINITRIESRPSKTEKKDYDFFLDLEYPTENNKEVEKV
IKDLEEKGVKATTLQESSNQTYAPWFPRKISDLDLFANKVLEMGSDLTSDHPGASDPVYR
E---

---RPVQGLLSARDFLNGLAFRVFHATQYIRHPSVPLYTPEPDCCHELLGHVPLLADPDF
ADFSQEIGLASIGASDEDIQLLSTCYWFTVEFGLCKEGDTIRAYGAGILSSTGEMEHFLT
DKAKKLPFNPFDACNTEYPITTFQPLYYVAESFQKAKEQMRQFADSFKKPFSIRYNPYTQ
SIEILDNKDKLLNICNXIRNQ

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

SFI142 (SFI142Q) /CSM/SF/SFI1-B/SFI142Q.Seq.d/ 1907 0.0
SSM681 (SSM681Q) /CSM/SS/SSM6-D/SSM681Q.Seq.d/ 1138 0.0
SFK632 (SFK632Q) /CSM/SF/SFK6-B/SFK632Q.Seq.d/ 1138 0.0
SFK580 (SFK580Q) /CSM/SF/SFK5-D/SFK580Q.Seq.d/ 1138 0.0
SFB840 (SFB840Q) /CSM/SF/SFB8-B/SFB840Q.Seq.d/ 1138 0.0
AFI283 (AFI283Q) /CSM/AF/AFI2-D/AFI283Q.Seq.d/ 1138 0.0
AFC478 (AFC478Q) /CSM/AF/AFC4-D/AFC478Q.Seq.d/ 1138 0.0
SFK602 (SFK602Q) /CSM/SF/SFK6-A/SFK602Q.Seq.d/ 1122 0.0
SFJ456 (SFJ456Q) /CSM/SF/SFJ4-C/SFJ456Q.Seq.d/ 1122 0.0
SFJ372 (SFJ372Q) /CSM/SF/SFJ3-C/SFJ372Q.Seq.d/ 1122 0.0

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

BG225890|0 90 8e-14 1
AC116984|AC116984.2 Dictyostelium discoideum chromosome 2 map 2567470-3108875 strain AX4, complete sequence. 36 5e-07 22
AP006628|AP006628.1 Onion yellows phytoplasma DNA, complete genome. 38 4e-04 21
AP003188|AP003188.2 Clostridium perfringens DNA, complete genome, section 4/10. 40 0.001 14
AC115598|AC115598.2 Dictyostelium discoideum chromosome 2 map 581427-735498 strain AX4, complete sequence. 36 0.017 14
AL671873|AL671873.15 Mouse DNA sequence from clone RP23-382D17 on chromosome X. 42 0.017 2
CC139550|CC139550.1 NDL.48G14.SP6 Notre Dame Liverpool Aedes aegypti genomic clone NDL.48G14, DNA sequence. 52 0.017 1
CC852580|CC852580.1 NDL.130C7.T7 Notre Dame Liverpool Aedes aegypti genomic clone NotreDame Liverpool-130C7, genomic survey sequence. 52 0.017 1
AC016647|AC016647.7 Homo sapiens chromosome 16 clone RP11-89G14, complete sequence. 40 0.018 9
BU770913| (AF031034) tryptophan hydroxylase; SmTPH [Schistosoma mansoni], mRNA sequence. 48 0.020 2
dna update 2003.12.16
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

(Q54XS1) RecName: Full=Probable phenylalanine-4-hydroxylase; ... 410 e-113
BC056537_1(BC056537|pid:none) Danio rerio phenylalanine hydroxyl... 278 2e-73
AY330224_1(AY330224|pid:none) Danio rerio Pah mRNA, complete cds... 278 2e-73
AY615523_1(AY615523|pid:none) Gallus gallus neuronal tryptophan ... 270 7e-71
BC013458_1(BC013458|pid:none) Mus musculus phenylalanine hydroxy... 268 2e-70
EU730760_1(EU730760|pid:none) Micropogonias undulatus tryptophan... 267 4e-70
(Q8IWU9) RecName: Full=Tryptophan 5-hydroxylase 2; EC=1... 267 6e-70
BC114499_1(BC114499|pid:none) Homo sapiens tryptophan hydroxylas... 267 6e-70
BC114442_1(BC114442|pid:none) Homo sapiens tryptophan hydroxylas... 267 6e-70
(Q2HZ26) RecName: Full=Tryptophan 5-hydroxylase 2; EC=1... 267 6e-70
protein update 2009. 5.25
PSORT

psg: 0.58 gvh: 0.35 alm: 0.43 top: 0.53 tms: 0.00 mit: 0.29 mip: 0.00
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

40.0 %: cytoplasmic
28.0 %: nuclear
16.0 %: mitochondrial
8.0 %: cytoskeletal
4.0 %: vacuolar
4.0 %: vesicles of secretory system

>> prediction for SFI142 is cyt

5' end seq. ID SFI142F
5' end seq.
>SFI142F.Seq
ACTGTTGGCCTACTGGNATTTTTTTTTATTTTTATTTTCACAACTAATAACAGNACTAAA
TTAATAAAAATAAAAATAAAAATAAACAAACCAAATTAAAAATATTAAATAAAATGGAAT
CTAACACAAATTCTCAAGGACAAGGTATTATTCCTCAATCATATCATTCATCAATTTTCT
TTTCAATTTCAAAAGGATCTGATAAGATTGGTGGATTATTAGAGTATTTAGAGATTATTA
AAAAACATAATATCAACATTACAAGAATTGAATCAAGACCATCAAAAACCGAAAAAAAAG
ATTATGATTTCTTTTTAGATTTAGAATATCCAACAGAAAACAATAAAGAAGTTGAAAAGG
TTATTAAAGATCTCGAAGAAAAAGGTGTAAAAGCTACAACCCTTCAAGAAAGTTCAAACC
AAACTTATGCTCCATGGTTTCCAAGAAAAATTTCAGATTTAGATTTATTTGCAAATAAAG
TATTAGAAATGGGATCAGATTTAACTTCAGATCATCCAGGTGCTTCAGATCCAGTTTACA
GAGAAA----------
Length of 5' end seq. 546
3' end seq. ID SFI142Z
3' end seq.
>SFI142Z.Seq
NNNNNNNNNNTTCGTCCAGTACAAGGTTTACTCTCTGCTAGAGATTTCTTAAATGGTTTA
GCTTTCCGTGTATTCCATGCAACTCAATATATTAGACATCCATCCGTACCATTATATACA
CCAGAACCAGATTGTTGTCATGAATTATTAGGTCATGTTCCATTATTAGCTGATCCTGAT
TTCGCTGATTTTAGTCAAGAGATTGGTTTAGCTTCAATTGGTGCTTCTGATGAAGATATT
CAATTACTTAGTACTTGTTATTGGTTTACAGTTGAATTTGGATTATGTAAAGAAGGTGAT
ACAATTAGAGCATATGGTGCAGGTATTTTATCATCAACAGGTGAAATGGAACACTTTTTA
ACTGATAAAGCAAAAAAATTACCATTTAATCCATTTGACGCATGCAATACTGAATATCCA
ATTACAACATTCCAACCACTTTACTATGTTGCAGAAAGTTTCCAAAAAGCAAAAGAACAA
ATGAGACAATTTGCTGATAGCTTTAAAAAACCATTTTCAATTCGTTACAATCCATACACT
CAATCAATTGAAATACTTGATAACAAAGATAAATTATTAAATATTTGCAATNATATTAGA
AATCAATC
Length of 3' end seq. 608
Connected seq. ID SFI142P
Connected seq.
>SFI142P.Seq
ACTGTTGGCCTACTGGNATTTTTTTTTATTTTTATTTTCACAACTAATAACAGNACTAAA
TTAATAAAAATAAAAATAAAAATAAACAAACCAAATTAAAAATATTAAATAAAATGGAAT
CTAACACAAATTCTCAAGGACAAGGTATTATTCCTCAATCATATCATTCATCAATTTTCT
TTTCAATTTCAAAAGGATCTGATAAGATTGGTGGATTATTAGAGTATTTAGAGATTATTA
AAAAACATAATATCAACATTACAAGAATTGAATCAAGACCATCAAAAACCGAAAAAAAAG
ATTATGATTTCTTTTTAGATTTAGAATATCCAACAGAAAACAATAAAGAAGTTGAAAAGG
TTATTAAAGATCTCGAAGAAAAAGGTGTAAAAGCTACAACCCTTCAAGAAAGTTCAAACC
AAACTTATGCTCCATGGTTTCCAAGAAAAATTTCAGATTTAGATTTATTTGCAAATAAAG
TATTAGAAATGGGATCAGATTTAACTTCAGATCATCCAGGTGCTTCAGATCCAGTTTACA
GAGAAA----------TTCGTCCAGTACAAGGTTTACTCTCTGCTAGAGATTTCTTAAAT
GGTTTAGCTTTCCGTGTATTCCATGCAACTCAATATATTAGACATCCATCCGTACCATTA
TATACACCAGAACCAGATTGTTGTCATGAATTATTAGGTCATGTTCCATTATTAGCTGAT
CCTGATTTCGCTGATTTTAGTCAAGAGATTGGTTTAGCTTCAATTGGTGCTTCTGATGAA
GATATTCAATTACTTAGTACTTGTTATTGGTTTACAGTTGAATTTGGATTATGTAAAGAA
GGTGATACAATTAGAGCATATGGTGCAGGTATTTTATCATCAACAGGTGAAATGGAACAC
TTTTTAACTGATAAAGCAAAAAAATTACCATTTAATCCATTTGACGCATGCAATACTGAA
TATCCAATTACAACATTCCAACCACTTTACTATGTTGCAGAAAGTTTCCAAAAAGCAAAA
GAACAAATGAGACAATTTGCTGATAGCTTTAAAAAACCATTTTCAATTCGTTACAATCCA
TACACTCAATCAATTGAAATACTTGATAACAAAGATAAATTATTAAATATTTGCAATNAT
ATTAGAAATCAATC
Length of connected seq. 1144
Full length Seq ID -
Full length Seq. -
Length of full length seq. -