SFG668
Library SF
(Link to library)
Clone ID SFG668
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U16442-1
Original site URL
Representative seq. ID SFG668E
(Link to Original site)
Representative DNA sequence
>SFG668 (SFG668Q) /CSM/SF/SFG6-C/SFG668Q.Seq.d/
AAATAAATGAAATTATAAATTATTTAAAATTTATTGAAGATAATAATTTTGAAAATACTA
AACATAAAAATTATAAATTTTTACATGATGAAAACAGAAAAGAAAAATTATTTAGAATTG
AAAAAAAAGTCATGAATTGTGGCGGAGATAGAAATATATCAAAAGAAGTGTGTTTAGAAG
TAATTAATGATTATAGAATGGTATCACTCTGGGAAGAAATGCATAAAGGTCATATAGGAA
GAGATGCAACCTATGGGAACTACGAGACTAAATATTATAATATGGGATTGTATTCTTTTG
TATCTGATGCAGTAGATACGTGTGACATTTGCCAACGAAATAGAATAAAAGGTATAACAA
AGGATTTTGCTCCAATTGTAGATACCGAGGAATACTCAAGATTGGTTTATGATTTAACAT
CAATTAAAGGTGAACATAAAGAAAAAGTTACATACGATGATGATAATGAAAAGATACTAA
CTAAACTAGATGATCTTATCCAATATGACTCTGTACAACCGTACGACACCCGATGTTGTT
TACATTATTCTATGCATAGATTCTTTTACAAAATTTGCTACTGGAAGATGTTTGACAACA
AAGAGAACAGTTCCCATATACAATTTTTTGGCTCTCACGTACTTTGGCAAACCTGTGAAA
GTATGGCACTGCGATAATGGACGTGAATTTAAAAACAAAGTCCAAAAAGAATTCCTAAAA
CTTTTTCCAGGCTCCAAGTCAGCACATGGAGCTCCTCGTACACCAACAACTCAAGGTATG
GTAGAAAGATTGAATCGAACTATCAAGGAGAGGATCTCAAAATTAAAACAACAAGATTTT
CTTGATGGTACTTCTAGGTCTCTTTCTGAACTATTAAAACAAGCTTTGTATGATTACAAT
AATACAAAAACAAGAACAATTAAAATGACACCATCTCAAGCTGTTGGTATTGTTCCTTTG
TTTATTAATGTTCAATCAGAACAAGACTCTCAATCAATTGGTGTTTCAGATGTTTCAAAA
GAAGAAAGAACAGCTATTATTCTTGAAAATCTCACAAGTTACCAAAATCAAGGAATTCAA
AACCACCAAAGGGAT
sequence update 2001. 6. 9
Translated Amino Acid sequence
INEIINYLKFIEDNNFENTKHKNYKFLHDENRKEKLFRIEKKVMNCGGDRNISKEVCLEV
INDYRMVSLWEEMHKGHIGRDATYGNYETKYYNMGLYSFVSDAVDTCDICQRNRIKGITK
DFAPIVDTEEYSRLVYDLTSIKGEHKEKVTYDDDNEKILTKLDDLIQYDSVQPYDTRCCL
HYSMHRFFYKICYWKMFDNKENSSHIQFFGSHVLWQTCESMALR*wt*i*kqspkripkt
fsrlqvstwsssytnnsrygrkiesnyqgedlkikttrfs*wyf*vsf*tiktsfv*lq*
yknknn*ndtisscwycsfvy*csirtrlsinwcfrcfkrrknsyys*kshklpksrnsk
ppkg


Translated Amino Acid sequence (All Frames)
Frame A:
k*mkl*ii*nllkiiilkilnikiinfymmktekknylelkkks*ivaeieiyqkkcv*k
*lmiiewyhsgkkcikvi*eemqpmgttrlniiiwdcillylmq*irvtfaneie*kv*q
rillql*iprntqdwfmi*hqlkvnikkklhtmmimkry*ln*milsnmtlynrttpdvv
yiilcidsftkfatgrclttkrtvpiynflaltyfgkpvkvwhcdngrefknkvqkeflk
lfpgsksahgaprtpttqgmverlnrtikerisklkqqdfldgtsrslsellkqalydyn
ntktrtikmtpsqavgivplfinvqseqdsqsigvsdvskeertaiilenltsyqnqgiq
nhqrd


Frame B:
nk*nyklfkiy*r**f*ky*t*kl*ift**kqkrkii*n*kkshelwrr*kyikrsvfrs
n**l*ngitlgrna*rsyrkrcnlwelrd*il*ygivffci*csryv*hlptk*nkrynk
gfcsncryrgilkigl*fnin*r*t*rksyir****kdtn*tr*sypi*lcttvrhpmlf
tlfya*illqnllledv*qqreqfpytifwlsrtlanl*kygtaimdvnlktkskkns*n
ffqapsqhmellvhqqlkvw*kd*ielsrrgsqn*nnkiflmvllglflny*nklcmiti
iqkqeqlk*hhlkllvlflcllmfnqnktlnqlvfqmfqkkkeqllflkisqvtkikefk
ttkg


Frame C:
INEIINYLKFIEDNNFENTKHKNYKFLHDENRKEKLFRIEKKVMNCGGDRNISKEVCLEV
INDYRMVSLWEEMHKGHIGRDATYGNYETKYYNMGLYSFVSDAVDTCDICQRNRIKGITK
DFAPIVDTEEYSRLVYDLTSIKGEHKEKVTYDDDNEKILTKLDDLIQYDSVQPYDTRCCL
HYSMHRFFYKICYWKMFDNKENSSHIQFFGSHVLWQTCESMALR*wt*i*kqspkripkt
fsrlqvstwsssytnnsrygrkiesnyqgedlkikttrfs*wyf*vsf*tiktsfv*lq*
yknknn*ndtisscwycsfvy*csirtrlsinwcfrcfkrrknsyys*kshklpksrnsk
ppkg


Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

SFG668 (SFG668Q) /CSM/SF/SFG6-C/SFG668Q.Seq.d/ 1917 0.0
SFE576 (SFE576Q) /CSM/SF/SFE5-D/SFE576Q.Seq.d/ 1326 0.0
SFJ664 (SFJ664Q) /CSM/SF/SFJ6-C/SFJ664Q.Seq.d/ 1211 0.0
AFO751 (AFO751Q) /CSM/AF/AFO7-C/AFO751Q.Seq.d/ 1166 0.0
SFF392 (SFF392Q) /CSM/SF/SFF3-D/SFF392Q.Seq.d/ 1156 0.0
VFL688 (VFL688Q) /CSM/VF/VFL6-D/VFL688Q.Seq.d/ 1152 0.0
SHI355 (SHI355Q) /CSM/SH/SHI3-C/SHI355Q.Seq.d/ 1124 0.0
AHO242 (AHO242Q) /CSM/AH/AHO2-B/AHO242Q.Seq.d/ 1114 0.0
CHE831 (CHE831Q) /CSM/CH/CHE8-B/CHE831Q.Seq.d/ 1112 0.0
VHC812 (VHC812Q) /CSM/VH/VHC8-A/VHC812Q.Seq.d/ 1108 0.0

own update 2002.12. 1
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

U57081|U57081.1 Dictyostelium discoideum Tdd-4 transposable element encodes putative transposition inhibitor and putative transposase, complete cds. 498 0.0 6
AC116957|AC116957.2 Dictyostelium discoideum chromosome 2 map 1685067-2090751 strain AX4, complete sequence. 498 0.0 17
AF298206|AF298206.2 Dictyostelium discoideum transposon Tdd-5, complete sequence. 66 2e-33 6
AL844509|AL844509.1 Plasmodium falciparum chromosome 13. 34 0.003 28
BS000212|BS000212.1 Pan troglodytes chromosome 22 clone:RP43-022K21, map 22, complete sequences. 40 0.005 7
CA128714|CA128714.1 SCJLLR2028A05.g LR2 Saccharum officinarum cDNA clone SCJLLR2028A05 5', mRNA sequence. 50 0.068 1
AE015937|AE015937.1 Clostridium tetani E88, section 2 of 10 of the complete genome. 34 0.12 13
AL844505|AL844505.1 Plasmodium falciparum chromosome 6. 32 0.13 26
AC114263|AC114263.2 Dictyostelium discoideum chromosome 2 map 215673-367476 strain AX4, complete sequence. 30 0.13 13
AE014837|AE014837.1 Plasmodium falciparum 3D7 chromosome 11 section 2 of 8 of the complete sequence. 32 0.15 14
dna update 2004. 2.21
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

U57081_2(U57081|pid:none) Dictyostelium discoideum Tdd-4 transpo... 367 e-100
U57081_1(U57081|pid:none) Dictyostelium discoideum Tdd-4 transpo... 367 e-100
AY634221_2(AY634221|pid:none) Oikopleura dioica transposon LTR r... 60 1e-07
DQ444472_1(DQ444472|pid:none) Nosema bombycis retrotransposon Nb... 59 3e-07
AF098806_1(AF098806|pid:none) Sorghum bicolor Gypsy-Ty3 type ret... 56 2e-06
T27231(T27231)hypothetical protein Y57G11C.19 - Caenorhabditis e... 54 1e-05
U89994_2(U89994|pid:none) Drosophila melanogaster burdock retrot... 52 3e-05
EF591042_2(EF591042|pid:none) Drosophila melanogaster clone 8.1 ... 52 3e-05
AY009101_1(AY009101|pid:none) Aedes aegypti LTR retrotransposon ... 52 4e-05
AY613856_20(AY613856|pid:none) Oikopleura dioica clone BACOIKO00... 52 5e-05
protein update 2009. 7. 3
PSORT

psg: 0.50 gvh: 0.24 alm: 0.48 top: 0.53 tms: 0.00 mit: 0.23 mip: 0.00
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

52.0 %: nuclear
24.0 %: cytoplasmic
8.0 %: cytoskeletal
8.0 %: mitochondrial
4.0 %: vacuolar
4.0 %: vesicles of secretory system

>> prediction for SFG668 is nuc

5' end seq. ID SFG668F
5' end seq.
>SFG668F.Seq
AAATAAATGAAATTATAAATTATTTAAAATTTATTGAAGATAATAATTTTGAAAATACTA
AACATAAAAATTATAAATTTTTACATGATGAAAACAGAAAAGAAAAATTATTTAGAATTG
AAAAAAAAGTCATGAATTGTGGCGGAGATAGAAATATATCAAAAGAAGTGTGTTTAGAAG
TAATTAATGATTATAGAATGGTATCACTCTGGGAAGAAATGCATAAAGGTCATATAGGAA
GAGATGCAACCTATGGGAACTACGAGACTAAATATTATAATATGGGATTGTATTCTTTTG
TATCTGATGCAGTAGATACGTGTGACATTTGCCAACGAAATAGAATAAAAGGTATAACAA
AGGATTTTGCTCCAATTGTAGATACCGAGGAATACTCAAGATTGGTTTATGATTTAACAT
CAATTAAAGGTGAACATAAAGAAAAAGTTACATACGATGATGATAATGAAAAGATACTAA
CTAAACTAGATGATCTTATCCAATATGACTCTGTACAACCGTACGACACCCGATGTTGTT
TACATTATTCTATGCATAGATTCTTTTACAAAATTTGCTACTGGAAGATGTTTGACAACA
AAG----------
Length of 5' end seq. 603
3' end seq. ID SFG668Z
3' end seq.
>SFG668Z.Seq
----------ATTTAACATCAATTAAAGGTGAACATAAAGAAAAAGTTACATACGATGAT
GATAATGAAAAGATACTAACTAAACTAGATGATCTTATCCAATATGACTCTGTACAACCG
TACGACACCGATGTTGTTTACATTATTCTATGCATAGATTCTTTTACAAAATTTGCTACT
GGAAGATGTTTGACAACAAAGAGAACAGTTCCCATATACAATTTTTTGGCTCTCACGTAC
TTTGGCAAACCTGTGAAAGTATGGCACTGCGATAATGGACGTGAATTTAAAAACAAAGTC
CAAAAAGAATTCCTAAAACTTTTTCCAGGCTCCAAGTCAGCACATGGAGCTCCTCGTACA
CCAACAACTCAAGGTATGGTAGAAAGATTGAATCGAACTATCAAGGAGAGGATCTCAAAA
TTAAAACAACAAGATTTTCTTGATGGTACTTCTAGGTCTCTTTCTGAACTATTAAAACAA
GCTTTGTATGATTACAATAATACAAAAACAAGAACAATTAAAATGACACCATCTCAAGCT
GTTGGTATTGTTCCTTTGTTTATTAATGTTCAATCAGAACAAGACTCTCAATCAATTGGT
GTTTCAGATGTTTCAAAAGAAGAAAGAACAGCTATTATTCTTGAAAATCTCACAAGTTAC
CAAAATCAAGGAATTCAAAACCACCAAAGGGAT
Length of 3' end seq. 683
Connected seq. ID SFG668P
Connected seq.
>SFG668P.Seq
AAATAAATGAAATTATAAATTATTTAAAATTTATTGAAGATAATAATTTTGAAAATACTA
AACATAAAAATTATAAATTTTTACATGATGAAAACAGAAAAGAAAAATTATTTAGAATTG
AAAAAAAAGTCATGAATTGTGGCGGAGATAGAAATATATCAAAAGAAGTGTGTTTAGAAG
TAATTAATGATTATAGAATGGTATCACTCTGGGAAGAAATGCATAAAGGTCATATAGGAA
GAGATGCAACCTATGGGAACTACGAGACTAAATATTATAATATGGGATTGTATTCTTTTG
TATCTGATGCAGTAGATACGTGTGACATTTGCCAACGAAATAGAATAAAAGGTATAACAA
AGGATTTTGCTCCAATTGTAGATACCGAGGAATACTCAAGATTGGTTTATGATTTAACAT
CAATTAAAGGTGAACATAAAGAAAAAGTTACATACGATGATGATAATGAAAAGATACTAA
CTAAACTAGATGATCTTATCCAATATGACTCTGTACAACCGTACGACACCCGATGTTGTT
TACATTATTCTATGCATAGATTCTTTTACAAAATTTGCTACTGGAAGATGTTTGACAACA
AAG----------ATTTAACATCAATTAAAGGTGAACATAAAGAAAAAGTTACATACGAT
GATGATAATGAAAAGATACTAACTAAACTAGATGATCTTATCCAATATGACTCTGTACAA
CCGTACGACACCGATGTTGTTTACATTATTCTATGCATAGATTCTTTTACAAAATTTGCT
ACTGGAAGATGTTTGACAACAAAGAGAACAGTTCCCATATACAATTTTTTGGCTCTCACG
TACTTTGGCAAACCTGTGAAAGTATGGCACTGCGATAATGGACGTGAATTTAAAAACAAA
GTCCAAAAAGAATTCCTAAAACTTTTTCCAGGCTCCAAGTCAGCACATGGAGCTCCTCGT
ACACCAACAACTCAAGGTATGGTAGAAAGATTGAATCGAACTATCAAGGAGAGGATCTCA
AAATTAAAACAACAAGATTTTCTTGATGGTACTTCTAGGTCTCTTTCTGAACTATTAAAA
CAAGCTTTGTATGATTACAATAATACAAAAACAAGAACAATTAAAATGACACCATCTCAA
GCTGTTGGTATTGTTCCTTTGTTTATTAATGTTCAATCAGAACAAGACTCTCAATCAATT
GGTGTTTCAGATGTTTCAAAAGAAGAAAGAACAGCTATTATTCTTGAAAATCTCACAAGT
TACCAAAATCAAGGAATTCAAAACCACCAAAGGGAT
Length of connected seq. 1286
Full length Seq ID SFG668E
Full length Seq.
>SFG668E.Seq
AAATAAATGAAATTATAAATTATTTAAAATTTATTGAAGATAATAATTTTGAAAATACTA
AACATAAAAATTATAAATTTTTACATGATGAAAACAGAAAAGAAAAATTATTTAGAATTG
AAAAAAAAGTCATGAATTGTGGCGGAGATAGAAATATATCAAAAGAAGTGTGTTTAGAAG
TAATTAATGATTATAGAATGGTATCACTCTGGGAAGAAATGCATAAAGGTCATATAGGAA
GAGATGCAACCTATGGGAACTACGAGACTAAATATTATAATATGGGATTGTATTCTTTTG
TATCTGATGCAGTAGATACGTGTGACATTTGCCAACGAAATAGAATAAAAGGTATAACAA
AGGATTTTGCTCCAATTGTAGATACCGAGGAATACTCAAGATTGGTTTATGATTTAACAT
CAATTAAAGGTGAACATAAAGAAAAAGTTACATACGATGATGATAATGAAAAGATACTAA
CTAAACTAGATGATCTTATCCAATATGACTCTGTACAACCGTACGACACCCGATGTTGTT
TACATTATTCTATGCATAGATTCTTTTACAAAATTTGCTACTGGAAGATGTTTGACAACA
AAGAGAACAGTTCCCATATACAATTTTTTGGCTCTCACGTACTTTGGCAAACCTGTGAAA
GTATGGCACTGCGATAATGGACGTGAATTTAAAAACAAAGTCCAAAAAGAATTCCTAAAA
CTTTTTCCAGGCTCCAAGTCAGCACATGGAGCTCCTCGTACACCAACAACTCAAGGTATG
GTAGAAAGATTGAATCGAACTATCAAGGAGAGGATCTCAAAATTAAAACAACAAGATTTT
CTTGATGGTACTTCTAGGTCTCTTTCTGAACTATTAAAACAAGCTTTGTATGATTACAAT
AATACAAAAACAAGAACAATTAAAATGACACCATCTCAAGCTGTTGGTATTGTTCCTTTG
TTTATTAATGTTCAATCAGAACAAGACTCTCAATCAATTGGTGTTTCAGATGTTTCAAAA
GAAGAAAGAACAGCTATTATTCTTGAAAATCTCACAAGTTACCAAAATCAAGGAATTCAA
AACCACCAAAGGGAT
Length of full length seq. 1095