dcGO - A comprehensive domain-centric ontology resource for post-genomic research on functions, phenotypes, diseases and more   
  
  

gi|125973736|ref|YP_001037646.1| (Clostridium thermocellum ATCC 27405)

(
show help)
Now Using Gene Ontology (GO). You can switch to another ontology:

Protein sequence

Comment glycosyltransferase [Clostridium thermocellum ATCC 27405]
Sequence length 2922
Sequence
MQMQLYILYLLGLFGILLCLFLLAIFSNCNERQRQLKVQDASLTFDELEAYAKEIAIEHS
VSGKKSMFSWPIPRMNDNYRYIMSVYKEMNEDVQKGISTTPAAEWLLDNFYIIEEQVKSL
RRDLTKEVYAKLPVLDSGHLKGYARIYSIALELLSHTDGRIDEKVLVNYIKAYQSNNVLT
GRELWAFPIMLKLVLIEKTRYICEKIAKAQEQRRKVEEILKAFDENIENTTQLITAIDNE
LKGKYEVNSAFIEYLAYKFRKMGRAYTHVLRYIDERLGESGTTVDDITQKEHNEQTASKA
SIGNCIMSLKFISTVNWVDIFEQLSKVEQILREDPSGFYSLMDFDSRNYYRNRVEKLALK
YKVSESHVAKKAVELARNAVENGNLTDKRLTHVGYYLVGKGICELEKEIGYEKSFNQRMF
ERIKEHPACLYFGFIGFITVLLLLCVTKYSLFRAEKYGIALSIIAVLATIIPATDIAVNF
VNWVLCKMIKPSLLPKLDFENGIPEEYATMVVIPALLPDENRARELIDNLEVYYLANREK
NLYFSIAGDFKDAPNKEMAGDKKIIETALGRIAELNEKYGRKNEGGEKDSRDIFYYFHRH
RQFNEKQNKWMGWERKRGALLEFNEVLLGSRTTSYSIMSHDVSQLPKIKYVITLDADTIL
PLGAARKLIGTMAHPLHRPVIDEQKGIVTEGYGLLQPRIGFDIESVNKSLFSRIFAGEEG
IDPYASAISDVYQDLFGEGIFTGKGIYDLEVFQKLLKDAIPDNTVLSHDLLEGSYVRAGL
VTDIEFIDGYPSKLNSYAMRLHRWVRGDWQLLPWLRGKTKDRKGNVIKNPLSLISRWKIL
DNLRRSIVAPSITLLIALGFSILPGSSLFWLGASLLTIYFPLITGTIDYIASKPLGAITS
KRYKPAICGLKASFLQMTLQFVFLPYNAWLMVHAAVLSLVRVLFTKRNMLEWVTALDAER
GLKNSLKGYVIKMKAAAFQALVVVVLAFAFKTGFSAAVSVLPFAVWVSSPFIAYWISKET
VYKTETLSDEENLELRRIARKTWRYYEEFVNRRNNYLAPDNFQEDPPNGIAYRTSPTNIG
LGMLAALTARDLGYIGTLELCDIISRTMSTVEKMEKWNGHLYNWYDTRTLETLRPRYIST
VDSGNFVCYLITLKEGLAEYLNRPLEDRAFIDGIRDTASLIADENENPYKDISCLKECIV
ISEGRSYVDIPQMMKALTKLSEDGNKMKDSKDVWKAKVDSMIEMLKIELYTYMPWCDMID
ELTEAFEKSEADIKEAFHGIIRKLNSDYSLKAMPVVYRETIKQIEKLRKKLKDGQQKNIE
GLDRLKEALEGATESADKLVKRYVDLINRICRIADETEFVHLYDKKKQLFSIGYNIEENS
LTNSYYDLLASEARQTSYIAIARGEVDQQHWFKLGRTLTQIDRYKGMVSWSGTMFEYFMP
LLIMKSHKNTLLDETYSFVVRSQKKYGKQRNLPWGISESGFYSFDINLDYQYKAFGVPWL
GLKRGLVEDMVVSPYATMLVLPLVPRDAMDNLKRLIAEGAYGHYGMYEAIDYTPERIPLG
EKKGIVKSYMAHHQGMSILALNNYFNDNIMQKRFHADPVVDAAKLLLMEKVPSNIVFTKE
NKEKILPFKDVVYDEKDFLRECGMPDPVLPKAHILSNGNYSVMVTDRGTGYSRWKNLDVT
RWREDVTLDNYGMFFYIRDVQNDEVWTSTFAPGRKKPDEYKVEFTSGKAKYYRKDGDIDT
LTEIVVCAGENAEIRSITLANHGQESCVMEITSYFEPVLSHHGADIAHPAFGNLFIRTEF
LAEHNCLIAGRRPRSEKEKPVWIMNTVVLEGEGVGSLQYETDRMQFIGRGRNVSEPVALE
PHRPLTNSVGAVLDPVMSFRQIVRVEPGKSVKISFVTAVANSREDVVEMATKFKSPQVIK
DELGMAVTKSRVEARYLNLDTEEIELYQDMISHILFISPLQRQKQKWVMNNKKGQPGLWP
YGISGDIPIVLVMLDKTDDIDIVREVLKAHEYWRLKKLAVDLVILNEEENSYTNPVNSLL
MDIIAESHAHDLINKPGGVFILKKSNMPPEDIDLICSVSRIILKGDAGDLKDQVKYARSI
ALAEFKQFEKKPASYDSKLAKDLELNFYNGLGGFGKDGKEYVIFLENGQNTPLPWINVIS
NQRFGFIVTESGSGYTWFENSRENKLTPWSNDPVSDTPGEILYVMDEHAGDVWSVTPLPV
REKEPYMIRHGFGYTVFSHASHGIEQEMVQFVPVDDSVKISILKLKNQSQENRGLSLTYY
IRPVLGVSDQFTAMHINTKADNGMIVIKNNYNDEFPGRVAFIDSSLKVNSLTCDRKEFFG
AGDIANPEGIKRTSLSGTTGAGFDPCAAISVSVNLKPDEEKEIIFLLGAGRDEEEARQLS
AKYKKLEEAKKALGEVKKFWELKLGALQFETPNTAMDILLNGWLLYQVVSCRLWTRSGFY
QSGGAYGFRDQLQDSISLTHIWPEATRNQILLHSRHQFIEGDVQHWWHEEKYKGTRTKFS
DDLLWMPYATIEYIRITGDYDILYEETPFLEDEPLKEFEDEAYRVPRISHTVSTLYDHCI
RAINRSLKFGEHGIPLIGSGDWNDGMNTVGNKGKGESVWLGWFLYSILKNFAPLCERMGD
NELAKRYLDTADRIVENIEKNAWDGKWYRRAYFDNGVPLGSIQNSECQIDSLAQSWAVIS
EGGDKERIAEAMSALENYLVKRDEGLIKLLTPPFDEGDLEPGYIKSYVPGVRENGGQYTH
AAAWVVMAFAKMGDGEKAMELFDLLNPINHSRTHIEYSRYKVEPYVMAADVYSVPPHTGR
GGWTWYTGSAGWIYRVGFEYILGFKKRGETLEIDPCIPGKWTDFTIKYRYYDTDYIIEVK
NPEGVNTGVKKVIVDGKVCDDGKVQLVNDKDTHKVEVYMGKK

Jump to [ Top · Protein sequence · Domain architecture ]

Domain architecture

1

Jump to [ Top · Protein sequence · Domain architecture ]