|
ENSMGAP00000014297 (Meleagris gallopavo 76_2)
(show help)
For a sequence (see Protein sequence) in target, dcGO predictor has the following procedures to predict the ontology terms of the target:
First, obtain Domain architecture and its residual domains and supra-domains from the SUPERFAMILY database.
Then, use the domain-centric annotations to predict the ontology terms of the target:
- If a target contained a domain/supra-domain, then all ontology terms associated to that domain/supra-domain are transferred to the target (together with hypergeometric score, h-score);
- When a target-to-term transfer is supported by one or more residential domains/supra-domains, sum up h-scores to calculate predictive score (p-score);
- The p-score is then rescaled to the range of 0-1. For each namespace (e.g., three sub-ontologies for GO), p-score=(SUM-MIN)/(MAX-MIN), where SUM is the sum of all h-scores to support a term transferred to the target, MIN and MAX are respectively the minimum and maximum of SUM over a whole list of predicted terms for the target;
Finally, the predictive score being rescaled is used to rank the predictions. The higher value of the p-score indicates the more evident the prediction is. In the dcGO, each ontology has a slim version on its own, containing ontological terms at four levels of increasing granularity (that is, being highly general, general, specific, and highly specific). Listed in the table are the top 5 predictions for each specificity and for each namespace. In addition to those restricted by the term specificity, i.e., Export prediction (slim version), the full list of predictions are also provided for the download, i.e., Export prediction (full version).
Protein sequence
Comment |
pep:known_by_projection chromosome:UMD2:5:55388078:55539914:-1 gene:ENSMGAG00000013516 transcript:ENSMGAT00000015225 gene_biotype:protein_coding transcript_biotype:protein_coding |
Sequence length |
6888 |
Sequence |
GCFPQLPDKGGSYFLGTISHLSFKMQVQRECTQKKTFTTWINSILAKHTPPSVISDLYTD
IQQGHMLLDLLEVLSGQHLPREKGCNTFQCRSNIENALTFLRSKSLKLINIHVADIIEGK
PSIVLGLIWTIIFHFHIEELARTLACTYNQPSLECSSTVDSSPKARRSAKKSAKIKERWK
ISATKALLFWAKEQCSLCGSISVNDFKSSWRSGLPFLAIINALRPGLIDLEKAKARSNKE
NLDEAFRIAELELNIPRLLEPEDVDVMNPDEKSIMTYVAQFLQCSRNMPETEDMQGKVTE
TVSWLAVQGKKLANLLMDTENETYYQKYKEMMSFMEAFNREKKAFLPLLSSKRSGAELSE
GQQQMREEWSKLISQINEWKAQLDQMLPPPLDGIEAWLQKVEHLQAEDLPDLQDPFKAMS
LFSEIISVFKGLMNCFDSHLETLQSFENKDENNMPLVVPEKLEEMKRRFNDIRFTNSSSL
LEYHYGLCSAFANEVTLKLNIWDVKYGTKESVESLLENWNNFIKEKRLKTQLETAADICE
DLKNKNISNLDLEISKLFKTVESKISMCKDYISNVNTTLQKVLSSWSSYTEKLYLLKTWL
EEKQNEYPIEVPAEVLANWNSVHASLNEAGNYLIEVSKEQVGSNIAKELKKLNRKWAKFI
KKTRFVSQLSISSFKTRGTENTQRYPIRKSLPLSISVGDELNENVNKLGEENTDTAEKAR
ELPGGMEKIEELLGSVESWATETECLLSETGHKTPVEPQMEEYLQSLAAKGISYQEQVLA
VEEILQSLMKNASSQVSQQHCHTTGLPARIKEARDRIEAVMGDMNNILPKSEKKPSQREA
LLNFEESQKDIEVSISKAVELVSQKLSREVHISRYEEAFNALDNKVLDNFLKAVEQLKSI
SSDQEKLAVEEKSEDIRKRWEAAHNEIISCVQLLLEREKKKFNTTFKKINKQLGKEKKLL
NMNKTKGLIKEHEAFFSHEGSLKELSKSLEDLKILGKLAEETVPDLQTFAADSEQKLEKL
QQTVADTYTALLSHLKFLLNHRSSIVASENGEKPSSTQQIDILSVQETSRPSEDMALETD
MKHHNGESCEKMFEEGSDSPTPALLESYNTQRRNLEELLQVSRDKVASGFTDEIKGTSCL
QNKLLELQMVENDINSGWAQLGNISSTLENLVDEVKQAAISETRGKLEGELKELQDIIST
RTNSLRTALEIVLPIENESILLCELDQRLNKKGIQQFNLVNSDLAYGELKELQRSILNQI
EVCKQLEHLDSSARDEFNPIDLQAASKIMFYYQNQLEEMSHKMQIRETVLKDLEAFMASL
RKIKSSVKHLIDPLGKPEIQGKTRREAAQELSHMAKEAQCLDERLKTVDICLEDAECGRN
TSCEKLLQTLSEELNVTGDHSSEQMLMEGNDLYRVFSTRNEELLKNIQDLRDRINKIGLK
DPTVPAIQQRVKSLMELEKELDCSAVEMKSMREIANKLLQMEEEKAEESNEQCRVTERSW
EDTKLLLAECQEQCTRALELLKQYQSCKSSLTSIIHKQEIVLSQQNSYMGKDNLNRLITK
IEEAKEEFNDHAEDVDKINQICKNLQFQLNKMRSFEEPPFENEANIIVDRWLDINEKTEN
YCDNLGRAQALWDKLLGLSGTIDAWANAELKNTEGRCLTDEDLTQLKACLSVQEQKLQKF
DNTVAEIEELLNSNEPPLELQVIRSSIVQKMELIKDLLTAKRRTSELSVNTAELKGDLDL
AKTQIGMTESLLKALSPSDTLEIFTKLEEIHQKILQQKHHVTLLQEETDCPDVDELNKQL
KCVTDLFNKKKHVFQDHFIGVLNRQCKNFNDWFSSTQLSLKDCFDPSETKQMLEEKLQKL
KHFFTSEGKDGDIQEVKTLLNKVKHYLPKASINQLSSRVRDQEAELQRLISKCQKREKEL
GTSLQQLNSLQESNTVLEKWLTTQEEKLHEIKKDEIKLENLYKTLLMQREPFDSLAQLAN
SLRETGLTEDEIISETSSLVSRYQTLMTNVNEMAGNTQGLAVDENFKELTQDMSNWIKKL
EKAINNLSSQESELAPDERINQIKEIMALKDAGDAKIQNIVSVGEMLIRNDKIKKPEIQQ
TVSDLQNQWERTCQLATAYRSPQEQLLFNREQYEQSKDDLRLALTELKKQHQEMDFALQP
GLLEKQAQLASYTKLLQKAEDLTSQLNELESRDHVQSFAENPRFTEESWLELKHQHENLL
SQLQAAVETLERHVQEHQQFQDMVTVLNTEIKTVSKKLADYVSPTVEQTSTEQKQLKSQE
QEAPLCQFEDMLKKILVLAETVKQNTSSPGQKFIEDEIETLQSEHRSLEEKLENVKQKKE
NIFSEALELKDDLTDGVQMENKLKREMQIISDTPVAAEEGTTTAELRECLEMPTKEGAAG
VDQKELFPGVQVNTEYFEEEQKVNKLKNEVAELDIVLINEQLKELENLQTELETWKAKSL
CLSQETFPDATRSNGAELQSPEPVVPCWDRLLQELDAVKAVKQQQSCLVNEYQKNLSAAQ
SSMKNLATEKDNIKMGPMNNTVLLEKIKACVQSLHKERDVLNRLKTQQELLSQHLTCMDK
VLTASKMRQLEQWWQHMEQAVQKKHDQVLAEINEFNTLMDKAQDIQRLIQEQYLQTESCS
SAGEKAKCPIIWTAELQNIKHGLSLLKRKIELQMQRIWSDEEKIALESCIQDLQSKLEAL
SEQHTPQDEVRVTGPAVKKQDIMKKLKENVSWVKDSLSSLDQKAALFPCDVKSQIRNCEL
MRNEVLDREPVIEALGNELQHILPNLKPEEISDMTFLLQALQNSYKALVLKSVERLQHLQ
LQLEERQRLTAEVEKVHCQLRNAEALGRPDMNQTSTCSELISQQDILKEILKDVQEIEGL
ISSHCEESQITAGELSLSEQLFLIDQLRSLKNRARKTQRQIQSKCHEVGKKIAVYREFAE
GITSLQKDLSDLQCSELKLEEELTGATQEVKNKCKALKEKVLSFQSNLSQIMKFKEIFEC
IGLNWDSFQLDELHELQTQFFKIRNKIKGKITHFGNVIRECDKFHALLNEIRTMTSAVRK
EANILSDRSNSSPAENLVSAQILLQTVQQILYLIQEVENQINKNEVFHTLFKESKRQEIK
SLETDVEELNLFLQNLVSSLQCVNKEDIQNVAEHLSHAIRHVQLELQQPMVVDMKMMQYE
KMRWESIQNMMSAEFSAIKCIMEKERGSQEEKSLAAGVETKLRALADHEIQLKTDIAARV
SALEEACKAGELYTKAVERAAKFLEDCEAQVRSAAVELSSSEDAYQTPQWKQEEFDSAKA
DIEQLYSKLKNLVKPEDKICLENTLRELINKSLALKGKIQRNEADKQSYLEKYKSYSKCK
DKVCDDLNNLGKMLGQSLSQTPMSYKEALENLQECKILVSNIDSAEDDLVKLRQVSGELM
RLCKGSDRALGRIVTALWENWLGLLEAAKELEINCEELKQEWKFINEELEREAIILDKLQ
EEQPESLKEKEKATREELVELLDSVYAFEENISRQQLLLLLLLHRIRNILNTPENVEAET
ALPALCEIKTMQDRCKKLFEKTQDRKNLIQSEIQERSKITEEINAVKNALQNALSVLSQD
AVGKAAQLKEVQSVIDKESQNLKDIMEKLRIRYSEMYTIVPAETEAQLEDCKKTLQDLED
KISFENLQSSPQYVLKGKAETINNGLQAIEKMLEQKTESVAKAKEVQKQIWDMLDLWHYK
LNELDAEVHDIVEQDSCHAQELMDILMIPLQHYQQVSQLAERRTAILNKAANKMEEYDEL
LKDMKVWIENTNSLLRAGAQNDSAKRLHKYTDGLQMALEDSEQKQNLLHSVYLELEELTP
VFETDSIMQQLNEAEEEVATLQQEIAEILPQIQHVADELDAIESHVKMFEKDVMKMKTIL
SSEDLLELSPKDQLKHGQVILDHIGPMQKTIVHIQSYEEDLQLPGVKKQPVSVFRRARQL
LRELKKIEKITKEQNELLEEAVKKTEECEQEIEKLKQFLKNNLAEKSHEYQPHTQKTSCP
EFQGEMEAIKEEIIKLCQRKEDILTGMKNSMSELHQRLQEEVPEPGDEPTASAFGDSIGT
DVSDVQLKKRGLMSLLPCLDEESEDAFHSKQEKGSTVTEIDVSFWPSLKYKEKHKEDSAT
SWSSGTLSDGGAQVMNSAVIHSINSWGGSKMKRDGVLSAPFASKGDESVTPPPVPNATQG
PRGKEGDPEEAVDGGRPEPGTILQACKAQVAELEQWLDKTKVSLGSDPQTPKMQQMVELQ
LGDFQVMLSEIEQKVLSLLEDGGNSAGHQQEAEDLSLKLKEVKCNLEKVQMMLQDKYSEE
QIPNRERIDPESLKMLHSNGSSMPQLGLAEQPFLQQNGFQHPQDLKVKAAEQKSLIDFIE
SCVEKMQPQFEDSVKSEPRYKATKSLPRFSNGESDPKSKKADPTLAPKDQTGNKWQYLQQ
ELSSKMKSPLCQLVEPQITTKLNMLPRGVFPNAGASTVEELKTYTVQLGDLSQEANVVHA
QENVAEEVSSNLDKKLFELLLAISRCLDNMEEMLNTSVLSTEEAAVQQVLYETLSVELQK
LHADLSDKKDDLLKSITCAGGSTDVFCECFNNLQAWLEQTQAATASRSNSVKAGLDHNTS
YQNETRLLYDQLTEKKATLQQCLNAIRGHNVSEQLQKTDACTLELQNFENQVAKLRGYGE
RFQLPVTLIQEAYKLEDVLDDMWGILKAKYVELNSPSISESQYEDLLCGFAELVAIGQEK
IAQDAKQLTKSRAALQSHLENHKDFFHNLMTHMAFMQAFSKKVNPSVLQKRENFWTGLVN
EVKLLEQKASQYGIRLENLLKEWTEFDDECLAFNKELEALTSTLPSVNLVEETEERLMER
IALLQQIKNNVDEKHARLYQMVKEGKKLLTAVSCPEITNQIGKLEEQWLSLTKKVGNELH
KLQTLLKLLVSYNRDSEELRKWLDSAEQRMKFWKEQSLNVSQDLPTIRDNMDSLFTFSKE
VDDKSSLKSSVVSTANQLFHVKQADTAGLRSSLAKFEQKWGELITQLPAIQEKLHQLQME
KLSSREAIAELMTWLDHVEQQQGHEEPINAQSSVAQVRSLLQKYKEYVMEMNFKQWMVDF
VNQSLLQMSTCDVESKRYERTEFAECLGEVNLRWHRLQASLNRKIQDLEHLLEDATENEN
KAQTLCNWLEAQSDRMRSLQTPASLISAQNTLDDCKDLENQLAMKSKMLDELKQSMSLNG
GTEQTPEVLSFRIADLCEMKDSVVSQVAQLKVSMQSILEQWKVYDDIYAEVSLLMTRYLY
CIDQCKPSVPSLEALKNQVKTLQSLQDELENSEESWAKLQVAANNLKKNCSPSFAEIIDQ
KCTEAHTRWSSVNEDITDQLRTAQATLQLWEPYDSLCTEAAAKLQQHEEQCTQLLDARMP
EDNMIETLKQRIQDVKNLQNGLQNIVGCRSQISELADQIKQQAGTAAQAVLLEKLQPLQR
ASYLEKMLQRKLDELEFNLLQLEDFKNCLETLEGHVKNCTDAFDSLHLEGETDNSELLMN
HTLELAALSPSIESLNEASIKLPVSDFILKKMQSLTRQWSQKTATALEHCSVLEGTQTDE
KKFLQKCENWMKFLEKMKEALKTDVPGRFEELQEQQRVYEMLQTEISINQQTFNSIIAKV
LLFLESGEAEKRTEFISKLTLLKEQWQNVIWLVQQRKKDIDGLVSQWQLFRGSLQSLSRF
LADTNSFLTAVKSQNCYSLYHLRNLIHDFKSKAVILQRWQGMYSSIIDVGEKLRTDSDPE
TSAVLQEELSQLQQSWGDTQVQLEKMKTQLSSILQQSWDSCEKHTKELESRLRELKDEVK
DPLPVEHEELYKSKEHIKELEQSLADWAHNMKELQAMKAELAHCILTEDMMVLQEQVEHL
HRQWEELCLRVSLRKQEIEDRLNAWTVFNEKNKELCSWLVQMESKVLQTADVSIEDMIDK
LQKDCMEEINLFSENKLHLKQMGDQLIKASNKSRVAEIDDKLNKINDRWQHLFDVIGARV
KKLKETFAFIQLLDKNMSNLRTWLARIESELSKPVVYDICDDQEIQKRLAEQQDLQRDIE
QHTAGVESVFNICEVLLHDSDACANETECDSIQQTSRSLDRRWRNICAMSMERRMKIEET
WRLWQRFLDDYSRFEDWLKSAERTAASPNSSEVLYTHAKEELKKFEAFQRQIHERLTQLE
LINKQYRRLARENRTDSASKLKQMVHEGNQRWDNLQKRVAAILRRLKHFTNRRDEFEATR
ENILVWLTEMDLQLTNVEHFSKSNFDDKMRQLNGFQQEITLHTNKIDQLIVFGEQLIQKS
EPLDAILIEDELEELHRYCQEVFGRVARFHQRLTSRHPGLDDEKETSENETDPEDSREIQ
NDPWHKKAISEGPSSPQSLCHLMPPTQGHERSGCETPTPVSVDSIPLEWDHTGDVGGSSS
HEDEEEATYYSALSGKTVSEAHPWHSPESPVCRKHRYNQAEIVGDVLSGPETSTPYKPGY
VKQLSSASSSSVNKENITSANMSDEEPQDDQELVTITAAEKQSGIIDRWELIQAQDLRNK
LRIKQHLQQWQQVNSDLSDVSAWLDKTEEELEELQKAKPPASMQAMEQRVKKLKDTLKAF
DNYKAVVLSVNLSSKEFQKADSTEFKELQNRLRKVNLRWEKATHSLDNWRKGLRQALLHC
QDFHDQSQKLILWLASAEGRRNEAQITDPNADPHTILESQKELMQLEKELLEQQLKVNCL
QELSAYLLLKSDGEDYIEADEKVHVIGKKLKQLIEQVSHDLKSLQGSLDSRVFLPVPDDL
DSEVYHPVAVKSSPPVKKMTIRRTSDGRKNSNTRAESHAQPTHVVPRSPSFFYRVLRAAL
PLQLFFLLLLLLACMIPSSEEDYSCTQANNFARSFYPMLRYTNGPPPT
|
Jump to [ Top · Protein sequence · Domain architecture ]
Domain architecture

1
Jump to [ Top · Protein sequence · Domain architecture ]
|
  
|