>X64318.1 H.sapiens E4BP4 gene
GCCCCTTTCTTTCTCCTCGTCGGCCCGAGAGCAGGAACACGATAACGAAGGAGGCCCAACTTCATTCAAT
AAGGAGCCTGACGGATTTATCCCAGACGGTAGAACAAAAGGAAGAATATTGATGGATTTTAAACCAGAGT
TTTTAAAGAGCTTGAGAATACGGGGAAATTAATTTGTTCTCCTACACACATAGATAGGGTAAGGTTGTTT
CTGATGCAGCTGAGAAAAATGCAGACCGTCAAAAAGGAGCAGGCGTCTCTTGATGCCAGTAGCAATGTGG
ACAAGATGATGGTCCTTAATTCTGCTTTAACGGAAGTGTCAGAAGACTCCACAACAGGTGAGGACGTGCT
TCTCAGTGAAGGAAGTGTGGGGAAGAACAAATCTTCTGCATGTCGGAGGAAACGGGAATTCATTCCTGAT
GAAAAGAAAGATGCTATGTATTGGGAAAAAAGGCGGAAAAATAATGAAGCTGCCAAAAGATCTCGTGAGA
AGCGTCGACTGAATGACCTGGTTTTAGAGAACAAACTAATTGCACTGGGAGAAGAAAACGCCACTTTAAA
AGCTGAGCTGCTTTCACTAAAATTAAAGTTTGGTTTAATTAGCTCCACAGCATATGCTCAAGAGATTCAG
AAACTCAGTAATTCTACAGCTGTGTACTTTCAAGATTACCAGACTTCCAAATCCAATGTGAGTTCATTTG
TGGACGAGCACGAACCCTCGATGGTGTCAAGTAGTTGTATTTCTGTCATTAAACACTCTCCACAAAGCTC
GCTGTCCGATGTTTCAGAAGTGTCCTCAGTAGAACACACGCAGGAGAGCTCTGTGCAGGGAAGCTGCAGA
AGTCCTGAAAACAAGTTCCAGATTATCAAGCAAGAGCCGATGGAATTAGAGAGCTACACAAGGGAGCCAA
GAGATGACCGAGGCTCTTACACAGCGTCCATCTATCAAAACTATATGGGGAATTCTTTCTCTGGGTACTC
ACACTCTCCCCCACTACTGCAAGTCAACCGATCCTCCAGCAACTCCCCGAGAACGTCGGAAACTGATGAT
GGTGTGGTAGGAAAGTCATCTGATGGAGAAGACGAGCAACAGGTCCCCAAGGGCCCCATCCATTCTCCAG
TTGAACTCAAGCATGTGCATGCAACTGTGGTTAAAGTTCCAGAAGTGAATTCCTCTGCCTTGCCACACAA
GCTCCGGATCAAAGCCAAAGCCATGCAGATCAAAGTAGAAGCCTTTGATAATGAATTTGAGGCCACGCAA
AAACTTTCCTCACCTATTGACATGACATCTAAAAGACATTTCGAACTCGAAAAGCATAGTGCCCCAAGTA
TGGTACATTCTTCTCTTACTCCTTTCTCAGTGCAAGTGACTAACATTCAAGATTGGTCTCTCAAATCGGA
GCACTGGCATCAAAAAGAACTGAGTGGCAAAACTCAGAATAGTTTCAAAACTGGAGTTGTTGAAATGAAA
GACAGTGGCTACAAAGTTTCTGACCCAGAGAACTTGTATTTGAAGCAGGGGATAGCAAACTTATCTGCAG
AGGTTGTCTCACTCAAGAGACTTATAGCCACACAACCAATCTCTGCTTCAGACTCTGGGTAAATTACTAC
TGAGTAAGAGCTGGGCATTTAGAAAGATGTCATTTGCAATAGAGCAGTCCATTTTGTATTATGCTGAATT
TTCACTGGACCTGTGATGTCATTTCACTGTGATGTGCACATGTTGTCTGTTTGGTGTCTTTTTGTGCACA
GATTATGATGAAGATTAGATTGTGTTATCACTCTGCCTGTGTATAGTCAGATAGTCATATGCGTAAGGCT
GTATATATTAAGNTTTTATTTTTGTTGTTCTATTATAAAGTGTGTAAGTTACCAGTTTCAATAAAGGATT
GGTGACAAACACAGAAAAAAAAAAAAAAAAAAA
>CAA45597.1 E4BP4 [Homo sapiens]
MQLRKMQTVKKEQASLDASSNVDKMMVLNSALTEVSEDSTTGEDVLLSEGSVGKNKSSACRRKREFIPDE
KKDAMYWEKRRKNNEAAKRSREKRRLNDLVLENKLIALGEENATLKAELLSLKLKFGLISSTAYAQEIQK
LSNSTAVYFQDYQTSKSNVSSFVDEHEPSMVSSSCISVIKHSPQSSLSDVSEVSSVEHTQESSVQGSCRS
PENKFQIIKQEPMELESYTREPRDDRGSYTASIYQNYMGNSFSGYSHSPPLLQVNRSSSNSPRTSETDDG
VVGKSSDGEDEQQVPKGPIHSPVELKHVHATVVKVPEVNSSALPHKLRIKAKAMQIKVEAFDNEFEATQK
LSSPIDMTSKRHFELEKHSAPSMVHSSLTPFSVQVTNIQDWSLKSEHWHQKELSGKTQNSFKTGVVEMKD
SGYKVSDPENLYLKQGIANLSAEVVSLKRLIATQPISASDSG
, 1. Which domains does E4BP4 protein have? Are these domains all specifc for E4BP4-like proteins, or
do they also occur in other proteins with diferent functon than E4BP4? Determine which domain(s)
you can use to study the evoluton of E4BP4.
a. Basic Leucine Zipper Domain (bZIP domain) wordt gevonden in veel verschillende DNA
bindende eiwiten en dus niet geschikt. Nucleotde 72 t/m 12:
b. Vertebrate interleukin-3 regulated transcripton factor is wel geschikt, wordt niet veel
gevonden. Nucleotde 130 t/m 461
2. Find homologs of E4BP4 protein, align these sequences and make a phylogenetc tree. Check both
nucleotde and protein sequences to reach a conclusion on the age of NK cells.
3. Are NK cells part of the adaptve or innate immune system? How does your conclusion on the
evoluton of NK cells ft into this? Was our inital assumpton a good one?
Database eiwit: uniprot goed, maar niet heel groot. Grotere database is non-redundant database, maar
geef veel overlap.
Nuclear factor interleukin-3-regulated protein [Cricetulus griseus] (E-value: 2E -71)
(chinese dwerghamster)
Sequence ID: EGW11462.1Length: 457Number of Matches: 1
>EGW11462.1 Nuclear factor interleukin-3-regulated protein [Cricetulus griseus] >ERE77468.1 nuclear factor
interleukin-3-regulated protein [Cricetulus griseus]
MQTIKKEPAPLDPTSSSDKVMVLNSALAEVAEDLASGEDLLLNEGSVGKNKSSACRRKREFIPDEKKDAMYWEKRRK
NNE
AAKRSREKRRLNDLVLENKLIALGEENATLKAELLSLKLKFGLISSTAYAQEIQKLSNSTAVYFQDYQTSKATVSSFVDE
HEPTMVAGSCISVIKHSPQSSLSDVSEVSSVEHTQESPAQGGCRSPENKFPVIKQEPVELESFARESREERGAYATSI
YQ
SYMGSSFPTYSHSPPLLQVHGSTSNSPRTSEADEGVVGKSSDGEDEQQVPKGPIHSPVELQRVHATVVKVPEVHPS
ALPH
KLRIKAKAMQIKVEALDSEFEGMQKLSSPADVIAKRHFDLEKHSTPGMVHSSLTPFSVQVTNIQDWSLKPEHWHHKEL
NG
KTQSSFKTGVLEVKDSGYKISEAESLYLKQGMANLSAEVVSLKRFIATQPISASDSR
nuclear factor interleukin-3-regulated protein [Macaca mulatta] Resusaap
Sequence ID: NP_001253842.1Length: 462Number of Matches: 1
>EHH57509.1 E4 promoter-binding protein 4 [Macaca fascicularis]
MQLRKMQTIKKEQASLDASSNVDKMMVLNSALTEVSEDSTTGEELLLSEGSVGKNKSSACRRKREFIPDEKKDAMY
WEKR
RKNNEAAKRSREKRRLNDLVLENKLIALGEENATLKAELLSLKLKFGLISSTAYAQEIQKLSNSTAVYFQDYQTSKSSVS