Here you are: IMGT Web resources > IMGT Repertoire (RPI) > RPI entries from gene to protein

IMGT reference sequences in FASTA format:

Amino acid sequences without gaps

The FASTA header contains:
IMGT accession number | gene and allele name | species | functionality | exon name | domain name | domain type | alternative splicing (if it is) | partial (if it is)
>AE003841|DSCAM*01|Drosophila melanogaster|F|EX1|
MNMPNERLKWLMLFAAVALI
>AE003841|DSCAM*01|Drosophila melanogaster|F|EX2|[D1]|C-LIKE|
ACGSQTLAANPPDADQKGPVFLKEPTNRIDFSNSTGAEIECKASGNPMPEIIWIRSDGTA
VGDVPGLRQISSDGKLVFPPFRAEDYRQEVHAQVYACLARNQFGSIISRDVHVRA
>AE003841|DSCAM*01|Drosophila melanogaster|F|EX3|[D2]|C-LIKE|
VVSQFYITEAENEYVIKGNAAVVKCKIPSFVADFVQVEAWVDEEGMELWRNNATDAY
>AE003841|DSCAM*01|Drosophila melanogaster|F|EX4|[D2]|C-LIKE|
DGKYLVLPSGELHIREVGPEDGYKSYQCRTKHRLTGETRLSATKGRLVIT
>AE003841|DSCAM*01|Drosophila melanogaster|F|EX5|[D3]|C-LIKE|
EPVSSSPPKINALTYKPNIVESMASTAILCPAQGYPAPSFR
>AE003841|DSCAM*01|Drosophila melanogaster|F|EX6|[D3,D4,D5,D6]|C-LIKE|
WYKFIEGTTRKQAVVLNDRVKQVSGTLIIKDAVVEDSGKYLCVVNNSVGGESVETVLTVT
APLSAKIDPPTQTVDFGRPAVFTCQYTGNPIKTVSWMKDGKAIGHSEPVLRIESVKKEDK
GMYQCFVRNDQESAEASAELKLGGRFDPPVIRQAFQEETMEPGPSVFLKCVAGGNPTPEI
SWELDGKKIANNDRYQVGQYVTVNGDVVSYLNITSVHANDGGLYKCIAKSKVGVAEHSAK
LNVYGLPYIRQMEKKAIVAGETLIVTCPVAGYPIDSIVWER
>AE003841|DSCAM*01|Drosophila melanogaster|F|EX7|[D6]|C-LIKE|
DNRALPINRKQKVFPNGTLIIENVERNSDQATYTCVAKNQEGYSARGSLEVQVM
>AE003841|DSCAM*01|Drosophila melanogaster|F|EX8|[D7]|C-LIKE|
VPPQVLPFSFGESAADVGDIASANCVVPKGDLPLEIRWSLNSAPIVNGENGFTLVRLNKR
TSLLNIDSLNAFHRGVYKCIATNPAGTSEYVAELQVN
>AE003841|DSCAM*01|Drosophila melanogaster|F|EX9|[D8]|C-LIKE|
VPPRWILEPTDKAFAQGSDAKVECKADGFPKPQVTWKKAV
>AE003841|DSCAM*01|Drosophila melanogaster|F|EX10|[D8,D9]|C-LIKE|[D10,D11,D12]|F-LIKE|
GDTPGEYKDLKKSDNIRVEEGTLHVDNIQKTNEGYYLCEAINGIGSGLSAVIMISVQAPP
EFTEKLRNQTARRGEPAVLQCEAKGEKPIGILWNMNNMRLDPKNDNRYTIREEILSTGVM
SSLSIKRTERSDSALFTCVATNAFGSDDASINMIVQEVPEMPYALKVLDKSGRSVQLSWA
QPYDGNSPLDRYIIEFKRSRASWSEIDRVIVPGHTTEAQVQKLSPATTYNIRIVAENAIG
TSQSSEAVTIITAEEAPSGKPQNIKVEPVNQTTMRVTWKPPPRTEWNGEILGYYVGYKLS
NTNSSYVFETINFITEEGKEHNLELQNLRVYTQYSVVIQAFNKIGAGPLSEEEKQFTAEG
TPSQPPSDTACTTLTSQTIRVGWVSPPLESANGVIKTYKVVYAPSDEWY
>AE003841|DSCAM*01|Drosophila melanogaster|F|EX11|[D12]|F-LIKE|
DETKRHYKKTASSDTVLHGLKKYTNYTMQVLATTAGGDGVRSVPIHCQTEPD
>AE003841|DSCAM*01|Drosophila melanogaster|F|EX12|[D13],[D15]|F-LIKE|[D14]|C-LIKE|
VPEAPTDVKALVMGNAAILVSWRPPAQPNGIITQYTVYSKAEGAETETKTQKVPHYQMSF
EATELEKNKPYEFWVTASTTIGEGQQSKSIVAMPSDQVPAKIASFDDTFTATFKEDAKMP
CLAVGAPQPEITWKIKGVEFSANDRMRVLPDGSLLIKSVNRQDAGDYSCHAENSIAKDSI
THKLIVLAPPQSPHVTLSATTTDALTVKLKPHEGDTAPLHGYTLHYKP
>AE003841|DSCAM*01|Drosophila melanogaster|F|EX13|[D15]|F-LIKE|
EFGEWETSEVSVDSQKHNIEGLLCGSRYQVYATGFNN
>AE003841|DSCAM*01|Drosophila melanogaster|F|EX14|[D16]|F-LIKE|
IGAGEASDILNTRTKGQKPKLPEKPRFIEVSSNSVSLHFKAWKDGGCPMSHFVVESKKR
>AE003841|DSCAM*01|Drosophila melanogaster|F|EX15|[D16]|F-LIKE|
DQIEWNQISNNVKPDNNYVVLDLEPATWYNLRITAHNSAGFTVAEYDFATLTVTG
>AE003841|DSCAM*01|Drosophila melanogaster|F|EX16|[CO+TM+CY]|
GTIAPSRDLPELSAEDTIRIILSNLNLVVPVVAALLVIIIAIIVICILRSKGNHHK
>AE003841|DSCAM*01|Drosophila melanogaster|F|EX17|[CY]|
DDVVYNQTMGPGATLDKRRPDLRDELGYIAPPNRKLPPVPGSNYNTCDRIKR
>AE003841|DSCAM*01|Drosophila melanogaster|F|EX18|[CY]|
GRGGLRSNHSTWDPRRNPNLYEELKAPPVPMHGNHYGHAHGNAECHYRHP
>AE003841|DSCAM*01|Drosophila melanogaster|F|EX19|[CY]|
GMEDEICPYATFHLLGFREEMDPTKAMNFQTFPHQNGHAGPVPGHAGTMLPPGHPGHVHS
RSGSQSMPRANRYQRKNSQGGQSSIYTPAPEYDDPANCAEEDQY
>AE003841|DSCAM*01|Drosophila melanogaster|F|EX20|[CY]|
RRYTRVNSQGGSLYSGPGPEYDDPANCAPEEDQYGSQYGGPYGQPYDHYGSRGSMGRRSI
>AE003841|DSCAM*01|Drosophila melanogaster|F|EX21|[CY]|
GSARNPGNGSPEPPPPPPRNHDMSNSSFNDSKESNEISEAECDRDHGPRGNYG
>AE003841|DSCAM*01|Drosophila melanogaster|F|EX22|[CY]|
AVKRSPQPKDQRTTEEMRKLIER
>AE003841|DSCAM*01|Drosophila melanogaster|F|EX23|[CY]|
NETGPKQLQLQQANGAGFTAYDTMAV
>DSCAM*01|Drosophila melanogaster|F|EX1-23|Splicing 12|
MNMPNERLKWLMLFAAVALIACGSQTLAANPPDADQKGPVFLKEPTNRIDFSNSTGAEIE
CKASGNPMPEIIWIRSDGTAVGDVPGLRQISSDGKLVFPPFRAEDYRQEVHAQVYACLAR
NQFGSIISRDVHVRAVVSQFYITEAENEYVIKGNAAVVKCKIPSFVADFVQVEAWVDEEG
MELWRNNATDAYDGKYLVLPSGELHIREVGPEDGYKSYQCRTKHRLTGETRLSATKGRLV
ITEPVSSSPPKINALTYKPNIVESMASTAILCPAQGYPAPSFRWYKFIEGTTRKQAVVLN
DRVKQVSGTLIIKDAVVEDSGKYLCVVNNSVGGESVETVLTVTAPLSAKIDPPTQTVDFG
RPAVFTCQYTGNPIKTVSWMKDGKAIGHSEPVLRIESVKKEDKGMYQCFVRNDQESAEAS
AELKLGGRFDPPVIRQAFQEETMEPGPSVFLKCVAGGNPTPEISWELDGKKIANNDRYQV
GQYVTVNGDVVSYLNITSVHANDGGLYKCIAKSKVGVAEHSAKLNVYGLPYIRQMEKKAI
VAGETLIVTCPVAGYPIDSIVWERDNRALPINRKQKVFPNGTLIIENVERNSDQATYTCV
AKNQEGYSARGSLEVQVMVPPQVLPFSFGESAADVGDIASANCVVPKGDLPLEIRWSLNS
APIVNGENGFTLVRLNKRTSLLNIDSLNAFHRGVYKCIATNPAGTSEYVAELQVNVPPRW
ILEPTDKAFAQGSDAKVECKADGFPKPQVTWKKAVGDTPGEYKDLKKSDNIRVEEGTLHV
DNIQKTNEGYYLCEAINGIGSGLSAVIMISVQAPPEFTEKLRNQTARRGEPAVLQCEAKG
EKPIGILWNMNNMRLDPKNDNRYTIREEILSTGVMSSLSIKRTERSDSALFTCVATNAFG
SDDASINMIVQEVPEMPYALKVLDKSGRSVQLSWAQPYDGNSPLDRYIIEFKRSRASWSE
IDRVIVPGHTTEAQVQKLSPATTYNIRIVAENAIGTSQSSEAVTIITAEEAPSGKPQNIK
VEPVNQTTMRVTWKPPPRTEWNGEILGYYVGYKLSNTNSSYVFETINFITEEGKEHNLEL
QNLRVYTQYSVVIQAFNKIGAGPLSEEEKQFTAEGTPSQPPSDTACTTLTSQTIRVGWVS
PPLESANGVIKTYKVVYAPSDEWYDETKRHYKKTASSDTVLHGLKKYTNYTMQVLATTAG
GDGVRSVPIHCQTEPDVPEAPTDVKALVMGNAAILVSWRPPAQPNGIITQYTVYSKAEGA
ETETKTQKVPHYQMSFEATELEKNKPYEFWVTASTTIGEGQQSKSIVAMPSDQVPAKIAS
FDDTFTATFKEDAKMPCLAVGAPQPEITWKIKGVEFSANDRMRVLPDGSLLIKSVNRQDA
GDYSCHAENSIAKDSITHKLIVLAPPQSPHVTLSATTTDALTVKLKPHEGDTAPLHGYTL
HYKPEFGEWETSEVSVDSQKHNIEGLLCGSRYQVYATGFNNIGAGEASDILNTRTKGQKP
KLPEKPRFIEVSSNSVSLHFKAWKDGGCPMSHFVVESKKRDQIEWNQISNNVKPDNNYVV
LDLEPATWYNLRITAHNSAGFTVAEYDFATLTVTGGTIAPSRDLPELSAEDTIRIILSNL
NLVVPVVAALLVIIIAIIVICILRSKGNHHKDDVVYNQTMGPGATLDKRRPDLRDELGYI
APPNRKLPPVPGSNYNTCDRIKRGRGGLRSNHSTWDPRRNPNLYEELKAPPVPMHGNHYG
HAHGNAECHYRHPGMEDEICPYATFHLLGFREEMDPTKAMNFQTFPHQNGHAGPVPGHAG
TMLPPGHPGHVHSRSGSQSMPRANRYQRKNSQGGQSSIYTPAPEYDDPANCAEEDQYRRY
TRVNSQGGSLYSGPGPEYDDPANCAPEEDQYGSQYGGPYGQPYDHYGSRGSMGRRSIGSA
RNPGNGSPEPPPPPPRNHDMSNSSFNDSKESNEISEAECDRDHGPRGNYGAVKRSPQPKD
QRTTEEMRKLIERNETGPKQLQLQQANGAGFTAYDTMAV