Here you are: IMGT Web resources > IMGT Repertoire (RPI) > RPI entries from gene to protein

IMGT reference sequences in FASTA format:

Amino acid sequences with gaps

The FASTA header contains:
IMGT accession number | gene and allele name | species | functionality | exon name | domain name | domain type | alternative splicing (if it is) | partial (if it is)
>P15812|CD1E*01|Homo sapiens|F|EX1||
MLLLFLLFEGLCCPGENTA
>P15812|CD1E*01|Homo sapiens|F|EX2|[D1]|G-LIKE-ALPHA1||
..AAEEQLSFRMLQTSSFANHSWAHSEGSGWLGDLQTHGWDTV..LGTIRFLK.......
.PWSHGNFSKQELKNLQSLFQLYFHSFIQIVQASAGQFQ..LE
>P15812|CD1E*01|Homo sapiens|F|EX3|[D2]|G-LIKE-ALPHA2||
....YPFEIQILAGCRMNAP.QIFLNMAYQGSDFLSFQGIS.WE.PSPGAGIRAQNI...
.CKVLNR.YLDIKEILQSLLGHTCPRFLAGLMEAGESELKRK
>P15812|CD1E*01|Homo sapiens|F|EX4|C-LIKE||
........VKPEAWLSCGPSPG.....PGRLQLVCHVSGFYP..KPVWVMWMRGEQEQR.
..GTQRGDVLPNAD......ETWYLRATLDVAA.....GEAAGLSCRVKHSS........
.....LGGHDLIIHW
>P15812|CD1E*01|Homo sapiens|F|EX5||
GGYSIFLILICLTVIVTLVILVVVDSRLKKQR
>P15812|CD1E*01|Homo sapiens|F||
MLLLFLLFEGLCCPGENTA..AAEEQLSFRMLQTSSFANHSWAHSEGSGWLGDLQTHGWD
TV..LGTIRFLK........PWSHGNFSKQELKNLQSLFQLYFHSFIQIVQASAGQFQ..
LE....YPFEIQILAGCRMNAP.QIFLNMAYQGSDFLSFQGIS.WE.PSPGAGIRAQNI.
...CKVLNR.YLDIKEILQSLLGHTCPRFLAGLMEAGESELKRK........VKPEAWLS
CGPSPG.....PGRLQLVCHVSGFYP..KPVWVMWMRGEQEQR...GTQRGDVLPNAD..
....ETWYLRATLDVAA.....GEAAGLSCRVKHSS.............LGGHDLIIHWG
GYSIFLILICLTVIVTLVILVVVDSRLKKQR
>CD1E*02|Homo sapiens|F|EX2|[D1]|G-LIKE-ALPHA1|partial||
..AAEEQLSFRMLQTSSFANHSWAHSEGSGWLGDLQTHGWDTV..LGTIRFLK.......
.PWSHGNFSKQELKNLQSLFQLYFHSFIRIVQASAGQFQ..LE
>CD1E*03|Homo sapiens|F|EX2|[D1]|G-LIKE-ALPHA1|partial||
..AAEEQLSFRMLQTSSFANHSWAHSEGSGWLGDLQTHGWDTV..LGTIRFLK.......
.PWSHGNFSKQELKNLQSLFQLYFRSFIRIVQASAGQFQ..LE
>CD1E*04|Homo sapiens|F|EX3|[D1]|G-LIKE-ALPHA1|partial||
....YPFEIQILAGCRMNAP.QIFLNMAYQGSDFLNFQGIS.WE.PSPGAGIWAQNI...
.CKVLNR.YLDIKEILQSLLGHTCPRFLAGLMEAGESELKRK
>CD1E*05|Homo sapiens|F|EX3|[D1]|G-LIKE-ALPHA1|partial||
....DPFEIQILAGCRMNAP.QIFLNMAYQGSDFLSFQGIS.WE.PSPGAGIWAQNI...
.CKVLNR.YLDIKEILQSLLGHTCPRFLAGLMEAGESELKRK
>CD1E*06|Homo sapiens|F|EX3|[D1]|G-LIKE-ALPHA1|partial||
....DPFEIQILAGCRMNAPQ.IFLNMAYQGSDFLSFQGIS.WE.PSPGAGIRAQNI...
.CKVLNRYLDIKEILQSLLGHTCPRFPAGLMEAGESELKRK