Here you are: IMGT Web resources > IMGT Repertoire (RPI) > RPI entries from gene to protein

IMGT reference sequences in FASTA format:

Amino acid sequences with gaps

The FASTA header contains:
IMGT accession number | gene and allele name | species | functionality | exon name | domain name | domain type | alternative splicing (if it is) | partial (if it is)
>J03858|CEACAM1*01|Homo sapiens|F|EX1|
MGHLSAPLHRVRVPWQGLLLT
>J03858|CEACAM1*01|Homo sapiens|F|EX2|[D1]|V-LIKE|
ASLLTFWNPPTTAQLTTESMPFNVAEGKEVLLLVHNLPQQ......LFGYSWYKGERVDG
NRQIVGYAIG...TQQATPGPAN.SGRETIY..P..NASLLIQNVTQNDTGFYTLQVIKS
...DLVNEEATGQFHVY
>J03858|CEACAM1*01|Homo sapiens|F|EX3|[D2]|C-LIKE|
........PELPKPSISSNNSNPVEDKDAVAFTCEPETQD.....TTYLWWINNQSLPV.
.SPRLQLSN..............GNRTLTLLSVTR..NDTGPYECEIQNP....VSANRS
DPVTLNVT
>J03858|CEACAM1*01|Homo sapiens|F|EX4|[D3]|C-LIKE|
......YGPDTPTISPSDTYYRP...GANLSLSCYAASNP....PAQYSWLING......
.TFQQST..................QELFIPNITV..NNSGSYTCHANNS...VTGCNRT
TVKTIIVT
>J03858|CEACAM1*01|Homo sapiens|F|EX5|[D4]|C-LIKE|
.....ELSPVVAKPQIKASKTTVTGDKDSVNLTCSTNDTG.....ISIRWFFKNQSLPS.
.SERMKLSQ..............GNTTLSINPVKR..EDAGTYWCEVFNP....ISKNQS
DPIMLNVN
>J03858|CEACAM1*01|Homo sapiens|F|EX7|
YNALPQENGLSPGAIAGIVIGVVALVALIAVALACFLHFGKTG
>J03858|CEACAM1*01|Homo sapiens|F|EX8|
RASDQRDLTEHKPSVSNH
>J03858|CEACAM1*01|Homo sapiens|F|EX9|
TQDHSNDPPNK
>J03858|CEACAM1*01|Homo sapiens|F|EX10|
MNEVTYSTLNFEAQQPTQPTSASPSLTATEIIYSEVKKQ
>D12502|CEACAM1*02|Homo sapiens|F|EX1|
MGHLSAPLHRVRVPWQGLLLT
>D12502|CEACAM1*02|Homo sapiens|F|EX2|[D1]|V-LIKE|
ASLLTFWNPPTTAQLTTESMPFNVAEGKEVLLLVHNLPQQ......LFGYSWYKGERVDG
NRQIVGYAIG...TQQATPGPAN.SGRETIY..P..NASLLIQNVTQNDTGFYTLQVIKS
...DLVNEEATGQFHVY
>D12502|CEACAM1*02|Homo sapiens|F|EX3|[D2]|C-LIKE|
........PELPKPSISSNNSNPVEDKDAVAFTCEPETQD.....TTYLWWINNQSLPV.
.SPRLQLSN..............GNRTLTLLSVTR..NDTGPYECEIQNP....VSANRS
DPVTLNVT
>D12502|CEACAM1*02|Homo sapiens|F|EX4|[D3]|C-LIKE|
......YGPDTPTISPSYTYYRP...GANLSLSCYAASNP....PAQYSWLING......
.TFQQST..................QELFIPNITV..NNSGSYTCHANNS...VTGCNRT
TVKTIIVT
>D12502|CEACAM1*02|Homo sapiens|F|EX6|
ERQNLTMLPGLDSNSWAQAILPSVSQSAEIT
>D12502|CEACAM1*02|Homo sapiens|F|EX7|
DNALPQENGLSPGAIAGIVIGVVALVALIAVALACFLHFGKTG
>D12502|CEACAM1*02|Homo sapiens|F|EX8|
RASDQRDLTEHKPSVSNH
>D12502|CEACAM1*02|Homo sapiens|F|EX9|
TQDHSNDPPNK
>D12502|CEACAM1*02|Homo sapiens|F|EX10|
MNEVTYSTLNFEAQQPTQPTSASPSLTATEIIYSEVKKQ