Here you are: IMGT Web resources > IMGT Repertoire (RPI) > RPI entries from gene to protein

IMGT reference sequences in FASTA format:

Amino acid sequences without gaps

The FASTA header contains:
IMGT accession number | gene and allele name | species | functionality | exon name | domain name | domain type | alternative splicing (if it is) | partial (if it is)
>J03858|CEACAM1*01|Homo sapiens|F|EX1|
MGHLSAPLHRVRVPWQGLLLT
>J03858|CEACAM1*01|Homo sapiens|F|EX2|[D1]|V-LIKE|
ASLLTFWNPPTTAQLTTESMPFNVAEGKEVLLLVHNLPQQLFGYSWYKGERVDGNRQIVG
YAIGTQQATPGPANSGRETIYPNASLLIQNVTQNDTGFYTLQVIKSDLVNEEATGQFHVY
>J03858|CEACAM1*01|Homo sapiens|F|EX3|[D2]|C1-LIKE|
PELPKPSISSNNSNPVEDKDAVAFTCEPETQDTTYLWWINNQSLPVSPRLQLSNGNRTLT
LLSVTRNDTGPYECEIQNPVSANRSDPVTLNVT
>J03858|CEACAM1*01|Homo sapiens|F|EX4|[D3]|C2-LIKE|
YGPDTPTISPSDTYYRPGANLSLSCYAASNPPAQYSWLINGTFQQSTQELFIPNITVNNS
GSYTCH ANNSVTGCNRTTVKTIIVT
>J03858|CEACAM1*01|Homo sapiens|F|EX5|[D4]|C3-LIKE|
ELSPVVAKPQIKASKTTVTGDKDSVNLTCSTNDTGISIRWFFKNQSLPSSERMKLSQGNT
TLSINPVKREDAGTYWCEVFNPISKNQSDPIMLNVN
>J03858|CEACAM1*01|Homo sapiens|F|EX7|
YNALPQENGLSPGAIAGIVIGVVALVALIAVALACFLHFGKTG
>J03858|CEACAM1*01|Homo sapiens|F|EX8|
RASDQRDLTEHKPSVSNH
>J03858|CEACAM1*01|Homo sapiens|F|EX9|
TQDHSNDPPNK
>J03858|CEACAM1*01|Homo sapiens|F|EX10|
MNEVTYSTLNFEAQQPTQPTSASPSLTATEIIYSEVKKQ
>J03858|CEACAM1*01|Homo sapiens|F|EX1-5, EX7-10|
MGHLSAPLHRVRVPWQGLLLTASLLTFWNPPTTAQLTTESMPFNVAEGKEVLLLVHNLPQ
QLFGYSWYKGERVDGNRQIVGYAIGTQQATPGPANSGRETIYPNASLLIQNVTQNDTGFY
TLQVIKSDLVNEEATGQFHVYPELPKPSISSNNSNPVEDKDAVAFTCEPETQDTTYLWWI
NNQSLPVSPRLQLSNGNRTLTLLSVTRNDTGPYECEIQNPVSANRSDPVTLNVTYGPDTP
TISPSDTYYRPGANLSLSCYAASNPPAQYSWLINGTFQQSTQELFIPNITVNNSGSYTCH
 ANNSVTGCNRTTVKTIIVTELSPVVAKPQIKASKTTVTGDKDSVNLTCSTNDTGISIRW
FFKNQSLPSSERMKLSQGNTTLSINPVKREDAGTYWCEVFNPISKNQSDPIMLNVNYNAL
PQENGLSPGAIAGIVIGVVALVALIAVALACFLHFGKTGRASDQRDLTEHKPSVSNHTQD
HSNDPPNKMNEVTYSTLNFEAQQPTQPTSASPSLTATEIIYSEVKKQ*