Here you are: IMGT Web resources > IMGT Repertoire (RPI) > RPI entries from gene to protein

IMGT reference sequences in FASTA format:

Nucleotide sequences without gaps

The FASTA header contains:
IMGT accession number | gene and allele name | species | functionality | exon name | domain name | domain type | alternative splicing (if it is) | partial (if it is)
>J03858|CEACAM1*01|Homo sapiens|F|EX1|
atggggcacctctcagccccacttcacagagtgcgtgtaccctggcaggggcttctgctc
acag
>J03858|CEACAM1*01|Homo sapiens|F|EX2|[D1]|V-LIKE|
cctcacttctaaccttctggaacccgcccaccactgcccagctcactactgaatccatgc
cattcaatgttgcagaggggaaggaggttcttctccttgtccacaatctgccccagcaac
tttttggctacagctggtacaaaggggaaagagtggatggcaaccgtcaaattgtaggat
atgcaataggaactcaacaagctaccccagggcccgcaaacagcggtcgagagacaatat
accccaatgcatccctgctgatccagaacgtcacccagaatgacacaggattctacaccc
tacaagtcataaagtcagatcttgtgaatgaagaagcaactggacagttccatgtatacc
>J03858|CEACAM1*01|Homo sapiens|F|EX3|[D2]|C-LIKE|
cggagctgcccaagccctccatctccagcaacaactccaaccctgtggaggacaaggatg
ctgtggccttcacctgtgaacctgagactcaggacacaacctacctgtggtggataaaca
atcagagcctcccggtcagtcccaggctgcagctgtccaatggcaacaggaccctcactc
tactcagtgtcacaaggaatgacacaggaccctatgagtgtgaaatacagaacccagtga
gtgcgaaccgcagtgacccagtcaccttgaatgtcacct
>J03858|CEACAM1*01|Homo sapiens|F|EX4|[D3]|C-LIKE|
atggcccggacacccccaccatttccccttcagacacctattaccgtccaggggcaaacc
tcagcctctcctgctatgcagcctctaacccacctgcacagtactcctggcttatcaatg
gaacattccagcaaagcacacaagagctctttatccctaacatcactgtgaataatagtg
gatcctatacctgccacgccaataactcagtcactggctgcaacaggaccacagtcaaga
cgatcatagtcactg
>J03858|CEACAM1*01|Homo sapiens|F|EX5|[D4]|C-LIKE|
agctaagtccagtagtagcaaagccccaaatcaaagccagcaagaccacagtcacaggag
ataaggactctgtgaacctgacctgctccacaaatgacactggaatctccatccgttggt
tcttcaaaaaccagagtctcccgtcctcggagaggatgaagctgtcccagggcaacacca
ccctcagcataaaccctgtcaagagggaggatgctgggacgtattggtgtgaggtcttca
acccaatcagtaagaaccaaagcgaccccatcatgctgaacgtaaact
>J03858|CEACAM1*01|Homo sapiens|F|EX7|
ataatgctctaccacaagaaaatggcctctcacctggggccattgctggcattgtgattg
gagtagtggccctggttgctctgatagcagtagccctggcatgttttctgcatttcggga
agaccggcag
>J03858|CEACAM1*01|Homo sapiens|F|EX8|
ggcaagcgaccagcgtgatctcacagagcacaaaccctcagtctccaaccaca
>J03858|CEACAM1*01|Homo sapiens|F|EX9|
ctcaggaccactccaatgacccacctaacaag
>J03858|CEACAM1*01|Homo sapiens|F|EX10|
atgaatgaagttacttattctaccctgaactttgaagcccagcaacccacacaaccaact
tcagcctccccatccctaacagccacagaaataatttattcagaagtaaaaaagcagtaa
>D12502|CEACAM1*02|Homo sapiens|F|EX1|
atggggcacctctcagccccacttcacagagtgcgtgtaccctggcaggggcttctgctc
acag
>D12502|CEACAM1*02|Homo sapiens|F|EX2|[D1]|V-LIKE|
cctcacttctaaccttctggaacccgcccaccactgcccagctcactactgaatccatgc
cattcaatgttgcagaggggaaggaggttcttctccttgtccacaatctgccccagcaac
tttttggctacagctggtacaaaggggaaagagtggatggcaaccgtcaaattgtaggat
atgcaataggaactcaacaagctaccccagggcccgcaaacagcggtcgagagacaatat
accccaatgcatccctgctgatccagaacgtcacccagaatgacacaggattctacaccc
tacaagtcataaagtcagatcttgtgaatgaagaagcaactggacagttccatgtatacc
>D12502|CEACAM1*02|Homo sapiens|F|EX3|[D2]|C-LIKE|
cggagctgcccaagccctccatctccagcaacaactccaaccctgtggaggacaaggatg
ctgtggccttcacctgtgaacctgagactcaggacacaacctacctgtggtggataaaca
atcagagcctcccggtcagtcccaggctgcagctgtccaatggcaacaggaccctcactc
tactcagtgtcacaaggaatgacacaggaccctatgagtgtgaaatacagaacccagtga
gtgcgaaccgcagtgacccagtcaccttgaatgtcacct
>D12502|CEACAM1*02|Homo sapiens|F|EX4|[D3]|C-LIKE|
atggcccggacacccccaccatttccccttcatacacctattaccgtccaggggcaaacc
tcagcctctcctgctatgcagcctctaacccacctgcacagtactcctggcttatcaatg
gaacattccagcaaagcacacaagagctctttatccctaacatcactgtgaataatagtg
gatcctatacctgccacgccaataactcagtcactggctgcaacaggaccacagtcaaga
cgatcatagtcactg
>D12502|CEACAM1*02|Homo sapiens|F|EX6|
agagacagaatctcaccatgttgcccgggctggactcgaactcctgggctcaagcaatcc
tcccatctgtttcccaaagtgctgagattacag
>D12502|CEACAM1*02|Homo sapiens|F|EX7|
ataatgctctaccacaagaaaatggcctctcacctggggccattgctggcattgtgattg
gagtagtggccctggttgctctgatagcagtagccctggcatgttttctgcatttcggga
agaccggcag
>D12502|CEACAM1*02|Homo sapiens|F|EX8|
ggcaagcgaccagcgtgatctcacagagcacaaaccctcagtgtccaaccaca
>D12502|CEACAM1*02|Homo sapiens|F|EX9|
ctcaggaccactccaatgacccacctaacaag
>D12502|CEACAM1*02|Homo sapiens|F|EX10|
atgaatgaagttacttattctaccctgaactttgaagcccagcaacccacacaaccaact
tcagcctccccatccctaacagccacagaaataatttattcagaagtaaaaaagcagtaa