Here you are: IMGT Web resources > IMGT Repertoire (RPI) > RPI entries from gene to protein

IMGT reference sequences in FASTA format:

Nucleotide sequences with gaps

The FASTA header contains:
IMGT accession number | gene and allele name | species | functionality | exon name | domain name | domain type | alternative splicing (if it is) | partial (if it is)
>J03858|CEACAM1*01|Homo sapiens|F|EX1|
atggggcacctctcagccccacttcacagagtgcgtgtaccctggcaggggcttctgctc
acag
>J03858|CEACAM1*01|Homo sapiens|F|EX2|[D1]|V-LIKE|
cctcacttctaaccttctggaacccgcccaccactgcccagctcactactgaatccatgc
cattcaatgttgcagaggggaaggaggttcttctccttgtccacaatctgccccagcaa.
.................ctttttggctacagctggtacaaaggggaaagagtggatggca
accgtcaaattgtaggatatgcaatagga.........actcaacaagctaccccagggc
ccgcaaac...agcggtcgagagacaatatac......ccc......aatgcatccctgc
tgatccagaacgtcacccagaatgacacaggattctacaccctacaagtcataaagtca.
........gatcttgtgaatgaagaagcaactggacagttccatgtatacc
>J03858|CEACAM1*01|Homo sapiens|F|EX3|[D2]|C-LIKE|
........................cggagctgcccaagccctccatctccagcaacaact
ccaaccctgtggaggacaaggatgctgtggccttcacctgtgaacctgagactcaggac.
..............acaacctacctgtggtggataaacaatcagagcctcccggtc....
..agtcccaggctgcagctgtccaat..................................
........ggcaacaggaccctcactctactcagtgtcacaagg......aatgacacag
gaccctatgagtgtgaaatacagaaccca............gtgagtgcgaaccgcagtg
acccagtcaccttgaatgtcacct
>J03858|CEACAM1*01|Homo sapiens|F|EX4|[D3]|C-LIKE|
..................atggcccggacacccccaccatttccccttcagacacctatt
accgtcca.........ggggcaaacctcagcctctcctgctatgcagcctctaaccca.
...........cctgcacagtactcctggcttatcaatgga...................
..acattccagcaaagcaca........................................
..............caagagctctttatccctaacatcactgtg......aataatagtg
gatcctatacctgccacgccaataactca.........gtcactggctgcaacaggacca
cagtcaagacgatcatagtcactg
>J03858|CEACAM1*01|Homo sapiens|F|EX5|[D4]|C-LIKE|
...............agctaagtccagtagtagcaaagccccaaatcaaagccagcaaga
ccacagtcacaggagataaggactctgtgaacctgacctgctccacaaatgacactgga.
..............atctccatccgttggttcttcaaaaaccagagtctcccgtcc....
..tcggagaggatgaagctgtcccag..................................
........ggcaacaccaccctcagcataaaccctgtcaagagg......gaggatgctg
ggacgtattggtgtgaggtcttcaaccca............atcagtaagaaccaaagcg
accccatcatgctgaacgtaaact
>J03858|CEACAM1*01|Homo sapiens|F|EX7|
ataatgctctaccacaagaaaatggcctctcacctggggccattgctggcattgtgattg
gagtagtggccctggttgctctgatagcagtagccctggcatgttttctgcatttcggga
agaccggcag
>J03858|CEACAM1*01|Homo sapiens|F|EX8|
ggcaagcgaccagcgtgatctcacagagcacaaaccctcagtctccaaccaca
>J03858|CEACAM1*01|Homo sapiens|F|EX9|
ctcaggaccactccaatgacccacctaacaag
>J03858|CEACAM1*01|Homo sapiens|F|EX10|
atgaatgaagttacttattctaccctgaactttgaagcccagcaacccacacaaccaact
tcagcctccccatccctaacagccacagaaataatttattcagaagtaaaaaagcagtaa
>D12502|CEACAM1*02|Homo sapiens|F|EX1|
atggggcacctctcagccccacttcacagagtgcgtgtaccctggcaggggcttctgctc
acag
>D12502|CEACAM1*02|Homo sapiens|F|EX2|[D1]|V-LIKE|
cctcacttctaaccttctggaacccgcccaccactgcccagctcactactgaatccatgc
cattcaatgttgcagaggggaaggaggttcttctccttgtccacaatctgccccagcaa.
.................ctttttggctacagctggtacaaaggggaaagagtggatggca
accgtcaaattgtaggatatgcaatagga.........actcaacaagctaccccagggc
ccgcaaac...agcggtcgagagacaatatac......ccc......aatgcatccctgc
tgatccagaacgtcacccagaatgacacaggattctacaccctacaagtcataaagtca.
........gatcttgtgaatgaagaagcaactggacagttccatgtatacc
>D12502|CEACAM1*02|Homo sapiens|F|EX3|[D2]|C-LIKE|
........................cggagctgcccaagccctccatctccagcaacaact
ccaaccctgtggaggacaaggatgctgtggccttcacctgtgaacctgagactcaggac.
..............acaacctacctgtggtggataaacaatcagagcctcccggtc....
..agtcccaggctgcagctgtccaat..................................
........ggcaacaggaccctcactctactcagtgtcacaagg......aatgacacag
gaccctatgagtgtgaaatacagaaccca............gtgagtgcgaaccgcagtg
acccagtcaccttgaatgtcacct
>D12502|CEACAM1*02|Homo sapiens|F|EX4|[D3]|C-LIKE|
..................atggcccggacacccccaccatttccccttcatacacctatt
accgtcca.........ggggcaaacctcagcctctcctgctatgcagcctctaaccca.
...........cctgcacagtactcctggcttatcaatgga...................
..acattccagcaaagcaca........................................
..............caagagctctttatccctaacatcactgtg......aataatagtg
gatcctatacctgccacgccaataactca.........gtcactggctgcaacaggacca
cagtcaagacgatcatagtcactg
>D12502|CEACAM1*02|Homo sapiens|F|EX6|
agagacagaatctcaccatgttgcccgggctggactcgaactcctgggctcaagcaatcc
tcccatctgtttcccaaagtgctgagattacag
>D12502|CEACAM1*02|Homo sapiens|F|EX7|
ataatgctctaccacaagaaaatggcctctcacctggggccattgctggcattgtgattg
gagtagtggccctggttgctctgatagcagtagccctggcatgttttctgcatttcggga
agaccggcag
>D12502|CEACAM1*02|Homo sapiens|F|EX8|
ggcaagcgaccagcgtgatctcacagagcacaaaccctcagtgtccaaccaca
>D12502|CEACAM1*02|Homo sapiens|F|EX9|
ctcaggaccactccaatgacccacctaacaag
>D12502|CEACAM1*02|Homo sapiens|F|EX10|
atgaatgaagttacttattctaccctgaactttgaagcccagcaacccacacaaccaact
tcagcctccccatccctaacagccacagaaataatttattcagaagtaaaaaagcagtaa