Here you are: IMGT Web resources > IMGT Repertoire (RPI) > RPI entries from gene to protein

IMGT reference sequences in FASTA format:

Nucleotide sequences without gaps

The FASTA header contains:
IMGT accession number | gene and allele name | species | functionality | exon name | domain name | domain type | alternative splicing (if it is) | partial (if it is)
>J03858|CEACAM1*01|Homo sapiens|F|EX1|
atggggcacctctcagccccacttcacagagtgcgtgtaccctggcaggggcttctgctc
acag
>J03858|CEACAM1*01|Homo sapiens|F|EX2|[D1]|V-LIKE|
cctcacttctaaccttctggaacccgcccaccactgcccagctcactactgaatccatgc
cattcaatgttgcagaggggaaggaggttcttctccttgtccacaatctgccccagcaac
tttttggctacagctggtacaaaggggaaagagtggatggcaaccgtcaaattgtaggat
atgcaataggaactcaacaagctaccccagggcccgcaaacagcggtcgagagacaatat
accccaatgcatccctgctgatccagaacgtcacccagaatgacacaggattctacaccc
tacaagtcataaagtcagatcttgtgaatgaagaagcaactggacagttccatgtatacc
>J03858|CEACAM1*01|Homo sapiens|F|EX3|[D2]|C1-LIKE|
cggagctgcccaagccctccatctccagcaacaactccaaccctgtggaggacaaggatg
ctgtggccttcacctgtgaacctgagactcaggacacaacctacctgtggtggataaaca
atcagagcctcccggtcagtcccaggctgcagctgtccaatggcaacaggaccctcactc
tactcagtgtcacaaggaatgacacaggaccctatgagtgtgaaatacagaacccagtga
gtgcgaaccgcagtgacccagtcaccttgaatgtcacct
>J03858|CEACAM1*01|Homo sapiens|F|EX4|[D3]|C2-LIKE|
atggcccggacacccccaccatttccccttcagacacctattaccgtccaggggcaaacc
tcagcctctcctgctatgcagcctctaacccacctgcacagtactcctggcttatcaatg
gaacattccagcaaagcacacaagagctctttatccctaacatcactgtgaataatagtg
gatcctatacctgccacgccaataactcagtcactggctgcaacaggaccacagtcaaga
cgatcatagtcactg
>J03858|CEACAM1*01|Homo sapiens|F|EX5|[D4]|C3-LIKE|
agctaagtccagtagtagcaaagccccaaatcaaagccagcaagaccacagtcacaggag
ataaggactctgtgaacctgacctgctccacaaatgacactggaatctccatccgttggt
tcttcaaaaaccagagtctcccgtcctcggagaggatgaagctgtcccagggcaacacca
ccctcagcataaaccctgtcaagagggaggatgctgggacgtattggtgtgaggtcttca
acccaatcagtaagaaccaaagcgaccccatcatgctgaacgtaaact
>J03858|CEACAM1*01|Homo sapiens|F|EX7|
ataatgctctaccacaagaaaatggcctctcacctggggccattgctggcattgtgattg
gagtagtggccctggttgctctgatagcagtagccctggcatgttttctgcatttcggga
agaccggcag
>J03858|CEACAM1*01|Homo sapiens|F|EX8|
ggcaagcgaccagcgtgatctcacagagcacaaaccctcagtctccaaccaca
>J03858|CEACAM1*01|Homo sapiens|F|EX9|
ctcaggaccactccaatgacccacctaacaag
>J03858|CEACAM1*01|Homo sapiens|F|EX10|
atgaatgaagttacttattctaccctgaactttgaagcccagcaacccacacaaccaact
tcagcctccccatccctaacagccacagaaataatttattcagaagtaaaaaagcagtaa
>J03858|CEACAM1*01|Homo sapiens|F|EX1-5, EX7-10|
atggggcacctctcagccccacttcacagagtgcgtgtaccctggcaggggcttctgctc
acagcctcacttctaaccttctggaacccgcccaccactgcccagctcactactgaatcc
atgccattcaatgttgcagaggggaaggaggttcttctccttgtccacaatctgccccag
caactttttggctacagctggtacaaaggggaaagagtggatggcaaccgtcaaattgta
ggatatgcaataggaactcaacaagctaccccagggcccgcaaacagcggtcgagagaca
atataccccaatgcatccctgctgatccagaacgtcacccagaatgacacaggattctac
accctacaagtcataaagtcagatcttgtgaatgaagaagcaactggacagttccatgta
tacccggagctgcccaagccctccatctccagcaacaactccaaccctgtggaggacaag
gatgctgtggccttcacctgtgaacctgagactcaggacacaacctacctgtggtggata
aacaatcagagcctcccggtcagtcccaggctgcagctgtccaatggcaacaggaccctc
actctactcagtgtcacaaggaatgacacaggaccctatgagtgtgaaatacagaaccca
gtgagtgcgaaccgcagtgacccagtcaccttgaatgtcacctatggcccggacaccccc
accatttccccttcagacacctattaccgtccaggggcaaacctcagcctctcctgctat
gcagcctctaacccacctgcacagtactcctggcttatcaatggaacattccagcaaagc
acacaagagctctttatccctaacatcactgtgaataatagtggatcctatacctgccac
gccaataactcagtcactggctgcaacaggaccacagtcaagacgatcatagtcactgag
ctaagtccagtagtagcaaagccccaaatcaaagccagcaagaccacagtcacaggagat
aaggactctgtgaacctgacctgctccacaaatgacactggaatctccatccgttggttc
ttcaaaaaccagagtctcccgtcctcggagaggatgaagctgtcccagggcaacaccacc
ctcagcataaaccctgtcaagagggaggatgctgggacgtattggtgtgaggtcttcaac
ccaatcagtaagaaccaaagcgaccccatcatgctgaacgtaaactataatgctctacca
caagaaaatggcctctcacctggggccattgctggcattgtgattggagtagtggccctg
gttgctctgatagcagtagccctggcatgttttctgcatttcgggaagaccggcagggca
agcgaccagcgtgatctcacagagcacaaaccctcagtctccaaccacactcaggaccac
tccaatgacccacctaacaagatgaatgaagttacttattctaccctgaactttgaagcc
cagcaacccacacaaccaacttcagcctccccatccctaacagccacagaaataatttat
tcagaagtaaaaaagcagtaa