Blood Res 2021; 56(S1):
Published online April 30, 2021
https://doi.org/10.5045/br.2021.2020332
© The Korean Society of Hematology
Correspondence to : Moon-Woo Seong, M.D., Ph.D.
Department of Laboratory Medicine, Seoul National University Hospital, Seoul National University College of Medicine, 101 Daehak-ro, Jongno-gu, Seoul 03080, Korea
E-mail: mwseong@snu.ac.kr
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Thalassemia is characterized by the impaired synthesis of globin chains due to disease-causing variants in α- or β-globin genes. In this review, we provide an overview of the molecular basis underlying α- and β-thalassemia, and of the current technologies used to characterize these disease-causing variants for the diagnosis of thalassemia. Understanding these molecular basis and technologies will prove to be beneficial for the accurate diagnosis of thalassemia.
Keywords Thalassemia, α-globin gene, β-globin gene
Inherited hemoglobin (Hb) disorders are classified into two disease groups: hemoglobinopathy caused by the structural defects in Hb and thalassemia caused by the impaired synthesis of globin, usually of normal structure; both are typically inherited in an autosomal recessive manner [1]. It has been estimated that 70,000 children are born with various forms of thalassemia annually [2]. The term “thalassemia” is derived from the Greek word “thalassa (sea),” as this inherited disease was first described in the Mediterranean region. Thalassemia is also highly prevalent in other regions of the world (e.g., Southeast Asia, the Indian subcontinent, and Africa), where malaria is endemic. Moreover, the prevalence of thalassemia has increased in recent years, due to an increase in migrant populations in other regions, including North America, Northern Europe, Australia, and South Korea. It is estimated that approximately 7% of the global population are thalassemia carriers [3].
Thalassemia is classified into α-, β-, δβ-, γδβ-, δ-, and γ-thalassemia, which are identified by the specific type of globin chain(s) where impaired synthesis occurs. For example, α- and β-thalassemia are deficient in the synthesis of α- and β-globin chains, respectively [4]. Among these various forms, the two most common forms are α- and β-thalassemia [5]. This review describes the genetic basis and molecular diagnosis of thalassemia, focusing on α- and β-thalassemia.
Human Hb consists of proteins with symmetric pairing of α-like and β-like globin dimers, which form a tetrameric structure, as well as functional units [6]. Individual α-like and β-like globin chains are encoded by two distinct gene clusters: the α-globin gene cluster on the short arm of chromosome 16 and the β-globin gene cluster on the short arm of chromosome 11. The α-globin gene cluster comprises three functional globin genes, the embryonic ζ gene (
The major pathophysiology of thalassemia is an imbalance in the ratio of globin chains, which is normally well controlled. Unbound globin chains (i.e., α-globin in β-thalassemia and β-globin in α-thalassemia) precipitate, leading to the destruction of erythroid precursors. RBC precursor damage leads to ineffective erythropoiesis in the bone marrow, and RBC hemolysis in circulation [8, 9].
The major cause of α-thalassemia is deletions involving one or more α-globin genes with variable lengths of the α-globin locus, which account for approximately 95% of α-thalassemia cases [10]. α-globin genes are duplicated and localized into two highly homologous units, and unequal crossover between these units during meiosis is likely to be the underlying mechanism of gene deletion [1]. Normal individuals have two α-globin genes on each chromosome. In diagnostic practice, α-thalassemia is classified into 1) α0-thalassemia, in which both α-globin genes are deleted (--/), and 2) α+-thalassemia, in which one of two α-globin genes is deleted (-α/) [11]. Less frequently, α+-thalassemia results from non-deletional variants [i.e., single nucleotide variants (SNV) or short insertion/deletions] [αTα/(α2-globin gene is affected) or ααT/(α1-globin gene is affected)] (Table 1).
Table 1 Common variants of each variant type of α-thalassemia.
Gene | Variant type | Common variant | Effect |
---|---|---|---|
α-globin gene | --/ | --SEA, --MED, --FIL | α0 |
-α/ | -α3.7 and -α4.2 | α+ | |
αTα/ | αIVS1(-5nt)α, αPA(AATAAG)α, αCSα | α+ | |
ααT/ | α+ |
In α0-thalassemia, the two most common deletion forms are (/--SEA, South Asia) and (/--MED, Mediterranean). Another type of rare deletion leading to α0-thalassemia is the deletion of the MCS, in which the α-globin genes remain intact, but are completely inactivated [5, 11]. In α-thalassemia, -α3.7 and -α4.2 are the most common forms. Unequal recombination between two homologous segments (Z boxes) that are 3.7 kb apart results in the formation of a chromosome with one α-globin gene (/-α3.7), and similarly, the other mispaired homologous segment (X boxes), 4.2 kb apart, produces the /-α4.2 allele [12]. The non-deletional type includes SNVs or short insertion/deletions in the α-globin gene, or in regions that affect α-globin expression. More than two-thirds of these variants are observed in the α2-globin gene, while less than one-third of the variants are observed in the α1-globin gene. The products of α2-globin genes account for the majority (∼two-thirds) of the total α-globin, while the α1-globin gene accounts for the remainder. Therefore, non-deletional variants in the α2-globin gene would elicit more severe effects than non-deletional variants in the α1-globin gene [13]. In addition, non-deletional variants lead to a greater reduction in α-globin chain expression than the single α-globin gene deletion form of thalassemia [12, 13]. The most common forms that occur in the α2-globin gene are αIVS1(-5nt)α (Mediterranean, 5 nucleotide deletion in IVS1), αPA(AATAAG)α (Middle East Asia, 3′ untranslated region [UTR] polyadenylation site variant), and αCSα (South Asia, stop codon variant resulting in protein extension by an additional 32 amino acids) [14]. Table 2 presents the distribution of the α-thalassemia genotypes identified in the Seoul National University Hospital, South Korea.
Table 2 Distribution of α-thalassemia genotypes identified in Seoul National University Hospital (South Korea).
α-thalassemia genotypes | N | % |
---|---|---|
--SEA/αα | 8 | 33.3 |
α3.7/αα | 5 | 20.8 |
--SEA/α3.7 | 4 | 16.7 |
α3.7/α3.7 | 2 | 8.3 |
α3.7/α212 patchwork | 2 | 8.3 |
--/αα (ATRA-16 syndrome) | 1 | 4.2 |
αα/αα (HS-40 deletion) | 1 | 4.2 |
--SEA/-α4.2 | 1 | 4.2 |
Total | 24 |
Unlike α-thalassemia, which is primarily caused by deletions, the majority of β-thalassemia-causing variants are non-deletions, including single nucleotide substitutions and short insertion/deletions leading to frameshift [15, 16]. This disorder is heterogeneous at the molecular level, and more than 300 variants of the β-globin gene have been identified thus far [16]. According to the degree of quantitative reduction in the production of β-globin, β-thalassemia alleles are classified into three categories: 1) the absence of β-globin (β0); 2) β-globin is produced but reduced (β+); and 3) β-globin production is minimally reduced (β++, also known as silent).
The categories of non-deletional variants of β-thalassemia are presented in Table 3. Non-deletional variants in β-thalassemia affect one of the following steps: transcription, RNA processing, or translation. Transcriptional variants involve promoter regions or the 5′ UTR. This category of variants generally results in mild, and occasionally silent, reduction of β-globin synthesis (β+ and β++). Variants that interfere with primary RNA transcript processing are usually found on either canonical or cryptic splice sites, and can lead to either β0, β+, or β++ thalassemia, depending on the proportion of abnormal mRNA transcripts produced. Other variants that reduce the efficiency of RNA processing include polyadenylation (poly-A) signal variants and those in the 3′ UTR, which generally cause β+ thalassemia [15, 17]. Variants that produce a premature termination codon (e.g., nonsense or frameshift) account for the most common types of β-thalassemia, and lead to β0 thalassemia. However, truncating variants in the last exon (exon 3) and the 3′ half of exon 2 are predicted to escape nonsense-mediated decay and produce truncated β-globin to form hemoglobin tetramers that are highly unstable and non-functional, with a dominant negative effect [16]. Variants involving the initiation codon (ATG) leading to β0 thalassemia have also been identified. In rare cases of β-thalassemia, deletions have been reported and classified into three categories: 1) deletions restricted to the β-globin gene from 105 bp to 67 kb in size; 2) deletions of β-LCR, leaving the β-globin gene intact, yet inactivated; and 3) deletions of β-LCR, which removes most of the β-globin gene cluster, including the β-globin gene [15, 16, 18]. Table 4 presents the molecular spectrum of β-thalassemia identified at the Seoul National University Hospital, South Korea.
Table 3 Types of variants in β-thalassemia.
Gene | Variant type | Effect |
---|---|---|
β-globin gene | Transcriptional variants | β+ or β++ |
Primary RNA transcript processing | β0, β+, or β++ | |
3’ UTR or poly-A site | β+ | |
Translation | ||
Initiation codon | β0 | |
Premature termination codon | β0 |
Abbreviation: UTR, untranslated region.
Table 4 Molecular spectrum of β-thalassemia identified in Seoul National University Hospital (South Korea).
HGVS nomenclature | Amino acid change | N of alleles | % |
---|---|---|---|
c.-138C>A | 1 | 0.6 | |
c.-81A>G | 2 | 1.2 | |
c.79G>A | p.Glu27Lys | 18 | 11.0 |
c.92G>A | p.Arg31Lys | 1 | 0.6 |
c.92+1G>A | 2 | 1.2 | |
c.92+1G>T | 2 | 1.2 | |
c.92+2T>C | 1 | 0.6 | |
c.92+5G>C | 3 | 1.8 | |
c.93-21G>A | 3 | 1.8 | |
c.93-1G>C | 2 | 1.2 | |
c.315+1G>A | 12 | 7.4 | |
c.316-197C>T | 1 | 0.6 | |
c.316-2A>G | 1 | 0.6 | |
c.1A>G | Start loss | 2 | 1.2 |
c.2T>G | start loss | 25 | 15.3 |
c.48G>A | p.Trp16* | 6 | 3.7 |
c.52A>T | p.Lys18* | 18 | 11.0 |
c.114G>A | p.Trp38* | 4 | 2.5 |
c.118C>T | p.Gln40* | 1 | 0.6 |
c.364G>T | p.Glu122* | 13 | 8.0 |
c.394C>T | p.Gln132* | 1 | 0.6 |
c.25_26del | p.Lys9Valfs*14 | 1 | 0.6 |
c.27dup | p.Ser10Valfs*14 | 8 | 4.9 |
c.126_129del | p.Phe42Leufs*19 | 15 | 9.2 |
c.201del | p.Val68Cysfs*22 | 1 | 0.6 |
c.217dup | p.Ser72Lysfs*2 | 2 | 1.2 |
c.253_254del | p.Thr85Leufs*6 | 1 | 0.6 |
c.270_271del | p.Ser90Argfs*5 | 2 | 1.2 |
c.287dup | p.Leu97Alafs*6 | 1 | 0.6 |
c.378_379del | p.Val127Alafs*13 | 1 | 0.6 |
c.182T>A | p.Val61Glu | 1 | 0.6 |
c.383A>G | p.Gln128Arg | 2 | 1.2 |
7 | 4.3 | ||
Beta-globin gene cluster deletion | 1 | 0.6 | |
1 | 0.6 | ||
Total | 163 |
A number of molecular techniques for detecting globin gene variants have been developed. Different strategies should be applied to each variant type, which can be divided into two groups: 1) non-deletional variants, including single nucleotide substitutions and short insertion/deletions, and 2) large deletions and duplications. Disease-causing variants in thalassemia are often population specific, and each population has frequently detected thalassemia alleles [19-21]. Occasionally, the clinical manifestations of thalassemia depend on the type of variant and its location within the gene. For example, in α-thalassemia, non-deletional variants of α-globin genes are associated with more severe phenotypes compared to large deletions [7]. Thus, strategic selection according to the type of variant associated with a specific population and clinical phenotype need to be made in a diagnostic laboratory.
The following discussion focuses on the molecular techniques available in a clinical laboratory, depending on the type of variant and prior knowledge of the variant to be examined (Table 5).
Table 5 Molecular diagnostic methods for thalassemia.
Mutation type | Method |
---|---|
Deletion | Gap PCR |
MLPAa,b) | |
Non-deletion | Allele-specific PCR |
Reverse dot blotting | |
Denaturing gradient gel electrophoresis | |
ARMS | |
Sanger sequencinga) | |
Next-generation sequencinga) |
a)Methods currently applied in clinical laboratories in Korea. b)Commercial kits available [SALSA MLPA Probemix P140 and P102 (MRC-Holland, Amsterdam, The Netherlands)].
Abbreviations: ARMS, amplification refractory mutation system; MLPA, multiplex ligation-dependent probe amplification; PCR, polymerase chain reaction.
Gap-polymerase chain reaction can be applied to common deletions in a specific population, using primers flanking known breakpoints. Common single α-globin-gene deletions include a 3.7 kb deletion (-α3.7) and a 4.2 kb deletion (-α4.2). Moreover, two common α-globin-gene deletions include founder variants in specific populations, such as --SEA (Southeast Asian), --FIL (Filipino), and --MED (Mediterranean). The multiplex ligation-dependent probe amplification (MLPA) method is another technique for characterizing deletions in thalassemia, which can detect both known and unknown deletions. MLPA uses two separate oligonucleotide probes (left probe oligonucleotide and right probe oligonucleotide) that are hybridized to adjacent target sequences and ligated. Ligated probes are amplified via PCR, and the amount of amplified probe ligation products enables the quantification of gene copy numbers. MLPA is simple to perform in clinical laboratories and is suitable for the detection of various deletions [4, 22].
Several cost-effective methods, such as allele-specific PCR, reverse dot blotting, denaturing gradient gel electrophoresis, and the amplification refractory mutation system can be applied to detect common sequence variants (e.g., Hb Constant Spring in South Asia). Sanger sequencing (i.e., direct DNA sequencing) is currently the most practical method to comprehensively detect all variants without prior knowledge of variants [22]. During Sanger sequencing, the PCR product is obtained and subsequently sequenced using the Sanger dideoxy termination method [4]. However, sequencing of α-globin genes is complex, as two α-globin genes (
Thalassemia is one of the most commonly inherited Hb disorders. The genetic basis of α- and β-thalassemia and molecular techniques applicable in a clinical laboratory for the diagnosis of thalassemia have been described. Understanding the genetic basis of thalassemia and these molecular techniques will have a strong impact on the accurate molecular diagnosis of thalassemia.
No potential conflicts of interest relevant to this article were reported.
Blood Res 2021; 56(S1): S39-S43
Published online April 30, 2021 https://doi.org/10.5045/br.2021.2020332
Copyright © The Korean Society of Hematology.
Jee-Soo Lee, Sung Im Cho, Sung Sup Park, Moon-Woo Seong
Department of Laboratory Medicine, Seoul National University Hospital, Seoul National University College of Medicine, Seoul, Korea
Correspondence to:Moon-Woo Seong, M.D., Ph.D.
Department of Laboratory Medicine, Seoul National University Hospital, Seoul National University College of Medicine, 101 Daehak-ro, Jongno-gu, Seoul 03080, Korea
E-mail: mwseong@snu.ac.kr
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Thalassemia is characterized by the impaired synthesis of globin chains due to disease-causing variants in α- or β-globin genes. In this review, we provide an overview of the molecular basis underlying α- and β-thalassemia, and of the current technologies used to characterize these disease-causing variants for the diagnosis of thalassemia. Understanding these molecular basis and technologies will prove to be beneficial for the accurate diagnosis of thalassemia.
Keywords: Thalassemia, α-globin gene, β-globin gene
Inherited hemoglobin (Hb) disorders are classified into two disease groups: hemoglobinopathy caused by the structural defects in Hb and thalassemia caused by the impaired synthesis of globin, usually of normal structure; both are typically inherited in an autosomal recessive manner [1]. It has been estimated that 70,000 children are born with various forms of thalassemia annually [2]. The term “thalassemia” is derived from the Greek word “thalassa (sea),” as this inherited disease was first described in the Mediterranean region. Thalassemia is also highly prevalent in other regions of the world (e.g., Southeast Asia, the Indian subcontinent, and Africa), where malaria is endemic. Moreover, the prevalence of thalassemia has increased in recent years, due to an increase in migrant populations in other regions, including North America, Northern Europe, Australia, and South Korea. It is estimated that approximately 7% of the global population are thalassemia carriers [3].
Thalassemia is classified into α-, β-, δβ-, γδβ-, δ-, and γ-thalassemia, which are identified by the specific type of globin chain(s) where impaired synthesis occurs. For example, α- and β-thalassemia are deficient in the synthesis of α- and β-globin chains, respectively [4]. Among these various forms, the two most common forms are α- and β-thalassemia [5]. This review describes the genetic basis and molecular diagnosis of thalassemia, focusing on α- and β-thalassemia.
Human Hb consists of proteins with symmetric pairing of α-like and β-like globin dimers, which form a tetrameric structure, as well as functional units [6]. Individual α-like and β-like globin chains are encoded by two distinct gene clusters: the α-globin gene cluster on the short arm of chromosome 16 and the β-globin gene cluster on the short arm of chromosome 11. The α-globin gene cluster comprises three functional globin genes, the embryonic ζ gene (
The major pathophysiology of thalassemia is an imbalance in the ratio of globin chains, which is normally well controlled. Unbound globin chains (i.e., α-globin in β-thalassemia and β-globin in α-thalassemia) precipitate, leading to the destruction of erythroid precursors. RBC precursor damage leads to ineffective erythropoiesis in the bone marrow, and RBC hemolysis in circulation [8, 9].
The major cause of α-thalassemia is deletions involving one or more α-globin genes with variable lengths of the α-globin locus, which account for approximately 95% of α-thalassemia cases [10]. α-globin genes are duplicated and localized into two highly homologous units, and unequal crossover between these units during meiosis is likely to be the underlying mechanism of gene deletion [1]. Normal individuals have two α-globin genes on each chromosome. In diagnostic practice, α-thalassemia is classified into 1) α0-thalassemia, in which both α-globin genes are deleted (--/), and 2) α+-thalassemia, in which one of two α-globin genes is deleted (-α/) [11]. Less frequently, α+-thalassemia results from non-deletional variants [i.e., single nucleotide variants (SNV) or short insertion/deletions] [αTα/(α2-globin gene is affected) or ααT/(α1-globin gene is affected)] (Table 1).
Table 1 . Common variants of each variant type of α-thalassemia..
Gene | Variant type | Common variant | Effect |
---|---|---|---|
α-globin gene | --/ | --SEA, --MED, --FIL | α0 |
-α/ | -α3.7 and -α4.2 | α+ | |
αTα/ | αIVS1(-5nt)α, αPA(AATAAG)α, αCSα | α+ | |
ααT/ | α+ |
In α0-thalassemia, the two most common deletion forms are (/--SEA, South Asia) and (/--MED, Mediterranean). Another type of rare deletion leading to α0-thalassemia is the deletion of the MCS, in which the α-globin genes remain intact, but are completely inactivated [5, 11]. In α-thalassemia, -α3.7 and -α4.2 are the most common forms. Unequal recombination between two homologous segments (Z boxes) that are 3.7 kb apart results in the formation of a chromosome with one α-globin gene (/-α3.7), and similarly, the other mispaired homologous segment (X boxes), 4.2 kb apart, produces the /-α4.2 allele [12]. The non-deletional type includes SNVs or short insertion/deletions in the α-globin gene, or in regions that affect α-globin expression. More than two-thirds of these variants are observed in the α2-globin gene, while less than one-third of the variants are observed in the α1-globin gene. The products of α2-globin genes account for the majority (∼two-thirds) of the total α-globin, while the α1-globin gene accounts for the remainder. Therefore, non-deletional variants in the α2-globin gene would elicit more severe effects than non-deletional variants in the α1-globin gene [13]. In addition, non-deletional variants lead to a greater reduction in α-globin chain expression than the single α-globin gene deletion form of thalassemia [12, 13]. The most common forms that occur in the α2-globin gene are αIVS1(-5nt)α (Mediterranean, 5 nucleotide deletion in IVS1), αPA(AATAAG)α (Middle East Asia, 3′ untranslated region [UTR] polyadenylation site variant), and αCSα (South Asia, stop codon variant resulting in protein extension by an additional 32 amino acids) [14]. Table 2 presents the distribution of the α-thalassemia genotypes identified in the Seoul National University Hospital, South Korea.
Table 2 . Distribution of α-thalassemia genotypes identified in Seoul National University Hospital (South Korea)..
α-thalassemia genotypes | N | % |
---|---|---|
--SEA/αα | 8 | 33.3 |
α3.7/αα | 5 | 20.8 |
--SEA/α3.7 | 4 | 16.7 |
α3.7/α3.7 | 2 | 8.3 |
α3.7/α212 patchwork | 2 | 8.3 |
--/αα (ATRA-16 syndrome) | 1 | 4.2 |
αα/αα (HS-40 deletion) | 1 | 4.2 |
--SEA/-α4.2 | 1 | 4.2 |
Total | 24 |
Unlike α-thalassemia, which is primarily caused by deletions, the majority of β-thalassemia-causing variants are non-deletions, including single nucleotide substitutions and short insertion/deletions leading to frameshift [15, 16]. This disorder is heterogeneous at the molecular level, and more than 300 variants of the β-globin gene have been identified thus far [16]. According to the degree of quantitative reduction in the production of β-globin, β-thalassemia alleles are classified into three categories: 1) the absence of β-globin (β0); 2) β-globin is produced but reduced (β+); and 3) β-globin production is minimally reduced (β++, also known as silent).
The categories of non-deletional variants of β-thalassemia are presented in Table 3. Non-deletional variants in β-thalassemia affect one of the following steps: transcription, RNA processing, or translation. Transcriptional variants involve promoter regions or the 5′ UTR. This category of variants generally results in mild, and occasionally silent, reduction of β-globin synthesis (β+ and β++). Variants that interfere with primary RNA transcript processing are usually found on either canonical or cryptic splice sites, and can lead to either β0, β+, or β++ thalassemia, depending on the proportion of abnormal mRNA transcripts produced. Other variants that reduce the efficiency of RNA processing include polyadenylation (poly-A) signal variants and those in the 3′ UTR, which generally cause β+ thalassemia [15, 17]. Variants that produce a premature termination codon (e.g., nonsense or frameshift) account for the most common types of β-thalassemia, and lead to β0 thalassemia. However, truncating variants in the last exon (exon 3) and the 3′ half of exon 2 are predicted to escape nonsense-mediated decay and produce truncated β-globin to form hemoglobin tetramers that are highly unstable and non-functional, with a dominant negative effect [16]. Variants involving the initiation codon (ATG) leading to β0 thalassemia have also been identified. In rare cases of β-thalassemia, deletions have been reported and classified into three categories: 1) deletions restricted to the β-globin gene from 105 bp to 67 kb in size; 2) deletions of β-LCR, leaving the β-globin gene intact, yet inactivated; and 3) deletions of β-LCR, which removes most of the β-globin gene cluster, including the β-globin gene [15, 16, 18]. Table 4 presents the molecular spectrum of β-thalassemia identified at the Seoul National University Hospital, South Korea.
Table 3 . Types of variants in β-thalassemia..
Gene | Variant type | Effect |
---|---|---|
β-globin gene | Transcriptional variants | β+ or β++ |
Primary RNA transcript processing | β0, β+, or β++ | |
3’ UTR or poly-A site | β+ | |
Translation | ||
Initiation codon | β0 | |
Premature termination codon | β0 |
Abbreviation: UTR, untranslated region..
Table 4 . Molecular spectrum of β-thalassemia identified in Seoul National University Hospital (South Korea)..
HGVS nomenclature | Amino acid change | N of alleles | % |
---|---|---|---|
c.-138C>A | 1 | 0.6 | |
c.-81A>G | 2 | 1.2 | |
c.79G>A | p.Glu27Lys | 18 | 11.0 |
c.92G>A | p.Arg31Lys | 1 | 0.6 |
c.92+1G>A | 2 | 1.2 | |
c.92+1G>T | 2 | 1.2 | |
c.92+2T>C | 1 | 0.6 | |
c.92+5G>C | 3 | 1.8 | |
c.93-21G>A | 3 | 1.8 | |
c.93-1G>C | 2 | 1.2 | |
c.315+1G>A | 12 | 7.4 | |
c.316-197C>T | 1 | 0.6 | |
c.316-2A>G | 1 | 0.6 | |
c.1A>G | Start loss | 2 | 1.2 |
c.2T>G | start loss | 25 | 15.3 |
c.48G>A | p.Trp16* | 6 | 3.7 |
c.52A>T | p.Lys18* | 18 | 11.0 |
c.114G>A | p.Trp38* | 4 | 2.5 |
c.118C>T | p.Gln40* | 1 | 0.6 |
c.364G>T | p.Glu122* | 13 | 8.0 |
c.394C>T | p.Gln132* | 1 | 0.6 |
c.25_26del | p.Lys9Valfs*14 | 1 | 0.6 |
c.27dup | p.Ser10Valfs*14 | 8 | 4.9 |
c.126_129del | p.Phe42Leufs*19 | 15 | 9.2 |
c.201del | p.Val68Cysfs*22 | 1 | 0.6 |
c.217dup | p.Ser72Lysfs*2 | 2 | 1.2 |
c.253_254del | p.Thr85Leufs*6 | 1 | 0.6 |
c.270_271del | p.Ser90Argfs*5 | 2 | 1.2 |
c.287dup | p.Leu97Alafs*6 | 1 | 0.6 |
c.378_379del | p.Val127Alafs*13 | 1 | 0.6 |
c.182T>A | p.Val61Glu | 1 | 0.6 |
c.383A>G | p.Gln128Arg | 2 | 1.2 |
7 | 4.3 | ||
Beta-globin gene cluster deletion | 1 | 0.6 | |
1 | 0.6 | ||
Total | 163 |
A number of molecular techniques for detecting globin gene variants have been developed. Different strategies should be applied to each variant type, which can be divided into two groups: 1) non-deletional variants, including single nucleotide substitutions and short insertion/deletions, and 2) large deletions and duplications. Disease-causing variants in thalassemia are often population specific, and each population has frequently detected thalassemia alleles [19-21]. Occasionally, the clinical manifestations of thalassemia depend on the type of variant and its location within the gene. For example, in α-thalassemia, non-deletional variants of α-globin genes are associated with more severe phenotypes compared to large deletions [7]. Thus, strategic selection according to the type of variant associated with a specific population and clinical phenotype need to be made in a diagnostic laboratory.
The following discussion focuses on the molecular techniques available in a clinical laboratory, depending on the type of variant and prior knowledge of the variant to be examined (Table 5).
Table 5 . Molecular diagnostic methods for thalassemia..
Mutation type | Method |
---|---|
Deletion | Gap PCR |
MLPAa,b) | |
Non-deletion | Allele-specific PCR |
Reverse dot blotting | |
Denaturing gradient gel electrophoresis | |
ARMS | |
Sanger sequencinga) | |
Next-generation sequencinga) |
a)Methods currently applied in clinical laboratories in Korea. b)Commercial kits available [SALSA MLPA Probemix P140 and P102 (MRC-Holland, Amsterdam, The Netherlands)]..
Abbreviations: ARMS, amplification refractory mutation system; MLPA, multiplex ligation-dependent probe amplification; PCR, polymerase chain reaction..
Gap-polymerase chain reaction can be applied to common deletions in a specific population, using primers flanking known breakpoints. Common single α-globin-gene deletions include a 3.7 kb deletion (-α3.7) and a 4.2 kb deletion (-α4.2). Moreover, two common α-globin-gene deletions include founder variants in specific populations, such as --SEA (Southeast Asian), --FIL (Filipino), and --MED (Mediterranean). The multiplex ligation-dependent probe amplification (MLPA) method is another technique for characterizing deletions in thalassemia, which can detect both known and unknown deletions. MLPA uses two separate oligonucleotide probes (left probe oligonucleotide and right probe oligonucleotide) that are hybridized to adjacent target sequences and ligated. Ligated probes are amplified via PCR, and the amount of amplified probe ligation products enables the quantification of gene copy numbers. MLPA is simple to perform in clinical laboratories and is suitable for the detection of various deletions [4, 22].
Several cost-effective methods, such as allele-specific PCR, reverse dot blotting, denaturing gradient gel electrophoresis, and the amplification refractory mutation system can be applied to detect common sequence variants (e.g., Hb Constant Spring in South Asia). Sanger sequencing (i.e., direct DNA sequencing) is currently the most practical method to comprehensively detect all variants without prior knowledge of variants [22]. During Sanger sequencing, the PCR product is obtained and subsequently sequenced using the Sanger dideoxy termination method [4]. However, sequencing of α-globin genes is complex, as two α-globin genes (
Thalassemia is one of the most commonly inherited Hb disorders. The genetic basis of α- and β-thalassemia and molecular techniques applicable in a clinical laboratory for the diagnosis of thalassemia have been described. Understanding the genetic basis of thalassemia and these molecular techniques will have a strong impact on the accurate molecular diagnosis of thalassemia.
No potential conflicts of interest relevant to this article were reported.
Table 1 . Common variants of each variant type of α-thalassemia..
Gene | Variant type | Common variant | Effect |
---|---|---|---|
α-globin gene | --/ | --SEA, --MED, --FIL | α0 |
-α/ | -α3.7 and -α4.2 | α+ | |
αTα/ | αIVS1(-5nt)α, αPA(AATAAG)α, αCSα | α+ | |
ααT/ | α+ |
Table 2 . Distribution of α-thalassemia genotypes identified in Seoul National University Hospital (South Korea)..
α-thalassemia genotypes | N | % |
---|---|---|
--SEA/αα | 8 | 33.3 |
α3.7/αα | 5 | 20.8 |
--SEA/α3.7 | 4 | 16.7 |
α3.7/α3.7 | 2 | 8.3 |
α3.7/α212 patchwork | 2 | 8.3 |
--/αα (ATRA-16 syndrome) | 1 | 4.2 |
αα/αα (HS-40 deletion) | 1 | 4.2 |
--SEA/-α4.2 | 1 | 4.2 |
Total | 24 |
Table 3 . Types of variants in β-thalassemia..
Gene | Variant type | Effect |
---|---|---|
β-globin gene | Transcriptional variants | β+ or β++ |
Primary RNA transcript processing | β0, β+, or β++ | |
3’ UTR or poly-A site | β+ | |
Translation | ||
Initiation codon | β0 | |
Premature termination codon | β0 |
Abbreviation: UTR, untranslated region..
Table 4 . Molecular spectrum of β-thalassemia identified in Seoul National University Hospital (South Korea)..
HGVS nomenclature | Amino acid change | N of alleles | % |
---|---|---|---|
c.-138C>A | 1 | 0.6 | |
c.-81A>G | 2 | 1.2 | |
c.79G>A | p.Glu27Lys | 18 | 11.0 |
c.92G>A | p.Arg31Lys | 1 | 0.6 |
c.92+1G>A | 2 | 1.2 | |
c.92+1G>T | 2 | 1.2 | |
c.92+2T>C | 1 | 0.6 | |
c.92+5G>C | 3 | 1.8 | |
c.93-21G>A | 3 | 1.8 | |
c.93-1G>C | 2 | 1.2 | |
c.315+1G>A | 12 | 7.4 | |
c.316-197C>T | 1 | 0.6 | |
c.316-2A>G | 1 | 0.6 | |
c.1A>G | Start loss | 2 | 1.2 |
c.2T>G | start loss | 25 | 15.3 |
c.48G>A | p.Trp16* | 6 | 3.7 |
c.52A>T | p.Lys18* | 18 | 11.0 |
c.114G>A | p.Trp38* | 4 | 2.5 |
c.118C>T | p.Gln40* | 1 | 0.6 |
c.364G>T | p.Glu122* | 13 | 8.0 |
c.394C>T | p.Gln132* | 1 | 0.6 |
c.25_26del | p.Lys9Valfs*14 | 1 | 0.6 |
c.27dup | p.Ser10Valfs*14 | 8 | 4.9 |
c.126_129del | p.Phe42Leufs*19 | 15 | 9.2 |
c.201del | p.Val68Cysfs*22 | 1 | 0.6 |
c.217dup | p.Ser72Lysfs*2 | 2 | 1.2 |
c.253_254del | p.Thr85Leufs*6 | 1 | 0.6 |
c.270_271del | p.Ser90Argfs*5 | 2 | 1.2 |
c.287dup | p.Leu97Alafs*6 | 1 | 0.6 |
c.378_379del | p.Val127Alafs*13 | 1 | 0.6 |
c.182T>A | p.Val61Glu | 1 | 0.6 |
c.383A>G | p.Gln128Arg | 2 | 1.2 |
7 | 4.3 | ||
Beta-globin gene cluster deletion | 1 | 0.6 | |
1 | 0.6 | ||
Total | 163 |
Table 5 . Molecular diagnostic methods for thalassemia..
Mutation type | Method |
---|---|
Deletion | Gap PCR |
MLPAa,b) | |
Non-deletion | Allele-specific PCR |
Reverse dot blotting | |
Denaturing gradient gel electrophoresis | |
ARMS | |
Sanger sequencinga) | |
Next-generation sequencinga) |
a)Methods currently applied in clinical laboratories in Korea. b)Commercial kits available [SALSA MLPA Probemix P140 and P102 (MRC-Holland, Amsterdam, The Netherlands)]..
Abbreviations: ARMS, amplification refractory mutation system; MLPA, multiplex ligation-dependent probe amplification; PCR, polymerase chain reaction..
Young Kyung Lee, Hee-Jin Kim, Kyunghoon Lee, Sang Hyuk Park, Sang Hoon Song, Moon-Woo Seong, Myungshin Kim, and Jin Yeong Han
Blood Res 2019; 54(1): 17-22Seyed Ali Mohammad Arabzadeh, Farideh Alizadeh, Ahmad Tavakoli, Hamidreza Mollaei, Farah Bokharaei-Salim, Gharib Karimi, Mohammad Farahmand, Helya Sadat Mortazavi, and Seyed Hamidreza Monavari
Blood Res 2017; 52(1): 50-54Eun Sil Park, Hye Lim Jung, Hee-Jin Kim, Sung Sup Park, Soon Hwan Bae, Hee Young Shin, Sang Hoon Song, Kyung-Nam Koh, Chuhl Joo Lyu, Young Tak Lim, Dong Kyun Han, and Jeong Ok Hah
Blood Res 2013; 48(3): 211-216