Introduction

The domain model of eukaryotic genome organization (Bodnar 1988; Goldman 1988) was proposed almost 30 years ago to account for the analysis of the generalized DNase I sensitivity of individual genes and genomic segments. The generalized sensitivity to DNase I is believed to reflect a partial distortion of higher orders of chromatin packaging and manifests itself in a randomly increased sensitivity toward DNase I over fairly long chromatin segments (up to hundreds of kilobases). In contrast, DNase I hypersensitive sites are commonly considered to be short nucleosome-free regions which are much more susceptible to DNase I than the regions of generalized sensitivity (Cockerill 2011). Although initially it was observed that active genes per se were preferentially digested by DNase I in isolated nuclei or permeabilized cells (Garel and Axel 1976; Weintraub and Groudine 1976; Weintraub et al. 1981), further studies demonstrated that relatively long genomic regions comprised of tissue-specific genes rather than individual genes resided in DNase I-sensitive or DNase I-resistant configurations in a lineage-specific fashion (Jantzen et al. 1986; Lawson et al. 1982). It was proposed that the whole genome was built from similarly organized structural-functional units (domains) that may be either active or repressed. These “chromatin domains” were defined as large genomic areas where changes of the transcriptional status and chromatin organization occurred independently of the flanking regions. The transcriptional status of domains was thought to be controlled by chromatin packaging. Each domain could be either folded (inactive) or unfolded (active) (Bodnar 1988; Goldman 1988). The differences in the pattern of chromatin domain folding could explain the differential sensitivity of these domains to DNase I (Fig. 1). Concurrently, several research teams developed the concept of the genome folded in closed loops and fixed to a proteinaceous skeletal structure both in interphase nuclei and metaphase chromosomes (Adolph et al. 1977; Benyajati and Worcel 1976; Cook et al. 1976). The sizes of these loops were reported to be in the range of 50–250 Kbs (reviewed in Razin et al. 1995). Although it was tempting to propose a direct link between these loops and the structural-functional genomic units postulated by the domain model (Fig. 1), subsequent studies did not provide substantial experimental support for this idea as profiles of DNA partitioning into loops were proved to be function-related (Ciejek et al. 1983; Cook et al. 1982; Small and Vogelstein 1985), while the domain model assumed them to be invariant.

Fig. 1
figure 1

Domain model of eukaryotic genome organization. Two compressed (inactive) and one unfolded (active) domains are shown. Active genes and regulatory elements (LCR) are shown in red. Red arrows show DNase I hypersensitive sites. Repressed genes and inactive regulatory elements are shown in blue. Insulators are shown by yellow diamonds

The transcriptional activity and general sensitivity of chromatin domains to DNase I was found to correlate with high levels of histone acetylation (Forsberg et al. 2000; Hebbes et al. 1988, 1994). The causal relationship between histone acetylation and generalized DNase I sensitivity of chromatin was demonstrated by Krajewski and Becker (1998). Consequently, histone acetylation was seen as a mechanism controlling the transcriptional status of chromatin domains (Eberharter and Becker 2002). Furthermore, the analysis of histone acetylation profiles allowed chromatin domains to be recognized as functional domains of the genome (Forsberg and Bresnick 2001). Henceforth, chromatin domains regardless of their loop structure are commonly considered to correspond to structural-functional units of the genome as postulated by the domain model.

The domain model of eukaryotic genome organization describes coregulation of gene clusters and switching between gene activation and repression by assuming that the domain as a whole constituted a primary unit of transcriptional regulation (Razin et al. 2007). The model postulated the existence of both domain-level regulatory elements and well-defined domain border elements and thus prompted several researchers to identify regulatory elements controlling the chromatin status of genomic domains. These studies resulted in identification of domain-bordering elements (insulators) (Kellum and Schedl 1991, 1992; Udvardy et al. 1986), nuclear scaffold/matrix attachment regions (S/MARs) (Cockerill and Garrard 1986; Mirkovitch et al. 1984), and locus control regions (LCRs) (Forrester et al. 1987, 1990; Grosveld et al. 1987; Li et al. 1990). The domain of human beta-globin genes represents a typical example of a structural-functional genomic unit considered by the domain model. This domain harbors several globin genes along with an upstream LCR flanked by insulators. In erythroid cells, the whole domain resides in a DNase I-sensitive configuration. However, individual globin genes present within the domain are transcribed in a developmental stage-specific manner. In non-erythroid cells, the whole domain resides in a DNase I-resistant configuration. Hence, the transcription of genes within the domain is activated in two steps: Firstly, activation of the domain as a whole in erythroid lineage cells makes all genes accessible for trans-acting factors; secondly, the individual genes are activated in a developmental stage-specific fashion. Interestingly, the timing of domain replication reveals lineage-specific switches of late-replication in non-erythroid cells to early-replication in erythroid cells (Forrester et al. 1990). Beta-globin gene domains within other vertebrates are organized similarly to the human domain (reviewed in Razin et al. 2003).

The domain model of eukaryotic genome organization was based on studies performed in a limited number of animal models (beta-globin gene domains of mammals and birds, an ovalbumin gene domain of birds, a chicken lysozyme gene domain, histone and heat shock gene loci of Drosophila); however, the deductions were extended to the entire genome. Further studies of numerous other domains led to the evolution of the model. Indeed, organization of the alpha-globin gene clusters of birds and mammals did not fit the predictions of the domain model.

The tissue-specific alpha-globin genes are located in an area also harboring housekeeping genes. In contrast to the beta-globin genes, alpha-globin genes reside in a DNase I-sensitive chromatin domain in both erythroid and non-erythroid cells (Craddock et al. 1995; Vyas et al. 1992). The major regulatory element of the alpha-globin gene cluster is situated in an intron of a housekeeping gene located upstream to the cluster (Jarman et al. 1991; Vyas et al. 1995). Furthermore, housekeeping genes were found in a few other tissue-specific gene domains (Chong et al. 2002).

To update the domain model of eukaryotic genome organization, the concept of “strong” and “weak” genomic domains was incorporated (Dillon and Sabbatini 2000) (reviewed in Razin et al. 2003). The weak domains such as those harboring the vertebrate alpha-globin genes were first considered to be an exception to the general rule, but subsequent studies have since shown that the typical eukaryotic genome is exceedingly constituted of these domains. Indeed, even in the “classical” mammalian beta-globin gene domains, the borders of functional domains (that include genes and regulatory modules) do not correspond to the borders of the region demonstrating lineage-dependent sensitivity to DNase I (Bulger et al. 2000; Schubeler et al. 2000). It is noteworthy that most known genes are housekeeping genes which are expressed in cells of different lineages; thus, they reside permanently in DNase I-sensitive chromatin domains. The expansion of genome-scale techniques in the twenty-first century has provided us with novel information on genome organization and folding. Below, we shall further review the evolution of the domain model of the eukaryotic genome organization alongside the progression of genome-wide data.

Structural units and domains of eukaryotic chromosomes

With the advent of genome-wide analysis techniques, the existence of chromatin domains was studied many times over, first with techniques that addressed the linear organization of the genome (ChIP-chip, ChIP-seq, transcriptome and exome analysis, genome-wide analysis of generalized DNase I sensitivity and DNaseI hypersensitive sites etc.) and then with increased precision, the 3D folding of the genome. We shall first consider the linear organization of the genome.

During the early times of the domain model, only two major types of chromatin were considered: the active and the repressed chromatin. Analysis of generalized sensitivity to DNase I was used to discriminate between genes present in active and repressed chromatin domains (Jantzen et al. 1986; Lawson et al. 1982; Weintraub and Groudine 1976; Weintraub et al. 1981). Recently, a strategy for the genome-wide analysis of generalized sensitivity to DNase I was developed (Milon et al. 2014). Using this strategy, it was demonstrated that the entire Drosophila genome illustrates a mosaic of interspersed DNase I-sensitive and DNase I-resistant segments (Milon et al. 2014).

Following the proposition of the histone code hypothesis (Strahl and Allis 2000), many histone marks corresponding to various chromatin states were extensively studied. Genome-wide combinatorial distribution analysis of histone modifications allowed identification of several types of distinct active and inactive chromatin regions (Filion et al. 2010; Kharchenko et al. 2011). Consequently, it became possible to distinguish several types of chromatin domains (Bickmore and van Steensel 2013). These include repressive polycomb domains (i.e., domains harboring genes inactivated in the course of cell differentiation) characterized by the presence of H3K27me3 (Tolhuis et al. 2006) and large organized chromatin K9-modifications (LOCKs) domains which may provide a cell type-heritable mechanism for phenotypic plasticity and are characterized by the presence of H3K9me2/3 (Wen et al. 2009). Domains associated with the nuclear lamina (LADs) (Guelen et al. 2008; Kind et al. 2013; Kind and van Steensel 2010; Meuleman et al. 2013) harbor mostly inactive genes.

Discrete genomic domains can be identified by analyzing the timing of replication. Extended early-replication and late-replication domains approximately correspond to active and repressed chromatin domains (Farkash-Amar and Simon 2010; Gilbert 2002). A more detailed analysis allowed the identification of four types of replication domains that colocalize with specific types of chromatin domains (Julienne et al. 2013). The Type I early-replicating domains correspond to active chromatin and are characterized by the presence of H3K4me3, H3K79me2, and H3K36me3, as well as by the presence of RNA polymerase II (RNApol II) and CTCF associated with chromatin. The intermediate- and late-replicating domains harbor different classes of repressed chromatin: facultative heterochromatin repressed by polycomb (Type 2), repressed chromatin without any distinctive histone modifications (LADs) (type 3), and classical constitutive heterochromatin containing H3K9me3 and HP1 (Type 4) (Julienne et al. 2013). The average sizes of these four types of replicative/chromatin domains vary significantly (275, 228, 325, and 718 Kb for types 1, 2, 3, and 4, respectively; Julienne et al. 2013). Of note, domains that switch their chromatin status in a lineage-specific manner, namely, active domains and domains repressed by polycomb, appear to correspond well to model domains considered at the early times of the domain model of eukaryotic genome organization.

The abovementioned domains were identified while studying linear (1D) genome organization. However, genomes are folded into a complex 3D structure and compacted over ×10,000 to fit into the nucleus. An assortment of methods based on chromosome conformation capture (3C) (Dekker et al. 2002) allowed for a genome-wide analysis of spatial proximity between various genomic regions (Belton et al. 2012; de Wit and de Laat 2012; Gibcus and Dekker 2013). Studies performed in mammals (Dixon et al. 2012; Lieberman-Aiden et al. 2009; Nora et al. 2012; Rao et al. 2014) and Drosophila (Hou et al. 2012; Sexton et al. 2012) demonstrated that the genome was partitioned into a set of self-interacting domains called topologically-associating domains (TADs). The average size of TADs constitutes ~100 Kb in Drosophila and ~1 Mb in mammals. However, TADs are often hierarchical, i.e., composed of smaller self-interacting domains separated by weak boundaries (Filippova et al. 2014; Sexton et al. 2012; Ulianov et al. 2015b; Weinreb and Raphael 2015).

Linear or three-dimensional?

Researchers engaged in genomics usually consider linear distances along DNA (i.e., in Kb). Consequently, different types of domains (either functional or structural) are also considered as linear, and they are represented as such in different genomic browsers. However, current studies demonstrate that the genome is spatially organized in a functionally dependent way (reviewed in Dekker and Mirny 2016; Rowley and Corces 2016). In particular, remote regulatory elements may be clustered in the nuclear space or even assembled in a common regulatory complex (reviewed in Bickmore 2013; de Laat and Grosveld 2003; Pombo and Dillon 2015; Razin et al. 2013; Sexton and Cavalli 2015). Consequently, genomic domains that are split on DNA may regain their integrity due to their spatial organization within the nuclear space. Enhancers usually harbor clusters of binding sites of various transcription factors. Even if binding of transcription factors to DNA is not very strong, the presence of several recognition sequences may lead to an increase of a local concentration of transcription factors in the vicinity of an enhancer; ultimately shifting the equilibrium between binding and dissociation towards binding. In fact, this mechanism does not require the presence of binding sites for transcription factors on the same DNA chain, only their spatial proximity appears to be important. Recent data strongly support a model where spatial organization of the genome occurs via looping between remote regulatory elements and their targets (Fanucchi et al. 2013; Mifsud et al. 2015; Tang et al. 2015). The role of this organization cannot be reduced to simple activation of a single gene by a remote regulatory element. Indeed, disruption of a single spatial contact within a complex of coregulated genes results in downregulation of all genes constituting the complex. Thus, the integrity of a spatial domain as a whole is important for the transcriptional regulation of all genes involved (Fanucchi et al. 2013).

Similar to split functional domains, the split chromatin domains may reassemble in the nuclear space. This can be demonstrated by an example of the mouse beta-globin gene domain. One of the regulatory modules of this domain (HS −62) is located within the array of inactive olfactory receptor genes ~62 Kbs upstream to the beta-globin gene cluster. In erythroid cells, HS −62 and LCR have a high level of histone acetylation. However, the chromatin area that separates these two regulatory elements is not highly acetylated. In the assembled active chromatin hub (de Laat and Grosveld 2003; Tolhuis et al. 2002), this fragment is looped out and HS −62, LCR, and the transcribed beta-globin genes that harbor hyperacetylated histones become located next to each other. Another example of the assembly of a spatial structural/functional genomic domain is provided by the murine HoxD gene locus. In mice, the expression of HoxD genes is controlled by remote enhancers that are spread over an extended (800 Kb) gene desert (Montavon et al. 2011). These remote regulatory modules form lineage-specific contacts with each other and with target genes resulting in the assembly of 3D regulatory domains which the authors refer to as “regulatory archipelagos” (Montavon et al. 2011). The principle of spatial assembly of split structural/functional domains in the 3D nuclear space is shown in Fig. 2.

Fig. 2
figure 2

A chromatin domain may not correspond to a continuous region on a linear chromatin fiber. Upper panel,  Map of a hypothetic genomic domain composed of three genes and two enhancers. The curve above the map demonstrates the profile of histone acetylation that is also illustrated by the intensity of red color of the line representing a linear chromatin fiber. The genes and enhancers are bound to highly acetylated histones while intermediate segments of chromatin do not contain highly acetylated histones. Lower panel, Within the cell nucleus, all acetylated regions of the domain shown in the upper section are assembled into a single hyperacetylated spatial chromatin domain

Splitting of a functional genomic domain into several blocks separated on a linear DNA molecule may be a consequence of integration of mobile elements and genome rearrangements (see Ulianov et al. 2015a for an extended discussion). However, it may also be advantageous as 3D combination of various regulatory elements could possibly provide additional opportunities for lineage-specific gene regulation or for the adaptation of a regulatory system to a changing environment.

Of note, the same genomic segment can acquire different 3D configurations in cells of different lineages or simply in different cells within a population (for example, depending on the stage of the cell cycle). The developmental stage-specific spatial configuration of the mouse beta-globin domain has long been described (de Laat and Grosveld 2003; Tolhuis et al. 2002). In this case, the functional relevance of various spatial configurations of the domain is quite obvious as the globin genes transcribed at each particular developmental stage are recruited to the active chromatin compartment. Similarly, different variants of Epstein-Barr virus latency programs were found to depend on the assembly of alternate spatial configurations of the viral genome (Tempera et al. 2011). The choice between these configurations is controlled by CTCF (Chen et al. 2014). In the both abovementioned cases, the alternate spatial configurations of the genomic domains were attributed to different cell populations. However, various genomic domains, including that of chicken alpha and beta-globin genes, can adopt various spatial configurations in the same cellular populations (Philonenko et al. 2009; Ulianov et al. 2012). Evidently, this allows alternate activation of different genes by the same enhancer element. In cultured chicken erythroblasts stimulated toward terminal erythroid differentiation, the assembly of two alternate spatial regulatory complexes was observed. One of them appears to be necessary for active transcription of a subset of globin genes (Gavrilov and Razin 2008), while the other activated the TMEM8 gene (Philonenko et al. 2009). Interestingly, these two distinct activator complexes share some regulatory elements and therefore simply cannot be assembled simultaneously in the same chromosome (Philonenko et al. 2009). Other studies performed on different experimental models strongly support the notion that spatial organization of the genome significantly differs in different cells within the same population. Likewise, in an NF-kB-regulated complex of coexpressed genes (Papantonis et al. 2010), the involved genes colocalize in the nuclear space and are essentially transcribed only in a small portion of cells in the population (Fanucchi et al. 2013). Similar observations were made when patterns of colocalization of various erythroid genes in individual cells were analyzed using 3D-FISH (Schoenfelder et al. 2010). It seems feasible that the 3D organization of the genome constantly changes thus providing a possibility for sorting and selection of optimal variants.

The question of how much the implementation of these alternative configurations correlates with changes in the structure of chromatin domains requires further study. Progress in this area of research will be determined by modeling 3D chromatin domains on the available epigenetic data. The first attempt to develop software for the prediction of 3D configurations of genomic domains based on 1D epigenetic profiles has already been reported (Zhu et al. 2016).

The current domain model suggests that histone modifications spread progressively along the chromatin fiber resulting in the expansion of various types of chromatin domains (Bannister and Kouzarides 2011; Bannister et al. 2001). However, the same mechanisms of chromatin domain expansion may also operate in the 3D space if chromatin fibers are located close enough to each other. In this case, a spherical domain of modified histones will be formed (for an extended discussion, see Razin and Gavrilov 2014). This domain may appear either continuous or discontinuous on a linear chromatin fiber, depending on the pattern of DNA packaging. The shape and size of the spatial chromatin domain will be determined by characteristics of self-assembled nucleosome conglomerates (Maeshima et al. 2016).

Topologically associating domains are the spatial units of the 3D genome

Hi-C data analysis performed on various organisms suggests that chromosomes of yeast, Drosophila, and mammals are organized into self-interacting chromatin domains which are referred to as topologically associating domains or TADs (Dixon et al. 2012; Hou et al. 2012; Hsieh et al. 2015; Sexton et al. 2012; Zhang et al. 2012). TADs are characterized by an increased level of intra-TAD spatial contacts and decreased level of inter-TAD spatial contacts (Lieberman-Aiden et al. 2009). TADs are usually identified with chromatin globules (Lieberman-Aiden et al. 2009), such as chromatin domains (CDs) reported by Cremer’s group (Cremer and Cremer 2001; Markaki et al. 2010; Smeets et al. 2014). 3D-FISH analysis of mutual positions of genomic segments located within the same and neighboring TADs indirectly supports this interpretation (Dixon et al. 2012; Fabre et al. 2015; Nora et al. 2012). TADs are larger in mammals than in Drosophila. However, in both organisms, TADs are hierarchical (i.e., composed of smaller self-interacting domains) (Fraser et al. 2015; Ulianov et al. 2015b; Weinreb and Raphael 2015). Profiles of chromosomes partitioning into TADs were reported to be cell lineage-independent and, furthermore, evolutionary conserved within syntenic regions (Dixon et al. 2012; Vietri Rudan et al. 2015). However, the degree of similarity of TAD profiles between cells of different lineages is limited and does not exceed 60–80 % in mammals (Dixon et al. 2012; Fraser et al. 2015) and 40–50 % in Drosophila (Hou et al. 2012; Ulianov et al. 2015a). Furthermore, the internal structure of TADs is not rigid. It undergoes certain changes in the course of cell differentiation and in response to stress (Bau et al. 2011; Berlivet et al. 2013; Dixon et al. 2015; Li et al. 2015). Nonetheless, organization of chromosomes into TADs restricts spatial contacts between remote genomic regions. Most of the observed contacts occur within TADs (Gibcus and Dekker 2013; Matharu and Ahituv 2015).

It is not yet clear how two adjacent TADs are separated. Our recent data strongly suggest that, in Drosophila, assembly of TADs is directed by internucleosomal contacts, and TAD separation occurs due to positioning of transcriptionally active chromatin segments between TADs. These segments are less likely to establish internucleosomal interactions since they are highly acetylated (Ulianov et al. 2015a). The folding of nucleosomal arrays into secondary chromatin structures and self-association of arrays into higher-order tertiary structures is an inherent property of chromatin fibers (Blacketer et al. 2010; Hansen 2002; Maeshima et al. 2016). This follows from the ability of nucleosomes to establish contacts via interaction of positively charged tails of histones H3 and H4 with the acidic patch on the surface of a nucleosomal globule (Kalashnikova et al. 2013; Pepenella et al. 2014; Schalch et al. 2005; Sinha and Shogren-Knaak 2010) and with negatively charged DNA (Arya and Schlick 2006). These interactions are adjusted by modifications of histone tails, in particular by acetylation (Allahverdi et al. 2011; Pepenella et al. 2014; Shogren-Knaak et al. 2006), which interferes with the establishment of internucleosomal contacts. In Drosophila, the TAD borders and inter-TAD regions harbor housekeeping genes and are enriched in active chromatin marks (Hou et al. 2012; Sexton et al. 2012; Ulianov et al. 2015a). We have proposed that clusters of housekeeping genes are located in spacers between TADs simply because they cannot be packed into compact structures due to a high level of histone acetylation (Ulianov et al. 2015b). This model was confirmed by computer simulations. A virtual polymer composed of 19 blocks of 500 nucleosomes capable of establishing saturating contacts (“inactive chromatin”) and 50 nucleosomes incapable to establishing such contacts (active chromatin) collapsed in a series of condensed blobs (structures similar to TADs) consisting mostly of inactive nucleosomes separated by spacers of active nucleosomes. Of note, 3D configurations of this polymer observed in 12 individual simulations significantly differed. However, averaging of the results of these individual simulations allowed the acquisition of a regular pattern of compact “TADs” separated by extended “inter-TADs” (Ulianov et al. 2015b). To this end, it should be stressed that the C-methods currently used for the analysis of 3D genome organization (de Wit and de Laat 2012) give only an integrated view of millions of cells. No high-resolution single-cell TAD maps have been reported so far. The results of our computer simulations suggest that TAD profiles in individual cells may differ from each other and from the integral picture that is obtained upon analyzing a cellular population.

Stochastic factors clearly play an important role in establishing the 3D organization of the genome (Kang et al. 2011). Self-organization of chromatin fiber into TADs and spatial restoration of otherwise split functional units of the genome are likely to happen simultaneously. They contribute to a pattern that may vary significantly within different cells of a population. Indeed, the 3D global structure of X-chromosome differ significantly in individual mouse cells, as shown by modeling based on low-resolution single-cell Hi-C data (Nagano et al. 2013). The distances between individual alleles located both inside a single chromosomal territory and in different chromosomal territories also vary (Kawamura et al. 2012; Kozubek et al. 2002; Schoenfelder et al. 2010).

In mammals, average TADs are larger than in Drosophila, and the mechanism of their assembly appears to be more sophisticated. Specifically, the extrusion of DNA loops constrained by CTCF/cohesion complexes is likely to shape the TAD profiles (Alipour and Marko 2012; Fudenberg et al. 2015; Sanborn et al. 2015). Even so, the assembly of compact chromatin blobs within the extruded DNA loops is likely to be governed by internucleosomal interactions.

Are there functional domains in the eukaryotic genome?

The finding that many tissue-specific functional gene domains do not change their sensitivity to DNase I in a lineage-specific manner puts forward a question about a suitable definition of these domains. Intuitively, one can define a functional genomic domain as a genomic segment harboring one or several genes along with the regulatory elements controlling the expression of these genes. In principle, any housekeeping gene along with regulatory elements in a CpG island would fit this definition. However, the term “functional domain” usually designates an extended genomic region such as mammalian beta-globin gene domain or Hox gene clusters. Yet, the minimal size of this region remains undefined. The clusters of coexpressing genes which (clusters) may be fairly long (Boutanaev et al. 2002; Spellman and Rubin 2002) also fit a definition of a functional domain assuming that these genes share at least some common regulatory modules. Interestingly, in Drosophila, these clusters are the same size as TADs. The clusters of highly expressed genes described in human and mouse also may be related to functional domains (Caron et al. 2001; Mijalski et al. 2005). Indeed, all genes present in such domains are influenced by the same regulatory mechanism. Transgenes randomly inserted within a cluster of highly expressed genes acquire a high expression level typical of the whole cluster (Mijalski et al. 2005). Using random insertion of a construct containing a reporter gene controlled by a minimal promoter over large genomic areas, Symmons et al. demonstrated the existence of functional genomic domains, designated by the authors as “regulatory domains,” harboring tissue-specific genes which impose tissue-specific expression patterns on the transgenes inserted into these domains (Symmons et al. 2014). Other authors suggested to name these large functional genomic domains “regulatory archipelagos” (Lonfat et al. 2014; Montavon et al. 2011) or “regulatory landscapes” (Spitz et al. 2003; Zuniga et al. 2004). The difference of all these domains from those that were studied 20 years ago is predominantly their size and complexity. Regulatory domains (landscapes) are not necessarily discontinuous and may overlap. The same genomic region may contain several non-related regulatory domains. The integrity of these domains composed of several split regions is only reestablished as a result of 3D folding (Lonfat et al. 2014). The identity of regulatory domains is determined by the coordinated action of numerous enhancers that may even be situated outside these domains (Marinic et al. 2013; Symmons and Spitz 2013; Symmons et al. 2014). Indeed, it is currently considered that tissue-specific gene expression is determined by enhancers rather than promoters (Heintzman et al. 2009). There are ~400,000 enhancers in the human genome (Consortium et al. 2012), 50,000–100,000 in Drosophila genome (Arnold et al. 2013), and 2300 in the genome of Caenorhabditis elegans (Chen et al. 2013). Taking into account that the known number of human genes is ~ 25,000, the expression of an average human gene might be controlled by 10–20 enhancers, and an enhancer can control the expression of more than one gene. Thus, the number of possible combinations further increases. Certainly, gene expression of higher eukaryotes is controlled by a complex network of regulatory elements that constitutes a basis of the regulatory domains. It appears that the above regulatory domains/archipelagos are very similar if not identical to the “weak domains” discussed above (Dillon and Sabbatini 2000; Razin et al. 2003). Interestingly, a naturally occurring repositioning of a non-related gene to the tissue-specific weak domain of alpha-globin genes resulted in acquisition of the expression profile typical for the domain (Philonenko et al. 2009). This naturally occurring change of tissue specificity of a repositioned gene reinforces the observations reported by Symmons et al. (2014).

The next important question is whether there is a correlation between functional and structural units of the 3D genome. Current evidence suggests that such a correlation does exist. Indeed, coordinated changes of gene expression occur within genomic segments that colocalize with replication domains and LADs (Letourneau et al. 2014). Regulatory domains and replicating domains also appear to colocalize with TADs (Dileep et al. 2015b; Le Dily et al. 2014; Le Dily and Beato 2015; Lonfat and Duboule 2015; Nora et al. 2012; Pope et al. 2014; Remeseiro et al. 2015; Symmons et al. 2014). Furthermore, replication programs and organization of chromosomes into TADs appear to be reestablished simultaneously after mitosis (Dileep et al. 2015a). Finally, disruption of TADs, in particular fusion of the neighboring TADs, results in dramatic changes of promoter-enhancer interactions and may cause various diseases (Ibn-Salem et al. 2014; Ji et al. 2016; Lupianez et al. 2015, 2016; Petrov et al. 2006, 2008). Thus, organization of the genome into spatial domains is directly related to the functioning of the genome (Pombo and Dillon 2015; Sexton and Cavalli 2015).

Concluding remarks

There are not many concepts in modern molecular biology that have remained valid for several decades. The emergence of modern genomics, which is based on high throughput genome-wide analysis, prompted us to reconsider many hypotheses and models. Some of them have been forgotten, while others were updated and improved. We believe that the domain model of eukaryotic genome organization belongs to the latter group. The basic postulate of this model, namely, the assumption that the genome can be subdivided into structural-functional units, remains authentic. However, our knowledge about the nature of these domains has changed significantly. Fundamentally, the understanding that spatial organization of the genome plays an important role in regulation of gene expression is new. Consequently, 3D genomics and 3D chromatin domains is becoming the trend of the time.