Introduction

The recent excitement generated by the analysis of human cancer genomes is tempered by the uncovered regulatory complexity. Interactions between heterogeneous multiprotein complex and genetic and regulatory modules form networks unique not only to different tumor types, but also, surprisingly, to similar type tumors generating molecular/cellular heterogeneity that ultimately control cancer cell proliferation and invasion, endowing tumors with robustness. It is, however, unknown how cancer-specific modules and their networks determine tumor behavior, or how they directly or indirectly reorganize signaling complexes into novel networks even within the same tumor, making tumor treatment problematic. Experimental and theoretical data from high-throughput studies suggest that the key lies in characterizing cancer-specific protein complex modules, identifying common nodes, that is, the key proteins (oncogenes or other molecules) shared by different complex modules and networks, and in dissecting adaptive, inter-module functional communication in the control of tumor growth by rearrangement of protein complex modules.

Altered genetic and epigenetic networks in cancer cells are organized in functional modules (1) and result in permanent changes in the types, numbers, and interactions of protein complex genes that endow tumors with heterogeneity and robustness, and largely determine disease progression. Robustness as used above is defined as the ability of genes to resist changes in function from perturbations in individual components (2,3). They originate from mutated/overexpressed oncogenic or tumor suppressor molecules (Figure 1) in key pathways that control cell and tissue growth, such as the cell cycle (4), signal transduction pathways (5), or chromatin remodeling pathways (6,7). Genomic, proteomic, and other high-throughput sequencing efforts are revealing an unprecedented complexity and heterogeneity of human cancer genomes (8,9).

Figure 1
figure 1

Model of hierarchical protein complex, gene, and regulatory module organization. Mutations and epimutations in “cancer genes” give rise to molecules (proteins and so on) that form altered multi-molecular complex functional modules. These are, in turn, reorganized into novel, tumor, stage, or even sample-specific, autonomous, physical, as well as temporal growth-promoting networks that control tumor tissue properties, such as proliferation and invasion.

No general principles have emerged that describe heterogeneity (10), and most methods currently in use, such as polymerase chain reaction (PCR), derive their utility from their clonal nature, which, although permitting genome-wide, high-throughput analyses of many samples, are limited by sensitivity, sample size, and noise. Recent computational and experimental results on yeast protein network architecture (11) open the possibility to formulate general experimental hypotheses that bridge theory and experiment in identifying cancer pathways that drive overall tumor behavior. Central to such hypotheses will be the identification of key node proteins (nodes) in different protein modules that permit inter-module communication between different cell populations and their networks (11). In this commentary, we discuss sources of human solid tumor heterogeneity, robustness, and their dependence on the existence of differently reorganized complexes and networks, using the example of epidermal growth factor receptor (EGFR)/MET complex reorganization in gefinitib resistance.

The Nature of Tumor Heterogeneity and Robustness

High-throughput gene (12,13) and protein expression data, as well as massively parallel signature sequencing (MPSS) which reveals the mutation landscape of tumors, generate a complex picture that can be analyzed in terms of genetic and transcriptional regulatory modules, that is, sets of transcription factors that regulate these gene modules (14,15) in a temporal and signal-dependent manner. Most solid human tumors are communities of several different types of cells with varied gene and protein expression profiles that generate hundreds of multimolecular complexes in cells, organized in functional signaling pathways (1618) (see Figure 1). Tumor sample heterogeneity also is reflected by the presence of different putative tumor driver mutations in different specimens in the same tumor samples (8). Heterogeneity within the same tumor is more clearly apparent in data from high-throughput, targeted sequencing of specific kinase families in different tumor types. For example, genome-wide sequencing of samples from 72 breast tumors and nine cell lines reveal that only 6 out of 518 kinases bear non-synonymous, putative driver mutations (19). Similar results were obtained from lung cancer data (1921) and testicular tumors (22).

Reducing Heterogeneity of Pathways and Genes to Genetic and Regulatory Modules

Reconstructing cancer pathways (23) addressed by genome-wide approaches is a formidable task (24,25). However, as mentioned in the previous section, these studies also reveal that, with few exceptions, profiles and distribution of mutations are variable, depending on sample, tumor type, and, within the same tumor type, on whether single cells or the gross tumor has been analyzed (8). Data also is confounded by noise and limited reproducibility, the non-concordance between mRNA, encoded protein, or small molecule (miRNA and so on) expression and processing. Genetic and regulatory modules are the units of cell- and tissue-level biological functions (26,27) and, in the long term, into targetable molecular mechanisms (28). Therefore, pattern identification relies on extracting coherent gene expression changes and on inferring the operative transcriptional and other cellular modules (26,27). More recent approaches attempt to integrate multiple data sources, including patient survival, with gene profile signatures in wound repair (29), to compile molecular concepts maps (MCMs) from gene/protein annotations, and to derive regulatory networks and expression profiles computationally (30), or to compare coherent sets of genes that already are mapped onto cancer pathways for association with either a single gene profile in microarrays or “phenotypes” using Kolmogorov-Smirnov sum statistics (31,32). Another approach uses “cancer gene and module compendium” from different studies to uncover shared genes and gene modules, and uses them to map onto and to predict clinical cancer properties (33,34). In many cases, use of gross tumor mass tissue results in an average profile that obfuscates the contribution of different cells and signals. Reduction of expression and other data to functional modules is more informative in simple model organisms, both because they are equipped with simpler genomes and because they consist of genetically similar or identical cells (less heterogeneous).

Identifying Key Protein Nodes in Complex and Functional Modules within Heterogeneous Tumors: Bridging Theory with Experiment

Although identification of gene and regulatory modules is revolutionizing our understanding of tumor growth, we are far from reconstructing shared molecular mechanisms in similar, let alone different, human cancers because there is a gap between experimental and theoretical approaches that define protein complex modules into realistic cancer cell networks. It is unknown how oncogene protein and other cancer protein complex modules are reorganized when tumors evolve, however, in spite of the complexity imposed by tumor cell genetic and biochemical heterogeneity, theoretical model building based on data from yeast interactomes and proteomes suggests that it is possible to deal with this issue by finding shared signatures from pure, isolated tumor cell populations. For example, analysis of the yeast protein interactome by Spirin and Mirny (11) revealed that two main modules exist: protein complex and functional modules with overlapping protein members in several growth-regulating pathways such as the cyclin-dependent kinase (CDK)/Cyclin; and mitogen-activated protein (MAP) kinase. This proposal remains purely hypothetical, however recent results on EGFR mutation-driven resistance to gefinitib in human lung cancers clearly support the idea that complex rearrangements are a major mechanism of robustness-driven resistance and growth within cells. EGFR mutations not affecting drug binding allow it to complex with the MET oncogene product, a receptor tyrosine kinase, thus permitting uncharacterized MET-driven growth pathways to sustain cancer cell survival.

Stable or transient multiprotein complex modules of known oncogene proteins, such as MET/ErbB3, form novel protein complex modules that cannot be predicted by any current high-throughput strategy, however, their central role in tumor robustness (35) makes it imperative that their topology and interactions in pure tumor cell populations be defined.

Conclusion

Recent experimental and theoretical results on biological networks, their topology, and characterization of oncogenic protein complex modules suggest that mapping oncogene products and other cancer-involved genes to actual pathways in simple, homogeneous cell populations determining tumor behavior is a desired approach to unravel tumor heterogeneity and vulnerability. How oncoprotein complex modules change in response to intratumor or tumor-stromal signaling, and how gene, protein, and regulatory modules change by rearranging polypeptide composition in response to disease progression remain unanswered questions, and are vital to successful, combination therapeutic interventions.