TY - JOUR
T1 - Motif grammar
T2 - The basis of the language of gene expression
AU - Nagy, Gergely
AU - Nagy, Laszlo
N1 - Funding Information:
The authors thank the members of the Nagy laboratory for discussions and comments on the manuscript. This work was supported by the National Research, Development and Innovation Office to the Nuclear Receptor Research Laboratory (KKP129909, K124298) and to G.N. (PD124843). In addition, G.N. is a recipient of the János Bolyai Research Scholarship of the Hungarian Academy of Sciences and supported by the ÚNKP-19-4-DE-173 New National Excellence Program of the Ministry of Human Capacities . L.N. is supported by the National Institutes of Health ( R01DK115924 ).
Publisher Copyright:
© 2020 The Authors
PY - 2020
Y1 - 2020
N2 - Collaboration of transcription factors (TFs) and their recognition motifs in DNA is the result of coevolution and forms the basis of gene regulation. However, the way how these short genomic sequences contribute to setting the level of gene products is not understood in sufficient detail. The biological problem to be solved by the cell is complex, because each gene requires a unique regulatory network in each cellular condition using the same genome. Thus far, only some components of these networks have been uncovered. In this review, we compiled the features and principles of the motif grammar, which dictates the characteristics and thus the likelihood of the interactions of the binding TFs and their coregulators. We present how sequence features provide specificity using, as examples, two major TF superfamilies, the bZIP proteins and nuclear receptors. We also discuss the phenomenon of “weak” (low affinity) binding sites, which appear to be components of several important genomic regulatory regions, but paradoxically are barely detectable by the currently used approaches. Assembling the complete set of regulatory regions composed of both weak and strong binding sites will allow one to get more comprehensive lists of factors playing roles in gene regulation, thus making possible the deeper understanding of regulatory networks.
AB - Collaboration of transcription factors (TFs) and their recognition motifs in DNA is the result of coevolution and forms the basis of gene regulation. However, the way how these short genomic sequences contribute to setting the level of gene products is not understood in sufficient detail. The biological problem to be solved by the cell is complex, because each gene requires a unique regulatory network in each cellular condition using the same genome. Thus far, only some components of these networks have been uncovered. In this review, we compiled the features and principles of the motif grammar, which dictates the characteristics and thus the likelihood of the interactions of the binding TFs and their coregulators. We present how sequence features provide specificity using, as examples, two major TF superfamilies, the bZIP proteins and nuclear receptors. We also discuss the phenomenon of “weak” (low affinity) binding sites, which appear to be components of several important genomic regulatory regions, but paradoxically are barely detectable by the currently used approaches. Assembling the complete set of regulatory regions composed of both weak and strong binding sites will allow one to get more comprehensive lists of factors playing roles in gene regulation, thus making possible the deeper understanding of regulatory networks.
KW - Basic leucine zipper
KW - Motif grammar
KW - Nuclear receptor
KW - Transcription factor
KW - Weak motif
UR - http://www.scopus.com/inward/record.url?scp=85088920358&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85088920358&partnerID=8YFLogxK
U2 - 10.1016/j.csbj.2020.07.007
DO - 10.1016/j.csbj.2020.07.007
M3 - Review article
C2 - 32802274
AN - SCOPUS:85088920358
SN - 2001-0370
VL - 18
SP - 2026
EP - 2032
JO - Computational and Structural Biotechnology Journal
JF - Computational and Structural Biotechnology Journal
ER -