WRKY transcription factor family


WRKY transcription factors are key regulators of many processes in plants. These include the responses to biotic and abiotic stresses, senescence, seed dormancy and seed germination and some developmental processes. Like many transcription factors, WRKY transcription factors are defined by the presence of a DNA-binding domain; in this case, it is the WRKY domain. The WRKY domain was named in 1996 after the almost invariant WRKY amino acid sequence at the N-terminus and is about 60 residues in length. In addition to containing the ‘WRKY signature’, WRKY domains also possess an atypical zinc-finger structure at the C-terminus. Most WRKY transcription factors bind to the W-box promoter element that has a sequence of TTGACC/T.

The History of WRKY Transcription Factors

In 1994 and 1995, the first two reports of WRKY transcription factors appeared. They described newly discovered but as yet ill-defined DNA binding proteins that played potential roles in the regulation of gene expression by sucrose or during germination. A third report appeared in 1996 that identified WRKY1, WRKY2 and WRKY3 from parsley. The authors named the new transcription factor family the WRKY family after a conserved amino acid sequence at the N-terminus of the DNA-binding domain. The parsley WRKY proteins also provided the first evidence that WRKY transcription factors play roles in regulating plant responses to pathogens. Numerous papers have now shown this to be a major function of WRKY transcription factors. Since these initial publications, it has become clear that the WRKY family is among the ten largest families of transcription factors in higher plants and that these transcription factors play key roles in regulating a number of plant processes including the responses to biotic and abiotic stresses, germination, senescence, and some developmental processes.

The Structure of the WRKY Domain

As soon as the WRKY domain was characterized, it was suggested that it contained a novel zinc finger structure and the first evidence to support this came from studies with 2-phenanthroline that chelates zinc ions. Addition of 2-phenenthroline to gel retardation assays that contained E. coli expressed WRKY proteins resulted in a loss of binding to the W-box target sequence. The other suggestion was that the WRKY signature amino acid sequence at the N-terminus of the WRKY domain directly binds to the W-box sequence in the DNA of target promoters. These suggestions were shown to be correct by publication of the solution structure of the C-terminal WRKY domain of the Arabidopsis WRKY4 protein. The WRKY domain was found to form a four-stranded β-sheet. Soon afterwards, a crystal structure of the C-terminal WRKY domain of the Arabidopsis WRKY1 protein was reported. This showed a similar result to the solution structure except that it may contain an additional β-strand at the N-terminus of the domain. From these two studies it appears that the conserved WRKYGQK signature amino acid sequence enters the major groove of the DNA to bind to the W-Box. Recently, the first structural determination of the WRKY domain complexed with a W-Box was reported. The NMR solution structure of the WRKY DNA-binding domain of Arabidopsis WRKY4 in complex with W Box DNA revealed that part of a four-stranded β-sheet enters the major groove of DNA in an atypical mode that the authors named the β-wedge, where this sheet is almost perpendicular to the DNA helical axis. As initially predicted, amino acids in the conserved WRKYGQK signature motif contact the W Box DNA bases mainly through extensive apolar contacts with thymine methyl groups. These structural data explain the conservation of both the WRKY signature sequence at the N-terminus of the WRKY domain and the conserved cysteine and histidine residues. It also provides the molecular basis for the previously noted remarkable conservation of both the WRKY amino acid signature sequence and the W Box DNA sequence.

The evolution of WRKY transcription factors

WRKY transcription factor genes are found throughout the plant lineage and also outside of the plant lineage in some diplomonads, social amoebae, fungi incertae sedis, and amoebozoa. This patchy distribution suggests that lateral gene transfer is responsible. These lateral gene transfer events appear to pre-date the formation of the WRKY groups in flowering plants, where there are seven well-defined groups, Groups I + IIc, Groups IIa + IIb, Groups IId + IIe, and Group III. Flowering plants also contain proteins with domains typical for both resistance proteins and WRKY transcription factors. R protein-WRKY genes have evolved numerous times in flowering plants, each type being restricted to specific flowering plant lineages. These chimeric proteins contain not only novel combinations of protein domains but also novel combinations and numbers of WRKY domains.

External Links and Useful Resources

*
*
*
*
*