Facioscapulohumeral muscular dystrophy (FSHD) is linked to the polymorphic D4Z4 locus on chromosome 4q35. In non-affected individuals, this locus comprises 10-100 tandem copies of members of the 3.3kb dispersed repeat family. Deletions leaving 1-8 such repeats have been associated with FSHD, for which no candidate gene has been identified. We have determined the complete nucleotide sequence of a 13.5kb EcoRI genomic fragment comprising the only two 3.3kb elements left in the affected D4Z4 locus of a patient with FSHD. Sequence analyses demonstrated that the two 3.3kb repeats were identical. They contain a putative promoter that was not previously detected, with a TACAA instead of a TATAA box, and a GC box. Transient expression of a luciferase reporter gene fused to 191bp of this promoter, demonstrated strong activity in transfected human rhabdomyosarcoma TE671 cells that was affected by mutations in the TACAA or GC box. In addition, these 3.3kb repeats include an open reading frame (ORF) starting 149bp downstream from the TACAA box and encoding a 391 residue protein with two homeodomains (DUX4). In-vitro transcription/translation of the ORF in a rabbit reticulocyte lysate yielded two (35)S Cys/ (35)S Met labeled products with apparent molecular weights of 38 and 75kDa on SDS-PAGE, corresponding to the DUX4 monomer and dimer, respectively. In conclusion, we propose that each of the 3.3kb elements in the partially deleted D4Z4 locus could include a DUX4 gene encoding a double homeodomain protein.
Sequence-specific DNA binding transcription factor activitydefinition[GO:0003700]‹silver
Interacting selectively and non-covalently with a specific DNA sequence in order to modulate transcription. The transcription factor may or may not also interact selectively with a protein or macromolecular complex.
IEAInterPro 2 GO
Transcription regulatory region sequence-specific DNA bindingdefinition[GO:0000976]‹silver
Interacting selectively and non-covalently with a specific sequence of DNA that is part of a regulatory region that controls transcription of that section of the DNA. The transcribed region might be described as a gene, cistron, or operon.
The cellular synthesis of RNA on a template of DNA.
IEAUniProtKB KW
Note
DUX genes are present in 3.3-kilobase elements, a tandem repeat family scattered in the genome found on the short arms of all acrocentric chromosomes as well as on several other chromosomes.
Protein involved in the transfer of genetic information from DNA to messenger RNA (mRNA) by DNA-directed RNA polymerase. In the case of some RNA viruses, protein involved in the transfer of genetic information from RNA to messenger RNA (mRNA) by RNA-directed RNA polymerase.
A reference proteome is a set of protein sequences derived from a complete proteome which constitutes a defined standard for a particular user community. Reference proteomes are manually defined according to a number of criteria. They cover the proteomes of well- studied model organisms and other proteomes of interest for biomedical and biotechnological research. Reference proteomes have been selected to provide broad coverage of the tree of life, and constitute a representative cross-section of the taxonomic diversity to be found within UniProtKB.