One of the multiple factors required for polyadenylation and 3'-end cleavage of mammalian pre-mRNAs. This subunit is directly involved in the binding to pre-mRNAs (By similarity).
Polyadenylation of mammalian mRNA precursors requires at least two signal sequences in the RNA: the nearly invariant AAUAAA, situated 5' to the site of polyadenylation, and a much more variable GU- or U-rich downstream element. At least some downstream sequences are recognized by the heterotrimeric polyadenylation factor CstF, although how, and indeed if, all variations of this diffuse element are bound by a single factor is unknown. Here we show that the RNP-type RNA binding domain of the 64-kDa subunit of CstF (CstF-64) (64K RBD) is sufficient to define a functional downstream element. Selection-amplification (SELEX) experiments employing a glutathione S-transferase (GST)-64K RBD fusion protein selected GU-rich sequences that defined consensus recognition motifs closely matching those present in natural poly(A) sites. Selected sequences were bound specifically, and with surprisingly high affinities, by intact CstF and were functional in reconstituted, CstF-dependent cleavage assays. Our results also indicate that GU- and U-rich sequences are variants of a single CstF recognition motif. For comparison, SELEX was performed with a GST fusion containing the RBD from the apparent yeast homolog of CstF-64, RNA15. Strikingly, although the two RBDs are almost 50% identical and yeast poly(A) signals are at least as degenerate as the mammalian downstream element, a nearly invariant 12-base U-rich sequence distinct from the CstF-64 consensus was identified. We discuss these results in terms of the function and evolution of mRNA 3'-end signals.
Interacting selectively and non-covalently with a nucleotide, any compound consisting of a nucleoside that is esterified with (ortho)phosphate or an oligophosphate at any hydroxyl group on the ribose or deoxyribose.
Interacting selectively and non-covalently with any protein or protein complex (a complex of two or more proteins that may include other nonprotein molecules).
Evidence
1:
Inferred from Physical InteractionIntAct
Polyadenylation of messenger RNA precursors requires a complex protein machinery that is closely integrated with the even more complex transcriptional apparatus. Here a polyadenylation factor, CstF-50 (cleavage stimulation factor), is shown to interact in vitro and in intact cells with a nuclear protein of previously unknown function, BRCA1-associated RING domain protein (BARD1). The BARD1-CstF-50 interaction inhibits polyadenylation in vitro. BARD1, like CstF-50, also interacts with RNA polymerase II. These results indicate that BARD1-mediated inhibition of polyadenylation may prevent inappropriate RNA processing during transcription, perhaps at sites of DNA repair, and they reveal an unanticipated integration of diverse nuclear events.
Evidence
2:
Inferred from Physical InteractionIntAct
Evidence for Iso 1
Tight connections exist between transcription and subsequent processing of mRNA precursors, and interactions between the transcription and polyadenylation machineries seem especially extensive. Using a yeast two-hybrid screen to identify factors that interact with the polyadenylation factor CstF-64, we uncovered an interaction with the transcriptional coactivator PC4. Both human proteins have yeast homologs, Rna15p and Sub1p, respectively, and we show that these two proteins also interact. Given evidence that certain polyadenylation factors, including Rna15p, are necessary for termination in yeast, we show that deletion or overexpression of SUB1 suppresses or enhances, respectively, both growth and termination defects detected in an rna15 mutant strain. Our findings provide an additional, unexpected connection between transcription and polyadenylation and suggest that PC4/Sub1p, via its interaction with CstF-64/Rna15p, possesses an evolutionarily conserved antitermination activity.
Evidence
3:
Inferred from Physical InteractionUniProtKB
In eukaryotes, the process of messenger RNA 3'-end formation involves endonucleolytic cleavage of the transcript followed by synthesis of the poly(A) tail. The complex machinery involved in this maturation process contains two proteins of the metallo-beta-lactamase (MBL) superfamily, the 73 and 100 kDa subunits of the cleavage and polyadenylation specificity factor (CPSF). By using an in vitro system to assess point mutations in these two mammalian proteins, we found that conserved residues from the MBL motifs of both polypeptides are required for assembly of the endonuclease activity that cleaves histone pre-mRNAs. This indicates that CPSF73 and CPSF100 act together in the process of maturation of eukaryotic pre-messenger RNAs, similar to other members of the MBL family, RNases Z and J, which function as homodimers.
Evidence
4:
Inferred from Physical InteractionUniProtKB
DEAD box proteins are putative RNA helicases that function in all aspects of RNA metabolism, including translation, ribosome biogenesis, and pre-mRNA splicing. Because many processes involving RNA metabolism are spatially organized within the cell, we examined the subcellular distribution of a human DEAD box protein, DDX1, to identify possible biological functions. Immunofluorescence labeling of DDX1 demonstrated that in addition to widespread punctate nucleoplasmic labeling, DDX1 is found in discrete nuclear foci approximately 0.5 microm in diameter. Costaining with anti-Sm and anti-promyelocytic leukemia (PML) antibodies indicates that DDX1 foci are frequently located next to Cajal (coiled) bodies and less frequently, to PML bodies. Most importantly, costaining with anti-CstF-64 antibody indicates that DDX1 foci colocalize with cleavage bodies. By microscopic fluorescence resonance energy transfer, we show that labeled DDX1 resides within a Förster distance of 10 nm of labeled CstF-64 protein in both the nucleoplasm and within cleavage bodies. Coimmunoprecipitation analysis indicates that a proportion of CstF-64 protein resides in the same complex as DDX1. These studies are the first to identify a DEAD box protein associating with factors involved in 3'-end cleavage and polyadenylation of pre-mRNAs.
Proc. Natl. Acad. Sci. U.S.A. 89, 1403-1407 (1992)[PubMed:1741396]
Cleavage stimulation factor is one of the multiple factors required for 3'-end cleavage of mammalian pre-mRNAs. We have shown previously that this factor is composed of three subunits with estimated molecular masses of 77, 64, and 50 kDa and that the 64-kDa subunit can be UV-crosslinked to RNA in a polyadenylylation signal (AAUAAA)-dependent manner. We have now isolated cDNAs encoding the 64-kDa subunit of human cleavage stimulation factor. The 64-kDa subunit contains a ribonucleoprotein-type RNA binding domain in the N-terminal region and a repeat structure in the C-terminal region in which a pentapeptide sequence (consensus MEARA/G) is repeated 12 times and the formation of a long alpha-helix stabilized by salt bridges is predicted. An approximately 270-amino acid segment surrounding this repeat structure is highly enriched in proline and glycine residues (approximately 20% for each). When cloned 64-kDa subunit was expressed in Escherichia coli, an N-terminal fragment containing the RNA binding domain bound to RNAs in a polyadenylylation-signal-independent manner, suggesting that the RNA binding domain is directly involved in the binding of the 64-kDa subunit to pre-mRNAs.
Proc. Natl. Acad. Sci. U.S.A. 89, 1403-1407 (1992)[PubMed:1741396]
Cleavage stimulation factor is one of the multiple factors required for 3'-end cleavage of mammalian pre-mRNAs. We have shown previously that this factor is composed of three subunits with estimated molecular masses of 77, 64, and 50 kDa and that the 64-kDa subunit can be UV-crosslinked to RNA in a polyadenylylation signal (AAUAAA)-dependent manner. We have now isolated cDNAs encoding the 64-kDa subunit of human cleavage stimulation factor. The 64-kDa subunit contains a ribonucleoprotein-type RNA binding domain in the N-terminal region and a repeat structure in the C-terminal region in which a pentapeptide sequence (consensus MEARA/G) is repeated 12 times and the formation of a long alpha-helix stabilized by salt bridges is predicted. An approximately 270-amino acid segment surrounding this repeat structure is highly enriched in proline and glycine residues (approximately 20% for each). When cloned 64-kDa subunit was expressed in Escherichia coli, an N-terminal fragment containing the RNA binding domain bound to RNAs in a polyadenylylation-signal-independent manner, suggesting that the RNA binding domain is directly involved in the binding of the 64-kDa subunit to pre-mRNAs.
Protein involved in the processing of the primary mRNA transcript to yield a functional mRNA. This includes 5' capping, 3' cleavage and polyadenylation, as well as mRNA splicing and RNA editing.
A reference proteome is a set of protein sequences derived from a complete proteome which constitutes a defined standard for a particular user community. Reference proteomes are manually defined according to a number of criteria. They cover the proteomes of well- studied model organisms and other proteomes of interest for biomedical and biotechnological research. Reference proteomes have been selected to provide broad coverage of the tree of life, and constitute a representative cross-section of the taxonomic diversity to be found within UniProtKB.