Bioinformatic approaches have increased their relevance in the latest years, being computational analysis a crucial tool for genomics and transcriptomics. To better understand how transforming growth factor beta (TGF-β) influence post-transcriptional regulation and RNA splicing we performed a wide computational analysis comparing RNA sequencing (RNA-seq) and Chromatinenriched RNA-seq (ChRNA-seq) data provided by Dr. Reyes’ group. TGF-β provoked changes in both nuclear and cytoplasmatic post-transcriptional regulation being the cytoplasmatic component crucial to gene expression direction. Thus, a positive (or negative) change in cytoplasmatic posttranscriptional regulation affect gene expression in the cell. In order to investigate splicing efficiency at the whole gene level we created the Splicing Index of the Gene (SIG) parameter. Genes with high SIG were characterized by high exon signal, a few reads in intronic regions and were less expressed in the nucleus. On the contrary, genes with low SIG were more expressed and had similar signal in exonic regions than in the intronic. Differences of SIG between genes were related to gene length and splicing site sequences. We observed a rough positive correlation between splicing efficiency and gene length, so we developed a model based on co-transcriptional nature of splicing where RNA polymerase II (RNAPII) would pass rapidly through the short gene, being all introns incapable to be matured, whereas in long genes, with long introns, spliceosome would have time to remove all introns from the pre-mRNA
This work was supported by the funded project called Epigenética del proceso de transición de epitelio a mesénquima y su reversión (BFU2017-85420-R).