Abstract: Objective To screen long non-coded RNA (lncRNA) associated with the prognosis of colon cancer, and to build a prognostic risk model of colon cancer. Methods The data were collected from the establishment to March 1,2022. The transcriptome data of colon cancer were downloaded and sorted from The Cancer Genome Atlas (TCGA), then we constructed an expression matrix of lncRNA about paired samples. Differentially expressed lncRNAs (DElncRNAs) were obtained by R-packet "edgeR". For DElncRNAs, univariate COX regression analysis, Lasso regression analysis, Kaplan-Meier (K-M) survival analysis, and multivariate COX regression analysis were performed to obtain the prognostic associated lncRNAs. The prognostic risk model of colon cancer was established based on the coefficient of multivariate COX regression model. Then we evaluated the accuracy through C-index value, time-dependent receiver operating characteristic curve (ROC), area under ROC (AUC) value, and K-M survival analysis. CeRNA network was constructed for the lncRNAs in our model. Gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis were performed for related mRNAs to explore the mechanism of lncRNA affecting the progression of colon cancer. Results Five thousand four hundred and sixty lncRNAs were screened by arranging the transcriptome data. Eight hundred and sixty-eight DElncRNAs were obtained by paired-sample analysis, including 548 up-regulated genes and 320 down-regulated genes. After univariate COX regression analysis, 40 lncRNAs were obtained. Through lasso regression analysis, we got 34 lncRNAs. Fourteen lncRNAs remained after K-M survival analysis. Multivariate COX regression analysis revealed 7 prognostic related lncRNAs (down-regulated genes: LINC01132; up-regulated genes: ELFN1-AS1, RP5-884M6.1, LINC00461, RP1-79C4.4, RP4-816N1.7, and RP3-380B8.4). The prognostic assessment model was constructed according to the regression coefficient. The C-index value of the model was 0.82; the AUC values at 3 and 5 years were 0.79 and 0.84; K-M survival analysis showed a statistical difference in the survival rate between the high and low risk groups ( P <0.000 1). Next, we constructed the ceRNA network, and the KEGG enrichment analysis suggested that the down-regulation lncRNA inhibited the progression of colon cancer possibly through the pathways of regulation of actin cytoskeleton, proteoglycans in cancer, and PI3K-Akt signaling pathway; up-regulation lncRNAs promoted colon cancer possibly through the pathways of cellular adhesion molecules, focal adhesions, and phagosomes. Conclusions In our study, we constructed a prognostic risk model of colon cancer with 7 lncRNAs. It has a nice accuracy in predicting the patients' survival prognosis. Each lncRNA is a potential independently prognostic biomarker. The prognostic risk model has certain value for clinical prognostic assessment of colon cancer patients.

Key words: Colon cancer, Colorectal cancer, TCGA, lncRNA, Prognostic model

摘要: 目的 筛选与结肠癌预后相关长链非编码RNA(lncRNA),并构建结肠癌预后风险模型。方法 数据提取时间:建库至2022年3月1日。从癌症基因组图谱(TCGA)数据库下载并整理结肠癌转录组数据,构建配对样本lncRNA表达矩阵,利用“edgeR”R包筛选获得差异表达lncRNA(DElncRNA)。对DElncRNA先后行COX回归模型单变量分析、Lasso回归分析、Kaplan-Meier(K-M)生存分析、多元COX回归模型分析,获取预后相关lncRNA。依据多元COX回归模型中回归系数构建结肠癌预后风险模型。通过C指数值、时间依赖的受试者工作特征曲线(ROC)和ROC下的面积(AUC)及K-M生存分析评估模型预测的准确性。对模型中lncRNA构建竞争性内源RNA(ceRNA)网络,对相关的mRNA进行基因本体论(GO)、京都基因与基因组大百科全书数据库(KEGG)富集分析,探索lncRNA影响结肠癌进展的机制。结果 整理转录组数据得到5 460个lncRNA,配对样本分析获得DElncRNA 868个,其中上调548个、下调320个。单变量COX回归分析后获得40个lncRNA,经Lasso回归分析过滤共线性因素,得到lncRNA 34个,K-M生存分析后,得出14个候选lncRNA。再进行多元COX回归分析,得到7个预后相关lncRNA(下调:LINC01132;上调:ELFN1-AS1、RP5-884M6.1、LINC00461、RP1-79C4.4、RP4-816N1.7、RP3-380B8.4),依据回归系数构建预后风险模型。模型的C指数值为0.82;3年和5年的AUC值分别为0.79、0.84;进行K-M生存分析提示高低风险组生存率差异有统计学意义( P <0.000 1)。随后构建ceRNA网络,通过KEGG富集分析提示下调lncRNA可能是通过肌动蛋白细胞骨架的调控、癌症中蛋白聚糖、PI3K-Akt信号通路等抑制结肠癌进展,上调lncRNA可能是通过细胞粘附分子、局灶性粘连、吞噬体等通路促进结肠癌进展。结论 本研究构建了一个包含7个lncRNA的结肠癌预后风险模型,具有较好预测患者生存预后准确性,每个lncRNA是潜在单独的预后生物标志物,对临床上结肠癌患者预后评估具有一定参考价值。

结直肠癌, TCGA, lncRNA,

He Tian, Cao Tiansheng. Screening of lncRNA related to prognosis of colon cancer based on TCGA database and establishment of prognostic risk model [J]. International Medicine and Health Guidance News, 2022, 28(13): 1864-1871.

何天 曹天生. 基于TCGA筛选结肠癌预后相关lncRNA及建立预后风险模型[J]. 国际医药卫生导报, 2022, 28(13): 1864-1871.

[1] Sung H, Ferlay J, Siegel RL, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality  worldwide for 36 cancers in 185 Countries[J]. CA Cancer J Clin, 2021,71(3):209-249.DOI: 10.3322/caac.21660.
[2] Feng YL, Shu L, Zheng PF, et al. Dietary patterns and colorectal cancer risk: a meta-analysis[J]. Eur J Cancer Prev, 2017,26(3):201-211.DOI: 10.1097/CEJ. 0000000000000245.
[3] 李道娟,李倩,贺宇彤. 结直肠癌流行病学趋势[J]. 肿瘤防治研究,2015,42(3):305-310. DOI:10.3971/j.issn.1000- 8578.2015.03.020.
[4] Das V, Kalita J, Pal M. Predictive and prognostic biomarkers in colorectal cancer: a systematic review of recent advances and challenges[J]. Biomed Pharmacother, 2017,87:8-19. DOI: 10.1016/j.biopha.2016.12.064.
[5] Spizzo R, Almeida MI, Colombatti A, et al. Long non-coding RNAs and cancer: a new frontier of translational research?[J]. Oncogene, 2012,31(43):4577-4587.DOI: 10.1038/onc.2011.621.
[6] Dhamija S, Diederichs S. From junk to master regulators of invasion: lncRNA functions in migration, EMT and metastasis[J]. Int J Cancer, 2016,139(2):269-280. DOI: 10.1002/ijc.30039.
[7] Li J, Meng H, Bai Y, et al. Regulation of lncRNA and its role in cancer metastasis[J]. Oncol Res, 2016,23(5): 205-217. DOI: 10.3727/096504016X14549667334007.
[8] Salmena L, Poliseno L, Tay Y, et al. A ceRNA hypothesis: the rosetta stone of a hidden RNA language?[J]. Cell, 2011,146(3):353-358.DOI: 10.1016/j.cell.2011.07.014.
[9] Peng CL, Zhao XJ, Wei CC, et al. LncRNA HOTAIR promotes colon cancer development by down-regulating miRNA-34a[J]. Eur Rev Med Pharmacol Sci, 2019,23(13):5752-5761. DOI: 10.26355/eurrev_201907_18312.
[10] De Neve J, Gerds TA. On the interpretation of the hazard ratio in Cox regression[J]. Biom J, 2020,62(3):742-750.DOI: 10.1002/bimj.201800255.
[11] Goerdten J, Carrière I, Muniz-Terrera G. Comparison of Cox proportional hazards regression and generalized Cox regression  models applied in dementia risk prediction[J]. Alzheimers Dement (N Y), 2020,6(1):e12041. DOI: 10. 1002/trc2.12041.
[12] Brentnall AR, Cuzick J. Use of the concordance index for predictors of censored survival data[J]. Stat Methods Med Res, 2018,27(8):2359-2373.DOI: 10.1177/0962280216680245.
[13] Hanley JA, McNeil BJ. The meaning and use of the area under a receiver operating characteristic (ROC)  curve[J]. Radiology, 1982,143(1):29-36.DOI: 10.1148/radiology. 143.1.7063747.
[14] Gaudet P, Logie C, Lovering RC, et al. Gene ontology representation for transcription factor functions[J]. Biochim Biophys Acta Gene Regul Mech, 2021,1864(11-12):194752.DOI: 10.1016/j.bbagrm.2021. 194752.
[15] Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes[J]. Nucleic Acids Res, 2000,28(1):27-30. DOI: 10.1093/nar/28.1.27.
[16] 叶志华,付金伦,桂定文,等. 下调长链非编码RNA FGD5-AS1抑制肾癌细胞增殖和侵袭的机制研究[J]. 国际医药卫生导报,2021,27(19):2969-2972. DOI:10.3760/cma.j.issn.1007-1245.2021.19.002.
[17] Zha Z, Zhang P, Li D, et al. Identification and construction of a long noncoding RNA prognostic risk model for  stomach adenocarcinoma patients[J]. Dis Markers, 2021,2021:8895723.DOI: 10.1155/2021/8895723.
[18] Shen Y, Peng X, Shen C. Identification and validation of immune-related lncRNA prognostic signature for breast cancer[J]. Genomics, 2020,112(3):2640-2646. DOI: 10.1016/j.ygeno.2020.02.015.
[19] Ogunwobi OO, Mahmood F, Akingboye A. Biomarkers in colorectal cancer: current research and future prospects[J]. Int J Mol Sci, 2020,21(15): 5311.DOI: 10.3390/ijms21155311.
[20] Bao L, Chen Y, Lai HT, et al. Methylation of hypoxia-inducible factor (HIF)-1α by G9a/GLP inhibits HIF-1 transcriptional activity and cell migration [J]. Nucleic Acids Res, 2018,46(13):6576-6591.DOI: 10.1093/nar/gky449.
[21] Meng C, Zhou JQ, Liao YS. Autophagy-related long non-coding RNA signature for ovarian cancer[J]. J Int Med Res, 2020,48(11): 300060520970761.DOI: 10.1177/0300060520970761.
[22] Lei R, Feng L, Hong D. ELFN1-AS1 accelerates the proliferation and migration of colorectal cancer via regulation of miR-4644/TRIM44 axis[J]. Cancer Biomark, 2020,27(4): 433-443. DOI: 10.3233/CBM-190559.
[23] Yu H, Ma J, Chen J, et al. LncRNA LINC00461 promotes colorectal cancer progression via miRNA-323b-3p/NFIB axis[J]. Onco Targets Ther, 2019, 12: 11119-11129. DOI: 10.2147/OTT.S228798.
[24] Meng Q, Liu M, Cheng R. LINC00461/miR-4478/E2F1 feedback loop promotes non-small cell lung cancer cell  proliferation and migration[J]. Biosci Rep, 2020,40(2): BSR20191345. DOI: 10.1042/BSR20191345.
[25] Fu X, Duanmu J, Li T, et al. A 7-lncRNA signature associated with the prognosis of colon adenocarcinoma[J]. PeerJ, 2020,8:e8877.DOI: 10.7717/peerj.8877.
Liu Siping, Mao Haiyan. Multiphase contrast-enhanced MSCT in TN staging of colorectal cancer: comparative study with pathology [J]. International Medicine and Health Guidance News, 2022, 28(4): 464-467. Zhao Wenzhen, Lin Yuning. LINC01836 is a new biomarker for the diagnosis and prognosis of colorectal cancer [J]. International Medicine and Health Guidance News, 2022, 28(3): 363-367. Li Feng, Xuan Jinfeng, Gong Chao, Li jiongxian. Single hole laparoscopic radical resection for patients with colorectal cancer [J]. International Medicine and Health Guidance News, 2022, 28(13): 1829-1833. Peng Shuling, Zhang Weihua, Zhong Yubo, Liu Lili, Huang Hongzhen. Effect of Jianpiyiqi prescription combined with acupoint application in prevention and treatment of chemotherapeutic gastrointestinal reactions after colorectal cancer surgery [J]. International Medicine and Health Guidance News, 2022, 28(11): 1516-1520.