TY - JOUR JF - MGJ JO - MGj VL - 16 IS - 2 PY - 2021 Y1 - 2021/7/01 TI - Developing a novel algorithm to predict diagnostic biomarkers in lung cancer TT - ارائه ی روشی جدید برای کشف نشانگرهای زیستی پیش آگاهی دهنده در سرطان ریه N2 - Today, machine-learning approaches are widely used in the analysis of massive data. Due to new technology and the production of high-throughput data in biology (such as next-generation sequencing data), the use of machine learning methods on large biological data can help to understand the mechanism of complex diseases such as cancer. Extraction of candida genes as a therapeutic target or biomarkers from high-throughput biological data such as gene expression data considered as the first step in cancer treatment. Therefore, developing an efficient approach to analyzing such data plays a key role in bioinformatics and computational biology. In this paper, we try to identify lung cancer-related genes as potential biomarkers by applying the WCC algorithm and SVM to lung cancer gene expression data. The data from RNA-Seq technology used for lung cancer samples as well as healthy tissue samples obtained from the TCGA database. These data include the expression of mRNA genes in cancerous and healthy tissue samples. The results of the study led to important findings such as CASZ1 and ASNS, which according to previously published articles have an important role in the formation of cancer and their role in the formation of lung cancer can be examined in future studies. In addition, the validation results of the proposed method show the power of machine learning-based methods in analyzing gene expression data. SP - 103 EP - 112 AU - Kouhsar, Morteza AU - Masoudi-Sobhanzadeh, Yosef AU - Masoudi-Nejad, Ali AD - Tehran University KW - Lung Cancer KW - WCC Algorithm KW - Support Vector Machine KW - Biomarker KW - Machine Learning KW - Gene Expression Data UR - http://mg.genetics.ir/article-1-1688-en.html ER -