نشریه مهندسی مکانیک ایران

نشریه مهندسی مکانیک ایران

طبقه بندی پنج نوع داده سرطان بر اساس روش های شبکه عصبی و تحلیل و بررسی بیان ژن بر اساس روش همجوشی انتخاب ویژگی

نوع مقاله : مقاله علمی پژوهشی

نویسندگان
1 دانشجوی دکتری، دانشکده مهندسی مکانیک، دانشگاه صنعتی خواجه نصیرالدین طوسی، تهران، ایران
2 کارشناسی ارشد، دانشکده مهندسی مکانیک، دانشگاه صنعتی خواجه نصیرالدین طوسی، تهران، ایران
3 دانشیار، دانشکده مهندسی مکانیک، دانشگاه صنعتی خواجه نصیرالدین طوسی، تهران، ایران
چکیده
در داده های میکروآرایه با حجم بالا، تعداد کم نمونه ها و تغییرپذیری ذاتی در فرآیندهای بیولوژیکی باعث ایجاد مشکل افزایش هزینه محاسباتی و پیچیدگی طبقه بندی ها می شود. همچنین تفسیر ژن های عامل بیماری پیچیده است، چرا که از نظر بیولولوژیکی، تنها مجموعه کوچکی از ژن ها می توانند بیماری را با دقت بیشتری توصیف نمایند. اولین قدم در آنالیز داده های میکروآرایه، کاهش قابل توجه تعداد ژن ها یا به عبارتی انتخاب ژن های متمایز کننده در فرآیند طبقه بندی است. این مرحله انتخاب ژن نامیده می شود. در این مقاله از طبقه بندی بیان ژن پنج داده، سرطان روده، سرطان پستان، لوسمی، تومورهای پروستات و لنفوم های سلول های بزرگ پخش شده استفاده شده و هر یک از آن ها به تفکیک در چرخه انتخاب ویژگی و نیز دسته بندی با تعداد ویژگی های متغیر وارد می شوند.
کلیدواژه‌ها

موضوعات


[1]  A. D. Gordon, "Classification," 2nd Ed., CRC Press, USA, 1999, ISBN: 1584888539. https://books.google.com/books?id=_w5AJtbfEz4C.
 
[2] L. Boguslawski, "Influence of Pressure Fluctuations Distribution on Local Heat Transfer on Flat Surface Impinged by Turbulent Free Jet," in Thermal Sciences 2004, Proceedings of the ASME-ZSIS International Thermal Science Seminar II, 2004, doi: 10.1615/ICHMT.2004.IntThermSciSemin.
 
[3] R. G. Brereton, and G. R. Lloyd, "Support Vector Machines for Classification and Regression," Analyst, Vol. 135, No. 2, pp. 230-267, 2010, doi: 10.1039/B918972F.
 
[4] J. R. Quinlan, "Programs for Machine Learning," Morgan Kaufmann Publishers, Inc., Elsevier, San Mateo, California, 2014, ISBN: 0080500587, https://books.google.com/books?id=b3ujBQAAQBAJ.
 
[5] W.-Y. Loh, "Classification and Regression Trees," Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, Vol. 1, No. 1, pp. 14-23, 2011, First Edition, John Wiley & Sons, Inc., New Jersey, USA, doi: 10.1002/widm.8.
 
[6] K. P. Murphy, "Naive Bayes Classifiers," University of British Columbia, Vol. 18, No. 60, pp. 1-8, 2006, http://www.ic.unicamp.br/rocha/teaching/ 2011s1/mc906/aulas/naivebayes.pdf.
 
[7] L. Van, D. Maaten, E. Postma, and J. Van den Herik, "Dimensionality Reduction: A Comparative," J Mach Learn Res, Vol. 10, No. 66-71, p. 13, 2009. [Online] Available: https://members.loria.fr/moberger/Enseignement/AVR/Exposes/.
 
[8] L. Nanni, and A. Lumini, "Wavelet Selection for Disease Classification by DNA Microarray Data," Expert Systems with Applications, Vol. 38, No. 1, pp. 990-995, 2011, doi: 10.1016/j.eswa.2010.07.104.
 
[9] I. Porto-Diaz, V. Bolon-CanedoA. Alonso-Betanzos, and O. Fontenla-Romero, "A Study of Performance on Microarray Data Sets for a Classifier Based on Information Theoretic Learning," Neural Networks, Vol. 24, No. 8, pp. 888-896, 2011, doi: 10.1016/j.neunet.2011.05.010.
 
[10] M. Shah, M. Marchand, and J. Corbeil, "Feature Selection with Conjunctions of Decision Stumps and Learning from Microarray Data," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 34, No. 1, pp. 174-186, 2011, doi: https://doi.org/10.1109/TPAMI.2011.82.
 
[11] F. V. Sharbaf, S. Mosafer, and M. H. Moattar, "A Hybrid Gene Selection Approach for Microarray Data Classification using Cellular Learning Automata and Ant Colony Optimization," Genomics, Vol. 107, No. 6, pp. 231-238, 2016, doi: 10.1016/j.ygeno.2016.05.001.
 
[12] Y. Leung, and Y. Hung, "A Multiple-filter-multiple-wrapper Approach to Gene Selection and Microarray Data Classification," IEEE/ACM Transactions on Computational Biology and Bioinformatics, Vol. 7, No. 1, pp. 108-117, 2008, doi: 10.1109/TCBB.2008.46.
 
[13] Y. Leung, and Y. Hung, "A Multiple-filter-multiple-wrapper Approach to Gene Selection and Microarray Data Classification," IEEE/ACM Transactions on Computational Biology and Bioinformatics, Vol. 7(1), pp. 108-117, 2008, doi: 10.1109/TCBB.2008.46.
 
[14] S. Maldonado, R. Weber, and F. Famili, "Feature Selection for High-dimensional Class-imbalanced Data Sets using Support Vector Machines," Information Sciences, Vol. 286, pp. 228-246, 2014, doi: 10.1016/j.ins.2014.07.015.
 
[15] A. Castaño, F. Fernández-NavarroC. Hervás-Martínez, and P.A. Gutiérrez, "Neuro-logistic Models Based on Evolutionary Generalized Radial Basis Function for the Microarray Gene Expression Classification Problem," Neural Processing Letters, Vol. 34, pp. 117-131, 2011, doi: 10.1007/s11063-011-9187-8.
 
[16] J. Li, Y. Jia, and W. Li, "Adaptive Huberized Support Vector Machine and its Application to Microarray Classification," Neural Computing and Applications, Vol. 20, pp. 123-132, 2011, doi: 10.1007/s00521-010-0371-y.
 
[17] K.-H. Liu, Z.-H. Zeng, and V. T. Y. Ng, "A Hierarchical Ensemble of ECOC for Cancer Classification Based on Multi-class Microarray Data," Information Sciences, Vol. 349, pp. 102-118, 2016, doi: 10.1016/j.ins.2016.02.028.
 
[18] S. Karimi, and M. Farrokhnia, "Leukemia and Small Round Blue-cell Tumor Cancer Detection using Microarray Gene Expression Data Set: Combining Data Dimension Reduction and Variable Selection Technique," Chemometrics and Intelligent Laboratory Systems, Vol. 139, pp. 6-14, 2014, doi: 10.1016/j.chemolab.2014.09.003.
 
[19] H. Liu, L. Liu, and H. Zhang, "Ensemble Gene Selection by Grouping for Microarray Data Classification," Journal of Biomedical Informatics, Vol. 43, No. 1, pp. 81-87, 2010, doi: 10.1016/j.jbi.2009.08.010.
 
[20] P. Maji, "Fuzzy–rough Supervised Attribute Clustering Algorithm and Classification of Microarray Data," IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), Vol. 41, No. 1, pp. 222-233, 2010, doi: 10.1109/TSMCB.2010.2050684.
 
[21] C.-P. Lee, W.-S. Lin, Y.-M. Chen, and B.-J. Kuo, "Gene Selection and Sample Classification on Microarray Data Based on Adaptive Genetic Algorithm/k-nearest Neighbor Method," Expert Systems with Applications, Vol. 38, No. 5, pp. 4661-4667, 2011, doi: 10.1016/j.eswa.2010.07.053.
 
[22] D. Hernández-Lobato, J. M. Hernández-Lobato, and A. Suárez, "Expectation Propagation for Microarray Data Classification," Pattern Recognition Letters, Vol. 31, No. 12, pp. 1618-1626, 2010, doi: 10.1016/j.patrec.2010.05.007.
 
[23] P. Mohapatra, S. Chakravarty, and P. Dash, "Microarray Medical Data Classification using Kernel Ridge Regression and Modified Cat Swarm Optimization Based Gene Selection System," Swarm and Evolutionary Computation, Vol. 28, pp. 144-160, 2016, doi: 10.1016/j.swevo.2016.02.002.
 
[24] J. M. Cadenas, M. C. Garrido, and R. MartíNez, "Feature Subset Selection Filter–wrapper Based on Low Quality Data," Expert Systems with Applications, Vol. 40, No. 16, pp. 6241-6252, 2013, doi: 10.1016/j.eswa.2013.05.051.
 
[25] H. Deng, and G. Runger, "Gene Selection with Guided Regularized Random Forest," Pattern Recognition, Vol. 46, No. 12, pp. 3483-3489, 2013, doi: 10.1016/j.patcog.2013.05.018.
 
[26] F. Fernández-Navarro, C. Hervás-Martínez, R. Ruiz, and J.C. Riquelme, "Evolutionary Generalized Radial Basis Function Neural Networks for Improving Prediction Accuracy in Gene Classification using Feature Selection," Applied Soft Computing, Vol. 12, No. 6, pp. 1787-1800, 2012, doi: 10.1016/j.asoc.2012.01.008.
 
[27] S. Tabakhi, A. Najafi, R. Ranjbar, and P. Moradi, "Gene Selection for Microarray Data Classification using a Novel Ant Colony Optimization," Neurocomputing, Vol. 168, pp. 1024-1036, 2015, doi: 10.1016/j.neucom.2015.05.022.
 
[28] M. Czajkowski, M. Grześ, and M. Kretowski, "Multi-test Decision Tree and its Application to Microarray Data Classification," Artificial Intelligence in Medicine, Vol. 61, No. 1, pp. 35-44, 2014, doi: 10.1016/j.artmed.2014.01.005.
 
[29] H. Lu, J. Chen, K. YanQ. JinY. Xue, and Z. Gao, "A Hybrid Feature Selection Algorithm for Gene Expression Data Classification," Neurocomputing, Vol. 256, pp. 56-62, 2017, doi: 10.1016/j.neucom.2016.07.080.
 
[30] K. Lee, Z. Man, D. Wang, and Z. Cao, "Classification of Bioinformatics Dataset using Finite Impulse Response Extreme Learning Machine for Cancer Diagnosis," Neural Computing and Applications, Vol. 22, pp. 457-468, 2013, doi: 10.1007/s00521-012-0847-z.
[31] Z. Liu, D. Tang, Y. Cai, R. Wang, and F. Chen, "A Hybrid Method Based on Ensemble WELM for Handling Multi Class Imbalance in Cancer Microarray Data," Neurocomputing, Vol. 266, pp. 641-650, 2017, doi: 10.1016/j.neucom.2017.05.066.
 
[32] L. Fan, K.-L. Poh, and P. Zhou, "Partition-conditional ICA for Bayesian Classification of Microarray Data," Expert Systems with Applications, Vol. 37, No. 12, pp. 8188-8192, 2010, doi: 10.1016/j.eswa.2010.05.068.
 
[33] A. Wang, N. An, G. Chen, L. Li, and G. Alterovitz, "Improving PLS–RFE Based Gene Selection for Microarray Data Classification," Computers in Biology and Medicine, Vol. 62, pp. 14-24, 2015, doi: 10.1016/j.compbiomed.2015.04.011.
 
[34] C. Bielza, V. Robles, and P. Larrañaga, "Regularized Logistic Regression without a Penalty Term: An Application to Cancer Classification with Microarray Data," Expert Systems with Applications, Vol. 38, No. 5, pp. 5110-5118, 2011, doi: 10.1016/j.eswa.2010.09.140.
 
[35] M.-Y. Wu, D.-Q. Dai, Y. Shi, H. Yan, and X.-F. Zhang, "Biomarker Identification and Cancer Classification Based on Microarray Data using Laplace Naive Bayes Model with Mean Shrinkage," IEEE/ACM Transactions on Computational Biology and Bioinformatics, Vol. 9, No. 6, pp. 1649-1662, 2012, doi: 10.1109/TCBB.2012.105.
 
[36] S. Monti, K. Savage, J. L. Kutok, F. Feuerhake, P. Kurtin, M. Mihm, B. Wu, L. Pasqualucci, D. Neuberg, R. C. T. Aguiar, P. Dal Cin, C. Ladd, G. S. Pinkus, G. Salles, N. Lee Harris, R. Dalla-Favera, T. M. Habermann, J. C. Aster, T. R. Golub, and M. A. Shipp, "Molecular Profiling of Diffuse Large B Cell Lymphoma Reveals a Novel Disease Subtype with Brisk Host Inflammatory Response and Distinct Genetic Features," Blood, Vol. 105, No. 5, pp. 1851-1861, 2005, https://doi.org/10.1182/blood-2004-07-2947.
 
[37] T. R. Golub,  D.K. SlonimP. Tamayo, C. Huard, M. Gaasenbeek, J.P. Mesirov, H. Coller, M.L. Loh, J. R. Downing, M. A. Caligiuri, C. D. Bloomfield, and E. S. Lander, "Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring," Science, Vol. 286, No. 5439, pp. 531-537, 1999, doi: 10.1126/science.286.5439.531.
 
[38] D. Singh, P. G. Febbo, K. Ross, D. G. Jackson, J. Manola, C. Ladd, P. Tamayo, A. A. Renshaw, A. V. D'Amico, J. P. Richie, E. S. Lander, M. Loda, P, W. Kantoff, T. R. Golub, and W. R. Sellers, "Gene Expression Correlates of Clinical Prostate Cancer Behavior," Cancer Cell, Vol. 1, No. 2, pp. 203-209, 2002, doi: 10.1016/S1535-6108(02)00030-2.
 
[39] U. Alon, N. Barkai, D. A. Notterman, K. Gish, S. Ybarra, D. Mack, A. J. Levine, and A. J. Levine, "Broad Patterns of Gene Expression Revealed by Clustering Analysis of Tumor and Normal Colon Tissues Probed by Oligonucleotide Arrays," Proceedings of the National Academy of Sciences, Vol. 96, No. 12, pp. 6745-6750, 1999, doi: 10.1073/pnas.96.12.6745.
 
[40] N. Matamala, M.T. Vargas, R. Gonzalez-Campora, R. Minambres, J. I. Arias, P. Menendez, E. Andrés-León, G. Gómez-López, K. Yanowsky, J. Calvete-Candenas, L. Inglada-Pérez, B. Martínez-Delgado, and J. Benítez, "Tumor MicroRNA Expression Profiling Identifies Circulating MicroRNAs for Early Breast Cancer Detection," Clinical Chemistry, Vol. 61, No. 8, pp. 1098-1106, 2015, doi: 10.1373/clinchem.2015.238691.

  • تاریخ دریافت 08 خرداد 1402
  • تاریخ بازنگری 07 شهریور 1402
  • تاریخ پذیرش 15 مهر 1402