Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

A Machine Learning-Based Framework for Diagnosis of Breast Cancer

Ravi Kumar Sachdeva, Priyanka Bathla

Source Title: International Journal of Software Innovation (IJSI) 10(1)

DOI: 10.4018/IJSI.301221

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Machine learning is used in the health care sector due to its ability to make predictions. Nowadays major cause of death in women is due to breast cancer. In this paper, a machine learning-based framework for the diagnosis of breast cancer has been proposed. The authors have used different feature selection methods on Breast Cancer Wisconsin (Diagnostic) dataset i.e. Chi-square, Pearson correlation between features and Feature importance. The competency of the feature selection methods has been analyzed using different machine learning classifiers on different performance parameters like accuracy, sensitivity, specificity, precision, and F-measure. Random Forest (RF), Extra Tree Classifier (ETC), and Logistic Regression (LR) machine learning classifiers have been used by the authors. Results reveal that FI (Feature Importance) is the preeminent feature selection method among all others used when applied with different classifiers. Results also show that the ETC machine learning classifier gives the best accuracy result in comparison with RF and LR classifiers.

Article Preview

Top

Introduction

Nowadays breast cancer is one of the growing diseases among women. With an age-adjusted prevalence of 25.8 per 100,000 women, it has risen to the top among Indian women. Breast cancer cases among women are more in less developed regions in comparison with the more developed regions (Malvia et al., 2017).

The machine learning field is constantly evolving. It allows computers to learn automatically without human intervention. It helps computers in building models from sample data to make predictions. Supervised learning is used in classification and regression types of problems. In supervised learning, the program is trained using a set of predefined training data and later on, accuracy has been checked using test data (Simon et al., 2015).

While creating a predictive model, the process of reducing the number of features is called feature selection. It is also the process of identifying and selecting the relevant from the available features while developing a predictive model. The purpose of feature selection methods is to identify and delete unnecessary features from the input data that do not help the model perform better (Vanaja and Kumar, 2014). Filter, wrapper, and embedded are different categories of methods for feature selection. Filter method ranks features according to some criterion, and then the features having the highest rank are selected. Wrapper methods evaluate all possible combinations and produce the result. The embedded method performs feature selection during the model training (Miao and Nio, 2016).

In this paper, breast cancer has been diagnosed using different combinations of classifier and feature selection methods. The authors did this by comparing the performance of various feature selection methods with different classifiers in machine learning. Following are the research contributions of the paper:

1.
Find out the best method for feature selection
2.
Propose a methodology for diagnosis of breast cancer
3.
For diagnosis of breast cancer, find optimal mix of classifier and feature selection approach

The remainder of the paper is organized as follows. The literature review of the relevant work has been covered in Section 2. Section 3 contains the methodology for the anticipated methods. Experimental Results of different feature selection methods have been included in section 4. Section 5 describes the conclusions and recommendations.

Top

New systems for disease diagnosis have been developed as technology in the medical industry has advanced. The following is a list of research related to the topic of the paper:

On the Wisconsin breast cancer dataset, Islam et al. (Islam et al., 2020) compared the accuracy, specificity, sensitivity, precision, F1 score, false-positive rate, negative predictive value, and Matthews correlation coefficient of five machine learning techniques: K-nearest Neighbors (KNN), LR, RF, Support Vector Machine (SVM), and Artificial Neural Networks (ANNs). According to the authors, ANNs had the best precision, accuracy, and F1 score.

Alickovic and Subasi (Alickovic and Subasi, 2015) have used Genetic Algorithm (GA) as feature selection technique for eliminating insignificant features. The authors have used several machine learning techniques i.e. LR, Decision Tree (DT), RF, SVM etc. on the Wisconsin datasets, and found that RF and GA feature selection gives the highest accuracy score.

Fatih (Fatih, 2020) used LR, DT, KNN, Naive Bayes (NB), RF, Rotation Forest techniques of machine learning on the Wisconsin breast cancer dataset (WBCD). The author has implemented classification algorithms in three different types with first as ‘All features included’, second as ‘Highly correlated features included’, and 3rd as ‘Low correlated features included’. Results reveal that Logistic Regression had the best classification accuracy with all features.

Complete Article List

Search this Journal:

Reset

Volume 12: 1 Issue (2024)

Volume 11: 1 Issue (2023)

Volume 10: 4 Issues (2022): 2 Released, 2 Forthcoming

Volume 9: 4 Issues (2021)

Volume 8: 4 Issues (2020)

Volume 7: 4 Issues (2019)

Volume 6: 4 Issues (2018)

Volume 5: 4 Issues (2017)

Volume 4: 4 Issues (2016)

Volume 3: 4 Issues (2015)

Volume 2: 4 Issues (2014)

Volume 1: 4 Issues (2013)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

A Machine Learning-Based Framework for Diagnosis of Breast Cancer

Abstract

Introduction

Complete Article List

A Machine Learning-Based Framework for Diagnosis of Breast Cancer

Abstract

Introduction

Related Work

Complete Article List