TY - GEN
T1 - Software Estimation Process Improvement using Data Segmentation
AU - Ali, Syed Sarmad
AU - Jamal, Muhammad Asif
AU - Khan, Muhammad Yaseen
AU - Rizvi, Syed Muhammad Asim Ali
AU - Wang, Ziyu
PY - 2024/10/17
Y1 - 2024/10/17
N2 - Estimating software development effort holds pivotal significance in the domains of resource allocation, budgeting, and project planning. Despite this importance, achieving precise predictions remains a complex challenge. This study aims to refine software effort assessment precision by investigating the efficacy of a segmentation technique tailored for dataset analysis. This approach involves categorizing interconnected projects into distinct segments based on their size, complexity, and domain characteristics. The inherent assumption is that tasks sharing similar attributes will coalesce within segments. Through such delineation, patterns and trends within each segment can be illuminated. This segmentation technique is anticipated to enhance the accuracy of software effort estimation through meticulous intra-segment effort computations and accounting for variations across segment boundaries. To substantiate this approach, the dataset is partitioned into subsets based on productivity values. Employing machine learning models like Linear, Support Vector Regression (SVR), k-Nearest Neighbors (k-NN), Multilayer Perceptron (MLP), and an Artificial Neural Network (ANN) with Exponential Linear Unit (ELU) activation, the study transforms nominal data into numerical form and evaluates model performance using metrics like MMRE, MAE, MdMRE, MdAE, and PRED. The segmentation technique exhibits promising outcomes in elevating software effort estimation accuracy, achieving an average MAE reduction of approximately 6.79 units compared to prior research. This technique holds potential to inform resource allocation and project scheduling decisions in the software industry, advancing estimation processes and empowering more informed strategic choices.
AB - Estimating software development effort holds pivotal significance in the domains of resource allocation, budgeting, and project planning. Despite this importance, achieving precise predictions remains a complex challenge. This study aims to refine software effort assessment precision by investigating the efficacy of a segmentation technique tailored for dataset analysis. This approach involves categorizing interconnected projects into distinct segments based on their size, complexity, and domain characteristics. The inherent assumption is that tasks sharing similar attributes will coalesce within segments. Through such delineation, patterns and trends within each segment can be illuminated. This segmentation technique is anticipated to enhance the accuracy of software effort estimation through meticulous intra-segment effort computations and accounting for variations across segment boundaries. To substantiate this approach, the dataset is partitioned into subsets based on productivity values. Employing machine learning models like Linear, Support Vector Regression (SVR), k-Nearest Neighbors (k-NN), Multilayer Perceptron (MLP), and an Artificial Neural Network (ANN) with Exponential Linear Unit (ELU) activation, the study transforms nominal data into numerical form and evaluates model performance using metrics like MMRE, MAE, MdMRE, MdAE, and PRED. The segmentation technique exhibits promising outcomes in elevating software effort estimation accuracy, achieving an average MAE reduction of approximately 6.79 units compared to prior research. This technique holds potential to inform resource allocation and project scheduling decisions in the software industry, advancing estimation processes and empowering more informed strategic choices.
KW - Accuracy
KW - Estimation
KW - Machine Learning
KW - articifial neurak network
KW - Software
KW - Data models
KW - Numerical models
KW - planning
KW - Resource management
KW - Software development management
UR - http://www.scopus.com/inward/record.url?scp=85208120845&partnerID=8YFLogxK
U2 - 10.1109/IBCAST59916.2023.10712851
DO - 10.1109/IBCAST59916.2023.10712851
M3 - Conference proceeding
SN - 979-8-3503-0825-9
SN - 979-8-3503-0826-6
SP - 279
EP - 284
BT - 2023 20th International Bhurban Conference on Applied Sciences and Technology (IBCAST)
PB - IEEE
T2 - 20th International Bhurban Conference on Applied Sciences and Technology (IBCAST)
Y2 - 22 August 2023 through 25 August 2023
ER -