Abstract
The classification of software requirements is an essential task in software engineering. Manual classification requires a large amount of efforts, time and cost. Hence, automated techniques are required to classify software requirements. This work aims to develop requirement classification models based on extraction of relevant features from requirement documents and thereafter classifying requirement into functional and non-function requirements. In this paper, different word- embedding techniques to extract numerical features, feature selection to remove irrelevant feature, SMOTE to balance data, and six different classifiers for models training. The experiments have been conducted on PROMISE software engineering dataset. The experimental finding indicate that Word2vec is best way to extracting numerical features from requirement documents, RANK-SUM test is best way to find important features, and SVM-R was found as the best classifier.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Canedo, E.D., Mendes, B.C.: Software requirements classification using machine learning algorithms. Entropy 22(9), 1057 (2020)
Quba, G.Y., Al Qaisi, H., Althunibat, A., AlZu’bi, S.: Software requirements classification using machine learning algorithm’s. In: 2021 International Conference on Information Technology (ICIT), pp. 685–690 (2021)
Haque, M.A., Rahman, M.A., Siddik, M.S.: Non-functional requirements classification with feature extraction and machine learning: an empirical study. In: 2019 1st International Conference on Advances in Science, Engineering and Robotics Technology (ICASERT), pp. 1–5. IEEE (2019)
Cleland-Huang, J., Settimi, R., Zou, X., Solc, P.: Automated classification of non-functional requirements. Requir. Eng. 12(2), 103–120 (2007)
Lu, M., Liang, P.: Automatic classification of non-functional requirements from augmented app user reviews. In: Proceedings of the 21st International Conference on Evaluation and Assessment in Software Engineering, pp. 344–353 (2017)
O’brien, P.C.: Comparing two samples: extensions of the t, rank-sum, and log-rank tests. J. Am. Statist. Assoc. 83(401), 52–61 (1988)
Ringnér, M.: What is principal component analysis? Nat. Biotechnol. 26(3), 303–304 (2008)
Joachims, T.: Svmlight: support vector machine. In: SVM-Light Support Vector Machine, vol. 19, no. 4. University of Dortmund (1999). http://svmlight.joachims.org/
Fernández, A., Garcia, S., Herrera, F., Chawla, N.V.: Smote for learning from imbalanced data: progress and challenges, marking the 15-year anniversary. J Artif. Intell. Res. 61, 863–905 (2018)
Chatfield, M., Mander, A.: The Skillings-Mack test (Friedman test when there are missing data). Stata J. 9(2), 299–305 (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Kumar, L., Baldwa, S., Jambavalikar, S.M., Murthy, L.B., Krishna, A. (2022). Software Functional and Non-function Requirement Classification Using Word-Embedding. In: Barolli, L., Hussain, F., Enokido, T. (eds) Advanced Information Networking and Applications. AINA 2022. Lecture Notes in Networks and Systems, vol 450. Springer, Cham. https://doi.org/10.1007/978-3-030-99587-4_15
Download citation
DOI: https://doi.org/10.1007/978-3-030-99587-4_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-99586-7
Online ISBN: 978-3-030-99587-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)