Semantic Information Extraction for Software Requirements using Semantic Role Labeling
摘要：Domain analysis is important to core assets development in software product line（SPL） engineering, and in software-as-a-service（Saa S） engineering. Traditional methods of domain analysis, however, are heavily based on manual labor. They depend on domain experts’ experience to analyze the commonality and variability of systems in a domain, which appears to be an obstacle for many organizations which tend to provide SaaS service or to launch a software product line（SPL）. In this paper, we propose a new approach to automatically extract the semantic information from software requirements specifications（SRSs）. We address this issue by combining techniques of semantic role labeling and domain knowledge modeling. In this approach, we selected frequent verbs from software requirement specification documents in the e-commerce domain, and built the semantic frames for those verbs. Then the selected sentences were labeled manually and the result was used as training examples for machine learning. To obtain the accurate features of the examples we correct the parsing result of Stanford Parser with the help of domain knowledge. During the labeling process, we adopt a sequential way in which the previous labeled results will be used to construct dynamic features for the identification of the subsequent semantic roles. The proposed approach was implemented and evaluated. The preliminary result shows the approach is effective and reliable. The extracted semantic information can be used to model the variability and commonality of functional requirements in the domain engineering.
2015 IEEE International Conference on Progress in Informatics and Computing（PIC 2015）