Machine Learning for Detecting Pronominal Anaphora Ambiguity in NL Requirements
- Resource Type
- Conference
- Authors
- Sharma, Richa; Sharma, Nidhi; Biswas, K.K.
- Source
- 2016 4th Intl Conf on Applied Computing and Information Technology/3rd Intl Conf on Computational Science/Intelligence and Applied Informatics/1st Intl Conf on Big Data, Cloud Computing, Data Science & Engineering (ACIT-CSII-BCD) ACIT-CSII-BCD Applied Computing and Information Technology/3rd Intl Conf on Computational Science/Intelligence and Applied Informatics/1st Intl Conf on Big Data, Cloud Computing, Data Science & Engineering (ACIT-CSII-BCD), 2016 4th Intl Conf on. :177-182 Dec, 2016
- Subject
- Computing and Processing
Classification algorithms
Pragmatics
Training
Software
Vegetation
Bayes methods
Business
requirements analysis
ambiguity
anaphora ambiguity
machine learning
- Language
Automated or semi-automated analysis of requirements specification documents, expressed in Natural Language (NL), has always been desirable. An important precursor to this goal is the identification and correction of potentially ambiguous requirements statements. Pronominal Anaphora ambiguity is one such type of pragmatic or referential ambiguity in NL requirements, which needs attention. However, identification of such ambiguous requirements statements is a challenging task since the count of such statements is relatively lower. We present a solution to this challenge by considering the task as that of a classification problem to classify ambiguous requirements statements having pronominal anaphora ambiguity from a corpus of potentially ambiguous requirements statements with pronominal anaphora ambiguity. We show how a classifier can be trained in semi-supervised manner to detect such instances of pronominal anaphoric ambiguous requirements statements. Our study indicates a recall of 95% with Bayesian network classification algorithm.