Tampere University of Technology

TUTCRIS Research Portal

Automatic requirements extraction, analysis and graph representation using an approach derived from computational linguistics

Research output: Contribution to journalArticleScientificpeer-review

Standard

Automatic requirements extraction, analysis and graph representation using an approach derived from computational linguistics. / Mokammel, Faisal; Coatanea, Eric; Coatanea, Joonas; Nenchev, Vladislav; Blanco, Eric; Pietola, Matti.

In: SYSTEMS ENGINEERING, 30.07.2018, p. 1-21.

Research output: Contribution to journalArticleScientificpeer-review

Harvard

APA

Vancouver

Author

Mokammel, Faisal ; Coatanea, Eric ; Coatanea, Joonas ; Nenchev, Vladislav ; Blanco, Eric ; Pietola, Matti. / Automatic requirements extraction, analysis and graph representation using an approach derived from computational linguistics. In: SYSTEMS ENGINEERING. 2018 ; pp. 1-21.

Bibtex - Download

@article{0db40582f98144ebb33980715e26bfa7,
title = "Automatic requirements extraction, analysis and graph representation using an approach derived from computational linguistics",
abstract = "The quality of requirements is fundamental in engineering projects. Requirements are usually expressed partly or totally in a natural language (NL) format and come from different documents.Their qualities are difficult to analyze manually, especially when hundreds of thousands of them have to be considered. The assistance of software tools is becoming a necessity. In this article, the goal was to develop a set of metrics supported by NL processing (NLP) methods supporting different types of analysis of requirements and especially the dependencies between requirements.An NLP approach is used to extract requirements from text; to analyze their quality, links, similarities, and contradictions; and to cluster them automatically. The analysis framework includes different combinations of methods such as cosine similarity, singular value decomposition, and K-means clustering. One objective is to assess the possible combinations and their impacts on detections to establish optimal metrics. Three case studies exemplify and support the validation of the work. Graphs are used to represent the automatically clustered requirements, as well as similarities and contradictions. A new contradiction analysis process that includes a rules-based approach is proposed. Finally, the combined results are presented as graphs, which unveil the semantic relationships between requirements. Subsection 4.8 compares the results provided by the tool and the results obtained from experts. The proposed methodology and network presentation not only support the understanding of the semantics of the requirements but also help requirements engineers to review the interconnections and consistency of requirements systems and manage traceability. The approach is valuable during the early phases of projects when requirements are evolving dynamically and rapidly.",
keywords = "contradictions analysis, network representation, requirements management, similarity",
author = "Faisal Mokammel and Eric Coatanea and Joonas Coatanea and Vladislav Nenchev and Eric Blanco and Matti Pietola",
year = "2018",
month = "7",
day = "30",
doi = "10.1002/sys.21461",
language = "English",
pages = "1--21",
journal = "SYSTEMS ENGINEERING",
issn = "1098-1241",
publisher = "Wiley",

}

RIS (suitable for import to EndNote) - Download

TY - JOUR

T1 - Automatic requirements extraction, analysis and graph representation using an approach derived from computational linguistics

AU - Mokammel, Faisal

AU - Coatanea, Eric

AU - Coatanea, Joonas

AU - Nenchev, Vladislav

AU - Blanco, Eric

AU - Pietola, Matti

PY - 2018/7/30

Y1 - 2018/7/30

N2 - The quality of requirements is fundamental in engineering projects. Requirements are usually expressed partly or totally in a natural language (NL) format and come from different documents.Their qualities are difficult to analyze manually, especially when hundreds of thousands of them have to be considered. The assistance of software tools is becoming a necessity. In this article, the goal was to develop a set of metrics supported by NL processing (NLP) methods supporting different types of analysis of requirements and especially the dependencies between requirements.An NLP approach is used to extract requirements from text; to analyze their quality, links, similarities, and contradictions; and to cluster them automatically. The analysis framework includes different combinations of methods such as cosine similarity, singular value decomposition, and K-means clustering. One objective is to assess the possible combinations and their impacts on detections to establish optimal metrics. Three case studies exemplify and support the validation of the work. Graphs are used to represent the automatically clustered requirements, as well as similarities and contradictions. A new contradiction analysis process that includes a rules-based approach is proposed. Finally, the combined results are presented as graphs, which unveil the semantic relationships between requirements. Subsection 4.8 compares the results provided by the tool and the results obtained from experts. The proposed methodology and network presentation not only support the understanding of the semantics of the requirements but also help requirements engineers to review the interconnections and consistency of requirements systems and manage traceability. The approach is valuable during the early phases of projects when requirements are evolving dynamically and rapidly.

AB - The quality of requirements is fundamental in engineering projects. Requirements are usually expressed partly or totally in a natural language (NL) format and come from different documents.Their qualities are difficult to analyze manually, especially when hundreds of thousands of them have to be considered. The assistance of software tools is becoming a necessity. In this article, the goal was to develop a set of metrics supported by NL processing (NLP) methods supporting different types of analysis of requirements and especially the dependencies between requirements.An NLP approach is used to extract requirements from text; to analyze their quality, links, similarities, and contradictions; and to cluster them automatically. The analysis framework includes different combinations of methods such as cosine similarity, singular value decomposition, and K-means clustering. One objective is to assess the possible combinations and their impacts on detections to establish optimal metrics. Three case studies exemplify and support the validation of the work. Graphs are used to represent the automatically clustered requirements, as well as similarities and contradictions. A new contradiction analysis process that includes a rules-based approach is proposed. Finally, the combined results are presented as graphs, which unveil the semantic relationships between requirements. Subsection 4.8 compares the results provided by the tool and the results obtained from experts. The proposed methodology and network presentation not only support the understanding of the semantics of the requirements but also help requirements engineers to review the interconnections and consistency of requirements systems and manage traceability. The approach is valuable during the early phases of projects when requirements are evolving dynamically and rapidly.

KW - contradictions analysis, network representation, requirements management, similarity

U2 - 10.1002/sys.21461

DO - 10.1002/sys.21461

M3 - Article

SP - 1

EP - 21

JO - SYSTEMS ENGINEERING

JF - SYSTEMS ENGINEERING

SN - 1098-1241

ER -