Publication Type: Journal Article
Source: Information & Security: An International Journal, Volume 47, Issue 2, p.187-202 (2020)
Keywords: data science
, machine learning
, semantic similarity search
, text similarity
This paper describes initial exploitation of Natural Language Processing (NLP) techniques applied to a specific set of related NATO documents. In particular, the text similarity technique was applied to document sets with the aim of capturing the relationships between documents or sections of documents from semantic and syntactic perspectives. Thesaurus and triple extraction techniques allowed the understanding of the sentences beyond the syntactic structure, thus improving the accuracy in capturing similar content across documents with diverse syntactic structures. The objective is to assess whether Natural Language Processing tools can retrieve relationships and gaps between such kinds of textual data. This work improves interoperability in NATO by enhancing the development and application of policies, directives and other documents, which dictate how Consultation, Command and Control (C3) systems across the Alliance interoperate and support NATO's operational needs.