Algorithms and software for verification of scientific and technical text documents

Authors

DOI:

https://doi.org/10.15276/aait.06.2023.21

Keywords:

MS Word style, text analysis, document analysis, verification of documents, .doc, .docx

Abstract

The work provides a solution to the problem of verifying the design (formatting) of scientific and technical documents for compliance with the requirements of regulatory documents (the problem of document verification). The basis of the check is the analysis of the styles of the Word text editor, which are used to design the paragraphs of the document under study. For each element of the document (headings, annotations, main text, figures, signatures under figures, list of references and others) a reference style of their design was developed. Together, these styles form the set of allowed styles. There can be many sets of allowed styles, each edition has its own set of styles. Only the administrator has access to each of the sets, which can create new styles, new sets, and edit both individual styles and individual sets. Due to the peculiarities of style parsing, the document is treated as a combination of headers and footers and the body of the document. Algorithms for its verification were developed for this structure of the document: an algorithm for analyzing headers and footers, an algorithm for analyzing paragraphs of the main text, and an algorithm for updating style settings by the administrator. .Net, WPF, DocumentFormat.OpenXml technologies were used to implement the algorithms by software. Using DocumentFormat.OpenXml allows you to analyze styles in .doc/.docx format documents; the developed program accepts .doc or .docx format files as input and analyzes them for compliance with specified styles. The result of the analysis is returned in .txt or .doc/.docx format, indicating the detected deviations from the standards. The .txt format file is a list of found deviations, and in the .doc/.docx format files, the deviations are recorded in the form of comments to the original text. Using the program simplifies the process of checking documents, it allows you to identify all deviations from standards and reduce the time and resources spent on checking. .Net and WPF technologies were used to develop the user interface. The developed program was checked in the process of checking the explanatory notes of real bachelor's and master's qualification theses. The style analysis time was determined; the time does not exceed 3 seconds. The developed program can be useful for automating the process of checking documents, ensuring quality and compliance with the design standards of scientific and technical documentation, scientific and technical publications, and, first of all, in the educational process for checking the design of bachelor's and master's qualification works, as well as various reports

Downloads

Download data is not yet available.

Author Biographies

Valerii S. Hlukhov, Lviv Polytechnic National University, 12, Stepan Bandera Str. Lviv, 79013, Ukraine

Doctor of Engineering Sciences, Professor, Professor of the Department of Electronic Computing

Scopus Author ID: 56979360900

Dmytro S. Sydorko, Lviv Polytechnic National University, 12, Stepan Bandera Str. Lviv, 79013, Ukraine

Bachelor of Electronic Computing Department

Downloads

Published

2023-09-30

How to Cite

[1]
Hlukhov V.S., Sydorko D.S. “Algorithms and software for verification of scientific and technical text documents”. Applied Aspects of Information Technology. 2023; Vol. 6, No. 3: 304–317. DOI:https://doi.org/10.15276/aait.06.2023.21.