MARKUP ONTOLOGY DESIGN FOR A CONTENT MANAGEMENT SYSTEM

Authors

DOI:

https://doi.org/10.32782/IT/2023-1-9

Keywords:

сontent management system, ontology, markup, search robots

Abstract

Low standards of written content and poor standard of search engine optimization (SEO) are the main problems in the content management area, especially, when we are talking about huge sitemaps with millions of pages. For each webpage, the same content can be represented for two targets: for search robots which work with special markup schemas and for users. As usual, the information for search robots is limited by keywords which cannot describe clearly the meaning of textual messages on the webpage thanks for different reasons. The problem here is content meaning synchronization for both users and for search robots across all the millions of pages. The different components of the webpage have their own style and could contain something meaningful context which we are going to synchronize with page meaning too. The most famous markup schema was created by leaders of search engines in the market, namely: Google, Yahoo, Microsoft and others (Schema.org, 2022). It helps the search engine robots better understand the content of web pages. The schema can be extended by demand using a well-documented extension model including vocabularies which describe entities and relationships between them. We are going to resolve the problems of content synchronization and its single representation form for users and for search robots, therefore, the text generation techniques are not considered in the paper. Sure, a matter of control of the content through millions of pages is so hard to resolve even using modern content management systems (CMS) systems. Thus, we resolve the content management problems by installing semantics for webpage markup schemas, webpages and their content using a single knowledge representation in markup ontology. The proposed ontology-based approach is able to synchronize the meaning between content for users and for search robots of webpages and could be implemented as an extra plugin for CMS.

References

Welcome to Schema.org. 2022. URL: https://schema.org/

Deane Barker. Web Content Management. Systems, Features, and Best Practices: O’Reilly Media, 2016. p. 550.

Lyashkevych V., Oksana O., Mirosh O. Logic-Textual and Neuronet Approach to Search Information. Proceedings of the 5th IEEE International Workshop on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications 21-23 September 2009, Rende (Cosenza), Italy. – P.539-543. - DOI: https://doi.org/10.1109/IDAACS.2009.5342920

Lyashkevych V., Olar O., Liashkevych M. Software Ontology Subject Domain Intelligence Diagnostics of Computer Means. The 7th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications, 12-14 September 2013, Berlin, Germany. V.2. pp. 601-606. DOI: https://doi.org/10.1109/IDAACS.2013.6662995

Generating news image captions with semantic discourse extraction and contrastive style-coherent learning / Zhang Z., Zhang H., Wang J., Sun Z., Yang Z. Computers and Electrical Engineering. 2022. V. 104. Part A. DOI: https://doi.org/10.1016/j.compeleceng.2022.108429

Syntax Customized Video Captioning by Imitating Exemplar Sentences / Yuan, Yitian, Ma, Lin, Zhu, Wenwu. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2022. V. 44, I. 12. pp. 10209-10221. DOI: https://doi.org/10.1109/TPAMI.2021.3131618

Downloads

Published

2023-06-20