The term "Knowledge Management System (KMS)" is applied to a wide variety of methodological, software and organizational solutions. KMS as an IT system can be a corporate portal on SharePoint, a tool for information support of internal or external users, embedded in a task management system, or even a set of resources for self-learning. Such solutions are suitable for solving niche tasks, such as organizing the work of support or customer service, for training employees to perform typical business processes, but they are not effective for providing access to complex formalizable, heterogeneous and diverse knowledge required by middle and senior level employees. As a rule, classical KMS completely lacks the functionality of working with structured data (part of this functionality is provided by BI-systems). Another problem of most KMS is the need to manually structure the information placed in them.

How can these limitations of classic KMS be overcome? Every organization accumulates a huge amount of data in the course of its work. The data can be structured, in which case it is contained in databases of corporate systems, or unstructured, if it is in the form of documents. Solutions that seek to combine information from structured and unstructured sources into a single cohesive representation are called Enterprise Knowledge Graph (EKG). A well-developed EKG with good information retrieval tools can serve as a Knowledge Management System for non-typical tasks and decision support. It is important that the EKG design process prioritizes automated knowledge extraction from accumulated data instead of manual input or structuring of information.

We will compare the process of building a classic KMS with the steps of creating an Enterprise knowledge graph from a technological point of view to illustrate the differences in approaches and results.

Knowledge Management System (KMS)Enterprise knowledge graph (EKG)
Knowledge structuring

In most KMS the top level of the knowledge structure is the familiar to all PC users hierarchy of "folders" and documents, so modeling of the knowledge structure begins with building a single hierarchy for navigation through them. In practice, the knowledge structure of almost any company is so complex that it does not fit into a single hierarchy. Mixing different bases of knowledge classification in one tree leads to the loss of its logical structure and complicates navigation.

To get around this limitation, knowledge items are beginning to be tagged with tags - randomly selected words, each of which can stand for one of the main business objects or a term from the business glossary. Tagging can be automated using machine learning algorithms. The tags themselves are usually not structured in any way and form a simple list.

To build an EKG, first an ontology model of the subject area is created, which describes the types of business objects, their possible relationships and properties. For some industries (e.g. banking) such models exist ready-made, for others they need to be developed during the project.

A business glossary (e.g., built according to the SKOS model), which presents the terms used to refer to business objects, properties, and relationships, can also be part of the model.

Then catalogs of the main business objects are built, with which knowledge elements can be associated - these can be assets, customers, employees, etc. If the company has already implemented an MDM system, it can become a source of filling such catalogs.

Multiple hierarchical facets can be created in the model to categorize knowledge items.

Filling with information

Filling information in a classic KMS is most often done manually. The expert needs to find knowledge sources for each issue in the organization, process them, extract the necessary information, place it in the appropriate section of the KMS and provide search metadata, including tags.

Complex project practices are often used to organize the filling process: organizing working groups and committees, conducting interviews and surveys, and coordinating the information entered into the KMS. All of this consumes expensive work time of the most competent employees of the organization.

To populate the EKG, structured and unstructured data existing in the organization are primarily used. Only information that did not previously exist in electronic form is entered manually. To extract structured data, adapters are created that extract information from corporate systems using web services or by accessing copies of their databases. To operate the adapters, rules are created in the ontology model that describe the correspondence of model elements to data structure elements in the sources (paradigm OBDA, Ontology-based data access). Extracted data can be materialized in the EKG repository or retrieved from the source on request.

To index documents, a crawler is created that traverses all file stores. Along with the usual full-text index, it identifies the business objects mentioned in each document and links them.

Thus, the EKG creates a virtual representation of each business object, enriched with all available information about it. This representation can be considered as a kind of digital shadow of the business object.

Reliability of information

Incorrect or irrelevant data contained in a KMS can cause significant damage if used in decision-making. The vast majority of classic KMS systems do not have any tools to control the relevance and verification of data - these tasks are completely left to human experts. With rapid changes in the business environment, it is extremely difficult to control the relevance of all information in a KMS.

The "human factor" cannot be excluded as a source of errors when entering data into KMS.

Ontologies allow to record in machine-readable form the rules for validating information and the rules for drawing logical conclusions from the available facts. Such rules, constructed by the analyst, allow to automate the cleaning and enrichment of EKG content.

Any information entering the EKG retains a link to its source. When viewing any facts in the EKG, it is always possible to establish exactly when and from where they were obtained. The use of structured data, indexing of primary source documents reduces the probability of incorrect information entering the EKG. All this helps to increase the credibility of the information contained in the EKG.

Method of use

To find the desired information in a classic KMS, the user uses content tree navigation, tag-based search, and full-text search. Hyperlinks between content elements and widgets such as "Related articles" are also used, the content of which can be built using machine learning algorithms.

The knowledge item found is usually a text article that contains a pre-prepared answer to a specific question or a description of a particular situation.

The user can build complex queries to EKG including several interrelated conditions, for example: "In which factoring transactions with clients of industry N for the amount of more than X million $ was risk assessment according to M methodology applied? " It is important that EKG gives precise answers to such questions based on data, and not just selecting documents in which the relevant words occur. It is possible to use the results of some queries as a set of input parameters for others.

A found knowledge item is an object that has many properties and relationships with other objects. Objects are viewed and navigated in a wiki-like interface. The relationship graph itself can also be visualized - this gives an opportunity to quickly assess the full picture of information available for each object.

Auxiliary functions of EKG are searching for relations between objects, searching for similar objects.

Support

The process of maintaining a KMS differs little from the process of its initial content and is limited to adding new materials, correcting existing ones, and removing irrelevant ones. Significant expert labor is spent on support.

EKG automatically collects up-to-date information from source systems and documents and eliminates outdated data. Support is to extend the model as new knowledge domains are added to the EKG perimeter or as business processes change.

EKG can respond quickly to changes in requirements, processes, and source data structures because only changes to the model, not program code, need to be made to reflect them in the system.

Innovation and development

Innovative features of KMS are most often reduced to attempts to apply neural networks to improve content search practices. In general, this segment is characterized by rather slow development of both the paradigm as a whole and the functionality of specific products.

The development of Natural Language Understanding (NLU) technologies can be used to enrich EKG content with facts extracted from text, thus increasing the accessibility of information from unstructured sources. They can be used to enrich the content of the EKG with facts extracted from the text, thus increasing the accessibility of information from unstructured sources. They will also improve the user experience of interacting with the system: the ability to ask a question in plain language, rather than constructing a query in a search form, will speed up information retrieval. A chatbot or dialog assistant using information from the EKG will allow the user to directly receive answers to questions that are important to the user, rather than having to sort through an array of query results.

Several vendors are offering solutions for building knowledge management systems based on EKGs and ontologies. German company Coreon focuses on developing a multilingual knowledge management system designed to bridge the gap between data governance and the language in which business experts communicate. This is especially important in technologically complex subject areas. Coreon's customers include Liebherr.

EKG is widely used in the medical and pharmaceutical industry, where the knowledge structure is also very complex. Metaphactory talks about several EKG implementations in this area. In one of the articles, the company's employees talk in detail about how EKG is used in companies in various industries to search for data and discover complex relationships between objects.

The European company Ontotext describes the process of saturating the enterprise knowledge graph with structured and unstructured information on the example of creating a knowledge management system for a pharmaceutical company. Austria's The Semantic Web Company (PoolParty) describes the case of implementing EKG for knowledge consolidation of an engineering/construction corporation. The descriptions of almost all of these projects emphasize that, as a result of the project, the customer improved the efficiency of its business processes through better accessibility of knowledge to its staff.

To get a high-quality solution for knowledge management using EKG, it is not necessary to resort to the services of foreign vendors. Specialists of DataVera Kazakhstan have practical experience of successful creation of knowledge management systems based on EKG, realizing all the above advantages, including in a large oil and gas company and an engineering institute. We offer our clients to get the most out of the accumulated information, turn data into a directly monetizable asset through extensive use in business processes, including decision support. We are ready to share success stories of clients who have implemented such tools, and talk about specific ways to improve the operational efficiency of companies with the help of knowledge.