DataVera EKG Provider is a data virtualization platform intended for processing of the big data arrays gathered in the Enterprise Knowledge Graph and structured according to an ontology. See User Guide for more details.
EKG stands for "Enterprise Knowledge Graph". This term is used to describe the overall process of managing and governing an organization's data and knowledge assets. EKG platform serves as a centralized system for managing and governing data across an enterprise, ensuring data quality, completeness, consistency, and accuracy. It also provides data lineage control, data dictionary, data profiling, data validation, and other data governance features.
DataVera EKG Provider uses ontologies to represent a data model. Ontologies allows to process effectively the data of a complex structure, containing thousands of the entity types and properties. There is a distinct class of the graph databases, RDF triple stores, intended for ontologies processing. The logical rules (such as SHACL) processing engines can be plug in to execute rules on its content. Such databases are performing well with the data of complex structure, but not with the big data.
This requires data virtualization platforms implementation. Data virtualization allows to physically store data in the common relational or document-oriented database, but process it as if they are sutuated in a graph. The industrial ontology-based data processing framework has to implement the following functions:
The DataVera EKG Platform implements all these requirements. The platform architecture is presented on the next diagram:
The next diagram presents the architecture of the platform's cornerstone, DataVera EKG Provider:
DataVera EKG Platform Architecture includes the following components:
DataVera EKG Explorer exchanges data to DataVera EKG Provider via REST API. DataVera EKG Provider exchanges data to RDF triple store Apache Fuseki via SPARQL protocol.
DataVera EKG Provider and DataVera EKG Explorer can be run on Kubernetes and scaled as required.
DataVera EKG Provider offers the following integration interfaces:
REST API provides the next core functions:
One of the core DataVera EKG Provider features is the support of the data temporal aspect. Temporal data functions allows working with any past or future state of the data set. There are the following functions for temporal data manipulation:
DataVera EKG Provider as a data virtualization platform provides:
DataVera EKG Provider has the following deployment infrastructure integration features: