Data Services

From digitalplaybook.org
Jump to navigation Jump to search

Go to Data Acquisition and Ingestion sectionGo to Data Streaming sectionGo to Data Transformation and Wrangling sectionGo to Data Contextualization sectionGo to Batch Processing sectionGo to Real-time processing sectionGo to Data PubSub Push sectionGo to Data Aggregation sectionGo to Synthetic Data Generation sectionGo to Ontology Management sectionGo to Digital Twin Model Repository sectionGo to Digital Twin Instance Repository sectionGo to Temporal (Time Series) Data Store sectionGo to Data Storage and Archive Services sectionGo to Simulation Model Repository sectionGo to AI Model Repository sectionGo to IntegrationGo to IntelligenceGo to ManagementGo to TrustworthinessGo to UXDT CPT Data Service.png


Enables data access, ingestion and data management across the platform from the edge to the cloud. It establishes the physical to virtual connection and receives data directly from equipment sensors or control systems, performs localized processing, and distributes to other tiers


Data Acquisition and Ingestion

Ability

The ability to configure and acquire data from different data sources including control system, historians, IoT sensors, smart devices, engineering system, enterprise systems etc.

Purpose

The purpose is to acquire data from the physical world, engineering technology systems, and information technology systems to support subsequent processing and insight generation.

Data Streaming

Ability

The ability to transfer of large volumes of data continuously and incrementally between a source and a destination without having to access all data at the same time.

Purpose

The purpose is to acquire fast continuous packets of information which is changing at high speed to be able to get near real-time insights.

Data Transformation and Wrangling

Ability

The ability to convert data types and properties through cleaning, structuring and enriching raw data to make if suitable for further processing and analytics.

Purpose

The purpose is to make data useable in Digital Twins.

Data Contextualization

Ability

The ability to add language or meta data to enrich real time or transactional data

Purpose

The purpose is to combine data from different sources such as real-time and context to make it suitable for subsequent processing by the digital twin.

Batch Processing

Ability

The ability to execute against previously collected data in bulk form.

Purpose

The purpose is to provide is an efficient way of processing high volumes of data in batches or groups.


Real-time processing

Ability

The ability to manage and act on the captured data with minimal latency

Purpose

The purpose is to support immediate insights from the data.

Data PubSub Push

Ability

The ability to package filtered data to different services based on publish / subscribe model

Purpose

The purpose is to provide information to subscribed digital twin consumers.

Data Aggregation

Ability

The ability to gather raw data and express in a summary form.

Purpose

The purpose is to gather data from multiple sources with the intent of combining these data sources into a summary for data analysis.

Synthetic Data Generation

Ability

The ability to generate synthetic data based on patterns and rules in existing sources.

Purpose

The purpose is to create representative synthetic data that can used by the digital twin to train and score predictive models.

Ontology Management

Ability

The ability to manage knowledge graphs and ontologies.

Purpose

The purpose is to enable a digital twin to interpret data directly from knowledge graphs and ontologies.

Digital Twin Model Repository

Ability

The ability to store, manage and retrieve the meta data that describe the digital twin model. The model can include formal data names, comprehensive data definitions, proper data structures, and precise data integrity rules.

Purpose

The purpose is to register and manage a portfolio of Digital Twin models in a central repository to improve configuration management and model governance.

Digital Twin Instance Repository

Ability

The ability to store, manage and retrieve digital twin instance data that conforms to the requirements of the digital twin model.

Purpose

The purpose is to store, manage and retrieve Digital Twin instance state data.

Temporal (Time Series) Data Store

Ability

The ability to store, organize and retrieve data relating to time instances through temporal data types, and stores information relating to past, present and potentially future time.

Purpose

The purpose is to store, manage and retrieve temporal (timeseries) data.

Data Storage and Archive Services

Ability

The ability to store, organize and retrieve data based on how frequently it will be accessed and how long it will be retained.

Purpose

The purpose is to reduce the cost and effort of managing Digital Twin data by using hot, cold and archival data services.

Simulation Model Repository

Ability

The ability to store, manage and retrieve the algorithmic codebase, business rules and meta data that describe a simulation model.

Purpose

The purpose is to register and manage a portfolio of simulation models in a central repository to improve configuration management and model governance.

AI Model Repository

Ability

The ability to store, manage, search and retrieve the algorithmic codebase that describe an artificial intelligence (AI) model or machine learning (ML) model

Purpose

The purpose is to register and manage a portfolio of AI and machine learning models in a central repository to improve configuration management and model governance.