Data Integration

Eliminate Data Silos

Overview


What is quite common in many companies is data is everywhere and none of it is integrated. It is spread out on a spreadsheet on someone's computer, documents and a variety of apps. When this happens the company is full of data solos and they miss out on opportunities.

However, if you integrate your data you reap the rewards. Integrated data can give you a complete picture of your entire organization and how everything interacts together so you can seize upon opportunities as they present themselves.

Key Concepts


The most important attributes of data integration.

Integration Methods

Data Integration can be done via many methods. Point-to-point, Hub-and-spoke, Publish subscribe and more. The exact method depends upon project requirements.

Data Standards

Don't attempt to integrate data without standards. Standards give a level of consistency and reliability that data from multiple systems need.

Latency

Latency is the time it takes to integrate the data from when it was created. Does it need to be in real time or will an overnight batch job do? In most cases it would be the latter.

Mapping

Mapping matches the source data to a target destination. It would also consider ETL and any needed transformations.

Lineage

Lineage logs how data has changed over time from its creation to its final destination. Much of this would be stored as metadata.

Profile

Data profiling helps with integration as you better understand the structure and format of the data being integrated by examining what is assumed about data vs. what is actual.

Related Topics


The topics with the most overlap data integration are Data Engineering and Analytics.