Lab: Create an EDA Blueprint

Estimated time: 30 minutes

Learning objectives

After completing this lab, you will be able to:

Prerequisites:

You can complete your tasks using any text editor, such as Word, Google Docs, or Notepad.

Introduction

Reliable and accurate data is the foundation of effective strategy development and decision-making in today's data-driven landscape.

Enterprise Data Architecture (EDA) provides a structured framework for collecting, storing, integrating, and managing data. In a supply chain company, a well-designed EDA ensures:

By adopting a structured EDA approach, supply chain companies can enhance operations, make informed decisions, and stay competitive in a rapidly evolving market.

Scenario overview

A supply chain company aims to streamline operations and improve decision-making by developing an Enterprise Data Architecture (EDA) blueprint. The process begins with identifying key data sources, such as inventory systems, transportation logs, supplier databases, and customer orders. The team defines data flow processes, integrating real-time tracking and analytics platforms. Using cloud-based solutions, they establish a centralized data lake for seamless storage and retrieval. The blueprint outlines data governance policies, ensuring security and compliance. Advanced analytics tools are incorporated to forecast demand, optimize routes, and reduce costs. This comprehensive EDA blueprint becomes the foundation for the company's digital transformation journey.

Exercise: Create an Enterprise Data Architecture blueprint for a supply chain company

In this lab, you will design an enterprise data architecture blueprint for a supply chain company that provides raw materials for garment manufacturing. The blueprint will focus on core components: data sources, ingestion, storage, processing, analytics, and governance. 

You will be guided step by step to design a framework that ensures efficient data flow, scalability, and compliance. By the end of this lab, you will have a comprehensive and structured data architecture plan tailored to streamline supply chain operations.

Template to use

If you are using Microsoft Word, right-click the link and download the template document. You can use the template to perform the tasks outlined in the lab.

You can also use the PDF format for any other word-processing software.

Task 1: Understand business requirements: Identify the specific data requirements for optimizing supply chain operations.

Step 1: Compile a list of all critical data sources

Example

Key data resources
Supplier database

Step 2: Define practical applications for the data

Example

Critical use cases
Inventory forecasting

Step 3: Outline the data compliance standards and regulations that must be followed

Data compliance requirements Description
GDPR
  • Applies if the company handles personal data of EU citizens

  • Ensure data minimization, lawful processing, and user consent

  • Provide mechanisms for data access, rectification, and deletion

Task 2: Identify and define the core components of data architecture

Step 1: Data sources: Identify and categorize all internal and external data sources

Example:

Data source type Examples
Internal ERP systems, warehouse management systems, production logs
External Supplier APIs, weather data, market trends

Step 2: Data ingestion: Establish the data ingestion methods.

Example

Ingestion method Description Examples Tools
Batch ingestion Processes data at scheduled intervals Daily supplier reports Apache Kafka, AWS Kinesis, Talend

Step 3: Data storage: Define the data storage types

Example

Storage type Purpose Examples
Data Lake Stores raw, unprocessed data Hadoop, AWS S3

Step 4: Data processing: Define the data processing methods

Example:

Processing method Description Examples
ETL/ELT pipelines Transform raw data for further analysis Apache Spark, Talend, AWS Glue, Microsoft Azure Data Factory

Step 5: Analytics: Identify and report business intelligence capabilities

Example:

Analytics type Description Examples/Use cases Tools
Dashboards Visualize key metrics such as inventory levels, supplier performance, and cost analysis Inventory levels, supplier performance, cost analysis Tableau, Power BI

Step 6: Data governance: Define the data governance tasks.

Example:

Governance tasks Description Examples/Tools
Data ownership & stewardship Define roles and responsibilities for data management Collibra, Alation

Task 3: Create the blueprint using Lucidchart

Step 1: Set-up the core layers

Open Lucidchart and click the New button, then select Blank Document to create a new diagram. Add six rectangular shapes, each representing a core section: Data Sources, Data Ingestion, Data Storage, Data Processing, Analytics, and Governance. Finally, connect these sections with arrows to illustrate the data flow between them.

Step 2: Populate the data-sources

Drag and drop circular shapes to represent data sources (e.g., Supplier APIs, ERP system, IoT devices, Market Trends API) and connect them to the Data Ingestion Layer with arrows.

Step 3: Design data ingestion

List the tools and mechanisms (e.g., Kafka for real-time, Talend for batch) under the rectangle labeled Data Ingestion Layer and annotate the arrows to specify batch or real-time ingestion.

Step 4: Define the storage architecture

Add cylinder shapes to represent each storage tier: Raw Data for the Data Lake, Structured Data for the Data Warehouse, and Archival Storage. Then, establish connections from all storage tiers to the Data Storage box.

Step 5: Detail data-processing

Under the Data Processing Layer rectangle, list the processing methods, such as ETL pipelines for data cleaning and transformation and stream processing for real-time analysis. Then, connect the data lake to the processing layer to illustrate the data flow.

Step 6: Map the analytics

Create rectangular shapes to represent analytics tools, such as Tableau, Power BI, and Python scripts. Next, establish connections between the data warehouse and these analytics tools to illustrate how data flows for analysis. Finally, include an example output, such as an inventory optimization dashboard, to demonstrate the insights generated from the data.

Step 7: Illustrate data governance

Add a rectangle labeled Data Governance at the top and connect it to all layers.

Step 8: Finalize and share the blueprint

Export the blueprint in PDF or PNG format to ensure clarity and accessibility. Then, integrate the exported file into the documentation to support the implementation process, providing stakeholders with a clear visual reference.

Summary

Congratulations on completing the hands-on lab: Create an EDA Blueprint. In this lab, you explored how to design an enterprise data architecture blueprint for a supply chain company.

Here is the exemplar document for your reference.

Right-click to open the file. Please remember that the exemplar is just a point of view and not a definitive solution.