@@ -5,7 +5,7 @@ Costa Rica
55[ ![ GitHub] ( https://img.shields.io/badge/--181717?logo=github&logoColor=ffffff )] ( https://github.com/ )
66[ brown9804] ( https://github.com/brown9804 )
77
8- Last updated: 2025-01-29
8+ Last updated: 2025-02-21
99
1010------------------------------------------
1111
@@ -73,34 +73,45 @@ Implementing a medallion architecture provides several benefits:
7373
7474### Step 1: Set Up Your Environment
7575
76+ > [ !NOTE]
77+ > If you need more visual help, check out the video that walks you through the steps. Below is a detailed step-by-step guide.
78+
76791 . ** Create a Fabric Workspace** : This will be your central hub for all activities.
77- - Navigate to the Fabric portal.
78- - Click on ` Create a resource ` and select ` Fabric Workspace ` .
80+ - Click on ` Create a resource ` and create your Fabric Capacity if you haven't done so already.
81+
82+ <img width =" 550 " alt =" image " src =" https://github.com/user-attachments/assets/8f259cac-1dcb-4129-9070-0b31899c4ab4 " >
83+
84+ - Go to the [ Fabric portal] ( https://app.fabric.microsoft.com/ ) .
85+ - Click on ` Workspaces ` , then select either your existing workspace or create a new one by clicking ` New Workspace ` :
7986 - Provide a name and other required details, then create the workspace.
8087
81- <img width =" 550 " alt =" image " src =" https://github.com/user-attachments/assets/8f259cac-1dcb-4129-9070-0b31899c4ab4 " >
8288
8389 <img width =" 550 " alt =" image " src =" https://github.com/user-attachments/assets/2f3225fc-6aa6-4eeb-8207-75038b36f18f " >
8490
91+ - Now, assign the Fabric Capacity to your workspace by clicking on ` Workspace settings ` and selecting the fabric capacity under the license.
92+
8593 <img width =" 550 " alt =" image " src =" https://github.com/user-attachments/assets/1831c97d-6b9a-4470-968d-e7803bc58b80 " >
8694
95+ https://github.com/user-attachments/assets/c524741c-be91-4fe4-82bc-c841fae8c6c9
96+
87972 . ** Create Lakehouses** : Set up three lakehouses for the Bronze, Silver, and Gold layers.
98+
8899 - In your Fabric workspace, create three lakehouses named ` raw_Bronze ` , ` cleansed_Silver ` , and ` curated_Gold ` .
89100
90- | Go to Data Engineering | Create the 3 lakehouses |
91- | --- | --- |
92- | <img src =" https://github.com/user-attachments/assets/f0f5baf2-9ad2-4422-9089-14aa28ae5181 " width =" 250 " > | <img src =" https://github.com/user-attachments/assets/5827c73a-1291-4fb6-a2c6-0c61f4808164 " width =" 550 " > |
101+ <img src =" https://github.com/user-attachments/assets/5827c73a-1291-4fb6-a2c6-0c61f4808164 " width =" 550 " >
93102
94103 <img width =" 550 " alt =" image " src =" https://github.com/user-attachments/assets/ca3c64c7-d414-4ec0-8cbc-cf39548bf758 " >
95104
96105 <img width =" 958 " alt =" image " src =" https://github.com/user-attachments/assets/828adf9d-8722-4bef-8694-8c22de330797 " >
97106
107+ https://github.com/user-attachments/assets/fdb64dd2-a6ec-4da0-a385-e55f875c8f8e
108+
98109### Step 2: Ingest Data into the Bronze Layer
991101 . ** Identify Data Sources** :
100111 - Determine the sources from which you'll ingest data.
101112 - List all the data sources such as databases, APIs, file systems, etc.
102113
103- > Let's suppose you have your data locally as csv, you can upload your data to the raw layer:
114+ > Let's suppose you have your data locally as csv, you can upload your data to the raw layer. Click [ to access sample files in case you don't have any data at the moment ] ( ./sample_files )
104115
105116 <img width =" 550 " alt =" image " src =" https://github.com/user-attachments/assets/0ebd4ca9-80f6-4d1c-a28c-fc7ab6988538 " >
106117
@@ -110,37 +121,41 @@ Implementing a medallion architecture provides several benefits:
110121 | --- | --- |
111122 | <img width =" 550 " alt =" image " src =" https://github.com/user-attachments/assets/09994e75-3029-4f61-aac8-b50f7c5fd2b1 " > | <img width =" 550 " alt =" image " src =" https://github.com/user-attachments/assets/2b25d187-85e2-48e7-9a97-e7549f28ed9c " > |
112123
113- 2 . ** Create Dataflows or Pipelines** : Use Data Factory to create dataflows or pipelines that ingest data into the Bronze lakehouse.
114- - In Data Factory, create a new pipeline.
115- - Add activities to the pipeline to ingest data from the identified sources.
124+ https://github.com/user-attachments/assets/56308a58-cf72-4f0f-bf3e-e9e1669fa0df
116125
117- > Image you want to extract data from your ` sql database `
118-
119- <img width =" 550 " alt =" image " src =" https://github.com/user-attachments/assets/1b186362-8041-4bd7-b797-ea2061d80b63 " >
120126
121- > Sample table:
122-
123- ``` sql
124- CREATE TABLE dbo .Employees (
125- EmployeeID INT PRIMARY KEY ,
126- FirstName VARCHAR (50 ),
127- LastName VARCHAR (50 ),
128- BirthDate DATE ,
129- HireDate DATE ,
130- JobTitle VARCHAR (50 ),
131- Salary DECIMAL (10 , 4 )
132- );
133-
134- INSERT INTO dbo .Employees (EmployeeID, FirstName, LastName, BirthDate, HireDate, JobTitle, Salary)
135- VALUES
136- (1 , ' John' , ' Doe' , ' 1985-11-15' , ' 2010-03-10' , ' Software Engineer' , 75000 .0000 ),
137- (2 , ' Jane' , ' Smith' , ' 1990-05-22' , ' 2012-07-18' , ' Project Manager' , 85000 .0000 ),
138- (3 , ' Emily' , ' Jones' , ' 1988-04-17' , ' 2014-06-25' , ' Data Analyst' , 65000 .0000 ),
139- (4 , ' Michael' , ' Brown' , ' 1982-06-21' , ' 2008-09-15' , ' HR Specialist' , 55000 .0000 ),
140- (5 , ' Sarah' , ' Davis' , ' 1995-09-30' , ' 2020-11-20' , ' Marketing Specialist' , 60000 .0000 );
141- ```
127+ > Image you want to extract data from your ` sql database `
142128
129+ <img width =" 550 " alt =" image " src =" https://github.com/user-attachments/assets/1b186362-8041-4bd7-b797-ea2061d80b63 " >
143130
131+ > [ !NOTE]
132+ > In case you don't have any data at the moment, please follow this video to create a sample:
133+
134+ ``` sql
135+ CREATE TABLE dbo .Employees (
136+ EmployeeID INT PRIMARY KEY ,
137+ FirstName VARCHAR (50 ),
138+ LastName VARCHAR (50 ),
139+ BirthDate DATE ,
140+ HireDate DATE ,
141+ JobTitle VARCHAR (50 ),
142+ Salary DECIMAL (10 , 4 )
143+ );
144+
145+ INSERT INTO dbo .Employees (EmployeeID, FirstName, LastName, BirthDate, HireDate, JobTitle, Salary)
146+ VALUES
147+ (1 , ' John' , ' Doe' , ' 1985-11-15' , ' 2010-03-10' , ' Software Engineer' , 75000 .0000 ),
148+ (2 , ' Jane' , ' Smith' , ' 1990-05-22' , ' 2012-07-18' , ' Project Manager' , 85000 .0000 ),
149+ (3 , ' Emily' , ' Jones' , ' 1988-04-17' , ' 2014-06-25' , ' Data Analyst' , 65000 .0000 ),
150+ (4 , ' Michael' , ' Brown' , ' 1982-06-21' , ' 2008-09-15' , ' HR Specialist' , 55000 .0000 ),
151+ (5 , ' Sarah' , ' Davis' , ' 1995-09-30' , ' 2020-11-20' , ' Marketing Specialist' , 60000 .0000 );
152+ ```
153+
154+ https://github.com/user-attachments/assets/357184bf-cc49-4311-84d4-6369514b3366
155+
156+ 2 . ** Create Dataflows or Pipelines** : Use Data Factory to create dataflows or pipelines that ingest data into the Bronze lakehouse.
157+ - In Data Factory, create a new pipeline.
158+ - Add activities to the pipeline to ingest data from the identified sources.
144159 - ** Configure Data Ingestion** : Set up the data ingestion process to load data into the Bronze layer in its raw format.
145160 - Configure the source and destination settings in the pipeline activities.
146161 - Ensure the data is being ingested into the ` raw_Bronze ` lakehouse.
0 commit comments