@@ -5,7 +5,7 @@ Costa Rica
55[ ![ GitHub] ( https://img.shields.io/badge/--181717?logo=github&logoColor=ffffff )] ( https://github.com/ )
66[ brown9804] ( https://github.com/brown9804 )
77
8- Last updated: 2025-01-29
8+ Last updated: 2025-02-21
99
1010------------------------------------------
1111
@@ -73,34 +73,45 @@ Implementing a medallion architecture provides several benefits:
7373
7474### Step 1: Set Up Your Environment
7575
76+ > [ !NOTE]
77+ > If you need more visual help, check out the video that walks you through the steps. Below is a detailed step-by-step guide.
78+
76791 . ** Create a Fabric Workspace** : This will be your central hub for all activities.
77- - Navigate to the Fabric portal.
78- - Click on ` Create a resource ` and select ` Fabric Workspace ` .
80+ - Click on ` Create a resource ` and create your Fabric Capacity if you haven't done so already.
81+
82+ <img width =" 550 " alt =" image " src =" https://github.com/user-attachments/assets/8f259cac-1dcb-4129-9070-0b31899c4ab4 " >
83+
84+ - Go to the [ Fabric portal] ( https://app.fabric.microsoft.com/ ) .
85+ - Click on ` Workspaces ` , then select either your existing workspace or create a new one by clicking ` New Workspace ` :
7986 - Provide a name and other required details, then create the workspace.
8087
81- <img width =" 550 " alt =" image " src =" https://github.com/user-attachments/assets/8f259cac-1dcb-4129-9070-0b31899c4ab4 " >
8288
8389 <img width =" 550 " alt =" image " src =" https://github.com/user-attachments/assets/2f3225fc-6aa6-4eeb-8207-75038b36f18f " >
8490
91+ - Now, assign the Fabric Capacity to your workspace by clicking on ` Workspace settings ` and selecting the fabric capacity under the license.
92+
8593 <img width =" 550 " alt =" image " src =" https://github.com/user-attachments/assets/1831c97d-6b9a-4470-968d-e7803bc58b80 " >
8694
95+ https://github.com/user-attachments/assets/c524741c-be91-4fe4-82bc-c841fae8c6c9
96+
87972 . ** Create Lakehouses** : Set up three lakehouses for the Bronze, Silver, and Gold layers.
98+
8899 - In your Fabric workspace, create three lakehouses named ` raw_Bronze ` , ` cleansed_Silver ` , and ` curated_Gold ` .
89100
90- | Go to Data Engineering | Create the 3 lakehouses |
91- | --- | --- |
92- | <img src =" https://github.com/user-attachments/assets/f0f5baf2-9ad2-4422-9089-14aa28ae5181 " width =" 250 " > | <img src =" https://github.com/user-attachments/assets/5827c73a-1291-4fb6-a2c6-0c61f4808164 " width =" 550 " > |
101+ <img src =" https://github.com/user-attachments/assets/5827c73a-1291-4fb6-a2c6-0c61f4808164 " width =" 550 " >
93102
94103 <img width =" 550 " alt =" image " src =" https://github.com/user-attachments/assets/ca3c64c7-d414-4ec0-8cbc-cf39548bf758 " >
95104
96105 <img width =" 958 " alt =" image " src =" https://github.com/user-attachments/assets/828adf9d-8722-4bef-8694-8c22de330797 " >
97106
107+ https://github.com/user-attachments/assets/fdb64dd2-a6ec-4da0-a385-e55f875c8f8e
108+
98109### Step 2: Ingest Data into the Bronze Layer
991101 . ** Identify Data Sources** :
100111 - Determine the sources from which you'll ingest data.
101112 - List all the data sources such as databases, APIs, file systems, etc.
102113
103- > Let's suppose you have your data locally as csv, you can upload your data to the raw layer:
114+ > Let's suppose you have your data locally as csv, you can upload your data to the raw layer. Click [ to access sample files in case you don't have any data at the moment ] ( ./sample_files )
104115
105116 <img width =" 550 " alt =" image " src =" https://github.com/user-attachments/assets/0ebd4ca9-80f6-4d1c-a28c-fc7ab6988538 " >
106117
@@ -110,37 +121,45 @@ Implementing a medallion architecture provides several benefits:
110121 | --- | --- |
111122 | <img width =" 550 " alt =" image " src =" https://github.com/user-attachments/assets/09994e75-3029-4f61-aac8-b50f7c5fd2b1 " > | <img width =" 550 " alt =" image " src =" https://github.com/user-attachments/assets/2b25d187-85e2-48e7-9a97-e7549f28ed9c " > |
112123
113- 2 . ** Create Dataflows or Pipelines** : Use Data Factory to create dataflows or pipelines that ingest data into the Bronze lakehouse.
114- - In Data Factory, create a new pipeline.
115- - Add activities to the pipeline to ingest data from the identified sources.
124+ https://github.com/user-attachments/assets/56308a58-cf72-4f0f-bf3e-e9e1669fa0df
116125
117- > Image you want to extract data from your ` sql database `
118-
119- <img width =" 550 " alt =" image " src =" https://github.com/user-attachments/assets/1b186362-8041-4bd7-b797-ea2061d80b63 " >
126+ > Suppose you need to extract data from your ` sql database `
120127
121- > Sample table:
122-
123- ``` sql
124- CREATE TABLE dbo .Employees (
125- EmployeeID INT PRIMARY KEY ,
126- FirstName VARCHAR (50 ),
127- LastName VARCHAR (50 ),
128- BirthDate DATE ,
129- HireDate DATE ,
130- JobTitle VARCHAR (50 ),
131- Salary DECIMAL (10 , 4 )
132- );
133-
134- INSERT INTO dbo .Employees (EmployeeID, FirstName, LastName, BirthDate, HireDate, JobTitle, Salary)
135- VALUES
136- (1 , ' John' , ' Doe' , ' 1985-11-15' , ' 2010-03-10' , ' Software Engineer' , 75000 .0000 ),
137- (2 , ' Jane' , ' Smith' , ' 1990-05-22' , ' 2012-07-18' , ' Project Manager' , 85000 .0000 ),
138- (3 , ' Emily' , ' Jones' , ' 1988-04-17' , ' 2014-06-25' , ' Data Analyst' , 65000 .0000 ),
139- (4 , ' Michael' , ' Brown' , ' 1982-06-21' , ' 2008-09-15' , ' HR Specialist' , 55000 .0000 ),
140- (5 , ' Sarah' , ' Davis' , ' 1995-09-30' , ' 2020-11-20' , ' Marketing Specialist' , 60000 .0000 );
141- ```
128+ <img width =" 550 " alt =" image " src =" https://github.com/user-attachments/assets/1b186362-8041-4bd7-b797-ea2061d80b63 " >
129+
130+ > [ !NOTE]
131+ > In case you don't have any data at the moment, please follow this video to create a sample:
132+
133+ ``` sql
134+ CREATE TABLE dbo .Employees (
135+ EmployeeID INT PRIMARY KEY ,
136+ FirstName VARCHAR (50 ),
137+ LastName VARCHAR (50 ),
138+ BirthDate DATE ,
139+ HireDate DATE ,
140+ JobTitle VARCHAR (50 ),
141+ Salary DECIMAL (10 , 4 )
142+ );
143+
144+ INSERT INTO dbo .Employees (EmployeeID, FirstName, LastName, BirthDate, HireDate, JobTitle, Salary)
145+ VALUES
146+ (1 , ' John' , ' Doe' , ' 1985-11-15' , ' 2010-03-10' , ' Software Engineer' , 75000 .0000 ),
147+ (2 , ' Jane' , ' Smith' , ' 1990-05-22' , ' 2012-07-18' , ' Project Manager' , 85000 .0000 ),
148+ (3 , ' Emily' , ' Jones' , ' 1988-04-17' , ' 2014-06-25' , ' Data Analyst' , 65000 .0000 ),
149+ (4 , ' Michael' , ' Brown' , ' 1982-06-21' , ' 2008-09-15' , ' HR Specialist' , 55000 .0000 ),
150+ (5 , ' Sarah' , ' Davis' , ' 1995-09-30' , ' 2020-11-20' , ' Marketing Specialist' , 60000 .0000 );
151+ ```
152+
153+ https://github.com/user-attachments/assets/357184bf-cc49-4311-84d4-6369514b3366
142154
155+ > [ !IMPORTANT]
156+ > Besides using Data pipelines to bring your SQL information, you can also leverage Microsoft Fabric's mirrored SQL capability. This feature allows you to create a mirrored copy of your SQL database, improving data availability, reliability, and disaster recovery. By maintaining a synchronized copy of your database in a different location, it ensures that your data is always accessible, even in the event of a failure or outage.
157+
158+ https://github.com/user-attachments/assets/2a64762a-f120-4448-b0fb-7a49f4d1bedb
143159
160+ 2 . ** Create Dataflows or Pipelines** : Use Data Factory to create dataflows or pipelines that ingest data into the Bronze lakehouse.
161+ - In Data Factory, create a new pipeline.
162+ - Add activities to the pipeline to ingest data from the identified sources.
144163 - ** Configure Data Ingestion** : Set up the data ingestion process to load data into the Bronze layer in its raw format.
145164 - Configure the source and destination settings in the pipeline activities.
146165 - Ensure the data is being ingested into the ` raw_Bronze ` lakehouse.
@@ -190,7 +209,6 @@ Implementing a medallion architecture provides several benefits:
190209 ```
191210
192211 < img width= " 550" alt= " image" src= " https://github.com/user-attachments/assets/16181f57-67b6-4141-8259-cf2ca0fc072e" >
193-
194212
195213 > If you want see more, click [here](./ src/ 0_notebook_bronze_to_silver .ipynb) to see a sample of the notebook.
196214
0 commit comments