Skip to content

Commit 285b20c

Browse files
authored
Merge 464483f into effaaa9
2 parents effaaa9 + 464483f commit 285b20c

5 files changed

Lines changed: 3090 additions & 38 deletions

File tree

AzurePortal/1_MedallionArch/README.md

Lines changed: 50 additions & 35 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ Costa Rica
55
[![GitHub](https://img.shields.io/badge/--181717?logo=github&logoColor=ffffff)](https://github.com/)
66
[brown9804](https://github.com/brown9804)
77

8-
Last updated: 2025-01-29
8+
Last updated: 2025-02-21
99

1010
------------------------------------------
1111

@@ -73,34 +73,45 @@ Implementing a medallion architecture provides several benefits:
7373

7474
### Step 1: Set Up Your Environment
7575

76+
> [!NOTE]
77+
> If you need more visual help, check out the video that walks you through the steps. Below is a detailed step-by-step guide.
78+
7679
1. **Create a Fabric Workspace**: This will be your central hub for all activities.
77-
- Navigate to the Fabric portal.
78-
- Click on `Create a resource` and select `Fabric Workspace`.
80+
- Click on `Create a resource` and create your Fabric Capacity if you haven't done so already.
81+
82+
<img width="550" alt="image" src="https://github.com/user-attachments/assets/8f259cac-1dcb-4129-9070-0b31899c4ab4">
83+
84+
- Go to the [Fabric portal](https://app.fabric.microsoft.com/).
85+
- Click on `Workspaces`, then select either your existing workspace or create a new one by clicking `New Workspace`:
7986
- Provide a name and other required details, then create the workspace.
8087

81-
<img width="550" alt="image" src="https://github.com/user-attachments/assets/8f259cac-1dcb-4129-9070-0b31899c4ab4">
8288

8389
<img width="550" alt="image" src="https://github.com/user-attachments/assets/2f3225fc-6aa6-4eeb-8207-75038b36f18f">
8490

91+
- Now, assign the Fabric Capacity to your workspace by clicking on `Workspace settings` and selecting the fabric capacity under the license.
92+
8593
<img width="550" alt="image" src="https://github.com/user-attachments/assets/1831c97d-6b9a-4470-968d-e7803bc58b80">
8694

95+
https://github.com/user-attachments/assets/c524741c-be91-4fe4-82bc-c841fae8c6c9
96+
8797
2. **Create Lakehouses**: Set up three lakehouses for the Bronze, Silver, and Gold layers.
98+
8899
- In your Fabric workspace, create three lakehouses named `raw_Bronze`, `cleansed_Silver`, and `curated_Gold`.
89100

90-
| Go to Data Engineering | Create the 3 lakehouses |
91-
| --- | --- |
92-
|<img src="https://github.com/user-attachments/assets/f0f5baf2-9ad2-4422-9089-14aa28ae5181" width="250"> | <img src="https://github.com/user-attachments/assets/5827c73a-1291-4fb6-a2c6-0c61f4808164" width="550"> |
101+
<img src="https://github.com/user-attachments/assets/5827c73a-1291-4fb6-a2c6-0c61f4808164" width="550">
93102

94103
<img width="550" alt="image" src="https://github.com/user-attachments/assets/ca3c64c7-d414-4ec0-8cbc-cf39548bf758">
95104

96105
<img width="958" alt="image" src="https://github.com/user-attachments/assets/828adf9d-8722-4bef-8694-8c22de330797">
97106

107+
https://github.com/user-attachments/assets/fdb64dd2-a6ec-4da0-a385-e55f875c8f8e
108+
98109
### Step 2: Ingest Data into the Bronze Layer
99110
1. **Identify Data Sources**:
100111
- Determine the sources from which you'll ingest data.
101112
- List all the data sources such as databases, APIs, file systems, etc.
102113

103-
> Let's suppose you have your data locally as csv, you can upload your data to the raw layer:
114+
> Let's suppose you have your data locally as csv, you can upload your data to the raw layer. Click [to access sample files in case you don't have any data at the moment](./sample_files)
104115
105116
<img width="550" alt="image" src="https://github.com/user-attachments/assets/0ebd4ca9-80f6-4d1c-a28c-fc7ab6988538">
106117

@@ -110,37 +121,41 @@ Implementing a medallion architecture provides several benefits:
110121
| --- | --- |
111122
| <img width="550" alt="image" src="https://github.com/user-attachments/assets/09994e75-3029-4f61-aac8-b50f7c5fd2b1"> | <img width="550" alt="image" src="https://github.com/user-attachments/assets/2b25d187-85e2-48e7-9a97-e7549f28ed9c"> |
112123

113-
2. **Create Dataflows or Pipelines**: Use Data Factory to create dataflows or pipelines that ingest data into the Bronze lakehouse.
114-
- In Data Factory, create a new pipeline.
115-
- Add activities to the pipeline to ingest data from the identified sources.
124+
https://github.com/user-attachments/assets/56308a58-cf72-4f0f-bf3e-e9e1669fa0df
116125

117-
> Image you want to extract data from your `sql database`
118-
119-
<img width="550" alt="image" src="https://github.com/user-attachments/assets/1b186362-8041-4bd7-b797-ea2061d80b63">
120126

121-
> Sample table:
122-
123-
```sql
124-
CREATE TABLE dbo.Employees (
125-
EmployeeID INT PRIMARY KEY,
126-
FirstName VARCHAR(50),
127-
LastName VARCHAR(50),
128-
BirthDate DATE,
129-
HireDate DATE,
130-
JobTitle VARCHAR(50),
131-
Salary DECIMAL(10, 4)
132-
);
133-
134-
INSERT INTO dbo.Employees (EmployeeID, FirstName, LastName, BirthDate, HireDate, JobTitle, Salary)
135-
VALUES
136-
(1, 'John', 'Doe', '1985-11-15', '2010-03-10', 'Software Engineer', 75000.0000),
137-
(2, 'Jane', 'Smith', '1990-05-22', '2012-07-18', 'Project Manager', 85000.0000),
138-
(3, 'Emily', 'Jones', '1988-04-17', '2014-06-25', 'Data Analyst', 65000.0000),
139-
(4, 'Michael', 'Brown', '1982-06-21', '2008-09-15', 'HR Specialist', 55000.0000),
140-
(5, 'Sarah', 'Davis', '1995-09-30', '2020-11-20', 'Marketing Specialist', 60000.0000);
141-
```
127+
> Image you want to extract data from your `sql database`
142128
129+
<img width="550" alt="image" src="https://github.com/user-attachments/assets/1b186362-8041-4bd7-b797-ea2061d80b63">
143130

131+
> [!NOTE]
132+
> In case you don't have any data at the moment, please follow this video to create a sample:
133+
134+
```sql
135+
CREATE TABLE dbo.Employees (
136+
EmployeeID INT PRIMARY KEY,
137+
FirstName VARCHAR(50),
138+
LastName VARCHAR(50),
139+
BirthDate DATE,
140+
HireDate DATE,
141+
JobTitle VARCHAR(50),
142+
Salary DECIMAL(10, 4)
143+
);
144+
145+
INSERT INTO dbo.Employees (EmployeeID, FirstName, LastName, BirthDate, HireDate, JobTitle, Salary)
146+
VALUES
147+
(1, 'John', 'Doe', '1985-11-15', '2010-03-10', 'Software Engineer', 75000.0000),
148+
(2, 'Jane', 'Smith', '1990-05-22', '2012-07-18', 'Project Manager', 85000.0000),
149+
(3, 'Emily', 'Jones', '1988-04-17', '2014-06-25', 'Data Analyst', 65000.0000),
150+
(4, 'Michael', 'Brown', '1982-06-21', '2008-09-15', 'HR Specialist', 55000.0000),
151+
(5, 'Sarah', 'Davis', '1995-09-30', '2020-11-20', 'Marketing Specialist', 60000.0000);
152+
```
153+
154+
https://github.com/user-attachments/assets/357184bf-cc49-4311-84d4-6369514b3366
155+
156+
2. **Create Dataflows or Pipelines**: Use Data Factory to create dataflows or pipelines that ingest data into the Bronze lakehouse.
157+
- In Data Factory, create a new pipeline.
158+
- Add activities to the pipeline to ingest data from the identified sources.
144159
- **Configure Data Ingestion**: Set up the data ingestion process to load data into the Bronze layer in its raw format.
145160
- Configure the source and destination settings in the pipeline activities.
146161
- Ensure the data is being ingested into the `raw_Bronze` lakehouse.

0 commit comments

Comments
 (0)