Skip to content

Commit eb6ef83

Browse files
authored
Merge pull request #7 from MicrosoftCloudEssentials-LearningHub/adding-notes-text
Notes for ws / implementation
2 parents effaaa9 + b2024bb commit eb6ef83

5 files changed

Lines changed: 3094 additions & 39 deletions

File tree

AzurePortal/1_MedallionArch/README.md

Lines changed: 54 additions & 36 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ Costa Rica
55
[![GitHub](https://img.shields.io/badge/--181717?logo=github&logoColor=ffffff)](https://github.com/)
66
[brown9804](https://github.com/brown9804)
77

8-
Last updated: 2025-01-29
8+
Last updated: 2025-02-21
99

1010
------------------------------------------
1111

@@ -73,34 +73,45 @@ Implementing a medallion architecture provides several benefits:
7373

7474
### Step 1: Set Up Your Environment
7575

76+
> [!NOTE]
77+
> If you need more visual help, check out the video that walks you through the steps. Below is a detailed step-by-step guide.
78+
7679
1. **Create a Fabric Workspace**: This will be your central hub for all activities.
77-
- Navigate to the Fabric portal.
78-
- Click on `Create a resource` and select `Fabric Workspace`.
80+
- Click on `Create a resource` and create your Fabric Capacity if you haven't done so already.
81+
82+
<img width="550" alt="image" src="https://github.com/user-attachments/assets/8f259cac-1dcb-4129-9070-0b31899c4ab4">
83+
84+
- Go to the [Fabric portal](https://app.fabric.microsoft.com/).
85+
- Click on `Workspaces`, then select either your existing workspace or create a new one by clicking `New Workspace`:
7986
- Provide a name and other required details, then create the workspace.
8087

81-
<img width="550" alt="image" src="https://github.com/user-attachments/assets/8f259cac-1dcb-4129-9070-0b31899c4ab4">
8288

8389
<img width="550" alt="image" src="https://github.com/user-attachments/assets/2f3225fc-6aa6-4eeb-8207-75038b36f18f">
8490

91+
- Now, assign the Fabric Capacity to your workspace by clicking on `Workspace settings` and selecting the fabric capacity under the license.
92+
8593
<img width="550" alt="image" src="https://github.com/user-attachments/assets/1831c97d-6b9a-4470-968d-e7803bc58b80">
8694

95+
https://github.com/user-attachments/assets/c524741c-be91-4fe4-82bc-c841fae8c6c9
96+
8797
2. **Create Lakehouses**: Set up three lakehouses for the Bronze, Silver, and Gold layers.
98+
8899
- In your Fabric workspace, create three lakehouses named `raw_Bronze`, `cleansed_Silver`, and `curated_Gold`.
89100

90-
| Go to Data Engineering | Create the 3 lakehouses |
91-
| --- | --- |
92-
|<img src="https://github.com/user-attachments/assets/f0f5baf2-9ad2-4422-9089-14aa28ae5181" width="250"> | <img src="https://github.com/user-attachments/assets/5827c73a-1291-4fb6-a2c6-0c61f4808164" width="550"> |
101+
<img src="https://github.com/user-attachments/assets/5827c73a-1291-4fb6-a2c6-0c61f4808164" width="550">
93102

94103
<img width="550" alt="image" src="https://github.com/user-attachments/assets/ca3c64c7-d414-4ec0-8cbc-cf39548bf758">
95104

96105
<img width="958" alt="image" src="https://github.com/user-attachments/assets/828adf9d-8722-4bef-8694-8c22de330797">
97106

107+
https://github.com/user-attachments/assets/fdb64dd2-a6ec-4da0-a385-e55f875c8f8e
108+
98109
### Step 2: Ingest Data into the Bronze Layer
99110
1. **Identify Data Sources**:
100111
- Determine the sources from which you'll ingest data.
101112
- List all the data sources such as databases, APIs, file systems, etc.
102113

103-
> Let's suppose you have your data locally as csv, you can upload your data to the raw layer:
114+
> Let's suppose you have your data locally as csv, you can upload your data to the raw layer. Click [to access sample files in case you don't have any data at the moment](./sample_files)
104115
105116
<img width="550" alt="image" src="https://github.com/user-attachments/assets/0ebd4ca9-80f6-4d1c-a28c-fc7ab6988538">
106117

@@ -110,37 +121,45 @@ Implementing a medallion architecture provides several benefits:
110121
| --- | --- |
111122
| <img width="550" alt="image" src="https://github.com/user-attachments/assets/09994e75-3029-4f61-aac8-b50f7c5fd2b1"> | <img width="550" alt="image" src="https://github.com/user-attachments/assets/2b25d187-85e2-48e7-9a97-e7549f28ed9c"> |
112123

113-
2. **Create Dataflows or Pipelines**: Use Data Factory to create dataflows or pipelines that ingest data into the Bronze lakehouse.
114-
- In Data Factory, create a new pipeline.
115-
- Add activities to the pipeline to ingest data from the identified sources.
124+
https://github.com/user-attachments/assets/56308a58-cf72-4f0f-bf3e-e9e1669fa0df
116125

117-
> Image you want to extract data from your `sql database`
118-
119-
<img width="550" alt="image" src="https://github.com/user-attachments/assets/1b186362-8041-4bd7-b797-ea2061d80b63">
126+
> Suppose you need to extract data from your `sql database`
120127
121-
> Sample table:
122-
123-
```sql
124-
CREATE TABLE dbo.Employees (
125-
EmployeeID INT PRIMARY KEY,
126-
FirstName VARCHAR(50),
127-
LastName VARCHAR(50),
128-
BirthDate DATE,
129-
HireDate DATE,
130-
JobTitle VARCHAR(50),
131-
Salary DECIMAL(10, 4)
132-
);
133-
134-
INSERT INTO dbo.Employees (EmployeeID, FirstName, LastName, BirthDate, HireDate, JobTitle, Salary)
135-
VALUES
136-
(1, 'John', 'Doe', '1985-11-15', '2010-03-10', 'Software Engineer', 75000.0000),
137-
(2, 'Jane', 'Smith', '1990-05-22', '2012-07-18', 'Project Manager', 85000.0000),
138-
(3, 'Emily', 'Jones', '1988-04-17', '2014-06-25', 'Data Analyst', 65000.0000),
139-
(4, 'Michael', 'Brown', '1982-06-21', '2008-09-15', 'HR Specialist', 55000.0000),
140-
(5, 'Sarah', 'Davis', '1995-09-30', '2020-11-20', 'Marketing Specialist', 60000.0000);
141-
```
128+
<img width="550" alt="image" src="https://github.com/user-attachments/assets/1b186362-8041-4bd7-b797-ea2061d80b63">
129+
130+
> [!NOTE]
131+
> In case you don't have any data at the moment, please follow this video to create a sample:
132+
133+
```sql
134+
CREATE TABLE dbo.Employees (
135+
EmployeeID INT PRIMARY KEY,
136+
FirstName VARCHAR(50),
137+
LastName VARCHAR(50),
138+
BirthDate DATE,
139+
HireDate DATE,
140+
JobTitle VARCHAR(50),
141+
Salary DECIMAL(10, 4)
142+
);
143+
144+
INSERT INTO dbo.Employees (EmployeeID, FirstName, LastName, BirthDate, HireDate, JobTitle, Salary)
145+
VALUES
146+
(1, 'John', 'Doe', '1985-11-15', '2010-03-10', 'Software Engineer', 75000.0000),
147+
(2, 'Jane', 'Smith', '1990-05-22', '2012-07-18', 'Project Manager', 85000.0000),
148+
(3, 'Emily', 'Jones', '1988-04-17', '2014-06-25', 'Data Analyst', 65000.0000),
149+
(4, 'Michael', 'Brown', '1982-06-21', '2008-09-15', 'HR Specialist', 55000.0000),
150+
(5, 'Sarah', 'Davis', '1995-09-30', '2020-11-20', 'Marketing Specialist', 60000.0000);
151+
```
152+
153+
https://github.com/user-attachments/assets/357184bf-cc49-4311-84d4-6369514b3366
142154

155+
> [!IMPORTANT]
156+
> Besides using Data pipelines to bring your SQL information, you can also leverage Microsoft Fabric's mirrored SQL capability. This feature allows you to create a mirrored copy of your SQL database, improving data availability, reliability, and disaster recovery. By maintaining a synchronized copy of your database in a different location, it ensures that your data is always accessible, even in the event of a failure or outage.
157+
158+
https://github.com/user-attachments/assets/2a64762a-f120-4448-b0fb-7a49f4d1bedb
143159

160+
2. **Create Dataflows or Pipelines**: Use Data Factory to create dataflows or pipelines that ingest data into the Bronze lakehouse.
161+
- In Data Factory, create a new pipeline.
162+
- Add activities to the pipeline to ingest data from the identified sources.
144163
- **Configure Data Ingestion**: Set up the data ingestion process to load data into the Bronze layer in its raw format.
145164
- Configure the source and destination settings in the pipeline activities.
146165
- Ensure the data is being ingested into the `raw_Bronze` lakehouse.
@@ -190,7 +209,6 @@ Implementing a medallion architecture provides several benefits:
190209
```
191210

192211
<img width="550" alt="image" src="https://github.com/user-attachments/assets/16181f57-67b6-4141-8259-cf2ca0fc072e">
193-
194212

195213
> If you want see more, click [here](./src/0_notebook_bronze_to_silver.ipynb) to see a sample of the notebook.
196214

0 commit comments

Comments
 (0)