Skip to content

Commit efb5636

Browse files
authored
Merge pull request #42 from microsoft/readmeupdates
add information to read me for adding/modifying model deployments
2 parents f1ad895 + db8f471 commit efb5636

2 files changed

Lines changed: 38 additions & 3 deletions

File tree

README.md

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -37,6 +37,7 @@ The diagram below illustrates the capabilities included in the template.
3737

3838
- Provides the ability to [add additional Azure services during deployment](docs/add_additional_services.md), configured to connect via isolation to enrich your AI project.
3939
(API Management, CosmosDB, Azure SQL DB)
40+
- Provides the ability to [define the models deployed into the hub](docs/modify_deployed_models.md) when the deployment runs.
4041

4142
## Prerequisites and high-level steps
4243

@@ -98,9 +99,6 @@ You can estimate the cost of this project's architecture with [Azure's pricing c
9899

99100
This template has [Managed Identity](https://learn.microsoft.com/entra/identity/managed-identities-azure-resources/overview) built in to eliminate the need for developers to manage these credentials. Applications can use managed identities to obtain Microsoft Entra tokens without having to manage any credentials.
100101

101-
<h2>
102-
Supporting documents
103-
</h2>
104102

105103
## Resources
106104

docs/modify_deployed_models.md

Lines changed: 37 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,37 @@
1+
## Update AI Model Deployments
2+
The AI Models that can be deployed and attached to the Foundry hub can be modified by changing the parameters within the [main.parameters.json](../infra/main.parameters.json) file.
3+
4+
By modifying the parameters listed in the parameters.json 'aiModelDeployments' section, additional or different models can be deployed with the solution and ready for use after the deployment. Simply modify the values to your liking in each of the objects, or add additional objects to the array.
5+
```powershell
6+
7+
8+
"aiModelDeployments": {
9+
"value": [
10+
{
11+
"name": "textembed",
12+
"model": {
13+
"name": modelName,
14+
"format": modelPublisherFormat,
15+
"version": modelVersion
16+
},
17+
"sku": {
18+
"name": skuName,
19+
"capacity": capacity
20+
}
21+
},
22+
{
23+
"name": "gpt",
24+
"model": {
25+
"name": "gpt-4o",
26+
"version": "2024-05-13",
27+
"format": "OpenAI"
28+
},
29+
"sku": {
30+
"name": "GlobalStandard",
31+
"capacity": 10
32+
}
33+
}
34+
]
35+
}
36+
```
37+
To find and validate additional model information, the [AI Foundry](https://ai.azure.com/explore/models) model page has the above parameters to refer to, as does the Microsoft Learn page for [Azure OpenAI Service Models](https://learn.microsoft.com/en-us/azure/ai-services/openai_) information.

0 commit comments

Comments
 (0)