Skip to content

Commit e221614

Browse files
Changed the format of submodule 00
1 parent 7efe657 commit e221614

2 files changed

Lines changed: 84 additions & 4 deletions

File tree

GoogleCloud/Submodule_00_background.ipynb

Lines changed: 54 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@
1414
"id": "5e6d2086-4dbf-4a61-a5bb-8f08a269f3fa",
1515
"metadata": {},
1616
"source": [
17-
"## Welcome!\n",
17+
"## Overview\n",
1818
"\n",
1919
"This is a series of notebooks that allows you to explore the biological and computational process of the transcriptome assembly. Through these notebooks, you will also learn to leverage the powerful capabilities of tools such as Nextflow and Google Life Science API to bring your computational capabilities to the next level!\n",
2020
"\n",
@@ -25,6 +25,40 @@
2525
"Good luck, and have fun!"
2626
]
2727
},
28+
{
29+
"cell_type": "markdown",
30+
"id": "3518c1a9",
31+
"metadata": {},
32+
"source": [
33+
"## Learning Objectives:\n",
34+
"\n",
35+
"1. **Assess prior knowledge:** A pre-check quiz verifies foundational understanding of DNA, RNA, transcription, and gene expression.\n",
36+
"\n",
37+
"2. **Introduce transcriptome assembly:** Learners gain an understanding of what transcriptome assembly is, why RNA sequencing is performed, and the overall workflow involved.\n",
38+
"\n",
39+
"3. **Explain the process of transcriptome assembly:** This includes understanding preprocessing, sequence assembly using de Bruijn graphs, assembly assessment (internal and external consistency, BUSCO), and refinement techniques.\n",
40+
"\n",
41+
"4. **Introduce workflow management:** Learners are introduced to the concept of workflows/pipelines in bioinformatics and the role of workflow management systems like Nextflow.\n",
42+
"\n",
43+
"5. **Explain the use of Docker containers:** The notebook explains the purpose and benefits of using Docker containers for managing software dependencies in bioinformatics.\n",
44+
"\n",
45+
"6. **Introduce the Google Cloud Life Sciences API:** Learners are introduced to the Google Cloud Life Sciences API and its advantages for managing and executing workflows on cloud computing resources.\n",
46+
"\n",
47+
"7. **Familiarize learners with Jupyter Notebooks:** The notebook provides instructions on how to navigate and use Jupyter Notebooks, including cell types and execution order."
48+
]
49+
},
50+
{
51+
"cell_type": "markdown",
52+
"id": "6a23eec6",
53+
"metadata": {},
54+
"source": [
55+
"## Prerequisites\n",
56+
"\n",
57+
"* **Basic Biology Knowledge:** A foundational understanding of DNA, RNA, transcription, and gene expression is assumed. The notebook includes quizzes to assess this knowledge.\n",
58+
"* **Python Programming:** While the notebook itself doesn't contain complex Python code, familiarity with Python syntax and the Jupyter Notebook environment is helpful.\n",
59+
"* **Command Line Interface (CLI) Familiarity:** The notebook mentions using `pip` (a command-line package installer), indicating some CLI knowledge is beneficial, although not strictly required for completing the quizzes and reviewing the material."
60+
]
61+
},
2862
{
2963
"cell_type": "markdown",
3064
"id": "22b95a28-fad7-4b6c-99ae-093c323f769c",
@@ -383,14 +417,30 @@
383417
},
384418
{
385419
"cell_type": "markdown",
386-
"id": "489beca6-4a9e-4a2e-a646-6b276270d810",
420+
"id": "8d3cf5c9",
421+
"metadata": {},
422+
"source": [
423+
"## Conclusion\n",
424+
"\n",
425+
"This introductory Jupyter Notebook provided essential background information and a pre-requisite knowledge check on fundamental molecular biology concepts (DNA, RNA, transcription, gene expression) crucial for understanding transcriptome assembly. The notebook established the context for the subsequent modules, outlining the workflow involving RNA-seq data, transcriptome assembly techniques (including de Bruijn graphs, BUSCO analysis), and the use of Nextflow and Google Cloud Life Sciences API for efficient workflow execution and management. The inclusion of interactive quizzes and video resources enhanced learning and engagement, preparing learners for the practical applications and computational challenges presented in the following notebooks. Successful completion of the checkpoint quizzes demonstrates readiness to proceed to the next stage of the MDIBL Transcriptome Assembly Learning Module."
426+
]
427+
},
428+
{
429+
"cell_type": "markdown",
430+
"id": "421cebc3",
387431
"metadata": {},
388432
"source": [
389-
"## When you are ready, proceed to the next notebook: [`Submodule_01_prog_setup.ipynb`](./Submodule_01_prog_setup.ipynb)."
433+
"## Clean Up\n",
434+
"\n",
435+
"Remember to proceed to the next notebook [`Submodule_01_prog_setup.ipynb`](./Submodule_01_prog_setup.ipynb) or shut down your instance if you are finished."
390436
]
391437
}
392438
],
393-
"metadata": {},
439+
"metadata": {
440+
"language_info": {
441+
"name": "python"
442+
}
443+
},
394444
"nbformat": 4,
395445
"nbformat_minor": 5
396446
}

test.md

Lines changed: 30 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,30 @@
1+
This Jupyter notebook is an introductory module for a transcriptome assembly learning course. Here's a breakdown of its prerequisites, API requirements, and cloud access needs:
2+
3+
**Prerequisites:**
4+
5+
* **Basic Biology Knowledge:** A foundational understanding of DNA, RNA, transcription, and gene expression is assumed. The notebook includes quizzes to assess this knowledge.
6+
* **Python Programming:** While the notebook itself doesn't contain complex Python code, familiarity with Python syntax and the Jupyter Notebook environment is helpful.
7+
* **Command Line Interface (CLI) Familiarity:** The notebook mentions using `pip` (a command-line package installer), indicating some CLI knowledge is beneficial, although not strictly required for completing the quizzes and reviewing the material.
8+
9+
10+
**APIs:**
11+
12+
* **No APIs are directly used in this introductory notebook.** The notebook *mentions* the Google Cloud Life Sciences API as a tool that will be used in later modules, but it's not utilized within this specific file.
13+
14+
15+
**Cloud Platform Account Roles and Access:**
16+
17+
* **None required for this notebook.** The Google Cloud Life Sciences API is mentioned for later modules, implying that access to a Google Cloud Platform account with appropriate permissions will be needed later, but this introductory notebook only requires local execution and doesn't necessitate any cloud interaction.
18+
19+
20+
**Necessary Cloud Platform Access:**
21+
22+
* **None for this notebook.** This notebook focuses on background information and introductory concepts. Cloud access would be required for later parts of the course, based on its reference to Google Cloud Life Sciences API, but not for this particular Jupyter notebook.
23+
24+
**Software Installations (mentioned in notebook):**
25+
26+
* `jupyterquiz==2.0.7` (for the quizzes)
27+
* `jupytercards` (also for quizzes, though not used in this specific notebook)
28+
29+
30+
In summary, this notebook is a self-contained introductory lesson. While future modules will require cloud access and specific APIs, this particular file only requires basic biological knowledge, familiarity with Python and Jupyter, and installation of a couple of Python packages.

0 commit comments

Comments
 (0)