updated submodule 00 and 01 in AWS

hadiparsianNIH · web-flow · commit 29aada2c819c · 2025-01-20T17:38:30.000-05:00
diff --git a/AWS/Submodule_00_background.ipynb b/AWS/Submodule_00_background.ipynb
@@ -109,7 +109,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "display_quiz(\"../quiz-material/00-pc1.json\")"
+    "display_quiz(\"Transcriptome-Assembly-Refinement-and-Applications/quiz-material/00-pc1.json\")"
    ]
   },
   {
@@ -281,7 +281,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "display_quiz(\"../quiz-material/00-cp1.json\", shuffle_questions = True)"
+    "display_quiz(\"Transcriptome-Assembly-Refinement-and-Applications/quiz-material/00-cp1.json\", shuffle_questions = True)"
    ]
   },
   {
@@ -407,7 +407,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "display_quiz(\"../quiz-material/00-cp2.json\", shuffle_questions = True)"
+    "display_quiz(\"Transcriptome-Assembly-Refinement-and-Applications/quiz-material/00-cp2.json\", shuffle_questions = True)"
    ]
   },
   {
@@ -444,7 +444,11 @@
    ]
   }
  ],
- "metadata": {},
+ "metadata": {
+  "language_info": {
+   "name": "python"
+  }
+ },
  "nbformat": 4,
  "nbformat_minor": 5
 }
diff --git a/AWS/Submodule_01_prog_setup.ipynb b/AWS/Submodule_01_prog_setup.ipynb
@@ -33,9 +33,9 @@
     "    * Mambaforge (a package manager for bioinformatics tools).\n",
     "    *  `sra-tools`, `perl-dbd-sqlite`, and `perl-dbi` (specific bioinformatics packages).\n",
     "    * Nextflow (a workflow management system).\n",
-    "    *  `gsutil` (for interacting with Google Cloud Storage).\n",
+    "    *  `aws s3` (for interacting with AWS S3 Storage).\n",
     "\n",
-    "3. **Download and organize necessary data:** Students will download the TransPi transcriptome assembly software and its associated resources (databases, scripts, configuration files) from a Google Cloud Storage bucket.  This includes understanding the directory structure and file organization.\n",
+    "3. **Download and organize necessary data:** Students will download the TransPi transcriptome assembly software and its associated resources (databases, scripts, configuration files) from an S3 bucket.  This includes understanding the directory structure and file organization.\n",
     "\n",
     "4. **Manage file permissions:** Students will use the `chmod` command to set executable permissions for the necessary files and directories within the TransPi software.\n",
     "\n",
@@ -53,7 +53,7 @@
     "* **Shell Access:**  The ability to execute shell commands from within the Jupyter Notebook environment (using `!` and `%`).\n",
     "* **Java Development Kit (JDK):**  Required for Nextflow.\n",
     "* **Miniforge** A package manager for installing bioinformatics tools.\n",
-    "* **`gsutil`:** The Google Cloud Storage command-line tool. This is crucial for downloading data from Google Cloud Storage."
+    "* **`aws s3`:** The AWS command-line tool. This is crucial for downloading data from an S3 storage bucket."
    ]
   },
   {
@@ -104,7 +104,7 @@
     "## Time to begin!\n",
     "\n",
     "**Step 1:** To start, make sure that you are in the right starting place with a `cd`.\n",
-    "> `pwd` prints our current local working directory. Make sure the output from the command is: `/home/jupyter`"
+    "> `pwd` prints our current local working directory. Make sure the output from the command is: `/home/ec2-user/SageMaker`"
    ]
   },
   {
@@ -114,7 +114,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "%cd /home/jupyter"
+    "%cd /home/ec2-user/SageMaker"
    ]
   },
   {
@@ -147,50 +147,12 @@
     "! java -version"
    ]
   },
-  {
-   "cell_type": "markdown",
-   "id": "7b3ffb16-3395-4c01-9774-ee568e815490",
-   "metadata": {},
-   "source": [
-    "**Step 3:** Install Miniforge (a package manager), which is needed to support the information held within the TransPi databases."
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "ac5b204a-f0db-4ceb-bf37-57eca6d77974",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "! curl -L -O https://github.com/conda-forge/miniforge/releases/latest/download/Miniforge3-$(uname)-$(uname -m).sh\n",
-    "! bash Miniforge3-$(uname)-$(uname -m).sh -b -p $HOME/miniforge"
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "c5584e2e",
-   "metadata": {},
-   "source": [
-    "Next, add it to the path."
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "ad030cd1",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "import os\n",
-    "os.environ[\"PATH\"] += os.pathsep + os.environ[\"HOME\"]+\"/miniforge/bin\""
-   ]
-  },
   {
    "cell_type": "markdown",
    "id": "7b930ad7",
    "metadata": {},
    "source": [
-    "Next, using Miniforge and bioconda, install the tools that will be used in this tutorial."
+    "**Step 3:** Using Mamba and bioconda, install the tools that will be used in this tutorial."
    ]
   },
   {
@@ -239,7 +201,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "! gsutil -m cp -r gs://nigms-sandbox/nosi-inbremaine-storage/TransPi ./"
+    "! aws s3 cp  --recursive s3://nigms-sandbox/nosi-inbremaine-storage/TransPi ./TransPi"
    ]
   },
   {
@@ -249,10 +211,10 @@
    "source": [
     "<div class=\"alert alert-block alert-success\">\n",
     "    <i class=\"fa fa-hand-paper-o\" aria-hidden=\"true\"></i>\n",
-    "    <b>Note: </b>  gsutil\n",
+    "    <b>Note: </b>  aws\n",
     "</div>\n",
     "\n",
-    ">`gsutil` is a tool allows you to interact with Google Cloud Storage through the command line."
+    ">`aws s3` is a tool allows you to interact with S3 Storage through the command line."
    ]
   },
   {
@@ -277,7 +239,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "! gsutil -m cp -r gs://nigms-sandbox/nosi-inbremaine-storage/resources ./"
+    "! aws s3 cp --recursive s3://nigms-sandbox/nosi-inbremaine-storage/resources ./resources"
    ]
   },
   {
@@ -302,8 +264,7 @@
     ">    - They can also be stacked so `../../` will take you two layers up.\n",
     ">\n",
     ">- If you were to type `!ls ./nextWeek/` it would return the contents of the `nextWeek` directory which is one layer down from the current directory, so it would return `manyThings.txt`.\n",
-    ">\n",
-    ">**This means that in the second line of the code cell above, the file `TransPi.nf` will be copied from the Google Cloud Storage bucket to the current directory.**"
+    ">"
    ]
   },
   {
@@ -377,7 +338,7 @@
    "outputs": [],
    "source": [
     "from jupyterquiz import display_quiz\n",
-    "display_quiz(\"../quiz-material/01-cp1.json\", shuffle_questions = True)"
+    "display_quiz(\"Transcriptome-Assembly-Refinement-and-Applications/quiz-material/01-cp1.json\", shuffle_questions = True)"
    ]
   },
   {
@@ -401,7 +362,25 @@
    ]
   }
  ],
- "metadata": {},
+ "metadata": {
+  "kernelspec": {
+   "display_name": "conda_python3",
+   "language": "python",
+   "name": "conda_python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.10.16"
+  }
+ },
  "nbformat": 4,
  "nbformat_minor": 5
 }

Original file line number	Diff line number	Diff line change
`@@ -109,7 +109,7 @@`
`109`	`109`	`"metadata": {},`
`110`	`110`	`"outputs": [],`
`111`	`111`	`"source": [`
`112`		`- "display_quiz(\"../quiz-material/00-pc1.json\")"`
	`112`	`+ "display_quiz(\"Transcriptome-Assembly-Refinement-and-Applications/quiz-material/00-pc1.json\")"`
`113`	`113`	`]`
`114`	`114`	`},`
`115`	`115`	`{`
`@@ -281,7 +281,7 @@`
`281`	`281`	`"metadata": {},`
`282`	`282`	`"outputs": [],`
`283`	`283`	`"source": [`
`284`		`- "display_quiz(\"../quiz-material/00-cp1.json\", shuffle_questions = True)"`
	`284`	`+ "display_quiz(\"Transcriptome-Assembly-Refinement-and-Applications/quiz-material/00-cp1.json\", shuffle_questions = True)"`
`285`	`285`	`]`
`286`	`286`	`},`
`287`	`287`	`{`
`@@ -407,7 +407,7 @@`
`407`	`407`	`"metadata": {},`
`408`	`408`	`"outputs": [],`
`409`	`409`	`"source": [`
`410`		`- "display_quiz(\"../quiz-material/00-cp2.json\", shuffle_questions = True)"`
	`410`	`+ "display_quiz(\"Transcriptome-Assembly-Refinement-and-Applications/quiz-material/00-cp2.json\", shuffle_questions = True)"`
`411`	`411`	`]`
`412`	`412`	`},`
`413`	`413`	`{`
`@@ -444,7 +444,11 @@`
`444`	`444`	`]`
`445`	`445`	`}`
`446`	`446`	`],`
`447`		`- "metadata": {},`
	`447`	`+ "metadata": {`
	`448`	`+ "language_info": {`
	`449`	`+ "name": "python"`
	`450`	`+ }`
	`451`	`+ },`
`448`	`452`	`"nbformat": 4,`
`449`	`453`	`"nbformat_minor": 5`
`450`	`454`	`}`