BANKSY notebook text: PR 1/2 by sjspielman · Pull Request #1000 · AlexsLemonade/training-modules

sjspielman · 2026-05-20T16:05:42Z

This the first of 2 PRs to flesh out the text in the BANKSY notebook. I'm particularly looking for feedback on the level of details I presented about how BANKSY itself works. I do figure some of the finer points will appear in forthcoming slides as well, e.g. how exactly lambda is used, so I didn't want to get into all of the weeds here (just some weeds!), so I tried to achieve a middle ground - let me know what you might add or remove!

The next PR will pick up from the "with AGF" section and go through the end, but please let me know if you'd prefer I split that up for review ease.

As needed for reference:

BANKSY paper https://www.nature.com/articles/s41588-024-01664-3
BANKSY docs https://prabhakarlab.github.io/Banksy/index.html

sjspielman · 2026-05-20T17:33:47Z

Happy to see whatever was going on with #991 seems to have sorted itself out with no intervention.

jashapiro

Overall this looks good, and I think you have the level of detail just about right.

As far as text goes, I think the biggest thing I suggested is removing the interpretation of the non-spatial plots/clustering. Part of that is that I somewhat disagree with one of your statements, but the bigger thing is that I don't think we really need to say anything definitive there; we can try to make it more interactive. For the in-person training, I expect that to work well, though it is always a bit more a challenge in virtual trainings!

The other thing that we might want to explore just a bit more is how the feature selection affects results. I would suggest we should try to do the same thing for both PCA/clustering analyses so we can have a more apples-to-apples comparison. I'd probably try both not filtering the non-spatial PCA (just pass in the full gene list, I think?) and filtering the BANKSY input. For either case, you probably want to note that what we are doing is "non-standard" for one or the other, but it makes the comparison more fair.

…ap and export accordingly, but still subject to review

Co-authored-by: Joshua Shapiro <josh.shapiro@ccdatalab.org>

…e some notebook items accordingly including return to nonsptial vs AGF comparison

sjspielman · 2026-05-22T14:12:03Z

Thank you for the thorough review, including noting parameters like jaccard which I had been forgetting to include! With this added in finally, I am quite pleased with where this is landing with HVGs as well. Should be ready for another look.

Here's the HTML to see the plots, but again the coordindate plots tend to land at different sizing in the Rmd view vs rendered view (🤷‍♀️) and I optimized plot sizing for Rmd viewing. 02-spatial_clustering.nb.html

Some notes:

Right now for BANKSY feature selection, I am wholesale subsetting the SPE and resaving it to spe, which was a choice! The honest answer for why I saved to the same variable is for typo avoidance, but there's a great argument to put it in spe_subset or so and I will work on typing skills ;). To pair with this choice, I changed the export to be a TSV with columns for clusters only. Let me know if you'd prefer an RDS export (or a TSV with more columns?) and/or different variable handling.
Not specifically part of this PR, but wanted to bring it up: As part of revising this to include jaccard & svgs, we now have a few more clusters in the output and the heatmap is less straightforward to interpret just because there are more rows/cols and more blobs of color. How would you feel about just plotting the col1a1 expression and removing the heatmap, or alternatively any other (non-tricky) heatmap plot ideas?
I plan to final.finalize the section headers in the next PR.

jashapiro · 2026-05-22T16:07:29Z

Quick response before looking at code:

Right now for BANKSY feature selection, I am wholesale subsetting the SPE and resaving it to spe, which was a choice! The honest answer for why I saved to the same variable is for typo avoidance, but there's a great argument to put it in spe_subset or so and I will work on typing skills ;).

You already know what I am going to say here. How about hvg_spe for this?

To pair with this choice, I changed the export to be a TSV with columns for clusters only. Let me know if you'd prefer an RDS export (or a TSV with more columns?) and/or different variable handling.

I think a TSV output is fine here. Haven't looked specifically at the code, but I might just save out the whole colData to make reading it in and joining with an existing spe as simple as possible?

sjspielman · 2026-05-22T16:12:15Z

You already know what I am going to say here. How about hvg_spe for this?

😂 facts

I think a TSV output is fine here. Haven't looked specifically at the code, but I might just save out the whole colData to make reading it in and joining with an existing spe as simple as possible?

this plus renamed SPE incoming!

sjspielman added 8 commits May 20, 2026 09:53

text in the intro sections

8406477

typos

7138952

nonspatial clustering text

6f3733c

text through non-agf

1cc8bab

couple text cleanups

52cf551

spelling

40c5a71

bit of spacing

dd29737

Merge branch 'master' into sjspielman/banksy-text

1b05d08

sjspielman requested a review from jashapiro May 20, 2026 17:34

jashapiro reviewed May 21, 2026

View reviewed changes

Merge branch 'master' into sjspielman/banksy-text

bafdbe2

jashapiro reviewed May 21, 2026

View reviewed changes

Comment thread spatial/02-spatial_clustering.Rmd Outdated

sjspielman mentioned this pull request May 21, 2026

Feature selection in the BANKSY notebook #1004

Closed

sjspielman and others added 9 commits May 22, 2026 08:28

Merge branch 'master' into sjspielman/banksy-text

24b9ef3

revise to always use HVGs based on discussion. Also updated the heatm…

1aa88ac

…ap and export accordingly, but still subject to review

Apply suggestions from code review

f152124

Co-authored-by: Joshua Shapiro <josh.shapiro@ccdatalab.org>

add in more hvg text and use jaccard which i had forgotten, and updat…

0104bc8

…e some notebook items accordingly including return to nonsptial vs AGF comparison

some opening text and plot styling as requested in review

dc179fe

use fig.height to remove the extra whitespace in patchworked plots

ca4d7b8

open questions instead of interpretation

7571e3d

few more plot sizings

e0a12ef

speeling

916bf9e

sjspielman requested a review from jashapiro May 22, 2026 14:12

sjspielman added 2 commits May 22, 2026 12:29

hvg_spe, clean up comments, and export all of colData

c2ba8fb

subset spe, silly

b56f70a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

BANKSY notebook text: PR 1/2#1000

BANKSY notebook text: PR 1/2#1000
sjspielman wants to merge 20 commits into
masterfrom
sjspielman/banksy-text

sjspielman commented May 20, 2026

Uh oh!

sjspielman commented May 20, 2026

Uh oh!

jashapiro left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sjspielman commented May 22, 2026

Uh oh!

jashapiro commented May 22, 2026

Uh oh!

sjspielman commented May 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

sjspielman commented May 20, 2026

Uh oh!

sjspielman commented May 20, 2026

Uh oh!

jashapiro left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sjspielman commented May 22, 2026

Uh oh!

jashapiro commented May 22, 2026

Uh oh!

sjspielman commented May 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants