Adapt Azimuth code for Wilms label transfer #843

sjspielman · 2024-10-29T19:50:26Z

Towards #810

This PR does the crux of the work to swap over label transfer strategies. There are a lot of code changes, but I think they are all ok for 1 PR since they are all highly related (please let me know if you want me to cherry pick some things out!).

This PR implements the following changes:

Add script notebook_template/utils/label-transfer-functions.R. This script contains functions adapted from Azimuth (with source links) to prepare a query for label transfer and perform label transfer
Update notebooks (02a and 02b) that perform label transfer to use these new functions rather than Azimuth
- I also added a param for CI to this notebook so we can set k.weight accordingly
Update 00_run_workflow.sh (now a shell script; see Create shell script for wilms-06 workflow #817 & issue Use shell script for cell-type-wilms-tumor-06 workflow #816) so these steps are not run in CI, but use the CI param

There are also some additional smol changes:

A minor tweak to one of the .gitleaks.toml regex's for efficiency
Remove code that installs the fetus reference from scripts/prepare-fetal-references.R, since this data is now tracked with renv (Add fetusref.SeuratData to renv #831)

I also added a new directory supplemental-notebooks which contains a README and 1 notebook (here is the HTML for review though it's also committed: compare-label-transfer-approaches.nb.html.zip) to compare these results to Azimuth. Before running this notebook, I re-ran the main branch workflow up through label transfer (including object Seurat preparation) to ensure results being compared use the current data release. Then, I ran this notebook to compare results between Azimuth and the code here which is adapted from Azimuth over 5 samples. Results are consistent enough that I feel comfortable with this code!

Note that the pre-commit hook styled a lot of this code, so you may want to turn off whitespace when reviewing!

Opening this as a draft since we're running in CI for the first time!
EDIT: Several samples have now successfully undergone label transfer in CI; it's still running at the time of writing this, but seems enough to be be ready for review!

… for space

…zimuth

jaclyn-taroni

I reviewed the results, but I am most curious about what happens when we use the adaptation with all the other results (e.g., inferCNV) to generate labels! We may not know that until next week (at the earliest).

The only question I had was about whether or not we should be setting query.assay = NULL.

jaclyn-taroni · 2024-11-01T18:46:32Z

analyses/cell-type-wilms-tumor-06/notebook_template/utils/label-transfer-functions.R

+    k.filter = NA,
+    reference.neighbors = "refdr.annoy.neighbors",
+    reference.assay = "refAssay",
+    query.assay = NULL,


Do we want to be doing this instead of accepting an argument to transfer_labels?

The NULL here is based on the source code, and to be honest I was hesitant to rock the boat on it...

This is defined in Seurat with this assay argument, and we previously didn't override the NULL default.

I'm ~sure the assay should be RNA, but I'd want to run it through to see if specifying* "RNA" changes the results. Do you think I should do that?

I'm ~sure the assay should be RNA, but I'd want to run it through to see if specifying* "RNA" changes the results. Do you think I should do that?

I too am ~sure, so I say go for it; it will bring us greater understanding at the very least!

sjspielman · 2024-11-01T20:13:39Z

I re-ran this with RNA as the assay, and interestingly (though I don't fully understand why, may be part of the query prep actually?)...

the fetal full results were the same
the fetal kidney results were not, but they were "improved" - cell type "disagree" score distributions were shifted much lower (yay!), and there were way fewer cell type differences (yay!). There were no more meshenchymal swaps, either.

Here's the updated supplemental notebook:
compare-label-transfer-approaches.nb.html.zip

Given this increased agreement with Azimuth inferences, I think specifying "RNA" is indeed the move. I updated the actual label transfer notebooks accordingly.

…' into sjspielman/wilms-run-azimuth

jaclyn-taroni

👍🏻

sjspielman added 17 commits October 17, 2024 13:13

add functions for label transfer

9a28271

move functions to utils folder

70e75ba

Update 02a notebook to use new functions, and style

de20e42

use 0/1 for testing variable

2eabc3d

update param usage for label transfer notebooks

c3fc4c1

Update 02b notebook to use new functions, and style

9abbee9

label transfer does not need to be skipped in CI anymore

7693571

merge base

6cee2f5

remove install fetusref code from script

788927f

fix notebook name

1778edd

use separate query variable to prevent feature loss, and rm when done…

4aa0eec

… for space

parameter fixes

78dfcdc

use is_ci

d9a6feb

revert testing code

c9b33fc

remove outdated comments

f7f9686

Add supplemental notebook with results comparing azimuth to adapted a…

d4a20f5

…zimuth

little regex tweak

76d9ace

sjspielman marked this pull request as ready for review October 29, 2024 20:12

sjspielman requested a review from jaclyn-taroni as a code owner October 29, 2024 20:12

jaclyn-taroni reviewed Nov 1, 2024

View reviewed changes

sjspielman added 3 commits November 1, 2024 15:57

add query assay arguments to functions

0433245

Update notebook using RNA as the query assay

90694c4

Specify RNA assay in the actual label transfer notebooks

69a8a7a

why was s still there? and fix a typo

6b3d7ed

sjspielman requested a review from jaclyn-taroni November 1, 2024 20:37

sjspielman added 3 commits November 4, 2024 09:42

Merge remote-tracking branch 'upstream/feature/wilms-tumor-06-azimuth…

813090a

…' into sjspielman/wilms-run-azimuth

fix typo - output for 02b needs to be 02b

3a0ba1e

names are hard

7522c56

jaclyn-taroni approved these changes Nov 5, 2024

View reviewed changes

sjspielman merged commit 3e2f90c into AlexsLemonade:feature/wilms-tumor-06-azimuth Nov 5, 2024
3 checks passed

sjspielman deleted the sjspielman/wilms-run-azimuth branch November 5, 2024 15:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adapt Azimuth code for Wilms label transfer #843

Adapt Azimuth code for Wilms label transfer #843

sjspielman commented Oct 29, 2024 •

edited

Loading

jaclyn-taroni left a comment

jaclyn-taroni Nov 1, 2024

sjspielman Nov 1, 2024 •

edited

Loading

jaclyn-taroni Nov 1, 2024

sjspielman commented Nov 1, 2024

jaclyn-taroni left a comment

Adapt Azimuth code for Wilms label transfer #843

Adapt Azimuth code for Wilms label transfer #843

Conversation

sjspielman commented Oct 29, 2024 • edited Loading

jaclyn-taroni left a comment

Choose a reason for hiding this comment

jaclyn-taroni Nov 1, 2024

Choose a reason for hiding this comment

sjspielman Nov 1, 2024 • edited Loading

Choose a reason for hiding this comment

jaclyn-taroni Nov 1, 2024

Choose a reason for hiding this comment

sjspielman commented Nov 1, 2024

jaclyn-taroni left a comment

Choose a reason for hiding this comment

sjspielman commented Oct 29, 2024 •

edited

Loading

sjspielman Nov 1, 2024 •

edited

Loading