Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

why multi_label=True? #24

Open
Miaoyuanyuan777 opened this issue Apr 27, 2024 · 3 comments
Open

why multi_label=True? #24

Miaoyuanyuan777 opened this issue Apr 27, 2024 · 3 comments

Comments

@Miaoyuanyuan777
Copy link

model.compile(loss=loss_fn, optimizer=optimizer,
metrics=[tf.keras.metrics.AUC(curve='ROC', multi_label=True),
tf.keras.metrics.AUC(curve='PR', multi_label=True)])

I would like to ask if multi_label=True is used in the optimization of the model, which is equivalent to calculating AUC per cell and optimizing it? And then calculated the per peak and per cell AUC of the test dataset directly with the optimized model? And the subsequent analysis is using this per cell AUC optimized model? (Why not set multi_label=False to optimize, that is, calculate the overall AUC of all samples, or optimize with the AUC of per peak?)

@hy395
Copy link
Collaborator

hy395 commented Apr 29, 2024

Hi,
The model is optimizing on cross-entropy loss. multi-label AUC is the metric we used to track performance and used as the criteria for early-stop. multi-label AUC is calculating AUC per cell, and taking an unweighted average. We give each cell equal weight in this way. Since cells have different depth, and different ratio of pos/neg, I chose to track multilabel AUC.

@Miaoyuanyuan777
Copy link
Author

Hi,
Thanks for your response!it has been very helpful to me~
And I have another question: Why, when evaluating a model's generalization performance, do we calculate AUC separately for "per cell" and "per peak" rather than using a single overall AUC? Is this a common practice? I have noticed this approach in other articles as well.
Wish you all the best!

@hy395
Copy link
Collaborator

hy395 commented Oct 14, 2024

Hi,

Sorry for missing this comment. Per cell AUC quantifies whether the model can distinguish accessibility across different genomic regions. Per peak AUC quantifies whether the model can distinguish accessibility across different cell types. They quantify different properties of the model.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants