Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

404 error - "https://s3-us-west-2.amazonaws.com/pubmed-rct/model_crf.tar.gz" #25

Open
itsmemala opened this issue Jun 29, 2020 · 10 comments

Comments

@itsmemala
Copy link

The model link results in a 404 error. Has it been moved to a different bucket or not hosted any longer?

@itsmemala itsmemala changed the title 202 error - "https://s3-us-west-2.amazonaws.com/pubmed-rct/model_crf.tar.gz" 404 error - "https://s3-us-west-2.amazonaws.com/pubmed-rct/model_crf.tar.gz" Jun 29, 2020
@titipata
Copy link
Owner

Hi @itsmemala, yes, it seems like the host who I host the data removed it. I will try to put it up quite soon early next month!

@itsmemala
Copy link
Author

Thanks!

@laviniaflorentina
Copy link

I'm also looking for this to be solved

@laviniaflorentina
Copy link

Links that need fixing:

https://s3-us-west-2.amazonaws.com/pubmed-rct/train.json
https://s3-us-west-2.amazonaws.com/pubmed-rct/dev.json
https://s3-us-west-2.amazonaws.com/pubmed-rct/test.txt
https://s3-us-west-2.amazonaws.com/pubmed-rct/model.tar.gz

Thank you 🙏

@titipata
Copy link
Owner

@laviniaflorentina thanks so much for the notice! @daniel-acuna Can I poke here if you store the deleted S3 somewhere?

@titipata
Copy link
Owner

@laviniaflorentina @itsmemala I put on temporary model paths here: https://github.com/titipata/detecting-scientific-claim/blob/master/main.py#L37-L38. You can now run it. For train.json, dev.json and test.txt, I will update it later. In gist, it's a post-process file of dataset folder.

@Shiyun-W
Copy link

Hi, I am facing this problem as well. I would like to ask if there is some way to solve it?
Thank you very much if somebody could help me to solve it!

@titipata
Copy link
Owner

Hi @Shiyun-W, unfortunately the model checkpoint was deleted. I might have to check if it's somewhere on my computer. In addition, the code is outdated with the AllenNLP old version.

@vibhor98
Copy link

Hi @titipata, using the provided S3 bucket links, I am also not able to access the model and the annotated dataset. Can you please share the dataset of the annotated labels (claims and non-claims) for PubMedRCT dataset? Assuming sharing this small dataset is easier than the model weights? Thank you!

@titipata
Copy link
Owner

titipata commented Jul 27, 2023

@vibhor98 yes, the dataset is available here https://github.com/titipata/detecting-scientific-claim/tree/master/dataset. I couldn't find the trained model since the bucket was deleted. I hope the provided notebook is sufficient for training the model. This codebase is kinda outdated.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants