Skip to content
This repository has been archived by the owner on Jan 10, 2025. It is now read-only.

Unable to download tokenizer. S3 link seems to be inaccessible #41

Open
reevr opened this issue Sep 2, 2021 · 15 comments
Open

Unable to download tokenizer. S3 link seems to be inaccessible #41

reevr opened this issue Sep 2, 2021 · 15 comments

Comments

@reevr
Copy link

reevr commented Sep 2, 2021

I tried installing the npm package. One of its step includes download of the tokenizer from https://tokenizers-releases.s3.amazonaws.com/node/0.7.0/index-v0.7.0-node-v83-linux-x64-glibc.tar.gz
This seems to be inaccessible.

I am using node v12.
Can you please help me with this?

@cabernal
Copy link

Bumping this issue, getting the same issue:

 ERR! command failed
npm ERR! command C:\WINDOWS\system32\cmd.exe /d /s /c node-pre-gyp install
npm ERR! 403 status code downloading tarball https://tokenizers-releases.s3.amazonaws.com/node/0.7.0/index-v0.7.0-node-v102-win32-x64-unknown.tar.gz
npm ERR! node-pre-gyp info it worked if it ends with ok
npm ERR! node-pre-gyp info using [email protected]
npm ERR! node-pre-gyp info using [email protected] | win32 | x64
npm ERR! node-pre-gyp WARN Using needle for node-pre-gyp https download
npm ERR! node-pre-gyp info check checked for "C:\Users\admin\git\huggingface-qa\node_modules\tokenizers\bin-package\index.node" (not found)
npm ERR! node-pre-gyp http GET https://tokenizers-releases.s3.amazonaws.com/node/0.7.0/index-v0.7.0-node-v102-win32-x64-unknown.tar.gz
npm ERR! node-pre-gyp http 403 https://tokenizers-releases.s3.amazonaws.com/node/0.7.0/index-v0.7.0-node-v102-win32-x64-unknown.tar.gz
npm ERR! node-pre-gyp ERR! install error

@DhairyaBahl
Copy link

Any updates with this issue ? I am facing the same issue. Kindly update the package with the new link.

@julien-c
Copy link
Member

@SaulLu @Narsil Do you know if Node is still supported as a compilation target from huggingface/tokenizers? Also we might want to transfer this issue to that repo.

@SaulLu
Copy link

SaulLu commented Feb 11, 2022

Unfortunately I don't know either 😞

@Narsil
Copy link

Narsil commented Feb 15, 2022

tokenizers does support and provide build targets for node, 12 and 14 (not 16 atm iirc linked to us needing to do a big upgrade on the bindings side).

Everything is available through npm install tokenizers normally. Not sure what that old s3 link is.

@creatorrr
Copy link

creatorrr commented Jul 18, 2022

Downgrading to node 12 does seem to work but, as @Narsil mentioned, newer versions don't work. I also get the following deprecation warning:
npm WARN deprecated [email protected]: This version has been yanked and won't be supported

Can someone provide an update on this package's future?

Correction: Node 14 also didn't work. Only 12 seems to be working

@Narsil
Copy link

Narsil commented Jul 18, 2022

@creatorrr ,

PRs are welcome. but updating to new node versions require updating to neon==latest which requires a lot of changes(didn't try yet, but it felt like a rewrite basically).

I am not going to lie, no one is dedicated specifically to tokenizers in general (as it works mostly as intended for transformers which is our biggest user). So spending time rewriting for node is not super high on the prio list right now.

PRs are very welcome if you feel like it !

@gouldcs
Copy link

gouldcs commented Mar 21, 2023

hey is this repo still active? Would love to help get this working again. Trying to get an application working with this and jumping through hoops with Node/Python just to get stuff installed.

@Narsil
Copy link

Narsil commented Mar 21, 2023

Last commit is from 3 years ago...

@gouldcs
Copy link

gouldcs commented Mar 21, 2023

@Narsil i noticed, but I see activity in issues as recent as last year. Is this package considered completely abandoned at this point?

@Narsil
Copy link

Narsil commented Mar 21, 2023

Definitely not actively maintained. tokenizers and its node bindings are, but they are in maintenance mode only (until someone wants to help redo the bindings with latest neon)

@julien-c
Copy link
Member

we can archive this repo btw maybe

@Narsil
Copy link

Narsil commented Mar 22, 2023

@Pierrci Are you ok with archiving this ?

@gouldcs
Copy link

gouldcs commented Mar 23, 2023

if this repo is being archived, is there a newer, more reliable open source tool I can use to achieve the same behavior?

@gouldcs
Copy link

gouldcs commented Mar 23, 2023

For those looking for an alternative, I found @tensorflow-models/qna. Does the same thing!

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

8 participants