You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Arxiv provides static html version of most papers using LateXML. The html contents are well structured by rich ltx_xxxx CSS classnames. It should be lightning fast parsing those paper htmls and get very precise info.
It would be cool to support arxiv html parsing, as a much faster branch or a strong hint for the pipeline.
The text was updated successfully, but these errors were encountered:
Arxiv provides static html version of most papers using LateXML. The html contents are well structured by rich ltx_xxxx CSS classnames. It should be lightning fast parsing those paper htmls and get very precise info.
It would be cool to support arxiv html parsing, as a much faster branch or a strong hint for the pipeline.
The text was updated successfully, but these errors were encountered: