Skip to content

Script to help automate pulling documents for research purposes

License

Notifications You must be signed in to change notification settings

sciserver/wiley-tdm-api-protoype

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Wiley TDM Download and Extract Using Gemini

This notebook demonstrates how to download PDFs using the Wiley TDM API and extract the text from the PDFs using the Gemini API.

To run the example, you need both a Wiley TDM API key and a Gemini API key.

You can pass the API keys as CLI arguments of set them as environment variables.

export WILEY_TDM_API_KEY=<your_wiley_tdm_api_key>
export GEMINI_API_KEY=<your_gemini_api_key>

To run, setup a Python environment (3.12 tested) and install the requirements:

python -m pip install -r requirements.txt

The run the following commands:

python -m src.download_articles --start_year 2023
python -m src.apply_gemini
python -m src.aggregate_gemini_out

To see more options run:

python -m src.download_articles --help
python -m src.apply_gemini --help
python -m src.aggregate_gemini_out --help

About

Script to help automate pulling documents for research purposes

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages