Skip to content

Latest commit

 

History

History
44 lines (38 loc) · 1.55 KB

File metadata and controls

44 lines (38 loc) · 1.55 KB

End2End Serverless Transformers On AWS Lambda for NLP 🚀

You need no servers

Deploy transformers with ease 💆‍♂️

Go through this deck for full info

Current available pipelines

  1. classification
  2. translation (coming soon)
  3. token classification (need contribution)
  4. text generation (need contribution)
  5. zero shot classification (need contribution)

What you get with this?

  • ability to run transformers without servers
  • complete CI/CD
  • concurrency upto 1000 (default AWS limit)

How to use this?

  • clone the repo
  • keep the pipeline folder you want to use
  • modify the source and tests
  • keep the corresponding github action in .github/workflows
  • modify directory, registry and lambda function name in workflow
  • create repository in AWS ECR
  • set up secrets in repo (needed for access to AWS; this creds should have access to ECR and Lambda)
    • AWS_ACCESS_KEY_ID
    • AWS_SECRET_ACCESS_KEY
  • push the code
  • create PR
    • this will build the container
    • run all the tests
    • push container to ECR registry
    • update lambda with the new container (this will not happen when you push the first time)
  • create lambda function if it does not exist
    • give appropriate IAM role
    • set timeout and RAM
  • create API in API gateway and link to lambda

Done! Now you can call the lambda using the API