Releases: aws-samples/foundation-model-benchmarking-tool
Releases · aws-samples/foundation-model-benchmarking-tool
SGLang, s3 fix
What's Changed
- Update README.md with new python version by @madhurprash in #284
- Custom predictor: Taking a Ollama Custom Server for TinyLlama 1.1b for example by @madhurprash in #285
- Customize config files doc update by @madhurprash in #286
- SGLang integration by @madhurprash in #287
Full Changelog: v2.1.2...v2.1.3
Updated deepseek prompt, bug fixes
What's Changed
- pricing fallback bug fix by @madhurprash in #280
- bug fix in the template.yml file by @madhurprash in #282
Full Changelog: v2.1.1...v2.1.2
Deepseek, uv, Qwen2.5, ConvFinQA, chores
Full Changelog: v2.1.0...v2.1.1
Deepseek, uv, Qwen2.5, chores
What's Changed
- qwen generic config file + prompt template by @madhurprash in #266
- Qwen + llama3.3 tests for accuracy & readme + doc updates by @madhurprash in #269
- Adding sonnet3.5 & llama3.3 as judges by @madhurprash in #272
- Ollama end to end support by @dheerajoruganty in #271
- Ollama fix + deepseek prompt & config file by @madhurprash in #273
- prompt template naming fix for deepseek by @madhurprash in #274
- chore: improve no latency metrics data available messaging by @tonyksong in #275
- VLLM GPU support by @dheerajoruganty in #277
- Deepseek and uv changes by @aarora79 in #279
Full Changelog: v2.0.26...v2.1.0
Bug fixes, new config file options
What's Changed
- Fix for the hf token content by @madhurprash in #265
- parameterizing tp depree, instance type and batch size by @madhurprash in #264. See https://aws-samples.github.io/foundation-model-benchmarking-tool/cli.html.
Full Changelog: v2.0.25...v2.0.26
Graviton 4, SageMaker fixes
Full Changelog: v2.0.23...v2.0.24
SageMaker BYOE metrics fix
What's Changed
- Summarization Prompt Template by @dheerajoruganty in #255
- HF dataset loading bug fixes + new config file by @madhurprash in #256
- llama3.2 1b g6e.2xl config file for smaller prompts by @madhurprash in #257
Full Changelog: v2.0.22...v2.0.23
Amazon Nova models, multi-modal
What's Changed
- NOVA models config file + bedrock converseAPI predictor by @madhurprash in #253
- Update template.yml by @antara678 in #254
Full Changelog: v2.0.21...v2.0.22
EC2 pricing through API, misc. config file changes
What's Changed
- copy s3 content update + llama3-8b triton g5.48xl config by @madhurprash in #244
- HF token bug fix + docs update for copy s3 content command by @madhurprash in #245
- EC2 instance utilization metrics by @madhurprash in #248
- Fallback pricing for EC2 by @dheerajoruganty in #249
- Update EC2 Metrics by @dheerajoruganty in #250
- Update Fallback Pricing YAML Path by @dheerajoruganty in #252
- chore(ec2): update byoe yml and docs by @tonyksong in #251
New Contributors
- @tonyksong made their first contribution in #251
Full Changelog: v2.0.20...v2.0.21
embeddings models on SageMaker
What's Changed
- Update config-ec2-llama3-1-70b-inf2-48xl-deploy-ec2-djl.yml by @aarora79 in #229
- Update config-ec2-llama3-1-70b-inf2-48xl-deploy-ec2-djl.yml by @aarora79 in #230
- fix for triton ep names by @madhurprash in #231
- Add Initial support for
bge-base-en-v1-5
embedding model and Llama 3.2 11b-Vision-Instruct on FMBench by @dheerajoruganty in #227
Full Changelog: v2.0.16...v2.0.17