Skip to content

Releases: aws-samples/foundation-model-benchmarking-tool

torch version 2.4

29 Oct 01:16
Compare
Choose a tag to compare

What's Changed

Full Changelog: v2.0.15...v2.0.16

Ollama support

27 Oct 23:57
Compare
Choose a tag to compare

What's Changed

Full Changelog: v2.0.14...v2.0.15

FMBench orchestrator

25 Oct 18:32
Compare
Choose a tag to compare

What's Changed

  • Configuration files for llama3.1 70b on large prompt payloads + longbench dataset by @madhurprash in #216
  • adding support for llama3 summarization prompt by @madhurprash in #217
  • changing file name for llama3 summarization prompt by @madhurprash in #218
  • Config files for llama3.1 8b instruct on g6e instances by @madhurprash in #219
  • All config files for llama3.1 8b on g6e instances using DJL by @madhurprash in #220
  • make config file naming convention consistent for llama3.1 8b/70b on g6e by @madhurprash in #221
  • Config files for all llama3.2 models - tested by @madhurprash in #222

Full Changelog: v2.0.13...v2.0.14

pricing.yml updates

10 Oct 21:37
Compare
Choose a tag to compare

What's Changed

  • Update pricing.yml by @aarora79 in #210
  • Rename config-llama3-8b-g6e.4xl-tp-2-mc-max-djl-ec2.yml to config-lla… by @aarora79 in #212
  • add mixtral config file for AWQ version - g6e.48xl by @madhurprash in #214
  • pricing update + retry logic added to bedrock predictor by @madhurprash in #215

Full Changelog: v2.0.11...v2.0.13

Llama3 with Triton+DJL on Neuron

04 Oct 02:13
Compare
Choose a tag to compare

Llama3 on g6e

03 Oct 22:13
Compare
Choose a tag to compare

What's Changed

Full Changelog: v2.0.9...v2.0.10

Triton-DJL support, Tokenizer from HF

01 Oct 18:01
Compare
Choose a tag to compare

What's Changed

  • Contains a configuration file trn and doc fix for triton on AWS chips by @madhurprash in #201
  • Integration triton inference server with djl by @madhurprash in #204

Full Changelog: v2.0.8...v2.0.9

v2.0.8

26 Sep 15:25
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v2.0.7...v2.0.8

Triton inference server

26 Sep 13:32
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v2.0.6...v2.0.7

Multiple model copies on a single EC2 instance

10 Sep 00:34
Compare
Choose a tag to compare

What's Changed

Full Changelog: v2.0.5...v2.0.6