I wonder if I should add support for recently released
Llama 3.1, which is now leading in multiple tests/categories?
Here's API vendor with 450B support I found so far
https://llama-api.com, there's probably more.
Price table from
https://console.llama-api.com/account/usage, pricing is based on the number of parameters of the model:
<7b = $ 0.0004 / 1K Tokens
between 7b and 34b = $ 0.0016 / 1K Tokens
>34b = $ 0.0032 / 1K Tokens (I assuming this will be covering 450b model)
Which makes is a lot cheaper than Chat-GPT 4o
https://openai.com/api/pricing/
gpt-4o
$5.00 / 1M input tokens (
$0.5 / 1K Tokens)
$15.00 / 1M output tokens (
$1.5 / 1K Tokens)