114.1K 1 year ago

This model extends LLama-3 8B's context length from 8k to over 1m tokens.

8b 70b

35 models