A 7.3B parameter model that outperforms Llama 2 13B on all benchmarks, with optimizations for speed and context length.
Modalities
Context
4K
Released
Sep 28, 2023
Knowledge Cutoff
Sep 2023
Token volume and request traffic to this model over time.