Expand description
Llama inference implementation.
See “LLaMA: Open and Efficient Foundation Language Models”
Implementation based on Hugging Face’s transformers
Llama inference implementation.
See “LLaMA: Open and Efficient Foundation Language Models”
Implementation based on Hugging Face’s transformers