Expand description
Llama2 inference implementation.
See “LLaMA 2: Open Foundation and Fine-Tuned Chat Models”
Based on the llama2.c implementation
Llama2 inference implementation.
See “LLaMA 2: Open Foundation and Fine-Tuned Chat Models”
Based on the llama2.c implementation