Module llama

Module llama 

Source
Expand description

Llama inference implementation.

See “LLaMA: Open and Efficient Foundation Language Models”

Implementation based on Hugging Face’s transformers

Structs§

Cache
Config
Llama
Llama3RopeConfig
LlamaConfig

Enums§

Llama3RopeType
LlamaEosToks

Constants§

DEFAULT_MAX_SEQ_LEN