E

E
Exllama

exllama is a memory-efficient tool for executing Hugging Face transformers with the LLaMA models using quantized weights, enabling high-performance NLP tasks on modern GPUs while minimizing memory usage and supporting various hardware configurations.

4.4
Rating
179865
Likes
12821924
Users
#free #Llm AI

Content is being generated for this tool. Please check back soon!