llama-bench (llama.cpp)
Sort by:
LLM-Name
Token generation (Nvidia)
Prompt processing (Nvidia)
Token generation (Apple)
Prompt processing (Apple)
LLM Size
Prompt processing
Token generation