Options for Rerank context creation
Path to reranker .gguf model
Optional
Context window size (default: 4096)
Max prompts per GPU dispatch (default: 8)
KV cache key quantization (default: 'q4_0')
KV cache value quantization (default: 'q4_0')
Options for Rerank context creation