Interface ContextOptions

Options for creating an inference context

interface ContextOptions {
    modelPath: string;
    embeddings?: boolean;
    nCtx?: number;
    nSeqMax?: number;
    nThreads?: number;
    poolingType?: PoolingType;
}

Index

Properties

modelPath embeddings? nCtx? nSeqMax? nThreads? poolingType?

Properties

modelPath

modelPath: string

Path to .gguf model file

`Optional`embeddings

embeddings?: boolean

Enable embedding extraction mode

When true, context is optimized for embedding extraction. Use with encode() and getEmbeddings() methods. Default: false (text generation mode)

`Optional`nCtx

nCtx?: number

Context size (default: 2048)

`Optional`nSeqMax

nSeqMax?: number

Maximum number of sequences for multi-sequence support

Set > 1 to enable multiple independent KV cache sequences. Useful for parallel decoding or conversation branching. Default: 1 (single sequence)

`Optional`nThreads

nThreads?: number

Number of threads (default: 4)

`Optional`poolingType

poolingType?: PoolingType

Pooling type for embedding extraction

Only relevant when embeddings=true. Default: MEAN for embedding contexts, NONE otherwise

Interface ContextOptions

Index

Properties

Properties

modelPath

`Optional`embeddings

`Optional`nCtx

`Optional`nSeqMax

`Optional`nThreads

`Optional`poolingType

Settings

On This Page

Interface ContextOptions

Index

Properties

Properties

modelPath

Optionalembeddings

OptionalnCtx

OptionalnSeqMax

OptionalnThreads

OptionalpoolingType

Settings

On This Page

`Optional`embeddings

`Optional`nCtx

`Optional`nSeqMax

`Optional`nThreads

`Optional`poolingType