Class ChatLlamaCpp

To use this model you need to have the node-llama-cpp module installed. This can be installed using npm install -S node-llama-cpp and the minimum version supported in version 2.0.0. This also requires that have a locally built version of Llama2 installed.

Hierarchy

SimpleChatModel<LlamaCppCallOptions>
- ChatLlamaCpp

Constructors

constructor

new ChatLlamaCpp(inputs): ChatLlamaCpp
Parameters
- inputs: LlamaCppInputs
Returns ChatLlamaCpp
Overrides SimpleChatModel.constructor
- Defined in langchain/src/chat_models/llama_cpp.ts:66

Properties

CallOptions

CallOptions: LlamaCppCallOptions

ParsedCallOptions

ParsedCallOptions: Omit<LlamaCppCallOptions, never>

caller

caller: AsyncCaller

The async caller should be used by subclasses to make any async calls, which will thus benefit from the concurrency and retry logic.

verbose

verbose: boolean

Whether to print out response text.

`Optional` cache

cache?: BaseCache<Generation[]>

`Optional` callbacks

callbacks?: Callbacks

`Optional` maxTokens

maxTokens?: number

`Optional` metadata

metadata?: Record<string, unknown>

`Optional` tags

tags?: string[]

`Optional` temperature

temperature?: number

`Optional` topK

topK?: number

`Optional` topP

topP?: number

`Optional` trimWhitespaceSuffix

trimWhitespaceSuffix?: boolean

`Static` inputs

inputs: LlamaCppInputs

`Protected` lc_runnable

lc_runnable: boolean = true

Accessors

callKeys

get callKeys(): string[]
Keys that the language model accepts as call options.

Returns string[]
Inherited from SimpleChatModel.callKeys
- Defined in langchain/src/base_language/index.ts:145

Methods

batch

batch(inputs, options?, batchOptions?): Promise<BaseMessageChunk[]>
Default implementation of batch, which calls invoke N times. Subclasses should override this method if they can batch more efficiently.
Parameters
- inputs: BaseLanguageModelInput[]
  
  Array of inputs to each batch call.
- Optional options: Partial<LlamaCppCallOptions> | Partial<LlamaCppCallOptions>[]
  
  Either a single call options object to apply to each batch call or an array for each call.
- Optional batchOptions: RunnableBatchOptions & {
  returnExceptions?: false;
  }
Returns Promise<BaseMessageChunk[]>
An array of RunOutputs, or mixed RunOutputs and errors if batchOptions.returnExceptions is set
Inherited from SimpleChatModel.batch
- Defined in langchain/src/schema/runnable/base.ts:157
batch(inputs, options?, batchOptions?): Promise<(BaseMessageChunk | Error)[]>
Parameters
- inputs: BaseLanguageModelInput[]
- Optional options: Partial<LlamaCppCallOptions> | Partial<LlamaCppCallOptions>[]
- Optional batchOptions: RunnableBatchOptions & {
  returnExceptions: true;
  }
Returns Promise<(BaseMessageChunk | Error)[]>
Inherited from SimpleChatModel.batch
- Defined in langchain/src/schema/runnable/base.ts:163
batch(inputs, options?, batchOptions?): Promise<(BaseMessageChunk | Error)[]>
Parameters
- inputs: BaseLanguageModelInput[]
- Optional options: Partial<LlamaCppCallOptions> | Partial<LlamaCppCallOptions>[]
- Optional batchOptions: RunnableBatchOptions
Returns Promise<(BaseMessageChunk | Error)[]>
Inherited from SimpleChatModel.batch
- Defined in langchain/src/schema/runnable/base.ts:169

bind

bind(kwargs): Runnable<BaseLanguageModelInput, BaseMessageChunk, LlamaCppCallOptions>
Bind arguments to a Runnable, returning a new Runnable.
Parameters
- kwargs: Partial<LlamaCppCallOptions>
Returns Runnable<BaseLanguageModelInput, BaseMessageChunk, LlamaCppCallOptions>
A new RunnableBinding that, when invoked, will apply the bound args.
Inherited from SimpleChatModel.bind
- Defined in langchain/src/schema/runnable/base.ts:66

call

call(messages, options?, callbacks?): Promise<BaseMessage>
Makes a single call to the chat model.
Parameters
- messages: BaseMessageLike[]
  
  An array of BaseMessage instances.
- Optional options: string[] | LlamaCppCallOptions
  
  The call options or an array of stop sequences.
- Optional callbacks: Callbacks
  
  The callbacks for the language model.
Returns Promise<BaseMessage>
A Promise that resolves to a BaseMessage.
Inherited from SimpleChatModel.call
- Defined in langchain/src/chat_models/base.ts:431

callPrompt

callPrompt(promptValue, options?, callbacks?): Promise<BaseMessage>
Makes a single call to the chat model with a prompt value.
Parameters
- promptValue: BasePromptValue
  
  The value of the prompt.
- Optional options: string[] | LlamaCppCallOptions
  
  The call options or an array of stop sequences.
- Optional callbacks: Callbacks
  
  The callbacks for the language model.
Returns Promise<BaseMessage>
A Promise that resolves to a BaseMessage.
Inherited from SimpleChatModel.callPrompt
- Defined in langchain/src/chat_models/base.ts:452

generate

generate(messages, options?, callbacks?): Promise<LLMResult>
Generates chat based on the input messages.
Parameters
- messages: BaseMessageLike[][]
  
  An array of arrays of BaseMessage instances.
- Optional options: string[] | LlamaCppCallOptions
  
  The call options or an array of stop sequences.
- Optional callbacks: Callbacks
  
  The callbacks for the language model.
Returns Promise<LLMResult>
A Promise that resolves to an LLMResult.
Inherited from SimpleChatModel.generate
- Defined in langchain/src/chat_models/base.ts:305

generatePrompt

generatePrompt(promptValues, options?, callbacks?): Promise<LLMResult>
Generates a prompt based on the input prompt values.
Parameters
- promptValues: BasePromptValue[]
  
  An array of BasePromptValue instances.
- Optional options: string[] | LlamaCppCallOptions
  
  The call options or an array of stop sequences.
- Optional callbacks: Callbacks
  
  The callbacks for the language model.
Returns Promise<LLMResult>
A Promise that resolves to an LLMResult.
Inherited from SimpleChatModel.generatePrompt
- Defined in langchain/src/chat_models/base.ts:407

getNumTokens

getNumTokens(content): Promise<number>
Parameters
- content: MessageContent
Returns Promise<number>
Inherited from SimpleChatModel.getNumTokens
- Defined in langchain/src/base_language/index.ts:200

invocationParams

invocationParams(): {
    maxTokens: undefined | number;
    temperature: undefined | number;
    topK: undefined | number;
    topP: undefined | number;
    trimWhitespaceSuffix: undefined | boolean;
}
Get the parameters used to invoke the model

Returns {
    maxTokens: undefined | number;
    temperature: undefined | number;
    topK: undefined | number;
    topP: undefined | number;
    trimWhitespaceSuffix: undefined | boolean;
}
- maxTokens: undefined | number
- temperature: undefined | number
- topK: undefined | number
- topP: undefined | number
- trimWhitespaceSuffix: undefined | boolean
Overrides SimpleChatModel.invocationParams
- Defined in langchain/src/chat_models/llama_cpp.ts:88

invoke

invoke(input, options?): Promise<BaseMessageChunk>
Invokes the chat model with a single input.
Parameters
- input: BaseLanguageModelInput
  
  The input for the language model.
- Optional options: LlamaCppCallOptions
  
  The call options.
Returns Promise<BaseMessageChunk>
A Promise that resolves to a BaseMessageChunk.
Inherited from SimpleChatModel.invoke
- Defined in langchain/src/chat_models/base.ts:121

map

map(): Runnable<BaseLanguageModelInput[], BaseMessageChunk[], LlamaCppCallOptions>
Return a new Runnable that maps a list of inputs to a list of outputs, by calling invoke() with each input.

Returns Runnable<BaseLanguageModelInput[], BaseMessageChunk[], LlamaCppCallOptions>
Inherited from SimpleChatModel.map
- Defined in langchain/src/schema/runnable/base.ts:77

pipe

pipe<NewRunOutput>(coerceable): RunnableSequence<BaseLanguageModelInput, Exclude<NewRunOutput, Error>>
Create a new runnable sequence that runs each individual runnable in series, piping the output of one runnable into another runnable or runnable-like.
Type Parameters
- NewRunOutput
Parameters
- coerceable: RunnableLike<BaseMessageChunk, NewRunOutput>
  
  A runnable, function, or object whose values are functions or runnables.
Returns RunnableSequence<BaseLanguageModelInput, Exclude<NewRunOutput, Error>>
A new runnable sequence.
Inherited from SimpleChatModel.pipe
- Defined in langchain/src/schema/runnable/base.ts:456

predict

predict(text, options?, callbacks?): Promise<string>
Predicts the next message based on a text input.
Parameters
- text: string
  
  The text input.
- Optional options: string[] | LlamaCppCallOptions
  
  The call options or an array of stop sequences.
- Optional callbacks: Callbacks
  
  The callbacks for the language model.
Returns Promise<string>
A Promise that resolves to a string.
Inherited from SimpleChatModel.predict
- Defined in langchain/src/chat_models/base.ts:483

predictMessages

predictMessages(messages, options?, callbacks?): Promise<BaseMessage>
Predicts the next message based on the input messages.
Parameters
- messages: BaseMessage[]
  
  An array of BaseMessage instances.
- Optional options: string[] | LlamaCppCallOptions
  
  The call options or an array of stop sequences.
- Optional callbacks: Callbacks
  
  The callbacks for the language model.
Returns Promise<BaseMessage>
A Promise that resolves to a BaseMessage.
Inherited from SimpleChatModel.predictMessages
- Defined in langchain/src/chat_models/base.ts:468

serialize

serialize(): SerializedLLM
Returns SerializedLLM

Deprecated
Return a json-like object representing this LLM.
Inherited from SimpleChatModel.serialize
- Defined in langchain/src/chat_models/base.ts:392

stream

stream(input, options?): Promise<IterableReadableStream<BaseMessageChunk>>
Stream output in chunks.
Parameters
- input: BaseLanguageModelInput
- Optional options: Partial<LlamaCppCallOptions>
Returns Promise<IterableReadableStream<BaseMessageChunk>>
A readable stream that is also an iterable.
Inherited from SimpleChatModel.stream
- Defined in langchain/src/schema/runnable/base.ts:223

streamLog

streamLog(input, options?, streamOptions?): AsyncGenerator<RunLogPatch, any, unknown>
Stream all output from a runnable, as reported to the callback system. This includes all inner runs of LLMs, Retrievers, Tools, etc. Output is streamed as Log objects, which include a list of jsonpatch ops that describe how the state of the run has changed in each step, and the final state of the run. The jsonpatch ops can be applied in order to construct state.
Parameters
- input: BaseLanguageModelInput
- Optional options: Partial<LlamaCppCallOptions>
- Optional streamOptions: Omit<LogStreamCallbackHandlerInput, "autoClose">
Returns AsyncGenerator<RunLogPatch, any, unknown>
Inherited from SimpleChatModel.streamLog
- Defined in langchain/src/schema/runnable/base.ts:502

toJSON

toJSON(): Serialized
Returns Serialized
Inherited from SimpleChatModel.toJSON
- Defined in langchain/src/load/serializable.ts:147

toJSONNotImplemented

toJSONNotImplemented(): SerializedNotImplemented
Returns SerializedNotImplemented
Inherited from SimpleChatModel.toJSONNotImplemented
- Defined in langchain/src/load/serializable.ts:221

transform

transform(generator, options): AsyncGenerator<BaseMessageChunk, any, unknown>
Default implementation of transform, which buffers input and then calls stream. Subclasses should override this method if they can start producing output while input is still being generated.
Parameters
- generator: AsyncGenerator<BaseLanguageModelInput, any, unknown>
- options: Partial<LlamaCppCallOptions>
Returns AsyncGenerator<BaseMessageChunk, any, unknown>
Inherited from SimpleChatModel.transform
- Defined in langchain/src/schema/runnable/base.ts:473

withConfig

withConfig(config): RunnableBinding<BaseLanguageModelInput, BaseMessageChunk, LlamaCppCallOptions>
Bind config to a Runnable, returning a new Runnable.
Parameters
- config: BaseCallbackConfig
  
  New configuration parameters to attach to the new runnable.
Returns RunnableBinding<BaseLanguageModelInput, BaseMessageChunk, LlamaCppCallOptions>
A new RunnableBinding with a config matching what's passed.
Inherited from SimpleChatModel.withConfig
- Defined in langchain/src/schema/runnable/base.ts:106

withFallbacks

withFallbacks(fields): RunnableWithFallbacks<BaseLanguageModelInput, BaseMessageChunk>
Create a new runnable from the current one that will try invoking other passed fallback runnables if the initial invocation fails.
Parameters
- fields: {
  fallbacks: Runnable<BaseLanguageModelInput, BaseMessageChunk, BaseCallbackConfig>[];
  }
  - fallbacks: Runnable<BaseLanguageModelInput, BaseMessageChunk, BaseCallbackConfig>[]
    
    Other runnables to call if the runnable errors.
Returns RunnableWithFallbacks<BaseLanguageModelInput, BaseMessageChunk>
A new RunnableWithFallbacks.
Inherited from SimpleChatModel.withFallbacks
- Defined in langchain/src/schema/runnable/base.ts:123

withRetry

withRetry(fields?): RunnableRetry<BaseLanguageModelInput, BaseMessageChunk, LlamaCppCallOptions>
Add retry logic to an existing runnable.
Parameters
- Optional fields: {
  onFailedAttempt?: RunnableRetryFailedAttemptHandler;
  stopAfterAttempt?: number;
  }
  - Optional onFailedAttempt?: RunnableRetryFailedAttemptHandler
  - Optional stopAfterAttempt?: number
Returns RunnableRetry<BaseLanguageModelInput, BaseMessageChunk, LlamaCppCallOptions>
A new RunnableRetry that, when invoked, will retry according to the parameters.
Inherited from SimpleChatModel.withRetry
- Defined in langchain/src/schema/runnable/base.ts:87

`Static` deserialize

deserialize(data): Promise<BaseLanguageModel<any, BaseLanguageModelCallOptions>>
Parameters
- data: SerializedLLM
Returns Promise<BaseLanguageModel<any, BaseLanguageModelCallOptions>>

Deprecated
Load an LLM from a json-like object describing it.
Inherited from SimpleChatModel.deserialize
- Defined in langchain/src/base_language/index.ts:292

`Static` isRunnable

isRunnable(thing): thing is Runnable<any, any, BaseCallbackConfig>
Parameters
- thing: any
Returns thing is Runnable<any, any, BaseCallbackConfig>
Inherited from SimpleChatModel.isRunnable
- Defined in langchain/src/schema/runnable/base.ts:552

`Protected` _buildSession

_buildSession(messages): string
Parameters
- messages: BaseMessage[]
Returns string
- Defined in langchain/src/chat_models/llama_cpp.ts:137

`Protected` _callWithConfig

_callWithConfig<T>(func, input, options?): Promise<BaseMessageChunk>
Type Parameters
- T extends BaseLanguageModelInput
Parameters
- func: ((input) => Promise<BaseMessageChunk>) | ((input, config?, runManager?) => Promise<BaseMessageChunk>)
- input: T
- Optional options: Partial<LlamaCppCallOptions> & {
  runType?: string;
  }
Returns Promise<BaseMessageChunk>
Inherited from SimpleChatModel._callWithConfig
- Defined in langchain/src/schema/runnable/base.ts:249

`Protected` _convertMessagesToInteractions

_convertMessagesToInteractions(messages): ConversationInteraction[]
Parameters
- messages: BaseMessage[]
Returns ConversationInteraction[]
- Defined in langchain/src/chat_models/llama_cpp.ts:224

`Protected` _getOptionsList

_getOptionsList(options, length?): Partial<LlamaCppCallOptions & {
runType?: string;
}>[]
Parameters
- options: Partial<LlamaCppCallOptions> | Partial<LlamaCppCallOptions>[]
- length: number = 0
Returns Partial<LlamaCppCallOptions & {
runType?: string;
}>[]
Inherited from SimpleChatModel._getOptionsList
- Defined in langchain/src/schema/runnable/base.ts:133

`Protected` _getSerializedCacheKeyParametersForCall

_getSerializedCacheKeyParametersForCall(callOptions): string
Create a unique cache key for a specific call to a specific language model.
Parameters
- callOptions: LlamaCppCallOptions
  
  Call options for the model
Returns string
A unique cache key.
Inherited from SimpleChatModel._getSerializedCacheKeyParametersForCall
- Defined in langchain/src/base_language/index.ts:256

`Protected` _separateRunnableConfigFromCallOptions

_separateRunnableConfigFromCallOptions(options?): [BaseCallbackConfig, Omit<LlamaCppCallOptions, never>]
Parameters
- Optional options: Partial<LlamaCppCallOptions>
Returns [BaseCallbackConfig, Omit<LlamaCppCallOptions, never>]
Inherited from SimpleChatModel._separateRunnableConfigFromCallOptions
- Defined in langchain/src/chat_models/base.ts:104

`Protected` _transformStreamWithConfig

_transformStreamWithConfig<I, O>(inputGenerator, transformer, options?): AsyncGenerator<O, any, unknown>
Helper method to transform an Iterator of Input values into an Iterator of Output values, with callbacks. Use this to implement stream() or transform() in Runnable subclasses.
Type Parameters
- I extends BaseLanguageModelInput
- O extends BaseMessageChunk<O>
Parameters
- inputGenerator: AsyncGenerator<I, any, unknown>
- transformer: ((generator, runManager?, options?) => AsyncGenerator<O, any, unknown>)
  - - (generator, runManager?, options?): AsyncGenerator<O, any, unknown>
    - Parameters
      
      generator: AsyncGenerator<I, any, unknown>
      
      Optional runManager: CallbackManagerForChainRun
      
      Optional options: Partial<LlamaCppCallOptions>
      
      Returns AsyncGenerator<O, any, unknown>
- Optional options: LlamaCppCallOptions & {
  runType?: string;
  }
Returns AsyncGenerator<O, any, unknown>
Inherited from SimpleChatModel._transformStreamWithConfig
- Defined in langchain/src/schema/runnable/base.ts:343

`Static` `Protected` _convertInputToPromptValue

_convertInputToPromptValue(input): BasePromptValue
Parameters
- input: BaseLanguageModelInput
Returns BasePromptValue
Inherited from SimpleChatModel._convertInputToPromptValue
- Defined in langchain/src/base_language/index.ts:230

Class ChatLlamaCpp

Hierarchy

Index

Constructors

Properties

Accessors

Methods

Constructors

constructor

Parameters

inputs: LlamaCppInputs

Returns ChatLlamaCpp

Properties

CallOptions

ParsedCallOptions

caller

verbose

Optional cache

Optional callbacks

Optional maxTokens

Optional metadata

Optional tags

Optional temperature

Optional topK

Optional topP

Optional trimWhitespaceSuffix

Static inputs

Protected lc_runnable

Accessors

callKeys

Returns string[]

Methods

batch

Parameters

inputs: BaseLanguageModelInput[]

Optional options: Partial<LlamaCppCallOptions> | Partial<LlamaCppCallOptions>[]

Optional batchOptions: RunnableBatchOptions & { returnExceptions?: false; }

Returns Promise<BaseMessageChunk[]>

Parameters

inputs: BaseLanguageModelInput[]

Optional options: Partial<LlamaCppCallOptions> | Partial<LlamaCppCallOptions>[]

Optional batchOptions: RunnableBatchOptions & { returnExceptions: true; }

Returns Promise<(BaseMessageChunk | Error)[]>

Parameters

inputs: BaseLanguageModelInput[]

Optional options: Partial<LlamaCppCallOptions> | Partial<LlamaCppCallOptions>[]

Optional batchOptions: RunnableBatchOptions

Returns Promise<(BaseMessageChunk | Error)[]>

bind

Parameters

kwargs: Partial<LlamaCppCallOptions>

Returns Runnable<BaseLanguageModelInput, BaseMessageChunk, LlamaCppCallOptions>

call

Parameters

messages: BaseMessageLike[]

Optional options: string[] | LlamaCppCallOptions

Optional callbacks: Callbacks

Returns Promise<BaseMessage>

callPrompt

Parameters

promptValue: BasePromptValue

Optional options: string[] | LlamaCppCallOptions

Optional callbacks: Callbacks

Returns Promise<BaseMessage>

generate

Parameters

messages: BaseMessageLike[][]

Optional options: string[] | LlamaCppCallOptions

Optional callbacks: Callbacks

Returns Promise<LLMResult>

generatePrompt

Parameters

promptValues: BasePromptValue[]

Optional options: string[] | LlamaCppCallOptions

Optional callbacks: Callbacks

Returns Promise<LLMResult>

getNumTokens

Parameters

content: MessageContent

Returns Promise<number>

`Optional` cache

`Optional` callbacks

`Optional` maxTokens

`Optional` metadata

`Optional` tags

`Optional` temperature

`Optional` topK

`Optional` topP

`Optional` trimWhitespaceSuffix

`Static` inputs

`Protected` lc_runnable

`Optional` options: Partial<LlamaCppCallOptions> | Partial<LlamaCppCallOptions>[]

`Optional` batchOptions: RunnableBatchOptions & {
returnExceptions?: false;
}

`Optional` options: Partial<LlamaCppCallOptions> | Partial<LlamaCppCallOptions>[]

`Optional` batchOptions: RunnableBatchOptions & {
returnExceptions: true;
}

`Optional` options: Partial<LlamaCppCallOptions> | Partial<LlamaCppCallOptions>[]

`Optional` batchOptions: RunnableBatchOptions

`Optional` options: string[] | LlamaCppCallOptions

`Optional` callbacks: Callbacks

`Optional` options: string[] | LlamaCppCallOptions

`Optional` callbacks: Callbacks

`Optional` options: string[] | LlamaCppCallOptions

`Optional` callbacks: Callbacks

`Optional` options: string[] | LlamaCppCallOptions

`Optional` callbacks: Callbacks

Returns {
maxTokens: undefined | number;
temperature: undefined | number;
topK: undefined | number;
topP: undefined | number;
trimWhitespaceSuffix: undefined | boolean;
}

`Optional` options: LlamaCppCallOptions

`Optional` options: string[] | LlamaCppCallOptions

`Optional` callbacks: Callbacks

`Optional` options: string[] | LlamaCppCallOptions

`Optional` callbacks: Callbacks

`Optional` options: Partial<LlamaCppCallOptions>

`Optional` options: Partial<LlamaCppCallOptions>

`Optional` streamOptions: Omit<LogStreamCallbackHandlerInput, "autoClose">

fields: {
fallbacks: Runnable<BaseLanguageModelInput, BaseMessageChunk, BaseCallbackConfig>[];
}

`Optional` fields: {
onFailedAttempt?: RunnableRetryFailedAttemptHandler;
stopAfterAttempt?: number;
}

`Optional` onFailedAttempt?: RunnableRetryFailedAttemptHandler

`Optional` stopAfterAttempt?: number