Access Violation in SafeLlamaContextHandle.Decode

Looks like the batch.ToNativeBatch function returns an nb object containing no tokens and thus there is an access violation by llama_decode. This happens with the raw example chat program and Semantic Kernel Chat example.  Ironically, the web project works fine.


        /// <summary>
        /// </summary>
        /// <param name="batch"></param>
        /// <returns>Positive return values does not mean a fatal error, but rather a warning:<br />
        ///  - 0: success<br />
        ///  - 1: could not find a KV slot for the batch (try reducing the size of the batch or increase the context)<br />
        ///  - &lt; 0: error<br />
        /// </returns>
        public DecodeResult Decode(LLamaBatch batch)
        {
            lock (GlobalInferenceLock)
                using (batch.ToNativeBatch(out var nb))
                    return (DecodeResult)llama_decode(this, nb);
        }

<img width="727" alt="image" src="https://github.com/SciSharp/LLamaSharp/assets/45177251/f43625a3-5ad3-43e3-9b71-6a4eecb8ea04">


Call Stack: 	[Managed to Native Transition]	
>	LLamaSharp.dll!LLama.Native.SafeLLamaContextHandle.Decode(LLama.Native.LLamaBatch batch) Line 373	C#
 	LLamaSharp.dll!LLama.Native.SafeLLamaContextHandle.Decode(System.Collections.Generic.List<LLama.Native.LLamaToken> tokens, LLama.Native.LLamaSeqId id, LLama.Native.LLamaBatch batch, ref int n_past) Line 400	C#
 	LLamaSharp.dll!LLama.StatelessExecutor.InferAsync(string prompt, LLama.Abstractions.IInferenceParams inferenceParams, System.Threading.CancellationToken cancellationToken) Line 98	C#
 	[Resuming Async Method]	
 	System.Private.CoreLib.dll!System.Runtime.CompilerServices.AsyncMethodBuilderCore.Start<LLama.StatelessExecutor.<InferAsync>d__21>(ref LLama.StatelessExecutor.<InferAsync>d__21 stateMachine)	Unknown
 	LLamaSharp.dll!LLama.LLamaTransforms.KeywordTextOutputStreamTransform.TransformAsync(System.Collections.Generic.IAsyncEnumerable<string> tokens) Line 232	C#
 	[Resuming Async Method]	
 	System.Private.CoreLib.dll!System.Runtime.CompilerServices.AsyncMethodBuilderCore.Start<LLama.LLamaTransforms.KeywordTextOutputStreamTransform.<TransformAsync>d__12>(ref LLama.LLamaTransforms.KeywordTextOutputStreamTransform.<TransformAsync>d__12 stateMachine)	Unknown
 	LLamaSharp.SemanticKernel.dll!LLamaSharp.SemanticKernel.ChatCompletion.LLamaSharpChatCompletion.GetChatMessageContentsAsync(Microsoft.SemanticKernel.ChatCompletion.ChatHistory chatHistory, Microsoft.SemanticKernel.PromptExecutionSettings executionSettings, Microsoft.SemanticKernel.Kernel kernel, System.Threading.CancellationToken cancellationToken) Line 77	C#
 	Microsoft.SemanticKernel.Abstractions.dll!Microsoft.SemanticKernel.ChatCompletion.ChatCompletionServiceExtensions.GetChatMessageContentAsync(Microsoft.SemanticKernel.ChatCompletion.IChatCompletionService chatCompletionService, Microsoft.SemanticKernel.ChatCompletion.ChatHistory chatHistory, Microsoft.SemanticKernel.PromptExecutionSettings executionSettings, Microsoft.SemanticKernel.Kernel kernel, System.Threading.CancellationToken cancellationToken)	Unknown
 	LlamaSharpExample.dll!LLama.Examples.Examples.SemanticKernelChat.Main() Line 34	C#
 	LlamaSharpExample.dll!LLama.Examples.Examples.SemanticKernelChat.<Main>()	Unknown


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Access Violation in SafeLlamaContextHandle.Decode #680

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Access Violation in SafeLlamaContextHandle.Decode #680

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions