[Obs AI Assistant] Remove custom token count event #205479

sorenlouv · 2025-01-03T10:58:08Z

Background

Previously we implemented token counting in the AI Assistant in order to track usage. This is now done by the inference plugin so we do not have to handle this anymore.

One difference to call out: the AI Assistant counts the number of tokens used per conversation and persists it in the conversations index. This enables users to track token count per conversation. We have not exposed or documented this in any way, and I don't think the (unused) functionality justifies the added complexity.

Solution

Remove the custom StreamingChatResponseEventType.TokenCount event as well as token counting per conversation.

Technical background

The inference plugin emits the event InferenceChatCompletionEventType.ChatCompletionTokenCount that contains the number of tokens used for the LLM call. The Obs AI Assistant converts this event to StreamingChatResponseEventType.TokenCount:

kibana/x-pack/platform/plugins/shared/observability_solution/observability_ai_assistant/server/service/client/operators/convert_inference_events_to_streaming_events.ts

Lines 45 to 54 in c4cf9fe

    
           case InferenceChatCompletionEventType.ChatCompletionTokenCount: 
        
             // Convert to TokenCountEvent 
        
             return { 
        
               type: StreamingChatResponseEventType.TokenCount, 
        
               tokens: { 
        
                 completion: event.tokens.completion, 
        
                 prompt: event.tokens.prompt, 
        
                 total: event.tokens.total, 
        
               }, 
        
             } as TokenCountEvent;

All the token count events are accumulated into a single result:

kibana/x-pack/platform/plugins/shared/observability_solution/observability_ai_assistant/server/service/client/index.ts

Lines 324 to 328 in c4cf9fe

    
           mergeOperator( 
        
             nextEvents$, 
        
             title$.pipe(filter((value): value is TokenCountEvent => typeof value !== 'string')) 
        
           ).pipe(extractTokenCount()), 
        
           // get just the title, and drop the token count events

kibana/x-pack/platform/plugins/shared/observability_solution/observability_ai_assistant/server/service/client/operators/extract_token_count.ts

Lines 25 to 33 in c4cf9fe

    
           scan( 
        
             (acc, event) => { 
        
               acc.completion += event.tokens.completion; 
        
               acc.prompt += event.tokens.prompt; 
        
               acc.total += event.tokens.total; 
        
               return acc; 
        
             }, 
        
             { completion: 0, prompt: 0, total: 0 } 
        
           )

The total token count for every LLM call within a conversation is persisted in the conversation.

kibana/x-pack/platform/plugins/shared/observability_solution/observability_ai_assistant/server/service/client/index.ts

Lines 377 to 386 in c4cf9fe

    
             conversation: { 
        
               title: title || conversation._source?.conversation.title, 
        
               token_count: { 
        
                 prompt: persistedTokenCount.prompt + tokenCountResult.prompt, 
        
                 completion: 
        
                   persistedTokenCount.completion + tokenCountResult.completion, 
        
                 total: persistedTokenCount.total + tokenCountResult.total, 
        
               }, 
        
             }, 
        
           }

kibana/x-pack/platform/plugins/shared/observability_solution/observability_ai_assistant/common/types.ts

Lines 60 to 65 in c4cf9fe

    
           conversation: { 
        
             id: string; 
        
             title: string; 
        
             last_updated: string; 
        
             token_count?: TokenCount; 
        
           };

In many cases we have to manually filter out the token count event:

kibana/x-pack/platform/plugins/shared/observability_solution/observability_ai_assistant/server/service/client/operators/continue_conversation.ts

Lines 331 to 333 in c4cf9fe

    
           shared$.pipe( 
        
             withoutTokenCountEvents(), 
        
             extractMessages(),

The text was updated successfully, but these errors were encountered:

elasticmachine · 2025-01-03T10:58:10Z

Pinging @elastic/obs-ai-assistant (Team:Obs AI Assistant)

emma-raffenne · 2025-01-06T10:31:16Z

@sorenlouv Would that affect any of the BI pipelines and reports?

sorenlouv · 2025-01-06T12:16:36Z

@sorenlouv Would that affect any of the BI pipelines and reports?

Not to my knowledge. But to be on the safe side I'll go look at the reports. Can you point me to them?

sorenlouv added the Team:Obs AI Assistant Observability AI Assistant label Jan 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Obs AI Assistant] Remove custom token count event #205479

[Obs AI Assistant] Remove custom token count event #205479

sorenlouv commented Jan 3, 2025 •

edited

Loading

elasticmachine commented Jan 3, 2025

emma-raffenne commented Jan 6, 2025

sorenlouv commented Jan 6, 2025

[Obs AI Assistant] Remove custom token count event #205479

[Obs AI Assistant] Remove custom token count event #205479

Comments

sorenlouv commented Jan 3, 2025 • edited Loading

Background

Solution

Technical background

elasticmachine commented Jan 3, 2025

emma-raffenne commented Jan 6, 2025

sorenlouv commented Jan 6, 2025

sorenlouv commented Jan 3, 2025 •

edited

Loading