[BUG] Problematic token usage result in the trace #1193

Howie-Arup · 2024-12-03T09:29:05Z

Hi, I found that the token counting seems problematic. When calculating the total tokens in the image 1 below, I found it just adds together the numbers of tokens showed up in the trace detail like the image 2 below. However, it seems that there are repetitive adding for two same processes. For example, in the image 2, the OpenAI.Chat and ChatCompletion in the red rectangular consume the same number of tokens, and when I clicked them the input and output messages are the same, but it added the two numbers (i.e., 580+580 in this case) for calculating total tokens in the image 1.

Image 1:

Image 2:

Environment:

OS: Windows 11
Notebook Runtime: Jupyter
Browser: Chrome
Version:

arize-phoenix             4.36.0                   pypi_0    pypi
arize-phoenix-evals       0.15.1                   pypi_0    pypi
arize-phoenix-otel        0.5.0                    pypi_0    pypi

The text was updated successfully, but these errors were encountered:

mikeldking · 2024-12-03T13:24:10Z

Hey @Howie-Arup thanks for the report. The double counting is definitely incorrect but I believe from looking at your screenshot above it's due to you having both openAI instrumentation and llamaindex instrumentation. At the current moment these two instrumentations don't compose together as we wanted to have llamaindex be fully instrumented (including llm calls). In this scenario, we recommend just using the llamaindex instrumentation as using both does double count.

We will explore better solutions for composition in the near future but hopefully this is enough to unblock you for now. Thanks for using Phoenix!

Howie-Arup · 2024-12-04T01:30:27Z

@mikeldking Thanks for your reply! You are right I was having both openAI and llamaindex instrumentations in the llamaindex workflow. I have removed openAI instrumentation and it worked fine.

But in the llamaindex workflow I have one step with llamaindex ReAct agent and the other step with crew AI agent. I used the below codes for the crew AI according to the documentation but it turned out there is double counting as shown in the screenshot. I tried deleting the LiteLLMInstrumentor().instrument(tracer_provider=tracer_provider) but the it's the same. Is there something wrong? Thanks!

tracer_provider = register(project_name="my-llm-app")
LlamaIndexInstrumentor().instrument(tracer_provider=tracer_provider)
CrewAIInstrumentor().instrument(tracer_provider=tracer_provider)
LiteLLMInstrumentor().instrument(tracer_provider=tracer_provider) # I am using CrewAI>= 0.63.0

Howie-Arup · 2024-12-04T01:54:52Z

@mikeldking Thanks for your reply! You are right I was having both openAI and llamaindex instrumentations in the llamaindex workflow. I have removed openAI instrumentation and it worked fine.

But in the llamaindex workflow I have one step with llamaindex ReAct agent and the other step with crew AI agent. I used the below codes for the crew AI according to the documentation but it turned out there is double counting as shown in the screenshot. I tried deleting the LiteLLMInstrumentor().instrument(tracer_provider=tracer_provider) but the it's the same. Is there something wrong? Thanks!
tracer_provider = register(project_name="my-llm-app")
LlamaIndexInstrumentor().instrument(tracer_provider=tracer_provider)
CrewAIInstrumentor().instrument(tracer_provider=tracer_provider)
LiteLLMInstrumentor().instrument(tracer_provider=tracer_provider) # I am using CrewAI>= 0.63.0

@mikeldking Sorry I found that that it's because I used jupyter notebook to run so the LiteLLMInstrumentor may be still there. After I re-opened the notebook and used the codes below there is no double counting.

LlamaIndexInstrumentor().instrument(tracer_provider=tracer_provider)
CrewAIInstrumentor().instrument(tracer_provider=tracer_provider)

But in this case I actually didn't follow the recommendation in the documentation as I removed the LiteLLMInstrumentor and the trace for the crew AI is like below screenshot which doesn't have the llm call to see the input and output messages?

Howie-Arup added bug Something isn't working triage Issues that require triage labels Dec 3, 2024

mikeldking added instrumentation Adding instrumentations to open source packages backlog and removed triage Issues that require triage labels Dec 3, 2024

mikeldking mentioned this issue Dec 6, 2024

[bug] crewAI kikoff spans incorrectly contain token counts #1154

Closed

mikeldking transferred this issue from Arize-ai/phoenix Jan 13, 2025

dosubot bot added the language: python Related to Python integration label Jan 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Problematic token usage result in the trace #1193

[BUG] Problematic token usage result in the trace #1193

Howie-Arup commented Dec 3, 2024

mikeldking commented Dec 3, 2024

Howie-Arup commented Dec 4, 2024 •

edited

Loading

Howie-Arup commented Dec 4, 2024 •

edited

Loading

[BUG] Problematic token usage result in the trace #1193

[BUG] Problematic token usage result in the trace #1193

Comments

Howie-Arup commented Dec 3, 2024

mikeldking commented Dec 3, 2024

Howie-Arup commented Dec 4, 2024 • edited Loading

Howie-Arup commented Dec 4, 2024 • edited Loading

Howie-Arup commented Dec 4, 2024 •

edited

Loading

Howie-Arup commented Dec 4, 2024 •

edited

Loading