Prompt Driver Input Truncation #1538

njedema · 2024-02-28T00:30:27Z

njedema
Feb 28, 2024

I have read and agree to the contributing guidelines.

Describe the bug
BedrockLlamaPromptModelDriver() does not enforce truncation for any PromptTasks. If the task prompt is longer than 2048 tokens when rendered, the following ValidationError is raised until the code exits.

WARNING:root:<RetryCallState ************: attempt #2; slept for 2.0; last result: failed (ValidationException An error occurred (ValidationException) when calling the InvokeModel operation: Validation Error)>

To Reproduce
Steps to reproduce the behavior:

The follow snippet is sufficient to reproduce the error:

First, verify that you are able to invoke Llama2 on Bedrock; ensure that you change the PROFILE_NAME

import boto3
from griptape.drivers import (
    AmazonBedrockPromptDriver,
    BedrockLlamaPromptModelDriver
)
from griptape.structures import Agent

REGION='us-west-2'
PROFILE_NAME=<CHANGE ME>

session = boto3.Session(region_name=REGION, profile_name=PROFILE_NAME)

prompt_driver = AmazonBedrockPromptDriver(
    model="meta.llama2-70b-chat-v1",
    prompt_model_driver=BedrockLlamaPromptModelDriver(),
    session=session,
)

agent = Agent(prompt_driver=prompt_driver)

agent.run(
    "Write a haiku about academic research"
)

Now run the code with a really long prompt. GETing content of a really long Wiki page will do. Again, be sure to change PROFILE_NAME.

import boto3
import requests
from griptape.drivers import (
    AmazonBedrockPromptDriver,
    BedrockLlamaPromptModelDriver
)
from griptape.structures import Agent

REGION='us-west-2'
PROFILE_NAME=<CHANGE ME>

session = boto3.Session(region_name=REGION, profile_name=PROFILE_NAME)

prompt_driver = AmazonBedrockPromptDriver(
    model="meta.llama2-70b-chat-v1",
    prompt_model_driver=BedrockLlamaPromptModelDriver(),
    session=session,
)

agent = Agent(prompt_driver=prompt_driver)

import requests
long_content = requests.get("https://en.wikipedia.org/wiki/Barack_Obama?action=raw").content

agent.run(
    f"{long_content}"
)

Expected behavior
I expect GripTape to truncate the response to the maximum number of tokens allowed by Bedrock Llama2. Griptape does this for other Bedrock prompt drivers, such as BedrockClaudePromptModelDriver()

Screenshots
If applicable, add screenshots to help explain your problem.

Desktop (please complete the following information):

OS: MacOS
Version: Ventura 13.6.4
griptape: 0.22.3
boto3: 1.34.11
botocore: 1.34.11

Additional context
Add any other context about the problem here.

collindutter · 2024-02-28T00:37:06Z

collindutter
Feb 28, 2024
Maintainer

Thanks for the report @njedema. I'm able to reproduce on my end; will keep you updated on a solution.

0 replies

njedema · 2024-02-29T01:38:21Z

njedema
Feb 29, 2024
Author

Thanks!

0 replies

collindutter · 2024-03-21T19:07:07Z

collindutter
Mar 21, 2024
Maintainer

After thinking about this a bit more, this will actually be an issue with all Drivers. It just so happens that Llama has a relatively small context window.

Do we want to truncate prompt input to models? It might hide an underlying issue with the input data being too large.

CC @andrewfrench @vasinov

0 replies

vasinov · 2024-03-21T19:29:20Z

vasinov
Mar 21, 2024
Maintainer

If we do then we need to log a warning. I am on the fence about this leaning towards not truncating and explicitly failing. IMO, should be on the user to truncate. Another option is adding an optional truncate_input parameter to models.

0 replies

collindutter · 2025-01-10T19:15:17Z

collindutter
Jan 10, 2025
Maintainer

Migrated to discussion to discuss whether Prompt Drivers should have input truncation.

0 replies

shhlife · 2025-01-11T04:46:40Z

shhlife
Jan 11, 2025
Maintainer

I think a truncate_input parameter makes sense.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Griptape

Prompt Driver Input Truncation #1538

{{title}}

Replies: 6 comments

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Griptape

Prompt Driver Input Truncation #1538

njedema Feb 28, 2024

Replies: 6 comments

collindutter Feb 28, 2024 Maintainer

njedema Feb 29, 2024 Author

collindutter Mar 21, 2024 Maintainer

vasinov Mar 21, 2024 Maintainer

collindutter Jan 10, 2025 Maintainer

shhlife Jan 11, 2025 Maintainer

njedema
Feb 28, 2024

collindutter
Feb 28, 2024
Maintainer

njedema
Feb 29, 2024
Author

collindutter
Mar 21, 2024
Maintainer

vasinov
Mar 21, 2024
Maintainer

collindutter
Jan 10, 2025
Maintainer

shhlife
Jan 11, 2025
Maintainer