How to configure and avoid Open AI rate limiting when ingesting files? #1601

emahpour · 2024-11-17T00:13:01Z

Describe the problem
I uploaded a json file with around 4000 entries inside. While I was monitoring the processes, I realized Open AI is enforcing rate limiting and the application was not responsive as it was keep retrying the failed calls to Open AI.
What is the recommendation to avoid running into this problem?

To Reproduce
Create a json file with large number of entries (eg 4000 rows).

Upload the file as a document
Initiate Graph Creation Process

Expected behavior
Processing the Graph generation with consideration of API rate limits with Open AI

Screenshots

NolanTrem · 2024-11-17T00:16:57Z

Edit: I didn't realize this was a graph process. If you're using the full version and a single job fails, you can retry that job look for the orchestration cookbook in the docs.

Given that this is a JSON file, it might make sense for you to upload entries as chunks rather than a single document. The embedding requests are sent in batches with exponential back off, though, so I suspect that eventually this will succeed. If you're using the full version, and it fails, you can always retry the job which is especially helpful when you've broken the file up or have many files.

emahpour · 2024-11-17T00:24:23Z

Even using Hatchet with smaller chunks it can technically run into same rate limit issue, no?
Is there any configuration to apply this rate limiting in hatchet queues?

NolanTrem · 2024-11-17T00:41:09Z

I think what you're looking for then is the batch_size parameter in the configuration file. The default is 256. Changing this would only impact future graphs, though.

emahpour · 2024-11-17T01:54:51Z

Unfortunately lowering batch_size did not help and instead it overloaded the container on cpu with tons of attempts to retry and fail. There should be a better way rather than brute forcing and hoping it eventually processes everything.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to configure and avoid Open AI rate limiting when ingesting files? #1601

How to configure and avoid Open AI rate limiting when ingesting files? #1601

emahpour commented Nov 17, 2024

NolanTrem commented Nov 17, 2024 •

edited

Loading

emahpour commented Nov 17, 2024

NolanTrem commented Nov 17, 2024

emahpour commented Nov 17, 2024

How to configure and avoid Open AI rate limiting when ingesting files? #1601

How to configure and avoid Open AI rate limiting when ingesting files? #1601

Comments

emahpour commented Nov 17, 2024

NolanTrem commented Nov 17, 2024 • edited Loading

emahpour commented Nov 17, 2024

NolanTrem commented Nov 17, 2024

emahpour commented Nov 17, 2024

NolanTrem commented Nov 17, 2024 •

edited

Loading