handle changes in case of output token limit exceeded #979

Anonymousinterpares · 2025-01-02T20:34:34Z

Is your feature request related to a problem? Please describe:

Very often the files I work on are significantly larger than the max output limit even of the newest Gemioni models (and adhering to keeping the code as modular as possible). Because of that, given file is never finished within the project structure and the only workaround I see is to request from LLM to answer directly in chat.

Describe the solution you'd like:

Think about adjusting the LLM prompt along with adjusting the app code so that in case output token limit is close to be reached, LLM could leave a defined markdown to allow the application to detect it and let the llm to continue from there in the next output.
Maybe it would also be possible to mark the unfinished file by the app itself - for example, definig a rule that if a file is not being changed anymore while create wheel is still spinning, that place could be marked by the app in a specific way to then, allow LLM to resume from that exact point.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

handle changes in case of output token limit exceeded #979

handle changes in case of output token limit exceeded #979

Anonymousinterpares commented Jan 2, 2025

handle changes in case of output token limit exceeded #979

handle changes in case of output token limit exceeded #979

Comments

Anonymousinterpares commented Jan 2, 2025