I’m sorry you’re hitting this, I’d like to understand how you got to this point so quickly, but in the meantime, a temporary workaround involves using a tool to reduce the input token count. A community-provided workaround is available at Temporary workaround for error due to exceeding input token count (unofficial) . A more permanent solution is in development.