Hi,
The system does not explicitly filter on any generation of larger than 100 tokens. It will evaluate the generated text to see if any flagged content was generated. You can read more about the behavior of the system here: https://video2.skills-academy.com/en-us/azure/cognitive-services/openai/concepts/content-filter.
In general, I'd recommend a looking at the reason for the filtering to determine the next steps:
- For prompts being filtered the service will return a 400 error, you could consider some flow to suggest the user change the input
- When generated content is filtered (Such as in the example above where the finish_reason is content filter), I'd recommend adding in retry logic as this may be only for a specific generation. You will also want to consider a maximum number of retries
We did ship a change to the content filters earlier this month to reduce false-positives in the filtering. Are you still observing this behavior?