Hello
How are you?
Thanks for contributing to this project.
I made my own AutoClipper class based on your code.

Please check if there is any problem.
Here I doubt the buffer length for the gradient history.
You mentioned the effect of ONLY percentile value in the training performance.
What about the effect of the buffer length for history?
If we set the buffer length to the number of steps in one epoch?
Hello
How are you?
Thanks for contributing to this project.
I made my own AutoClipper class based on your code.
Please check if there is any problem.
Here I doubt the buffer length for the gradient history.
You mentioned the effect of ONLY percentile value in the training performance.
What about the effect of the buffer length for history?
If we set the buffer length to the number of steps in one epoch?