Looking for a benchmark index over time

I'm wondering if there is some sort of AI model benchmark that is run periodically so we can monitor current model performance vs past model performance? I'm asking this because i do notice a significant decrease in opus 4.6 performance and i simply want to