When new cookbook code is released and introduces problems in one or more environments, it would be helpful to know if the multiple of failing infrastructure is happening for the same reasons or different reasons. Right now, to view errors you have to click through each server individually. It makes it difficult to grasp the scope of a particular issue across the fleet of servers. Gathering this data is helpful though when writing up defects / issue tickets to track the problem and the resolution.
It would be great if we could introduce a new view for Infrastructure where instead of showing nodes, you saw a list of unique error messages and the number of servers impacted by that error. Then you can drill into each error to view the list of nodes impacted.
There is an API endpoint for this: https://docs.chef.io/automate/api/#operation/ConfigMgmt_GetErrors
This is incredibly helpful. I've been thinking about this feature for a while and only recently discovered the API for this today, so that's great. However it would be good through this API to also see the nodes for those errors in an array.
Not everyone is going to use the API, so it would be good to expose this to other users through the Automate UI as well.