> In summary, I don't think we should solve the problem of correlation > here, given it is not straightforward. I just want to tag that the > hardware got an error while the kernel was running, and the operator can > use this information the way they want. > > Am I on the right track? It seems that Rafael has just applied your patch for taint with the machine check option. So you've got what you originally asked for. If you want pursue the idea of a taint for GHES warnings, then create a new patch that does that to spark discussion. Your case would be helped if you have some data to back up the need for this. E.g. we have observed "X% of recovered GHES errors are followed by a system crash within Y minutes". If you don't have hard numbers, then at least some "We often/sometimes see a crash shortly after a recovered GHES error that appears related." There are only a few unused capital letters for the taint summary: H, Q, V, Y, Z. None super-intuitive. Either pick one, or move into uncharted territory of using lower case ('g'?). -Tony