I mean general advice with potential hardware issues is remove as much hardware as possible, and see if the problem still exists. If it does, swap components one-by-one until you find the faulty component.
Since this seems to a sporadic problem, it would probably help to try find a way to trigger the problem more reliably. Maybe write a script that writes random files constantly, or something like that.