I personally lean more towards humans for moderation, as words alone dont convey the full intent and meaning. And this cuts both ways, benign words can be used to harass.
But of course, humans are expensive, and recordings of voice chat have privacy implications.