@BBC_News_Labs I read the blog post and saw nothing about how they are measuring the performance of LLMs at this task, or how they are validating the results.
Also no info about the resources that would be required to catalog this data the "old-fashioned way" with human indexers using a controlled vocabulary, or a "formerly novel" approach like crowdsourced tags.