The text mining project being delivered as part of the LRF Discovering Safety Programme is looking to build upon existing state of the art text mining and natural language processing to develop a suite of text mining and natural language processing tools and techniques for specific use on unstructured health and safety datasets. As well as enabling the Discovering Safety Programme to generate new health and safety insights and learning using such tools and techniques on the HSE datasets available to the programme, the intention is to make the tools developed available for industry to use on their own datasets, for their own specific purposes, further leveraging benefits arising from the work undertaken on the programme.
Aims and objectives
The core aim of Phase 1 work has been to convert the HSE reports archive to a format more amenable to collective analysis and demonstrate how it might be put to applied use.