Service: How we used deep learning to structure GOV.UK's content

Responsible organisation: Government Digital Service (Central-Government)

We used Natural Language Processing to make the text content on the page machine readable. We used this and the page metadata (like date published and department) to learn patterns that could be used to predict which sub-branches an untagged page should be organised in. In some cases, a GOV.UK page is tagged to more than one sub-branch, so we implemented a multilabel model to be able to do exactly that.

Additional information

Source Open Innovation Regione Lombardia
Web site https://www.gov.uk/
Start/end date 2018 -
Still active?

Related cases