Names are a common feature of interest when working with texts at scale. We can use a search function to locate names that we expect to find, but how do we go about searching for all names in the text – even those we do not know to look for?

The natural language processing technique of named entity recognition (NER) identifies words that may be names, places, or organizations within unstructured text. In this workshop, we will explore how NER works and apply it to a text corpus using a Python library named SpaCy.

