Creating machine learning-based text extraction models

Use Pega Platform machine-learning capabilities to create text extraction models for named entity recognition.

  • Make sure that you can access the Analytics Center. You can do this by starting the pyDecisionAnalytics portal. Add this portal to the list of portals in your access group. For more information see, Access Group form - Completing the Definition tab.
  • Make sure that the system locale language settings are set to UTF-8.
By using models that are based on the Conditional Random Fields (CRF) algorithm, you can extract information from unstructured data and label it as belonging to a particular group. For example, if the document that you want to analyze mentions Galaxy S8, the text extraction model classifies that as Phone.