Abstract
A method of constructing a dataset for identifying a plurality of latent concepts in a Natural Language Processing model is provided. The method includes executing a clustering process on a first dataset, preparing a second dataset, defining a hierarchical concept tag-set from the second dataset, and annotating the hierarchical concept tag-set.
| Original language | English |
|---|---|
| Patent number | US2023325426 |
| IPC | G06F 16/ 38 A I |
| Priority date | 7/04/23 |
| Publication status | Published - 12 Oct 2023 |