You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The set of entities corresponding to wires in a DisCoCirc diagram should be controlled in some way to keep the size of the diagram manageable and to avoid the inclusion of insignificant entities that could appear in the document. Using a simple metric such as TF-IDF might be a good first step. The metric can apply on simple noun tokens, or on dependencies derived from a dependency parser for more robust coverage.
The text was updated successfully, but these errors were encountered:
@AnnaNPearson suggested that the nouns that do not make the threshold shouldn't be discarded completely but it would be useful to appear in the diagram as single boxes, interacting with their context. This feature can be offered to the user as an extra option.
The set of entities corresponding to wires in a DisCoCirc diagram should be controlled in some way to keep the size of the diagram manageable and to avoid the inclusion of insignificant entities that could appear in the document. Using a simple metric such as TF-IDF might be a good first step. The metric can apply on simple noun tokens, or on dependencies derived from a dependency parser for more robust coverage.
The text was updated successfully, but these errors were encountered: