Migrate Cineast's VisualTextCoEmbedding #101

ppanopticon · 2024-08-26T07:24:18Z

Task Description

In #94 we discussed features that should be ported over from Cineast. One of these features was the VisualTextCoEmbedding feature that can be used to query image and video content using textual queries.

While the current implementation is arguably worse than OpenCLIP, is has one advantage: I uses information not just from the keyframe but also surrounding frames and may therefore spot things, other models might miss.

The goal of this issue is therefore to migrate the VisualTextCoEmbedding feature so that it can be used until we have a better alternative.

Dependencies

In light of #19, we ought to check, whether this should be part of the FES infrastructure or TorchServe.

Boundary Conditions

None

The text was updated successfully, but these errors were encountered:

ppanopticon added the enhancement New feature or request label Aug 26, 2024

ppanopticon added this to the Release Candidate #2 milestone Aug 26, 2024

ppanopticon mentioned this issue Aug 26, 2024

Migration of Cineast Features #94

Closed

25 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Migrate Cineast's VisualTextCoEmbedding #101

Migrate Cineast's VisualTextCoEmbedding #101

ppanopticon commented Aug 26, 2024

Migrate Cineast's VisualTextCoEmbedding #101

Migrate Cineast's VisualTextCoEmbedding #101

Comments

ppanopticon commented Aug 26, 2024

Task Description

Dependencies

Boundary Conditions