News
Large language models (LLMs) excel at using textual reasoning to understand the context of a document and provide a logical ...
Video action segmentation aims to identify and localize actions. Existing models have achieved impressive performance with pre-extracted frame-level features, but this may limit zero-shot learning and ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results