结合关键帧提取的视频-文本跨模态实体分辨双重编码方法
Dual Encoding Integrating Key Frame Extraction for Video-text Cross-modal Entity Resolution
{{custom_ref.label}} |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
/
〈 |
|
〉 |