Probabilistic von Mises–Fisher Representation Learning forFew-Shot Remote Sensing Scene Classification
Multimodal Model Based on Contrastive Language-Image Pretraining for Micro-Expression Recognition
Enhancing Multimodal Recommendation via Contrastive Self-Supervised Modality-Preserving Learning
Optimising Few-Shot Class-Incremental Learning for Fine-Grained Visual Recognition
DCA-CL: Enhancing Multimodal Emotion Recognition via Dual Cross Attention and Contrastive Learning
Enhancing Action Recognition via Dynamic Cross-Frame Differential Modeling
FiCo-ITR: bridging fine-grained and coarse-grained image-text retrieval for comparative performance analysis
Pedestrian Face Recognition using CC Footage
Multimodal scene-graph matching for cheapfakes detection
Concept-Based and Embedding-Based Models in Lifelog Retrieval: An Empirical Comparison of Performance
Learn more about International Journal of Multimedia Information Retrieval