multimodal classification python