A Multimodal Deep Learning Framework