Similar Items: Voice Command and Gesture Recognition using Multimodal Deep Learning /