Robust And Efficient Deep Learning For Multimedia Generation And Recognition