Modeling execution and predicting performance in multi-GPU environments