Scalable heterogeneous CPU-GPU computations for unstructured tetrahedral meshes