Mint: Realizing CUDA Performance in 3D Stencil Methods With Annotated C