Challenges of MPI Processing on Many Core Architectures We are looking at the problem of MPI processing on the Xeon Phi. Because these cores run at a lower clock speed, with a reduced feature set and MPI thread-multiple actually reduces performance, MPI performance drops when running on the Xeon Phi. We examine this problem, and propose a novel solution to this problem using replication.