Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

A Survey on Vectorization on Intel Xeon Phi Coprocessor: Technique for Performance Optimization


Affiliations
1 SP Pune University, India
2 Department of Computer Technology, PICT, Pune, India
3 CAE Group, CDAC, Pune, India
     

   Subscribe/Renew Journal


In computer science, vectorization is the process of converting an algorithm from a scalar implementation to a vector process. It does an operation on all the pairs of operands stored in SIMD registers at a time. This is totally different from task parallelism using MPI, OpenMP or alternative parallel libraries wherever extra cores or nodes are added to take care of information belonging to separate tasks placed on completely different cores or nodes. It adds a form of parallelism to software. It is a well-known technique for performance optimization. It makes the full use of the features provided by the hardware for parallelism. We discuss two vectorization techniques in this paper which are The Loop Unrolling technique and the Two Way Vectorization technique. The aim of this work is to study the work done until now on vectorization and its related techniques for performance optimization.

Keywords

Vectorization, SIMD, Parallelism.
User
Subscription Login to verify subscription
Notifications
Font Size

Abstract Views: 157

PDF Views: 2




  • A Survey on Vectorization on Intel Xeon Phi Coprocessor: Technique for Performance Optimization

Abstract Views: 157  |  PDF Views: 2

Authors

Niraj J. Tiwari
SP Pune University, India
Pujashree S. Vidap
Department of Computer Technology, PICT, Pune, India
Pallavi G. Gavali
CAE Group, CDAC, Pune, India

Abstract


In computer science, vectorization is the process of converting an algorithm from a scalar implementation to a vector process. It does an operation on all the pairs of operands stored in SIMD registers at a time. This is totally different from task parallelism using MPI, OpenMP or alternative parallel libraries wherever extra cores or nodes are added to take care of information belonging to separate tasks placed on completely different cores or nodes. It adds a form of parallelism to software. It is a well-known technique for performance optimization. It makes the full use of the features provided by the hardware for parallelism. We discuss two vectorization techniques in this paper which are The Loop Unrolling technique and the Two Way Vectorization technique. The aim of this work is to study the work done until now on vectorization and its related techniques for performance optimization.

Keywords


Vectorization, SIMD, Parallelism.